C
ChaoBro

CLI-Anything: HKUDS Wants to Make All Software Agent-Native, the Architecture Revolution Behind 35K Stars

CLI-Anything: HKUDS Wants to Make All Software Agent-Native, the Architecture Revolution Behind 35K Stars

Most AI Agent tools face the same problem: every software needs custom adaptation.

Want your Agent to operate Photoshop? Write a dedicated plugin. Excel? Write another. An internal system? Write yet another.

CLI-Anything's approach: stop writing plugins.

It lets AI Agents directly operate any software through the command line interface (CLI). No API needed. No SDK. No custom adapter layer. As long as a software has a CLI—or can be wrapped with one—the Agent can operate it.

35,515 stars. 3,473 forks. From the Hong Kong University Data Science Lab (HKUDS).

What "Agent-Native" Means

First, let's be clear about the concept.

"Agent-Native" doesn't mean "AI-supported." Many software products claim "AI support" by adding a chatbox or an AI feature.

Agent-Native means: the software's architecture is designed from the ground up so that AI Agents can fully and autonomously operate it—understanding its state, calling its functions, handling its output.

CLI-Anything's core argument is: CLI is the most natural Agent-Native interface.

Why?

  • CLI has clear input/output formats
  • CLI operations are composable (pipes, scripts)
  • CLI doesn't require GUI parsing—100x simpler for AI Agents than graphical interfaces
  • CLI is standardized (flags, parameters, stdin/stdout/stderr)

GUIs are designed for humans. Command lines are designed for machines. AI Agents are essentially machines, so CLI is their most natural interaction method.

How the Architecture Works

CLI-Anything's core is a CLI Hub—a centralized layer for managing CLI tool registration and dispatch.

The flow is roughly:

  1. Registration: Register any CLI tool into the Hub, describing its functions, parameters, output format
  2. Understanding: AI Agent receives natural language instructions, translates them into CLI call sequences
  3. Execution: Hub dispatches CLI execution, captures output
  4. Feedback: Results return to the Agent, which decides the next step

The key innovation is automated registration and description generation. You don't need to manually write documentation for each CLI tool—CLI-Anything can automatically analyze the CLI's help output, man pages, and even actual execution samples to generate structured function descriptions.

This solves the biggest pain point in the Agent tool ecosystem: adaptation cost.

Comparison with Existing Solutions

MCP (Model Context Protocol): Anthropic's standard for letting AI models call external tools. But MCP requires each tool developer to write an MCP Server. CLI-Anything doesn't need that—any existing CLI tool works out of the box.

LangChain Tools: Requires Python code to define tools. CLI-Anything is language-agnostic—CLI itself is cross-language.

Function Calling: Requires API endpoints. CLI-Anything doesn't need APIs—local CLI is enough.

RPA Tools: Operate GUIs, slow and fragile. CLI-Anything operates command lines directly, fast and stable.

In short: MCP/LangChain/Function Calling require "custom adapter development," CLI-Anything says "stop developing, your existing CLI is enough."

The Numbers

35,515 stars puts it in the top tier of GitHub AI tools. But more importantly, look at forks and contributor activity.

3,473 forks means many people are building their own integrations on top of it. Commits are driven by multiple contributors (yuh-yang, AiMiDi, omerarslan0, etc.), not a single maintainer.

The HKUDS (Hong Kong University Data Science Lab) background also adds academic credibility. This isn't a weekend project—it's backed by a research team.

Limitations and Risks

First, CLI isn't universal. Many software products lack CLI interfaces, or have limited CLI functionality. Graphical software like Figma and Photoshop have weak CLI support. CLI-Anything can't help there.

Second, security. Giving AI Agents direct CLI execution is essentially giving them shell access. If an Agent gets injected with malicious commands, the consequences are serious. The project needs robust sandboxing mechanisms.

Third, complex operation orchestration. Simple git commit is fine, but operations involving multiple steps, conditional branching, and human confirmation—can CLI-Anything's orchestration handle them? This needs real-world testing.

My Take

CLI-Anything makes a long-overlooked observation: AI Agents don't need perfect APIs—they can understand CLI just fine.

The value of this approach is that it lowers the barrier to entry for the Agent ecosystem. Not every software needs to develop an AI interface—CLI is enough. And CLI has existed for decades; almost every dev tool, ops tool, and data processing tool has one.

If this paradigm holds, AI Agents' applicability expands from "specifically adapted software" to "almost anything with a CLI."

That's not a linear improvement. It's exponential.


Main sources: