C
ChaoBro

CLI-Anything: The HKU Team Wants to Make All Software Agent-Native, and It's Not Just Hype

"Making ALL Software Agent-Native."

When you see this written in a GitHub project's README, your first reaction might be: just another overpromising project.

But CLI-Anything has already earned 38,429 stars on GitHub, and its approach is highly pragmatic—it's not reinventing software, but rather wrapping existing applications in an "Agent-understandable shell."

What is Agent-Native

Let's clarify this term first.

Agent-Native doesn't mean rewriting the software itself. Rather, it means any software should be directly operable by AI agents, without humans acting as intermediaries or translators.

Currently, if you want an AI agent to edit photos in Photoshop, perform map analysis in QGIS, or edit audio in Audacity—it's basically impossible. That's because these software interfaces are designed for human fingers and eyes, not for agent API calls.

CLI-Anything's approach is to write an agent-harness (agent adapter) for each application. This adapter standardizes the software's CLI interface, allowing AI agents to invoke any software in a unified manner.

What They've Already Built

Looking at the repository's directory structure, CLI-Anything already has a substantial number of ready-to-use agent-harnesses:

  • 3MF —— 3D model processing
  • QGIS —— Geographic Information System
  • AdGuard Home —— Network ad filtering
  • Audacity —— Audio editing
  • Blender —— 3D modeling and rendering

Each harness does one thing: wraps the software's command-line interface into a standard format that agents can understand and invoke.

For example, Audacity's agent-harness enables an AI agent to execute a sequence like "open this audio file → apply noise reduction → export as WAV" without requiring manual menu clicks.

Why CLI is the Key

You might ask: Why CLI? Why not automate the GUI?

The reason is simple: CLI is structured, deterministic, and scriptable. GUI automation (relying on image recognition or coordinate clicking) is too fragile—move a button and it breaks, update the UI and it fails.

CLI is the backdoor software leaves for machines. CLI-Anything simply standardizes this backdoor.

CLI-Hub: Ecosystem Ambitions

The project also features a companion website clianything.cc, called CLI-Hub. It's a shared platform for agent-harnesses—similar to Docker Hub, but instead of containers, it hosts agent adapters for software.

If this idea takes off, it will create an interesting ecosystem loop:

  1. Someone writes an agent-harness for a new software
  2. Uploads it to CLI-Hub
  3. Others install it, enabling their AI agents to operate the software
  4. More users → more contributors → more software supported

This closely mirrors the rise of npm/Docker Hub. The difference is that this time, it's not about managing code/containers, but managing software operability.

Academic Background

CLI-Anything comes from the Data Science Research Team at The University of Hong Kong (HKUDS). This isn't a weekend side project, but a research initiative backed by academic papers.

The advantage of academic teams building open-source projects is rigorous methodology and standardized documentation. The downside is that the update pace sometimes lags behind commercial projects.

Judging by the commit history, the project is quite active (687 commits, with updates as recent as 3 hours ago), indicating sustained team commitment.

Limitations and Challenges

CLI-Anything is not a silver bullet:

  • Only covers software with a CLI——many consumer-grade applications lack command-line interfaces entirely
  • Harnesses require manual development——although the framework provides assistance, CLI usage varies greatly across software, making automated harness generation unrealistic for now
  • Complex operations still face limitations——simple tasks (format conversion, batch processing) work well, but agents may still struggle with software features requiring complex interactions

Why It's Worth Watching

What makes CLI-Anything most noteworthy isn't what it can do today, but the direction it points toward: software "agentability" is becoming a new dimension in software design.

In the future, evaluating whether software is "good" may no longer just depend on UI aesthetics or feature count, but on how efficiently it can be operated by AI agents.

This is similar to how "API completeness" became a standard for evaluating SaaS products years ago. CLI-Anything is building the infrastructure for this emerging trend.


Primary Sources: