OpenClaw 4.24 Update: DeepSeek V4 Flash as Default Model, Real-Time Voice Fully Enabled

In late April, OpenClaw (NousResearch’s open-source AI Agent platform, ~105k GitHub Stars) released its 2026.4.24 version update. Three changes combined signal a redefinition of the Agent platform competitive landscape.

What Changed

1. DeepSeek V4 Flash as Default Model

New OpenClaw installations now default to DeepSeek V4 Flash instead of Claude Sonnet. V4 Pro is also available in the built-in model directory. This means new users can start using Agent capabilities at near-zero cost — community testing confirms V4 Flash’s cost-effectiveness makes it “fully sufficient” for daily Agent tasks.

2. Real-Time Voice Fully打通

The largest upgrade in 4.24: Talk, Voice Call, and Google Meet now support realtime voice loops with full OpenClaw Agent invocation during calls. AI no longer just “listens then speaks” — it can think and operate tools while listening.

Voice calls now include full Agent mid-call access: the Agent can query data, browse the web, or call external APIs without hanging up or switching interfaces.

3. Browser Automation: Pixel-Level Clicking

OpenClaw now supports viewport-coordinate-based clicking, enabling precise coordinate interaction on web pages. This covers scenarios where DOM element targeting falls short (Canvas, custom UI components).

4. WhatsApp Voice Message Transcription

The new version supports WhatsApp voice message transcription, expanding OpenClaw’s messaging integration capabilities.

Comparison

Capability	Before 4.24	After 4.24
Default model	Claude Sonnet	DeepSeek V4 Flash
Real-time voice	Limited support	All channels (Talk/Voice Call/Meet)
Agent in voice calls	Not supported	Real-time mid-call access
Browser clicking	DOM element targeting	New viewport pixel-level clicking
WhatsApp voice	Not supported	Transcription supported

Quick Start

If you already have OpenClaw installed:

# Update to latest version
openclaw update

# Check current version
openclaw --version  # Should show 2026.4.24

New users can install directly — DeepSeek V4 Flash is pre-configured as the default:

# Fresh install
openclaw install

Observations

Setting DeepSeek V4 Flash as the default model is a notable strategy: it dramatically lowers entry costs while reducing dependency on a single model provider. However, V4 Flash’s stability in complex Agent workflows still needs more real-world testing.

The integration of real-time voice with full Agent capabilities moves “natural language dialogue to control Agent workflows” from concept to usable. Whether this becomes mainstream in Agent interaction paradigms depends on further optimization of response latency and accuracy.

What Changed

1. DeepSeek V4 Flash as Default Model

2. Real-Time Voice Fully打通

3. Browser Automation: Pixel-Level Clicking

4. WhatsApp Voice Message Transcription

Comparison

Quick Start

Observations

Primary Sources

Related

awesome-codex-skills Gains 1,177 Stars in One Day, Codex Skills Ecosystem Is Exploding

AgentField: Managing AI Agents Like Pods — A New Player in AI-Native Infrastructure

Microsoft Open-Sources Agent Lightning: Zero-Intrusion RL Training Framework for Any AI Agent