C
ChaoBro

Grok 4.3 Silent Launch: AA Intelligence Index Score of 53, Input Price Slashed 40%

Grok 4.3 Silent Launch: AA Intelligence Index Score of 53, Input Price Slashed 40%

Core Conclusion

xAI released a heavyweight model in the most xAI way possible: no press conference, no blog post, just dropped it directly in the API.

Grok 4.3 has quietly gone live on platforms like Venice, supporting 1 million token context, function calling, multimodal input, and native X search. It achieved a score of 53 on the Artificial Analysis Intelligence Index, surpassing Muse Spark, Claude Sonnet 4.6, and previous Grok iterations. API pricing was adjusted simultaneously: input dropped from $2.10 to $1.25/M tokens (40% cut), output cut by 60%.

Benchmark Performance

Artificial Analysis Intelligence Index

ModelAA IndexNotes
GPT-5.5 Pro~60+Current leader
Grok 4.353Surpassed Muse Spark, Sonnet 4.6
Muse Spark<53Surpassed by Grok 4.3
Claude Sonnet 4.6<53Surpassed by Grok 4.3
Gemini 3.1 Pro~50Close to Grok 4.3

Vals Index Rankings

BenchmarkGrok 4.3 RankNotes
Overall#13Above average
CaseLaw#1Top-tier legal reasoning
CorpFin#1Top-tier corporate finance analysis
General CodingWeakNot a strength

GDPval-AA Benchmark

Grok 4.3’s most significant improvement is in real-world Agent tasks — on the GDPval-AA benchmark, Grok 4.3’s agentic capability score increased substantially. This is the core metric for measuring “can AI complete tasks independently.”

Pricing Strategy Analysis

ItemGrok 4.3Change
Input Price$1.25/M tokens↓ 40%
Output PriceSignificantly reduced↓ 60%
Context Window1M tokensSame as previous

This pricing strategy is extremely aggressive. The $1.25/M token input price is already lower than most mid-tier models, yet Grok 4.3’s performance sits in the top tier. xAI is clearly pursuing a “cost-performance route” — delivering near Claude Opus 4.7 performance at prices approaching DeepSeek V4.

Horizontal Comparison with Competitors

DimensionGrok 4.3Claude Sonnet 4.6GPT-5.5DeepSeek V4
AA Index53<53~60+N/A
Input Price$1.25/M~$3/M~$5/M~$0.15/M
Legal Reasoning#1StrongStrongMedium
Financial Analysis#1StrongStrongMedium
General CodingWeakStrongStrongStrong
Agent CapabilitySignificantly improvedStrongStrongStrong

Landscape Assessment

Grok 4.3’s release signals several things:

  1. xAI is transitioning from “chaser” to “cost-performance leader”: An AA index of 53 with $1.25 pricing delivers far better value than Claude and GPT
  2. Clear advantage in specialized domains: #1 rankings in CaseLaw and CorpFin indicate Grok 4.3 has unique advantages in legal and finance verticals
  3. Silent launch shows xAI prioritizes product over marketing: This is both a strength (pragmatic) and weakness (low visibility)

How to Use This

  • Legal/Finance professionals: Grok 4.3’s #1 rankings in CaseLaw and CorpFin are worth attention — potentially the most cost-effective specialized model choice
  • API users: $1.25/M input pricing + 53-point performance makes this the cheapest option among first-tier models
  • Agent developers: The substantial improvement on GDPval-AA means Grok 4.3’s reliability in Agent scenarios has increased significantly — worth testing