Qwen3.6-Plus Officially Available on Together AI, Accelerating Globalization of Tongyi Qianwen Ecosystem

The deployment of Chinese-developed large language models on overseas inference platforms continues to accelerate. In late April 2026, Qwen3.6-Plus officially went live on the Together AI platform, enabling developers to call the model directly via a standard OpenAI-compatible API without the need for self-deployment.

What Happened

Together AI is currently one of the largest third-party model inference aggregation platforms, providing developers with a unified API interface to access models from multiple vendors. The availability of Qwen3.6-Plus on the platform means:

Ready-to-Use: No GPUs or weight file configurations required; the model can be called directly via a standard API
Automatic Scaling: Together AI’s inference infrastructure automatically handles concurrency and load balancing
OpenAI-Compatible Interface: Existing code requires no modifications; simply switch the base_url and model name to integrate

Current Ecosystem Positioning of the Qwen3.6 Series

The Qwen3.6 series is the flagship model family released by Alibaba’s Tongyi Qianwen team in April 2026, encompassing multiple specifications:

Model Version	Parameters	Positioning	Key Features
Qwen3.6-35B-A3B	35B Total / 3B Active	Efficient Inference	MoE architecture, extremely low inference costs
Qwen3.6-27B	27B	Mid-Range All-Rounder	Optimal price-to-performance ratio
Qwen3.6-Plus	Undisclosed	Flagship	Comprehensive capabilities benchmarked against top-tier models

As the flagship model in the series, Qwen3.6-Plus has ranked in the global top ten in public evaluations such as the LMSys Chatbot Arena, demonstrating particularly outstanding performance in Chinese language understanding, code generation, and mathematical reasoning.

Why Deployment on Together AI Matters

Compared to previous distribution channels for the Qwen3.6 series, which primarily relied on Alibaba Cloud’s Bailian platform and Hugging Face, this launch on Together AI holds several key implications:

Lowered Access Barrier for Overseas Users: Together AI’s primary user base is concentrated in North America and Europe. The launch of Qwen3.6-Plus allows these developers to experience Chinese-developed models with zero friction
API Ecosystem Integration: Together AI supports hybrid orchestration of Qwen3.6-Plus alongside other models like Claude and GPT, facilitating multi-model workflow development
Commercial Signal: The willingness of a third-party inference platform to integrate and promote Qwen models indicates their competitive performance and cost-effectiveness

Competitive Landscape Assessment

Current deployment landscape of Chinese-developed models across mainstream inference platforms:

Platform	Integrated Chinese Models
Together AI	Qwen3.6-Plus, Qwen3.6-27B, DeepSeek V4
OpenRouter	Full Qwen3.6 series, DeepSeek V4, MiniMax
Groq	Qwen3.6-27B (Ultra-fast inference)
Alibaba Cloud Bailian	Full Qwen series (Exclusive access to latest)

While the Qwen series already boasts extensive coverage across third-party platforms, its availability on Together AI remains a landmark event—signifying recognition from a leading global inference aggregation platform regarding the capabilities of Chinese-developed models.

Actionable Recommendations

Existing Together AI Users: Directly call the Qwen/Qwen3.6-Plus model. Streaming outputs and tool calling are supported
Evaluating Model Selection: Consider adding Qwen3.6-Plus to your A/B testing candidate pool, especially for Chinese-language or multilingual tasks
Cost-Sensitive Scenarios: Test both Qwen3.6-27B and Qwen3.6-35B-A3B simultaneously. The latter may offer lower inference costs thanks to its MoE architecture

What Happened

Current Ecosystem Positioning of the Qwen3.6 Series

Why Deployment on Together AI Matters

Competitive Landscape Assessment

Actionable Recommendations

Related

MiniMax M2.7 Deep Dive: The Model That Trains Itself

DeepSeek V4 Pro API 75% Off, Unlocks 1M Context in Claude Code / OpenClaw

Moonshot AI Announces Kimi K3: 2.5 Trillion Parameters, Targeting Global Top-Tier Models