Qwen3.6-Plus Officially Available on Together AI, Accelerating Globalization of Tongyi Qianwen Ecosystem

Qwen3.6-Plus Officially Available on Together AI, Accelerating Globalization of Tongyi Qianwen Ecosystem

The deployment of Chinese-developed large language models on overseas inference platforms continues to accelerate. In late April 2026, Qwen3.6-Plus officially went live on the Together AI platform, enabling developers to call the model directly via a standard OpenAI-compatible API without the need for self-deployment.

What Happened

Together AI is currently one of the largest third-party model inference aggregation platforms, providing developers with a unified API interface to access models from multiple vendors. The availability of Qwen3.6-Plus on the platform means:

  • Ready-to-Use: No GPUs or weight file configurations required; the model can be called directly via a standard API
  • Automatic Scaling: Together AI’s inference infrastructure automatically handles concurrency and load balancing
  • OpenAI-Compatible Interface: Existing code requires no modifications; simply switch the base_url and model name to integrate

Current Ecosystem Positioning of the Qwen3.6 Series

The Qwen3.6 series is the flagship model family released by Alibaba’s Tongyi Qianwen team in April 2026, encompassing multiple specifications:

Model VersionParametersPositioningKey Features
Qwen3.6-35B-A3B35B Total / 3B ActiveEfficient InferenceMoE architecture, extremely low inference costs
Qwen3.6-27B27BMid-Range All-RounderOptimal price-to-performance ratio
Qwen3.6-PlusUndisclosedFlagshipComprehensive capabilities benchmarked against top-tier models

As the flagship model in the series, Qwen3.6-Plus has ranked in the global top ten in public evaluations such as the LMSys Chatbot Arena, demonstrating particularly outstanding performance in Chinese language understanding, code generation, and mathematical reasoning.

Why Deployment on Together AI Matters

Compared to previous distribution channels for the Qwen3.6 series, which primarily relied on Alibaba Cloud’s Bailian platform and Hugging Face, this launch on Together AI holds several key implications:

  1. Lowered Access Barrier for Overseas Users: Together AI’s primary user base is concentrated in North America and Europe. The launch of Qwen3.6-Plus allows these developers to experience Chinese-developed models with zero friction
  2. API Ecosystem Integration: Together AI supports hybrid orchestration of Qwen3.6-Plus alongside other models like Claude and GPT, facilitating multi-model workflow development
  3. Commercial Signal: The willingness of a third-party inference platform to integrate and promote Qwen models indicates their competitive performance and cost-effectiveness

Competitive Landscape Assessment

Current deployment landscape of Chinese-developed models across mainstream inference platforms:

PlatformIntegrated Chinese Models
Together AIQwen3.6-Plus, Qwen3.6-27B, DeepSeek V4
OpenRouterFull Qwen3.6 series, DeepSeek V4, MiniMax
GroqQwen3.6-27B (Ultra-fast inference)
Alibaba Cloud BailianFull Qwen series (Exclusive access to latest)

While the Qwen series already boasts extensive coverage across third-party platforms, its availability on Together AI remains a landmark event—signifying recognition from a leading global inference aggregation platform regarding the capabilities of Chinese-developed models.

Actionable Recommendations

  • Existing Together AI Users: Directly call the Qwen/Qwen3.6-Plus model. Streaming outputs and tool calling are supported
  • Evaluating Model Selection: Consider adding Qwen3.6-Plus to your A/B testing candidate pool, especially for Chinese-language or multilingual tasks
  • Cost-Sensitive Scenarios: Test both Qwen3.6-27B and Qwen3.6-35B-A3B simultaneously. The latter may offer lower inference costs thanks to its MoE architecture