Alibaba’s Qwen-3 Coder ‑480B‑A35B‑Instruct is a state-of-the-art open-source AI coding model that combines 480 billion parameters (35 B active) with extended context (up to 1 million tokens), outperforming rivals like GPT‑4o and DeepSeek in benchmark tasks. It is Alibaba’s most advanced AI coding model to date and a key asset in its broader push for open, agentic, enterprise-grade AI infrastructure.
Qwen-3 Coder – Key Points
Open-source launch
Released on July 23, 2025 under the Apache 2.0 license, Qwen3-Coder is available on HuggingFace, GitHub, Qwen Chat, Alibaba Cloud API, and other channels. Alibaba describes it as its most powerful agentic open-source coding model to date (VentureBeat, SCMP, Mint).
Model architecture
- Qwen3-Coder is a Mixture-of-Experts (MoE) model with 480B total parameters, 35B active per query, and 8 experts selected from 160.
- Features include 62 transformer layers, 96 attention heads, 8 key-value heads, and native support for 256K token context, extended to 1M tokens via YaRN positional extrapolation (VentureBeat, Mint).
Training details
The model is pretrained on 7.5 trillion tokens, ~70% of which are code, and further refined with:
Code RL (Reinforcement Learning) for verifiable execution tasks
Long-Horizon Agent RL for planning and adapting over multi-step interactions
Training involved a 20,000-environment simulation system on Alibaba Cloud (VentureBeat).
Performance and benchmarks
- On SWE-bench Verified, it scored 67.0% (standard) and 69.6% (500-turn).
- The related Qwen3-235B-A22B-2507-FP8 scored 70.3 on the 2025 American Invitational Mathematics Examination, beating DeepSeek‑V3 (46.6) and GPT‑4o (26.7).
- On the MultiPL-E coding benchmark, Qwen scored 87.9, ahead of DeepSeek (82.2) and GPT‑4o (82.7), trailing only Claude Opus 4 (88.5) (SCMP, Mint).
Tooling & integration
- Qwen Code CLI (open-sourced alongside the model) is forked from Gemini Code. It includes structured prompting, custom function call protocols, and Node.js support.
- Integrates with: Claude Code (DashScope), Cline, Ollama, LMStudio, MLX‑LM, llama.cpp, and KTransformers.
- Works via OpenAI-compatible APIs and can be deployed locally or via Alibaba Cloud (VentureBeat, Mint).
Enterprise adoption & cost
- As a fully open-source model, Qwen3-Coder can be self-hosted with no license fees, enabling vendor-neutral deployments.
- Alternatively, use via Alibaba Cloud costs:
$1/$5 per million tokens (up to 32K)
$1.8/$9 (128K), $3/$15 (256K), $6/$60 (1M)
(VentureBeat, Medium).
Early feedback
- Sebastian Raschka: “Best coding model yet.”
- Wolfram Ravenwolf: “This is surely the best one currently.”
- Kevin Nelson: “Qwen3 Coder is on another level.”
- Jack Dorsey: “Goose + qwen3-coder = wow,” referring to integration with his open-source agent framework Goose (VentureBeat).
Future developments
- Smaller Qwen3-Coder variants are in development to reduce inference costs without sacrificing performance.
- 3B parameter variant will power HP’s smart assistant ‘Xiaowei Hui’ in China, handling document generation and meeting summarization tasks (SCMP).
- Qwen team is also exploring self-improvement, aiming to evolve coding agents into autonomous, adaptive systems (Mint).
Why This Matters
- Enterprise-ready: Qwen3-Coder provides powerful long-context and coding-focused capabilities without the costs or restrictions of closed-source competitors.
- Benchmark leader: Outperforming GPT-4o and DeepSeek in rigorous mathematical and coding tests underscores Alibaba’s growing AI dominance.
- Productivity gains: Agentic capabilities and tool integration offer hands-free, intelligent handling of real-world software development tasks.
- Open-source edge: Companies can fully control infrastructure, scale usage, and customize the model—key advantages in regulated or security-sensitive industries.
Claude Artifacts let anyone build interactive AI apps without coding. Now shareable, responsive, and cross-platform—perfect for non-developers.
AWS is developing Kiro, a sophisticated AI coding tool aimed at transforming software development by integrating real-time code generation, multimodal interfaces, and AI agent collaboration.
Read a comprehensive monthly roundup of the latest AI news!






