Alibaba Releases Trillion-Parameter Flagship Model Qwen3-Max-Preview: Ushering in a New Era for China's AI Technology

September 09, 2025

Alibaba

4 min

News Summary

On September 5, 2025, Alibaba released its first trillion-parameter large language model, Qwen3-Max-Preview. This flagship model, the largest in the Qwen series by parameter scale, marks a significant breakthrough for Chinese AI technology in the field of ultra-large-scale models. The model adopts a non-inference architecture, demonstrates excellent performance in multiple benchmark tests, supports over 100 languages, and can be experienced for free via the Qwen Chat web interface or called through the Alibaba Cloud Bailian Platform API.

Technical Specifications and Innovative Breakthroughs

As the flagship preview version of the Qwen3 series, Qwen3-Max-Preview boasts over a trillion parameters, making it one of the largest known models with an open API. The model employs a Mixture-of-Experts (MoE) architecture, which significantly reduces inference costs while maintaining powerful performance by dynamically activating expert modules.

Notably, Qwen3-Max-Preview utilizes a non-inference model architecture but still achieves significant improvements in inference capabilities through optimized design. The model supports a context processing capacity of over 256K tokens, enabling it to handle long documents, complex conversations, and large-scale code files.

Leading Performance Across the Board

According to officially released benchmark test results, Qwen3-Max-Preview performed exceptionally well in several authoritative evaluations:

Math Reasoning (AIME25): Scored 80.6%, an outstanding performance for a non-inference model.
Programming Ability (LiveCodeBench v6): Scored 57.6%, significantly enhancing programming assistance capabilities.
General Knowledge (SuperGPQA): Demonstrated strong comprehension in general knowledge Q&A.
Human Preference Alignment (Arena-Hard v2): Showcased excellent performance in following complex instructions.

Test results indicate that Qwen3-Max-Preview surpassed mainstream domestic and international models such as Claude Opus 4 (non-thinking mode), Kimi K2, and DeepSeek-V3.1 in multiple metrics, proving the effectiveness of scaled expansion.

Comprehensive Core Capability Upgrades

Compared to the previous Qwen2.5 series, Qwen3-Max-Preview has achieved substantial improvements in the following dimensions:

Language Understanding and Generation: Supports over 100 languages, with excellent Chinese and English comprehension, and significantly improved multilingual translation quality.

Reasoning and Instruction Execution: Greatly enhanced accuracy in complex logical reasoning, improved understanding and execution of complex instructions, and significantly reduced model hallucination phenomena.

Tool Calling Optimization: Specifically optimized for Retrieval-Augmented Generation (RAG) and tool calling, laying the foundation for building powerful AI Agent applications.

Long-tail Knowledge Coverage: More comprehensive coverage of professional domain knowledge, with simultaneous improvements in knowledge breadth and robustness.

Application Scenarios and Commercial Value

The release of Qwen3-Max-Preview brings new application possibilities to various industries:

Enterprise Document Processing: Its ultra-long context capability enables it to handle complex tasks such as large enterprise documents and contract analysis.

Intelligent Programming Assistant: Powerful code comprehension and generation capabilities can provide high-quality programming assistance for developers.

Multilingual Customer Service: Supports over 100 languages, offering unified multilingual customer service solutions for multinational enterprises.

Content Creation and Strategy Planning: Demonstrates strong potential in creative writing, advertising copywriting, strategic analysis, and other fields.

Business Strategy and Market Positioning

Unlike its previous open-source strategy, Alibaba has chosen to release Qwen3-Max-Preview as a closed-source commercial model. The model adopts a pricing strategy similar to Claude and GPT-4 but offers a certain cost advantage, reflecting Alibaba's confidence in the model's performance.

Users can experience the model in two ways: first, through the Qwen Chat (chat.qwen.ai) web interface for free, and second, through API calls on the Alibaba Cloud Bailian Platform for commercial use. The platform also provides new users with a trial quota of 1 million free tokens per model.

Technical Significance and Industry Impact

The release of Qwen3-Max-Preview holds significant technical importance and industry value:

Technical Breakthrough: The trillion-parameter scale demonstrates the technical strength of Chinese AI enterprises in developing ultra-large-scale models, marking a major breakthrough for China in the field of foundational AI models.

Industry Promotion: Provides world-class foundational model support for domestic AI application developers, helping to drive the overall development of China's AI industry.

Competitive Landscape: In the global AI model competition, Chinese enterprises are narrowing the gap with international leaders and even achieving leadership in certain metrics.

Future Outlook

As a preview version, Qwen3-Max-Preview's capabilities are still being refined. Alibaba states that the official version will bring more surprises, and the philosophy of "Scaling works" will continue to guide the model's development direction.

The model's release comes at a critical time of change in the international AI environment, showcasing China's determination and strength in independent AI innovation. With more user experience and feedback, Qwen3-Max-Preview is expected to play a greater role in enterprise-level applications, injecting new impetus into the development of China's AI industry.

Currently, the model's API access service is open on the Alibaba Cloud Bailian Platform. Developers can quickly integrate it using the standard OpenAI API format, and it is expected to be widely applied in various fields such as intelligent customer service, content creation, and programming assistance.