- Meituan released LongCat-2.0, a 1.6 trillion-parameter Mixture-of-Experts model.
- The model features a native 1-million-token context window powered by LongCat Sparse Attention.
- It is optimized for training and inference on domestic AI ASIC superpods.
- The model activates 48 billion parameters per token, balancing performance with efficiency.
Meituan Unveils LongCat-2.0: A 1.6T Parameter AI Powerhouse
The new Mixture-of-Experts model promises massive context windows and efficient performance on domestic AI hardware.

Key Takeaways
In a significant development for the global artificial intelligence landscape, Chinese tech giant Meituan has unveiled its latest breakthrough: LongCat-2.0. This state-of-the-art model is a 1.6 trillion-parameter Mixture-of-Experts (MoE) system that marks a shift toward more efficient, high-capacity architectures. By leveraging a sparse activation strategy, the model balances massive intellectual breadth with the computational agility required for real-world deployment.
At its core, LongCat-2.0 is designed to handle complex reasoning tasks that require vast amounts of data processing. With the ability to activate approximately 48 billion parameters per token, the model demonstrates a sophisticated approach to computing, ensuring that resources are directed only where they are needed most. This efficiency is critical for modern AI applications that demand high performance without the prohibitive energy costs typically associated with dense models of this size.
The standout feature of the 2.0 release is its native 1-million-token context window. Achieving such a massive context length has historically been a "holy grail" for AI researchers, as the memory requirements for self-attention mechanisms scale quadratically with sequence length. Meituan has bypassed this bottleneck through the implementation of "LongCat Sparse Attention."
This proprietary attention mechanism allows the model to process long documents, extensive codebases, or complex temporal datasets without losing track of the information provided at the start of the input. By selectively attending to relevant segments of the data rather than attempting to compute the entire sequence at once, the model maintains a high degree of accuracy over extreme distances.
A particularly notable aspect of the LongCat-2.0 launch is the focus on hardware-software co-design. The model was trained and is currently served end-to-end on domestic AI ASIC (Application-Specific Integrated Circuit) superpods. This is a strategic move for Meituan, as it demonstrates that high-performance, large-scale AI is possible using localized hardware ecosystems rather than relying exclusively on imported high-end GPUs.
By optimizing the model specifically for these domestic superpods, Meituan has managed to create a vertically integrated AI stack. This reduces latency, lowers the cost of inference, and provides a roadmap for other organizations looking to build sovereign AI capabilities in an era of global semiconductor supply chain volatility.
According to vendor-reported benchmarks, LongCat-2.0 excels in tasks requiring high recall across massive datasets. Whether it is summarizing an entire legal library, debugging massive software repositories, or conducting deep-dive analysis on long-form financial reports, the model provides consistent results.
However, the AI community remains cautious regarding the independent verification of these claims. While the initial technical documentation provided by the Meituan team is comprehensive, third-party testing is still in its infancy. For enterprise users, the ability to integrate LongCat-2.0 via API is a significant step forward, allowing developers to test the model’s capabilities within their own environments to determine if it meets the stringent requirements of their specific use cases.
The release of LongCat-2.0 underscores a broader industry trend: the transition from dense, monolithic models toward sparse, modular architectures. By utilizing the Mixture-of-Experts approach, developers can scale models to trillions of parameters while keeping the actual computational "burn" per query manageable.
As Meituan continues to refine the LongCat series, the industry will be watching closely to see how the model performs under the "stress test" of commercial deployment. If the native 1-million-token context proves as stable as early reports suggest, it could redefine the standard for document processing and AI-assisted research across the globe.
For now, the model stands as a testament to the rapid innovation occurring in the AI sector—a space where domestic innovation is increasingly competing with established global giants to set the next generation of technological benchmarks.
Enjoying this article?
Get the daily AI briefing sent straight to your inbox.
Frequently Asked Questions
What is the parameter size of LongCat-2.0?
LongCat-2.0 is a 1.6 trillion-parameter Mixture-of-Experts (MoE) model.
What is the context window of LongCat-2.0?
The model features a native 1-million-token context window, enabled by the LongCat Sparse Attention mechanism.
What hardware does LongCat-2.0 run on?
The model is designed for and runs on domestic AI ASIC superpods, optimizing both training and serving efficiency.
Comments
0Related articles

Uber’s European Expansion Stalls as Regulatory Hurdles Mount
Uber’s aggressive European expansion has encountered unexpected turbulence, with the company pausing operations in five of its seven target markets due to regulatory scrutiny.

Navigating Complexity: Lessons from Crisis to the Digital Frontier
From the oil fields of Kuwait to the digital landscape of 2026, we explore how resilience and human ingenuity remain the ultimate tools for solving global challenges.

Navigating AI Volatility: Expert Investment Strategies for a Fast-Paced Market
Investors are facing unprecedented speed in the AI sector; here is how experts are adjusting their portfolios to thrive amidst the chaos.