Our belief in Scaling Laws has not only driven continuous breakthroughs in model parameters and data scale, but has also pushed infrastructure engineering toward its limits. This process inevitably comes with growing pains, which we refer…
model
GLM-5
huggingface.co/zai-org/GLM-5 ↗
442542 downloads2078 likestext-generationtransformers
from the model card
GLM-5 👋 Join our WeChat or Discord community. 📖 Check out the GLM-5 technical blog. 📍 Use GLM-5 API services on Z.ai API Platform. 👉 One click to GLM-5. [Paper] [GitHub] Introduction We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI). Compared to GLM-4.5, GLM-5 scales from 355B parameters (32B active) to 744B parameters (40B active), and increases pre-training data from 23T to 28.5T tokens. GLM-5 also integrates DeepSeek Sparse Attention (DSA), largely reducing deployment cost while preserving long-context capacity. Reinforcement learning aims to bridge the gap between competence and excellence in pre-trained models. However, deploying it at scale for LLMs is a challenge due to the RL training inefficiency. To this end, we developed slime, a novel asynchronous RL infrastructure that substantially improves training throughput and efficiency, enabling more fine-grained post-training iterations. With advances in both pre-training and post-training, GLM-5 delivers significant improvement compared to GLM-4.7 across a wide range of academic benchmarks and achieves best-in-class performance among all open-source models in the world on reasoning, coding, and agentic tasks, closing the gap with frontier models. Benchma…
discussions
- GLM 5 6 2026-04-27 – 2026-05-04
recent items
Scaling Pain of Coding Agent Serving: Lessons from Debugging GLM-5 at Scale (z.ai via hn) Received a message from Z.AI about occasional garbled outputs and unexpected behavior (www.reddit.com) I received this mail: "Hi developers, Some of you flagged occasional garbled outputs and unexpected behavior when building with the GLM-5 series, especially under heavy workloads. We heard you, reproduced the issues, and the fixes are now…
3 of TIME's top 10 AI companies are Chinese and I only knew one by name (www.reddit.com) I code for a living, close to 7 years now, and I read way too much tech news. TIME dropped their 2026 most influential AI companies list and going through it I see OpenAI, Anthropic, Google, Meta, Amazon, then Zhipu AI sitting right there…