#glm

176 items

Chinese AI companies are shipping faster and cheaper than anyone expected and I'm not sure the west has a good answer for it (www.reddit.com) +528275 11w

Something keeps nagging at me about the Chinese AI space lately. Every few months a new Chinese model drops that closes the gap with US frontier models a little more(not by throwing more compute at it, just genuinely clever engineering at…

↯ Glm glm opus
Major drop in intelligence across most major models. (www.reddit.com) +510319 10w

As of mid Apr 2026, I have noticed every model has had a major intelligence drop. And no I'm not talking about just ChatGPT.

↯ Glm glm grok sonnet+3
2x 512gb ram M3 Ultra mac studios (www.reddit.com) +346106 9w

↯ Glm ↯ DeepSeek 3.2 glm deepseek
Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild (www.reddit.com) +29832 4w

Been following the infrastructure side of AI more lately and stumbled on this from Zai. They upgraded the network architecture on a thousand-GPU cluster running GLM-5.1 coding inference from the standard ROFT setup to something they built…

↯ Glm ↯ GLM 5.1 glm
I'm glad we have deepseek (www.reddit.com) +17532 8w

other companies are slowly going away from open weight, not releasing base models, delaying open weight distribution, not releasing top models (this one I think is fair, but still), and I also noticed they stopped publishing research (old…

↯ Glm ↯ Minimax ↯ Qwen 3.5 minimax glm gemma+2
Recent Open models from last 6 Months - Nov 2025 - Apr 2026 (www.reddit.com) +11628 9w

I created this chart with recent open models from last 6 months. Few might be older than that possibly.

↯ Mistral ↯ Glm ↯ DeepSeek 3.2 mistral glm gemma+1
GLM-5.2 is the new leading open weights model on Artificial Analysis (artificialanalysis.ai via hn) +9426 9d

June 17, 2026 GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index Z ai’s GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index scoring 51 and it sits on the Pare…

↯ Glm ↯ GLM 5.2 glm
Do you guys think there’s a high chance of Singularity being open source? (www.reddit.com) +7467 10w

GLM 5.1 is dominant in almost every aspect in Design arena, surpassing Opus 4.6 in many tasks. Although user experiences vary dependent on subscription plans for both of those one of them is open source.

↯ Glm ↯ Qwen 3.6 glm gemma qwen+1
(Interactive)OpenCode Racing Game Comparison Qwen3.6 35B vs Qwen3.5 122B vs Qwen3.5 27B vs Qwen3.5 4B vs Gemma 4 31B vs Gemma 4 26B vs Qwen3 Coder Next vs GLM 4.7 Flash (www.reddit.com) +6629 9w

↯ Glm ↯ Qwen 3.6 glm gemma mcp
Minimax M2.5 vs. GLM-5 vs. Kimi k2.5: How do they compare to Codex and Claude for coding? (www.reddit.com) +5742 18w

↯ Glm ↯ Minimax minimax glm codex+1
Guys we have to change the pelican test (www.reddit.com) +4864 10w

So i have been seeing more of those pelican on a bike svg tests and while they work i feel like (and maybe you guys do too) they are getting kinda benchmaxxed so we should switch things up soon and this is my idea generate me a html svg of…

↯ Glm ↯ Minimax ↯ MiniMax 2.7 minimax glm deepseek+3
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents (arxiv.org via hn) +385 7w

We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on language reasoning, but also on the ability…

↯ Glm glm agentic
GLM-5.2: The Most Powerful Open Model yet and the Brutal Reality of Running It (vettedconsumer.com via hn) +3716 7d

Every few weeks the "best open model" crown changes hands. This week it's GLM-5.2, from the Chinese lab Z.ai — and unusually, the claim has teeth: it sits at #1 on the independent Artificial Analysis Intelligence Index.

↯ Glm ↯ GLM 5.2 glm
ZAI might stop open-weighting their models? (www.reddit.com) +3448 10w

Ever since the company went public, they’ve been making a lot of changes that clearly seem to be prioritizing profit without regard to their customers. For example, with their coding plans: - They promised/advertised that the Lite coding p…

↯ Glm glm openai anthropic
Running gpt and glm-5.1 side by side. Honestly can’t tell the difference (www.reddit.com) +2418 10w

So I have been running gpt and glm-5.1 side by side lately and tbh the gap is way smaller than what im paying for On SWE-Bench Pro glm-5.1 actually took the top spot globally, beat gpt-5.4 and opus 4.6. overall coding score is like 55 vs g…

↯ Glm ↯ Swe Bench ↯ Opus 4.6 swe-bench glm gpt-5+1
Abliterlitics: Benchmarks and Tensor Comparison for Heretic, Abliterlix, Huiui, HauhauCS for GLM 4.7 Flash (www.reddit.com) +193 8w

This is a follow up to the previous benchmark and tensor analysis of abliteration techniques across the Qwen model family. Same approach, same toolkit, new model family.

↯ Glm ↯ GLM 4.7 mixture-of-experts glm qwen
The pacman benchmark: finally a viable local agentic coding agent with Qwen 3.6 27b (www.reddit.com) +178 5w

One way I like to test new models, is by one-shoting (with a good prompt) a single webpage clone of the classic arcade game pacman. I usually do 3 attempts and keep the best one.

↯ Glm ↯ Qwen 3.6 glm qwen chatgpt+2
Tested how OpenCode Works with SelfHosted LLMS: Qwen 3.5, 3.6, Gemma 4, Nemotron 3, GLM-4.7 Flash - v2 (www.reddit.com) +1732 9w

I have run two tests on each LLM with OpenCode to check their basic readiness and convenience: - Create IndexNow CLI in Golang (Easy Task) and - Create Migration Map for a website following SiteStructure Strategy. (Complex Task) Tested Qwe…

↯ Glm ↯ Qwen 3.6 glm gemma qwen
GLM-5.2: Frontier Intelligence, Open Weights (twitter.com via hn) +145 9d

Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits,…

↯ Glm ↯ GLM 5.2 glm agentic
Is Qwen3.6 current king for local agentic use? (www.reddit.com) +1122 4w

I've been testing other models but it seems like nothing even come close to Qwen3.6 35B A3B for agentic use. The worse I'd get is a loop sometimes, while Gemma4 produced broken tool calls occasionally and I couldn't even get GLM 4.7 Flash…

↯ Glm ↯ Qwen 3.6 glm moe agentic
Single question llm comparison (www.reddit.com) +101 18w

↯ Glm ↯ Minimax minimax glm grok+6
Kimi K2.6-Code-Preview, Opus 4.7, GLM 5.1, Minimax M2.7 and more tested in coding (www.reddit.com) +91 10w

Hi everyone. It's been a while since I posted (was a lil burned out), but some of you may have seen my older SanityHarness posts.

↯ Glm ↯ Minimax ↯ Opus 4.7 minimax glm opus
I expanded DystopiaBench to 42 models and 6 dystopia types. Claude is still the only one I'd trust with nuclear codes. (www.reddit.com) +86 5w

Since the last post I've added: Huxley module (Brave New World style behavioral conditioning) Baudrillard module (synthetic intimacy, trust collapse, simulation) 30 more models including Grok 4.3, GPT-5.5, Gemini 3.1 Pro, GLM-5.1 Multi-jud…

↯ Glm ↯ Gemini 3.1 glm grok gpt-5+2
GLM 5.1 Locally: 40tps, 2000+ pp/s (www.reddit.com) +78 8w

After some sglang patching and countless experiments, managed to get reap-ed nvfp4 version running stable and FAST on 4 x RTX 6000 Pros (limited to 350W). Very happy with performance and quality.

↯ Glm ↯ GLM 5.1 glm sonnet claude-code
GLM-5.2: Chop off 84% of the volume from a 1.5TB model, still retain 82% power (twitter.com via hn) +61 7d

Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits,…

↯ Glm ↯ GLM 5.2 glm agentic
GPT 5.5 (Codex) leading the future prediction race (www.reddit.com) +61 5w

Researchers from the Max Planck Institute recently released FutureSim, an environment in which agents are replayed a temporal slice of the web and are tasked with predicting real-world future events. In their environment, GPT 5.5 leads at…

↯ Glm ↯ DeepSeek 4 glm deepseek codex+2
Your local LLM predictions and hopes for May 2026 (www.reddit.com) +620 7w

Which of these do you think we'll get in May? Also, feel free to pick/rank which ones you'd want the most badly: more Gemma4 models (124b?) (other sizes?) more Qwen3.6 models (9b?

↯ Mistral ↯ Glm ↯ Minimax ↯ DeepSeek 4 mistral minimax glm+2
Comparing GPT-5.4, Opus 4.6, GLM-5.1, Kimi K2.5, MiMo V2 Pro and MiniMax M2.7 (www.codejam.info via hn) +62 9w

↯ Glm ↯ Minimax ↯ Opus 4.6 minimax glm gpt-5+1
Local GLM 5.1 - Parkour! (www.reddit.com) +62 10w

Some more 'sloptuber' content for those who are enjoying it :) Model: unsloth glm 5.1 @ IQ2_XXS UD Prompt 1: Task: in a single web page, build a city based parkour game. wsad controls, moving player aligned with current camera direction.

↯ Glm ↯ GLM 5.1 glm
Ollama Cloud Pro ($20/mo) vs OpenAI Plus ($23/mo). Which gives more tokens ? (www.reddit.com) +64 10w

Hey everyone, I'm comparing these two plans side by side for running AI agents daily through OpenClaw (self-hosted AI agent platform): • Ollama Cloud Pro — $20/month • OpenAI Plus — €23/month (~$25) My setup: 3 agents running in parallel (…

↯ Glm ↯ GLM 5.1 glm ollama openclaw+1
GLM-5.2 is the step change for open agents (www.interconnects.ai via hn) +5 3d

↯ Glm ↯ GLM 5.2 glm
do you use different models for different steps in your agent, or just one for everything? (www.reddit.com) +512 4w

Our dev team flagged last week that xAI is retiring grok 4.1 fast. We weren't using it for anything critical but it made me ask something I'd never actually asked: how did we pick the models we're running?

↯ Glm glm grok
DeepSeek's 10T USD grand strategy (twitter.com via hn) +5 4w

Have you ever wondered, how DeepSeek may make money, and lot of it? They didn't come up with competitive coding plans like GLM, MoonShot and MiniMax.

↯ Glm ↯ Minimax minimax glm deepseek
Tips for using Composer 2? New to Cursor (www.reddit.com) +53 6w

Hi. I new to using Cursor - coming from Claude Code, Antigravity and most recently GLM coding plan.

↯ Glm glm cursor claude-code
Anyone tried +- 100B models locally with foreign languages? (www.reddit.com) +56 7w

I am quite curious as I tried Gemma 4 31B, Qwen 3.6 27B, GLM 4.7 30B and some others in my native language (czech). Gemma performs "best" and considering the fact its "just" 18GB model - it actually blows my mind how well it can respond in…

↯ Glm ↯ Qwen 3.6 glm gemma qwen
Scaling Pain of Coding Agent Serving: Lessons from Debugging GLM-5 at Scale (z.ai via hn) +51 8w

Our belief in Scaling Laws has not only driven continuous breakthroughs in model parameters and data scale, but has also pushed infrastructure engineering toward its limits. This process inevitably comes with growing pains, which we refer…

↯ Glm ↯ GLM 5 glm
Used a Claude Code skill to fine-tune Qwen3-1.7B from 327 noisy traces, matches GLM-5 (www.reddit.com) +5 8w

Had 327 production traces from a restaurant-reservation agent I wanted to retrain. The plan was to fine-tune a smaller self-hostable model so I could ditch the frontier-API bill.

↯ Glm glm claude-code
GLM-5.2 vs. Claude Opus: Same Code, Less Than Half the Cost (entelligence.ai via hn) +4 1d

GLM-5.2 vs Claude Opus: Same Code, Less Than Half the Cost We ran GLM-5.2 head to head with Claude Opus the way an agent actually runs: inside a real coding agent, in a real shell, graded by hidden tests. The harness is Claude Code on term…

↯ Glm ↯ GLM 5.2 glm opus claude-code
Z.ai GLM 5.2 (huggingface.co via hn) +41 9d

GLM-5.2 👋 Join our WeChat or Discord community. 📖 Check out the GLM-5.2 blog and GLM-5 Technical report.

↯ Glm ↯ GLM 5.2 glm
Ask HN: Which cheap Chinese LLM are you using? (news.ycombinator.com) +4 12d

In the last one or two months, starting from DeepSeek V4 Pro, there are quite many low-price Chinese models coming out. Their performance looks more or less similar to me: Mimo V2.5 Pro, MiniMax M3, and the just released GLM 5.2, etc.

↯ Glm ↯ Minimax ↯ DeepSeek 4 minimax glm deepseek
I built a local GUI for the TradingAgents framework — works with Ollama (www.reddit.com) +4 4w

https://preview.redd.it/i90oxxk7n03h1.png?width=1898&format=png&auto=webp&s=7d219c804fda7dfe122b84fcdb6d0d6883818c68 A while back I came across TradingAgents — a really cool multi-agent LLM stock analysis framework where like a dozen "agen…

↯ Glm ↯ Minimax minimax glm ollama+4
Best AI coding plan alternative to Claude and ChatGPT (news.ycombinator.com) +43 6w

With the lowering usage limit in Claude, I am thinking of jumping ship to Chinese AI, since the benchmark is already very near compared to Sonnet or Haiku 4.5 , but for a fraction of the price. I am not worried about where is my data endin…

↯ Glm ↯ Minimax ↯ Haiku 4.5 minimax glm haiku+2
Update to the LLM Debate Benchmark: GPT-5.5, Grok 4.3, DeepSeek V4 Pro, GLM-5.1, Kimi K2.6, Qwen 3.6 Max Preview, Xiaomi MiMo V2.5 Pro, Tencent Hy3 Preview, and Mistral Medium 3.5 High Reasoning added (www.reddit.com) +4 7w

The benchmark uses adversarial, multi-turn debates across 683 curated motions. Each model pair debates the same motion twice with sides swapped.

↯ Mistral ↯ Glm ↯ DeepSeek 4 mistral glm grok+4
Current state of open-source ? (www.reddit.com) +416 9w

I’m trying to understand the current open-source LLM landscape beyond surface-level hype. We all got used to the nerfed products of Claude/Geminj so I believe really in opensource as a solution.

↯ Mistral ↯ Glm ↯ Minimax mistral minimax glm+2
llama.cpp / ik_llama MoE Expert Offloading - Main Memory Bandwidth vs. PCIe Bandwidth (www.reddit.com) +418 9w

↯ Glm ↯ GLM 5.1 glm moe llama
GLM-5.2 Is the New Best Open Model (thezvi.wordpress.com via hn) +3 2d

GLM-5.2 arrived last week. It boasts excellent benchmarks and looks strong.

↯ Glm ↯ GLM 5.2 glm
GLM-5.2 Beat Fable 5 at Website Design (twitter.com via hn) +3 6d

https://t.co/JSn0lDCNkB Design Arena@DesignarenaArticleHow GLM-5.2 Beat Fable 5 at Website DesignGLM 5.2 ranks 1st overall on Design Arena’s single-turn, HTML Web Design (Non-Agentic) evaluation, 5 places higher than its predecessor GLM-5.…

↯ Glm ↯ GLM 5.2 glm agentic
GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2 (arrowtsx.dev via hn) +3 6d

Bigger models are not the way Jun 18, 2026 A shift is happening among major AI labs, who are becoming increasingly skeptical of endless parameter count and training data scaling. The limits of this paradigm were put on the world’s stage wh…

↯ Glm ↯ GPT 5.5 glm gpt-5
I ran GLM-5.1 on a 16GB RAM machine (github.com via hn) +3 4w

🧠 MoE-on-a-Potato Running a 754-Billion Parameter LLM on a 16GB RAM Consumer PC "Saying it's impossible is not engineering. Saying we don't know how yet is science." MoE-on-a-Potato is an experimental project dedicated to testing the extre…

↯ Glm ↯ GLM 5.1 glm moe
Open weights GLM and Mimo are better than Gemini 3.5 flash according to arena (www.reddit.com) +33 5w

While we are weathering the gemini 3.5 flash hype, keep in mind that according to arena, GLM and Mimo are better. https://arena.ai/leaderboard/text/coding-no-style-control #7 GLM #9 Mimo #12 Gemini 3.5 Flash

↯ Glm ↯ Gemini 3.5 glm gemini
cdesktop — open-source Claude Code Desktop alternative, runs locally via npx, supports any provider (www.reddit.com) +35 5w

I built cdesktop with Claude Code — it's an open-source alternative to Anthropic's Claude Code Desktop, running locally on your machine via npx cdesktop. Free, Apache 2.0.

↯ Glm glm deepseek gemini+3
Open source battle: GLM vs Kimi vs MiMo vs DeepSeek (www.youtube.com via reddit) +31 6w

About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC

↯ Glm glm deepseek
Show HN: Grunden – Frontier AI inference hosted in Sweden, OpenAI-compatible (grunden.ai via hn) +31 6w

grunden.ai är en svensk AI-tjänst för utvecklare, myndigheter och helt vanliga människor. GLM 5.1 (open-weight) med EU-jurisdiktion, ett OpenAI-kompatibelt API och prissättning i kronor.

↯ Glm ↯ GLM 5.1 glm openai
Ran K2.6 through a third-party coding benchmark: heres how the figures stand up (www.reddit.com) +31 7w

I have been following the akitaonrails coding benchmark which tests against a fixed rails + Rubyllm + docker task rather than vendor-reported evals. April 2026 update put K2.6 at 87 sitting in tier A (80+), ahead of Qwen 3.6 plus (71), Dee…

↯ Glm ↯ DeepSeek 4 glm deepseek qwen+1
Just got a beast. (www.reddit.com) +316 8w

1.5 tb ram with 128gb vram and a 28 core processor. Mac Pro 2019.

↯ Glm glm
Capacity vs Speed trade-off: 1.1TB Mac Unified Memory vs. RTX 6000 Pros (www.reddit.com) +38 9w

I'm usually a Windows person, but I’m currently running a Mac cluster for local LLM orchestration. My setup consists of four 256GB Mac Studios plus one 96GB Mac Studio, giving me about 1.1TB of unified memory.

↯ Glm ↯ GLM 5.1 ↯ GLM 5.1 ↯ GLM 5.1 glm
What's the best GPU cluster/configuration 30k $ can buy? (www.reddit.com) +344 9w

Edit: I’m getting the consensus is that the budget I suggested is not enough for my lil ambitious project. I’d like to reshape the question for the upcoming comments: what’s the minimal budget to achieve my goal?

↯ Glm ↯ GLM 5.1 glm
do GLM-4.7 Flash Q4_K_M have problem with claude or agent? (www.reddit.com) +36 10w

I'm brand new to local LLMs and started with GLM-4.7 Flash q4_K_M. When I run it directly: ollama run glm-4.7-flash:q4_K_M it works pretty decently — nothing amazing, but usable and responsive.

↯ Glm glm ollama
I got better results when I made each AI tool do one job (www.reddit.com) +32 10w

I spent too much time trying to find one AI dev tool that could do everything. Planning, coding, fixing, reviewing, maybe filing my taxes too It never really worked.

↯ Glm ↯ Minimax ↯ GLM 5.1 minimax glm sonnet+4
What's the current best code autocomplete LLM for local deployment (as of April 2026)? (www.reddit.com) +34 10w

I know this question has already been asked a thousand times, probably, but... what's the best or close-to-best model I can use with Continue for local IDE-like code autocomplete?

↯ Glm ↯ Qwen 2.5 glm
GLM-5.2 (Max) API Provider Benchmarking and Analysis (artificialanalysis.ai via hn) +2 16h

Analysis of API providers for GLM-5.2 (max) across performance metrics including latency (time to first token), output speed (output tokens per second), price and others. API providers benchmarked include Together AI, FriendliAI, Fireworks…

↯ Glm ↯ GLM 5.2 glm
GLM-5.2, not Mythos, is the real security emergency (joshuasaxe181906.substack.com via hn) +2 22h

Until last week, attackers faced a dilemma in using frontier models: even if they could manage the cat-and-mouse game of setting up fake accounts to retain API access to frontier model providers, and even if they could induce models to hel…

↯ Anthropic Mythos ↯ Glm ↯ GLM 5.2 glm mythos
An Open-Source IDE Built for GLM 5.2 (stagewise.io via hn) +2 1d

An Open-Source IDE Built for GLM GLM 5.2 is compelling for a simple reason: it gets unusually close to Claude Opus 4.8 in quality while sitting at a much lower cost point. stagewise gives it a local IDE built for long-running agent work.

↯ Opus 4.8 ↯ Glm ↯ GLM 5.2 glm opus
Running GLM-5.2 on a 64GB Mac, barely (andreaborio.substack.com via hn) +2 2d

I tried to run GLM-5.2 on a 64GB Mac Field notes from an experimental ds4 fork, a 244GB GGUF, and the small horror of sparse models that are sparse in compute but not very friendly to filesystems. I have a weakness for local LLM experiment…

↯ Glm ↯ GLM 5.2 glm
We built the fastest API for GLM-5.2 (twitter.com via hn) +2 3d

https://t.co/LlNSUlxWLa Philip Kiely@philipkielyArticleHow we built the world’s fastest API for GLM-5.2 GLM-5.2 is the biggest news in open models since DeepSeek-R1. It’s easy to see why.

↯ Glm ↯ GLM 5.2 glm deepseek
Openresearch: GLM 5.2 for Autoresearch (openresearch.sh via hn) +2 3d

could not extract summary

↯ Glm ↯ GLM 5.2 glm
GLM 5.2 vs. Claude Opus 4.5 (gopeekapp.blogspot.com via hn) +21 3d

Claude Opus 4.5 vs GLM-5.2 Claude Opus 4.5 vs GLM-5.2 Claude Opus 4.5 (2025) and GLM-5.2 (2026) are frontier-tier reasoning models from Anthropic and Zhipu AI. Claude Opus 4.5 ships a 200k-token context window, while GLM-5.2 ships a 1M-tok…

↯ Glm ↯ GLM 5.2 ↯ Opus 4.5 glm opus anthropic
Zero Weights Graph Language Engine (MSE-GLM) (aircityshops.com via hn) +2 3d

1. Introduction Most language models are built around the same idea: train a neural network on enormous amounts of text, let it adjust billions of floating-point weights until it learns to predict the next word reasonably well, and then sa…

↯ Glm glm
Genuinely impressed, almost shocked, at how good GLM-5.2 (twitter.com via hn) +2 4d

Genuinely impressed, almost shocked, at how good GLM-5.2 by @zai_org is at coding. This changes things.

↯ Glm ↯ GLM 5.2 glm
Show HN: AdvertBench, ranking the ability of LLMs to create image ads (advertbench.com via hn) +2 5d

Experiment that I've made. The models get access to an E2B sandbox and are instructed to create an ad according to the specifications (they can choose whatever tools they want to use for it, e.g.

↯ Opus 4.8 ↯ Glm glm opus
I evaluated GLM 5.2 against the frontier on tasks from real repos (www.stet.sh via hn) +22 5d

GLM 5.2 vs Composer 2.5 and the premium field on 50 real merged PRs from graphql-go-tools (Go) and sqlparser-rs (Rust). GLM lands last on craft and equivalence in both repos, costs about twice Composer, and writes more code than the human…

↯ Glm ↯ GLM 5.2 glm
GLM 5.2 ranks #2 in Code Arena: Frontend (twitter.com via hn) +21 9d

Exciting news: GLM-5.2 (Max) ranks #2 in Code Arena: Frontend, with +29pt over Claude Opus 4.7 (Thinking) and only behind Fable 5! GLM-5.2 is the best open model vs Kimi-K2.6 and Minimax-M3 by a large margin.

↯ Glm ↯ Minimax ↯ Opus 4.7 ↯ Opus 4.7 ↯ Opus 4.7 minimax glm opus
GLM 5.2 Performance Benchmarks (artificialanalysis.ai via hn) +2 9d

GLM-5.2 (max) Intelligence, Performance & Price Analysis Model summary IntelligenceUpdated Speed Price Cache Hit Price Verbosity GLM-5.2 (max) is amongst the leading models in intelligence, but particularly expensive when comparing to othe…

↯ Glm ↯ GLM 5.2 glm
GLM-5.2 is now available with 1M-context support (twitter.com via hn) +2 13d

Intelligence should be open, accessible, and ready to build with, empowering every developer, everywhere. GLM-5.2 is now available to all GLM Coding Plan users, including Lite, Pro, Max, and Team plans.

↯ Glm ↯ GLM 5.2 ↯ GLM 5.2 ↯ GLM 5.2 ↯ GLM 5.2 ↯ GLM 5.2 glm
Show HN: Free open source coding models in Slack (www.runcord.com via hn) +2 4w

Hey HN, We believe we have the easiest onboarding from signup to being able to spin up coding agents in slack like Stripe, Ramp & Coinbase. Demo of the onboarding: https://www.tella.tv/video/connecting-cord-to-slack-1-19ep Every signup get…

↯ Glm ↯ Minimax ↯ DeepSeek 4 ↯ DeepSeek 4 minimax glm gemma+5
Show HN: Chuddy, self-hosted media downloading, translation and OCR Telegram bot (github.com via hn) +2 5w

My latest project, about 60% of the codebase was written with Z.ai's GLM-5.1 model. It's basically a Telegram bot that allows for embedding/downloading media easier within group chats.

↯ Glm ↯ GLM 5.1 glm
What’s going on with GLM? Are they scamming or what? (www.reddit.com) +21 6w

I have a GLM subscription that’s marketed as offering 3× higher usage than Claude Pro. I primarily use it through Claude Code CLI as a backup coding model.

↯ Glm glm claude-code
Chinese AI Coding Plan (www.reddit.com) +25 6w

With the lowering usage limit in Claude, I am thinking of jumping ship to Chinese AI, since the benchmark is already very near compared to Sonnet or Haiku 4.5 , but for a fraction of the price. I am not worried about where is my data endin…

↯ Glm ↯ Minimax ↯ Haiku 4.5 minimax glm haiku+1
tested four newest open source Kimi K2.6 is the fastest, GLM 5.1 the fanciest, DeepSeek V4 is the most comprehensive, and Xiaomi MiMo is the slowest (www.reddit.com) +21 7w

Architecture explains the gap: MiMo's MoE runs more active params per token than Kimi K2.6's optimized routing hence slowest. DeepSeek V4's 'comprehensive' edge is partly MLA: ~75% KV-cache compression makes it far better for long agentic…

↯ Glm ↯ DeepSeek 4 glm moe deepseek+1
Why is no open weight model inference provider hosting Mimo-v2.5 or Mimo-v2.5-pro? (www.reddit.com) +23 7w

Literally no 3rd party api inference provider is hosting the mimo-2.5 series models from Xiaomi. They seem to be reallly good.

↯ Glm ↯ DeepSeek 4 glm deepseek
Who else thinks AI is reaching a plateau (www.reddit.com) +213 7w

I must say that I almost feel no difference in all of the latest models that are coming out. Opus 4.7 is almost equal to 4.6 and 4.5, same about the other GPT models, the Kimi K models and the GLM models they all I feel they’re almost all…

↯ Anthropic Mythos ↯ Glm ↯ Opus 4.7 glm mythos opus+1
Ask HN: Are there any good open-source chat apps? (news.ycombinator.com) +2 8w

Hi HN family! I've recently been messing around with open models through ollama (glm-5.1 and kimi-k2.6), and I've been impressed with just how close they are to Claude Sonnet for my needs, especially programming.

↯ Glm ↯ GLM 5.1 ↯ GLM 5.1 glm ollama sonnet
3 of TIME's top 10 AI companies are Chinese and I only knew one by name (www.reddit.com) +24 8w

I code for a living, close to 7 years now, and I read way too much tech news. TIME dropped their 2026 most influential AI companies list and going through it I see OpenAI, Anthropic, Google, Meta, Amazon, then Zhipu AI sitting right there…

↯ Glm ↯ GLM 5 glm gemini openai+1
Open Source Company Coding Plans (www.reddit.com) +23 8w

I’ve been looking to buy a coding plan from one of the major open source contributors to give my meager support to them and transition away from Claude. I would love to hear some feedback from the community of their experience with some of…

↯ Glm glm qwen
I'm Not a Dev But I Use Qwen 3.6 35b to Code (www.reddit.com) +247 8w

Full disclosure: I used to program a bit, but I was garbage at it so I found a new career. This was eons ago so I'm not a dev, obviously.

↯ Glm ↯ Qwen 3.6 glm qwen
Cursor 3 eating GLM 5.1 usage (www.reddit.com) +21 10w

Hello all just as it sounds. I recently started using GLM 5.1 in cursor 3 but unlike in the past, GLM 5.1 ran through my entire daily budget from summarizing chat context and running commands.

↯ Glm ↯ Cursor 3 glm cursor
Show HN: SQL MCP Server – 61.37% on DataAgentBench with GLM-5.2 (github.com via hn) +1 2d

We just posted results to the DataAgentBench leaderboard scoring 61.37% with GLM 5.2. Please check it out and do share your feedback

↯ Glm ↯ GLM 5.2 glm mcp
Show HN: Subconscious and GLM-5.2 Makes "/compact" Obsolete (www.subconscious.dev via hn) +1 3d

GLM-5.2 is a turning point for coding agents. It's the first model a business would actually pay to replace Claude Opus with.

↯ Glm ↯ GLM 5.2 glm opus agentic
GLM-5.2: Another open-source Chinese AI model has Silicon Valley's attention (www.businessinsider.com via hn) +1 3d

A new AI model from China is generating the kind of buzz not seen since DeepSeek's R1 announced China as a serious threat to American chatbot hegemony over a year ago. Silicon Valley's online echo chamber has been alight with intrigue in r…

↯ Glm ↯ GLM 5.2 glm deepseek
Show HN: Cc-fleet – run other LLMs as Claude Code workers, your sub drives (github.com via hn) +1 5d

🚢 cc-fleet 🤖 Plug any third-party model into Claude Code's ⚙️ Dynamic Workflows, 👥 Agent Teams, and ⚡ Subagents — from DeepSeek · GLM · Kimi · Qwen … to your Codex subscription, with your main session's auth untouched; no Claude subscripti…

↯ Glm glm deepseek qwen+2
MiniMax M3 vs. GLM 5.2: Codegen comparison across autonomous coding tasks (thinkwright.ai via hn) +11 6d

Thinkbench, our custom evaluation harness, was used to drive both models through the same autonomous coding loop: read files, write files, run shell commands, and stop when the task was complete. The scored suite covered greenfield builds,…

↯ Glm ↯ GLM 5.2 ↯ Minimax autonomous-coding minimax glm
GLM-5.2 – How to Run Locally (unsloth.ai via hn) +1 6d

For the complete documentation index, see llms.txt. This page is also available as Markdown.

↯ Glm ↯ GLM 5.2 glm
Running GLM-5.2 5x faster at 500tps with limitation (abhishek.it via hn) +1 7d

Running GLM-5.2 5× faster than vLLM, on a runtime that doesn't support it I rented an 8×B200 and tried to run GLM-5.2 on TileRT, the runtime MiMo used to push a 1T model past 1000 tok/s. TileRT doesn't support GLM-5.2, so I reverse-enginee…

↯ Glm ↯ GLM 5.2 glm vllm
GLM-5.2: Benchmarks, Architecture and How to Run It (www.techaffiliate.in via hn) +1 7d

GLM-5.2 Review (2026): Benchmarks, Free Access & How to Use It Aditya Kachhawa If you've been keeping up with AI news lately, you've probably noticed a new name showing up everywhere GLM-5.2. And there's a good reason for that.

↯ Glm ↯ GLM 5.2 glm
GLM 5.2 is now available via a unified Model API (www.hpc-ai.com via hn) +11 8d

Model APIs Instant Access for Frontier Open-Source AI Models Build AI Apps and Agents with High-Performance Model APIs — No Deployment Required. Start Free TrialEverything You Need to Run AI Models Z.ai: GLM 5.1 4 supported capabilities fo…

↯ Glm ↯ GLM 5.2 glm
An open-source AI just beat OpenAI's GPT-5.5 at coding (1/6th the price) (docs.z.ai via hn) +1 8d

Overview GLM-5.2 is a flagship model built for the era of long-horizon tasks. With truly usable 1M-token context, it has been tested to handle project-scale engineering context, delivering more stable long-task execution, more reliable adh…

↯ Glm ↯ GPT 5.5 glm gpt-5 openai
GLM 5.2 playing text adventures (entropicthoughts.com via hn) +1 8d

GLM 5.2 playing text adventures I’ve heard some buzz around the new glm 5.2 open-weights model. They say it’s very capable!

↯ Glm ↯ GLM 5.2 glm
Model Card: unsloth/GLM-5.2-GGUF (huggingface.co via hn) +1 8d

GLM-5.2 👋 Join our WeChat or Discord community. 📖 Check out the GLM-5.2 blog and GLM-5 Technical report.

↯ Glm ↯ GLM 5.2 glm
GLM-5.2 Beats Fable 5 on Reasoning – 24 Hours After the U.S. Export Ban (explainx.ai via hn) +1 9d

GLM-5.2 by Zhipu AI tops BridgeBench reasoning 24 hours after the U.S. banned Fable 5.

↯ Glm ↯ GLM 5.2 glm
Show HN: LimitPing – Keep Claude Code and Codex rate-limit windows continuous (github.com via hn) +1 2w

CCLimitPing (limitping) English | 中文 Keep your Claude Code, Codex, and GLM (Zhipu / Z.ai Coding Plan) rate-limit windows back-to-back. These providers bill on a 5-hour rolling window (plus a weekly cap), and the 5h window starts on your fi…

↯ Glm glm codex claude-code
Noob here, curious about roughly how advanced of a video game a model like Qwen3.6 27b could create, if kept fully offline, and got unlimited attempts/revisions (maybe ~1 month project time limit). Like, could it make something equivalent to Pokemon Red? Doom? Doom II? What if using GLM 5.1? (www.reddit.com) +125 4w

So, I got interested in local LLMs a few months ago, but, I don't have a background in coding, and I don't know how to code, and I am not good with computers or anything. So far I mainly just was having fun with comparing different local L…

↯ Glm ↯ Qwen 3.6 glm
Is Composer 2.5 better than Glm 5.1 and DeepSeek v4 pro in real world tasks? (www.reddit.com) +1 4w

I am new to Cursor and still testing the free version. Benchmark for Composer 2.5 indicates it is better than DeepSeek v4 and Glm 5.1.

↯ Glm ↯ DeepSeek 4 glm deepseek codex+1
When configuring a third-party AI large model on the MacBook Claude Code desktop client, an error message appears. How can this be resolved? (www.reddit.com) +12 5w

This is my GLM-4.6 model API configuration, and this error is really confusing me. I'm not sure which step went wrong.

↯ Glm glm claude-code
Reliable Open Source LLM as a Service (www.reddit.com) +12 6w

Has anyone figured out a provider whose open source models (Kimi, Qwen, GLM e.t.c) can be used reliably in production. I have tested some well known providers and they all suffer from high latency and poor uptime rendering them mostly usel…

↯ Glm glm qwen gemini+1
Multi-LLM AI trading agent harness (github.com via hn) +1 6w

1rok 1rok is a standalone harness for running portfolio-construction agents across OpenAI, Anthropic, Gemini, xAI, DeepSeek, GLM, and OpenRouter against the same financial tool surface. Agents query Alpaca, Yahoo Finance, FRED, and Tavily…

↯ Glm glm deepseek gemini+2
Vertex MaaS GLM-5 prompt cache telemetry seems inconsistent. Anyone else seeing this? (www.reddit.com) +11 6w

I'm testing prompt-cache behavior for GLM models on Vertex AI MaaS and I'm seeing inconsistent telemetry. I reproduced it with a synthetic long prompt and repeated identical requests.

↯ Glm glm openai
Which Chinese Model is best for planning and which is best for implementation? I'm currently using Opencode with an Openrouter API Key, mostly wanna decide between Kimi, GLM, DeepSeek, Qwen, Minimax and Mimo (www.reddit.com) +11 6w

Original plan was to use Kimi/GLM for planning and DeepSeek for implementation, but seeing a lot of love for MiMo and Minimax lately. Anyone running a planner + coder split on Opencode?

↯ Glm ↯ Minimax minimax glm deepseek+1
Which model has less restrictions now? (www.reddit.com) +12 7w

GPT and Opus block on certain requests. This didnt use to be the case 2 months ago and I made signficant progress with Opus and then one day I had a 2 week break and then a single prompt to continue the work resulted in refusal.

↯ Glm glm qwen opus
Group Buys for Shared Compute or Model Hosting? Is this a thing? (www.reddit.com) +1 7w

I've been using GLM 5.1 a lot lately, and I love this model. However I don't love sending all my requests to China.

↯ Glm ↯ GLM 5.1 glm gemini
I plan to use a chinese AI model through API for coding through a harness, I'm a uni student so nothing prod related for now. should i go deepseek, minimax, kimi or glm? kinda confused (www.reddit.com) +11 7w

Just cancelled my claude subscription due to poor rate limits, gemini cli doesn't really excel in coding from my personal experience, and my local hardware isn't that powerful to run local AI models, and while codex is good, I wanna try so…

↯ Glm ↯ Minimax minimax glm deepseek+2
PP speed on dual RTX 6000 12c EPYC setup (www.reddit.com) +16 7w

I want to run big models like GLM 5.1 or Kimi k2.6. I can buy Mac Studio M3 Ultra with 512gb ram, but PP speed would be ofc bad.

↯ Glm ↯ GLM 5.1 glm
Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek) (www.reddit.com) +1 7w

Detailed Article: https://autobe.dev/articles/local-llm-benchmark-about-backend-generation.html Five months ago I posted the "Hardcore function calling benchmark in backend coding agent" thread here. As I wrote in that post, it was an unco…

↯ Glm ↯ Function Calling ↯ Sonnet 4.6 function-calling glm gpt-5+3
Built a self-hosted agent for small businesses that writes its own skills. ~$0.15 per customer booking on GLM-5.1 (www.reddit.com) +14 7w

Been working on this for a while and finally at a point where it's running in production for a couple of small businesses, so figured I'd share. The thing that kept bugging me about "AI employee" products is that none of them are something…

↯ Glm ↯ GLM 5.1 ↯ GLM 5.1 glm
Received a message from Z.AI about occasional garbled outputs and unexpected behavior (www.reddit.com) +12 8w

I received this mail: "Hi developers, Some of you flagged occasional garbled outputs and unexpected behavior when building with the GLM-5 series, especially under heavy workloads. We heard you, reproduced the issues, and the fixes are now…

↯ Glm ↯ GLM 5 glm
Comparing SVG Generation for the top open models (codeinput.com via reddit) +1 8w

Some of the larger models (like Llama) weren't available on OpenRouter, so I had to work with what was there. Best small model: Gemma 4 26B For its size, I think it had the best output.

↯ Glm ↯ Minimax ↯ DeepSeek 4 minimax glm gemma+2
Best value in the 20$ range coding agents? I want the best quality and high-usage-limit I can get at that price. (www.reddit.com) +11 8w

I'm a compsci student and I've been using the 10$ copilot plan for about 2 years now, and it was fine for me since I did a good model distribution taking into account the complexity of the task, I was able to get through the month always u…

↯ Copilot ↯ Glm ↯ Windsurf ↯ Qwen 3.5 windsurf glm copilot+3
anyone actually tried deepseek v4 pro for coding? (www.reddit.com) +12 8w

so v4 pro dropped and barely anyone is talking about it. feels weird since when kimi k2.6 came out i seen post about it everywhere anyone here tried v4 pro for actual code work?

↯ Glm ↯ DeepSeek 4 glm deepseek
Qwen 3.5 397b and GLM 5.1 Opus fine tune (www.reddit.com) +12 9w

Hi all. Many models on hugging face have been fine tuned with that 3000x opus dataset, but the two I mentioned in the title are missing it.

↯ Glm ↯ Qwen 3.5 glm qwen opus
Best app to use Nvidia Nim? (www.reddit.com) +1 9w

↯ Glm ↯ GLM 5.1 glm
Show HN: RepoGauge – save token costs and compare agents on your own repos (repogauge.org via hn) +1 9w

I've grown increasingly skeptical that public coding benchmarks tell me much about which model is actually worth paying for and worried that as demand continues to spike model providers will silently drop performance. I did a few manual an…

↯ Glm glm sonnet opus
Minimax vs Qwen vs Kimi vs Mimo(Omni) vs Glm ( via reddit) +1 10w

could not extract summary

↯ Glm ↯ Minimax minimax glm qwen
Upgrade paths for my 256g ddr4 ram + 4x24g vram system (www.reddit.com) +110 10w

So I was just about to give up playing with local models, until I realised I can actually run GLM 5.1 at not too horrible speeds, using this quant https://huggingface.co/ubergarm/GLM-5.1-GGUF/tree/main/IQ2_KL in ik llama. Getting around 6.…

↯ Glm ↯ MiniMax 2.7 glm llama
Which AI model is best for real data analysis? [benchmark] (www.reddit.com) +1 10w

I created and run a benchmark for AI models in data analysis tasks. In contrary to other benchmarks, it is not one-prompt benchmark, but I tried to simulate the real work of data analyst.

↯ Glm ↯ Qwen 3.5 glm ollama gpt-5
Model API Performance (news.ycombinator.com) +1 10w

We’ve been benchmarking a few models on our API platform and got some interesting performance numbers: - MiniMax M2.5 → 0.118s time-to-first-token, 103 tokens/sec - GLM 5.1 → 120 tokens/sec throughput - Kimi K2.5 → 0.643s TTFT, 69 tokens/s…

↯ Glm ↯ Minimax ↯ GLM 5.1 minimax glm
What Am I Doing Wrong? Models Won't Listen, At All (GLM 5.1, MiniMax M2.7, Kimi K2.5) (www.reddit.com) +114 10w

What am I doing wrong here? I can't get models to follow my instructions, pretty much at all.

↯ Glm ↯ Minimax ↯ GLM 5.1 minimax glm ollama
GLM 5.2 is unbelievably dumb (www.reddit.com via reddit) 18h

Yeah... you heard it right.

↯ Glm ↯ GLM 5.2 glm codex opus
Claude Max vs Codex Pro or both combined? (www.reddit.com via reddit) 20h

I’m considering one heavier subscription (~€100/month) and want to know which provides better value for agentic coding. I tested GPT Pro and was satisfied with Codex.

↯ Glm ↯ GLM 5.2 glm ollama codex+2
GLM 5.2 on consumer hardware (www.reddit.com via reddit) 21h

I tried out the unsloth quants of GLM 5.2 on still "consumer-ish" hardware: 32C Zen5 Threadripper Pro 9975 WX, Asus WRX90E-SAGE-SE PCIe Gen5, 512GB DDR5 ECC RAM @ 4800MHz, dual RTX 5090. This machine was put together pre-RAMpocalypse, and…

↯ Glm ↯ GLM 5.2 glm llama
Fable 5 vanished in 96 hours and four days later an MIT model took its arena crown (www.reddit.com via reddit) 1d

I have been thinking about the Fable 5 to GLM-5.2 sequence as one event rather than two. June 9, Anthropic ships Fable 5, the Mythos line opens to the public for the first time, SWE-bench Verified at 95 percent, people calling it the best…

↯ Opus 4.8 ↯ Anthropic Mythos ↯ Glm ↯ GLM 5.2 ↯ GPT 5.5 ↯ Swe Bench swe-bench glm gpt-5+3
GLM-5.2 matched Claude Opus on 45 terminal-bench coding-agent tasks at less than half the cost (full methodology + failure transcripts inside) (www.reddit.com via reddit) 1d

We wanted to know whether an open-weights model can actually do frontier coding-agent work, so we ran GLM-5.2 head-to-head with Claude Opus the way an agent actually runs not on a static eval, but inside a real coding agent (Claude Code) o…

↯ Glm ↯ GLM 5.2 glm opus claude-code
My experience spending $16,000 on Anthropic in 1 year (www.reddit.comhttps) 2d

Over the last year I have spent $16,000 on Anthropic via the OpenRouter API (and another $1k on other AI models). I started out using the Claude VS Code extension.

↯ Glm glm openclaw sonnet+3
Why Cursor don't have GLM models? (www.reddit.comhttps) 3d

GLM-5.2 is currently ranked #2 on the Arena leaderboard, but since Claude Fable 5 isn’t actively being sampled right now, GLM-5.2 is practically the #1 available model for coding. Despite its top-tier performance, Cursor has never natively…

↯ Glm ↯ GLM 5.2 glm cursor
GLM 5.2 vs Opus 4.8 on 50 real Go and Rust PRs from open source repos: last on quality, and not the cheapest (www.reddit.com via reddit) 5d

TL;DR There's been a lot of hype around GLM 5.2 being a cheap "frontier killer": good enough to replace Opus 4.8 / GPT 5.5 for most coding work, just by swapping it in. On these 50 tasks it finished last on quality in both repos – and it's…

↯ Opus 4.8 ↯ Glm glm opus
GLM 5.2 and MiniMax M3 are a lot closer/better to Sonnet 4.6 than I expected on coding-agent workloads (www.reddit.comhttps) 6d

We benchmarked GLM 5.2, MiniMax M3, Kimi K2.7-code, Qwen 3.7-Plus and Sonnet 4.6 across nearly 1,000 coding-agent scenarios. The scenarios were run twice.

↯ Sonnet 4.6 ↯ Glm ↯ Minimax minimax glm sonnet+1
When will GLM-5.2 be available natively in Cursor? (www.reddit.com via reddit) 6d

GLM-5.2 was recently released and looks promising, especially for coding and long-running agent tasks. I know it may be possible to use it through BYOK, but does anyone know when it will be added as a built-in model in Cursor IDE and Curso…

↯ Glm ↯ GLM 5.2 glm cursor
[AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December (www.latent.space) 7d

[AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December With GLM-5.2 passing everyone's vibe check, the open models story finally becomes a real frontier story.

↯ Glm ↯ GLM 5.2 glm
GLM-5.2 is probably the most powerful text-only open weights LLM (simonwillison.net) 8d

GLM-5.2 is probably the most powerful text-only open weights LLM 17th June 2026 Chinese AI lab Z.ai released GLM-5.2 to their coding plan subscribers on June 13th, and then yesterday (June 16th) released the full open weights under an MIT…

↯ Glm ↯ GLM 5.2 glm
GLM 5.2 via Claude Code is the first non-Claude model that feels close to Opus (www.reddit.com via reddit) 8d

I’ve been using GLM 5.2 with Claude Code through its Anthropic-compatible API endpoint. I’ve tested it on various projects, including but not limited to database development, backend payment API work, backend and frontend debugging, Larave…

↯ DeepSeek 4 ↯ Glm ↯ DeepSeek 4 glm deepseek opus+2
GLM-5.2: Built for Long-Horizon Tasks (huggingface.co) 9d

GLM-5.2: Built for Long-Horizon Tasks - Solid 1M Context: A solid 1M-token context that stably sustains long-horizon work - Advanced Coding with Flexible Effort: Stronger coding capabilities with multiple thinking effort levels to balance…

↯ Glm ↯ GLM 5.2 glm
[AINews] GLM-5.2: the top Frontend Coding model in the world, IndexShare for Speculative Decoding (www.latent.space) 9d

[AINews] GLM-5.2: the top Frontend Coding model in the world, IndexShare for Speculative Decoding We have a new top open model in the world! Last 6 days before regular tickets sell out at AI Engineer World’s Fair - this is the single bigge…

↯ Glm ↯ GLM 5.2 glm
I thought Chinese censorship didn't affect me. I was wrong. (www.reddit.com via reddit) 2w

I was debugging some code and LLM crashed out: ``` The debug_log config defaults to "debug.json" and creates a FileHandler — which appends by default. That file is a log of everything that happened, never cleared.

↯ Glm glm
Suitable replacement to grok fast 4.1 (www.reddit.com via reddit) 2w

↯ Glm glm grok
Can you really replace paid models with a local model? (www.reddit.com via reddit) 2w

Long time lurker, and I say this as someone who genuinely loves this community and runs many local models myself. I’ve been using LLMs since the early GPT and LLaMA days.

↯ Glm ↯ Minimax minimax glm deepseek+2
Claude Fable/Mythos 5 just came out, so it will take Deepseek or Z.ai or Xiaomi or Kimi 9-12 months to release a model just as good as Fable? (www.reddit.com via reddit) 2w

It should be at least 7-8 months until we have an open Fable(not just as good as Fable in benchmarks, but actually as good as Fable), probably more like 9-12 months. By the time, an open Fable model comes out, Fable 6.5-7 will be way bette…

↯ Anthropic Mythos ↯ Glm ↯ Minimax minimax glm deepseek+2
Would you pay for Chinese AI models if the quality was close enough? (www.reddit.com via reddit) 2w

DeepSeek, Qwen, and GLM aren't necessarily winning every benchmark. But they don't need to.

↯ Glm glm deepseek qwen
GLM-5.1 and Kimi K2.6 THE CHEAPEST WAY TO RUN (www.reddit.com via reddit) 2w

Guys how to run it as cheap as possible to get at least 15-20 ts? Asking for a friend!

↯ Glm glm
Dynamic Workflows With External Models and Max Plan? (www.reddit.com via reddit) 2w

Has anyone figured out a way to mix max plan with models from other providers (like GLM or Deepseek) while using dynamic workflows? I suppose we could create a passthrough proxy and route sonnet and haiku to other models?

↯ Glm glm haiku deepseek+1
Z.ai, we need Air! GLM GGUF wen? (www.reddit.com via reddit) 2w

First we never saw an upgraded Air model after 4.5. Then GLM 4.7 Turbo was great, but quickly surpassed for coding.

↯ Glm ↯ Qwen 3.6 glm gemma qwen+1
Fuck, sucessfully ran minecraft server on GLM AI's Agent lol. (www.reddit.com via reddit) 2w

I just told it, make a minecraft server and let me play and it worked lol. I just asked "host a minecraft server so I can play" and it did host it, made me a dashboard ands its crazyyyyy lol, It is hosted in hongkong somewere TwT

↯ Glm glm
Went to the monthly AI dev meetup (www.reddit.com) 21 4w

Usual crowd. Everyone's on Claude or Codex, nobody's really sure how any of it actually works, and that's fine, that's the vibe.

↯ Glm ↯ GLM 5.1 glm llama codex+1
Some tests with qwen3.6 27b + 35b a3b about MTP vs ngram-mod (www.reddit.com) 14 4w

I will try to keep this short ;) I used GLM 5.1 to vibecode a vague prompt on my vibecoded react web app and have GLM 5.1 rank the plans made with each other and the one it made itself. Test strategy: - use starter prompt as always - add v…

↯ Glm ↯ Qwen 3.6 glm
OCR: what is the best way to extract data in JSON format from this old French book? (www.reddit.com) 10 5w

As some of you may have guessed, what we have here is an old Bible. I would like to extract the following information from the page: { verse: number, verse_content: string, comments: string[] } I've played around with PaddleOCR a bit; I co…

↯ Glm glm
How to Find Open-Source Models / Providers that Do not Train on Data (www.reddit.com) 5 5w

A lot of people are saying just use X, just do Y, just run Z locally, but the best models cannot be run locally (GLM 5.1). No one ever talks about privacy, but for those concerned about privacy, how do we know when we use Z AI's GLM 5.1 th…

↯ Glm ↯ GLM 5.1 glm
I built a 24h TPS + Intelligence Index table for Ollama Cloud models (www.reddit.com) 5w

I recently made ollamatps.com for my own model-selection workflow and thought it might be useful here too. It shows 39 Ollama cloud models sorted by average TPS over the last 24 hours, and I added the Artificial Analysis Intelligence Index…

↯ Glm glm ollama
We built Irene — an AI agent platform that actually remembers you, builds its own tools , adapts and improve as you use it (www.reddit.com) 14 6w

Hey r/AI_Agents — we're launching Irene today, and I want to be straight about what it is, why we built it, and where it's going. What makes Irene different Affordable with massive token limits and the latest open-source models We have gen…

↯ Glm ↯ Minimax minimax glm ollama+2
Mac Studio local loadout - May 2026 (www.reddit.com) 2 7w

Day-to-day user vibes, not rigorous benchmarks, so YMMV. GLM 5.1 has by far been my biggest winner in the last batch of releases.

↯ Glm ↯ GLM 5.1 glm claude-code
GLM-5.1 smol-IQ2_KS at 2.3t/s or GLM-4.7 UD-Q3_K_XL at 4.42t/s, which is "better" for chats (no coding)? (www.reddit.com) 13 7w

I wonder which one is better, I tested it a little bit (too slow, of course) and I'm still unsure. Does the GLM-5.1 smol-IQ2_KS loses too much?

↯ Glm ↯ GLM 5.1 glm
Best local model for MBP 48GB UM (www.reddit.com) 2 7w

I have been toying with GLM 4.7 flash mlx a while ago using lmstudio. I had integrated it successfully with openclaw and it was kinda stable in tool calling.

↯ Glm ↯ Qwen 3.6 glm openclaw qwen
Running 7 autonomous AI agents for 14 days. Here's what actually happens when they need to find customers. (www.reddit.com) 5 7w

I set up 7 AI coding agents on a VPS with automated cron sessions (2-8 per day depending on the agent). Each uses a different model: Claude Sonnet, GPT-5.4, Gemini 2.5 Pro, DeepSeek V4 Pro, Kimi K2.6, MiMo V2.5 Pro, GLM-5.1.

↯ Glm glm gpt-5 deepseek+2
Does running a model (like qwen3.6-27b) on vllm or transformers use less VRAM than llama.cpp? (www.reddit.com) 5 7w

I have been using llama.cpp to run some models recently. For example, I've been running GLM-4.7-Flash with this command .\llama-server.exe -hf unsloth/GLM-4.7-Flash-GGUF:Q6_K_XL --alias "GLM-4.7-Flash" --host 127.0.0.1 --port 10000 --ctx-s…

↯ Glm ↯ Qwen 3.6 glm vllm qwen+1
Should I replace stored models? (www.reddit.com) 7 8w

Hello everyone, the question is easy, with the new models of deepseek, kimi, GLM and qwen, should you replace the old models with the new version? Do I lose some quality, information or performance in the process?

↯ Glm glm deepseek qwen
Did anyone of you already make the "doomsday" or "offgrid" knowledge based? (ofc powered with LLM) (www.reddit.com) 8 8w

Basically, I’m really into the idea of a fully offline setup. (Another way to say it: I’m a data hoarder.) For LLMs, I’m using uncensored models from both Western (Gemma, GPT-OSS) and Eastern ones (GLM 4.7 Flash, Qwen 35B).

↯ Glm ↯ Qwen 3.5 glm gemma qwen
Qwen 3.6 27b S2 Opus + GLM + Kimi (huggingface.co via reddit) 1 8w

My first time releasing a fine-tune publicly! If anyone wants to independently eval against base, that’d be awesome.

↯ Glm ↯ Qwen 3.6 glm qwen opus
How will you scale these models (www.reddit.com) 5 8w

How will you scale these models coding and overall. Deepseek v4 pro Kimi k2.6 Mimo v2.5 pro Glm 5.1 Qwen 3.6 plus

↯ Glm ↯ DeepSeek 4 glm deepseek qwen
Anthropic's Claude remote uses GLM-4.7 (www.reddit.com) 4 8w

I just noticed this after a bug wasn't getting fixed. If you start a Claude code remote environment the default model (hidden on mobile) is glm 4.7 I assumed anthropic only used their own models for everything so it was interesting to me t…

↯ Glm ↯ GLM 4.7 glm anthropic claude-code
QClaw-4B — a 4B agent model fine-tuned for tool use and agentic workflows (www.reddit.com) 3 8w

QClaw-4B is a 4-billion parameter language model fine-tuned for agentic tasks and tool use, designed for use with OpenClaw-compatible agent frameworks. Despite its compact size, QClaw-4B achieves state-of-the-art results in the 4B class, m…

↯ Tool Use ↯ Glm tool-use glm openclaw+1
Best open source LLM for planning ? (www.reddit.com) 4 9w

↯ Glm glm sonnet opus
The quality of GPT-5.4 is infuriatingly POOR (www.reddit.com) 2 9w

I got a Codex membership when GPT-5.4 launched and was getting by well enough for a while. Then I started using Claude and GLM 5.1, and my production quality improved significantly.

↯ Glm ↯ GPT 5.4 glm gpt-5 codex
FREE Claude Code alternative using GLM 5.1 + VS Code (tutorial) (www.reddit.com) 8 10w

https://youtu.be/tL3cOdgukt8

↯ Glm ↯ GLM 5.1 glm claude-code
What’s your LLM routing strategy for personal agents? (www.reddit.com) 10w

TL;DR I try to keep most traffic on very cheap models (Nano / GLM‑Flash / Qwen / MiniMax) and only escalate to stronger models for genuinely complex or reasoning‑heavy queries. I’m still actively testing this and tweaking it several times…

↯ Mistral ↯ Glm ↯ Minimax ↯ Gemini 2.5 mistral minimax glm+3
Claude Code with Pro subscription + OpenRouter in parallel — what's the cleanest setup? (www.reddit.com) 3 10w

Hi there, I have a Claude Pro subscription and use Claude Code daily. I'd also like to use Claude Code routed through my OpenRouter API key so I can experiment with other models (GLM-5.1, DeepSeek, Kimi, Gemini, etc.) — without giving up m…

↯ Glm ↯ GLM 5.1 glm deepseek sonnet+2
Long context prompt help (www.reddit.com) 3 10w

Hi all, I'm running GLM 4.7 flash uncensored (Q8) on a 5090. I'm trying to get it to edit a short story (about 8.5k tokens, added via PDF) to add a scene.

↯ Glm glm
Speed on m5 pro 48Gb (www.reddit.com) 10w

Hey guys! How would you reckon a 30-50b model would run on a 48 GBs m5 pro?

↯ Glm ↯ Qwen 3.5 glm gemma qwen
Why most open-source models can't answer this question while most closed-source models can answer most of the time? (www.reddit.com) 30 10w

WEB SEARCH WAS ALWAYS ON!!!! Question Calculate the precise VRAM requirement for the **KV Cache only** at the maximum context window for **DeepSeek V3.2** and **MiniMax M2.5**.

↯ Glm ↯ Minimax minimax glm grok+4
GLM OCR for Arabic (www.reddit.com) 2 10w

So, I have been testing GLM OCR for my rag app, but it is not working good for Arabic. It is unable to extract data either on textual page, scanned pages or even images.

↯ Glm glm rag
Stop donating your salary to OpenAI: Why Minimax M2.5 is making GPT-5.2 Thinking look like an overpriced dinosaur for coding plans. (www.reddit.com) 10 18w

↯ Hallucination ↯ Glm ↯ Minimax ↯ Swe Bench swe-bench minimax hallucination+5

← all tags