model roundup

GLM 5.1

3 items · started 2026-05-24 · closed 2026-05-30

Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild (www.reddit.com)

+29832 4w glm

Been following the infrastructure side of AI more lately and stumbled on this from Zai. They upgraded the network architecture on a thousand-GPU cluster running GLM-5.1 coding inference from the standard ROFT setup to something they built…
Went to the monthly AI dev meetup (www.reddit.com)

21 4w glm llama codex+1

Usual crowd. Everyone's on Claude or Codex, nobody's really sure how any of it actually works, and that's fine, that's the vibe.
I ran GLM-5.1 on a 16GB RAM machine (github.com via hn)

+3 4w glm moe

🧠 MoE-on-a-Potato Running a 754-Billion Parameter LLM on a 16GB RAM Consumer PC "Saying it's impossible is not engineering. Saying we don't know how yet is science." MoE-on-a-Potato is an experimental project dedicated to testing the extre…