model roundup

Qwen 3.5

2 items · started 2026-05-27 · closed 2026-05-30

Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop (www.reddit.com)

+98 4w qwen

https://preview.redd.it/u8062juegq3h1.png?width=1919&format=png&auto=webp&s=a213f6929c6cad58e92bc1681dac9f0545b04d13 Overview: As the market for consumer computing parts becomes more scarce due to the AI boom, finding ways to use lower-end…
Is a 128 GB MacBook Pro M5 Max actually too slow for large-context local LLM coding workflows? (www.reddit.com)

+114 4w moe rag qwen+2

People are warning me about the prompt-processing speed of a MacBook Pro M5 Max with 128 GB RAM. My main concern is prompt ingestion / prefill latency and large-context handling — not raw token generation speed (which I think is OK).