Gemma4 26b & E4B are crazy good, and replaced Qwen for me!

reddit-localllama · www.reddit.com ·392 pts·100 replies ↗ ·1d

My pre-gemma 4 setup was as follows: Llama-swap, open-webui, and Claude code router on 2 RTX 3090s + 1 P40 (My third 3090 died, RIP) and 128gb of system memory Qwen 3.5 4B for semantic routing to the following models, with n_cpu_moe where…