Hardware needed for Gemma 26B MoE vs Qwen 14B for ~100–300 users (vLLM, single node?)

reddit-localllama · www.reddit.com ·16 replies ↗ ·2d

moevllmqwengemma

open →

← back to top