Qwen3.5 50% expert reduction success

hn · news.ycombinator.com ·4 pts·1 replies ↗ ·6h

We surgically removed half the experts from Qwen3.5-35B-A3B to create 8 memory efficient domain specialists (coding, web, math, physics, biology, engineering, vocational, humanities). A cross-domain test shows a 96-point pass@5 gap between…

moe

open →

← back to top