Recent Open models from last 6 Months - Nov 2025 - Apr 2026 (www.reddit.com)
model roundup
DeepSeek 3.2
-
I created this chart with recent open models from last 6 months. Few might be older than that possibly.
-
More info, including charts, per-case metrics, raw judge outputs, and the parsed answer dump: https://github.com/lechmazur/position_bias This benchmark isolates one basic and frustrating failure mode. The model-average first-shown pick rat…
-
2x 512gb ram M3 Ultra mac studios (www.reddit.com)