model roundup

Gemini 3.1

5 items · started 2026-04-23 · ongoing (last activity 2026-04-25)

  1. Hi folks, I've been benching Kimi K2.6 for the past few days, and I'd like to share my findings. For context, this is based on a benchmark I've created that pits models against each other in autonomous games of Blood on the Clocktower - a…

← all threads