A simple test-time method that beats Claude Mythos on Terminal-Bench

hn · llm-as-a-verifier.notion.site ·1 pts·1 replies ↗ ·2d

JavaScript must be enabled in order to use Notion. Please enable JavaScript to continue.

mythosclaude

open →

← back to top