Ask HN: Opus 4.7 – is anyone measuring the real token cost on agentic tasks?

hn · news.ycombinator.com ·1 pts ·3h

Shipped today. The benchmarks are real: 87.6% SWE-bench (from 80.8%), +13% on coding tasks, 3x more resolved production tasks on Rakuten-SWE-Bench.

swe-benchagenticopus

open →

← back to top