Claude Opus 4.6 accuracy on BridgeBench hallucination test drops from 83% to 68%
Anthropic's flagship model just took a pretty significant accuracy hit on one of the most important AI benchmarks out there. So here's the deal: Claude Opus 4.6 was recently tested on BridgeBench, which specifically measures how often AI m…