model roundup

Gemini 3.1

9 items · started 2026-04-23 · closed 2026-05-02

Agent got stuck in a loop and spent over $2000 in less than two hours. (www.reddit.com)

+21 8w gemini cursor

I was trying to find a problem in my math heavy code and asked an agent (Gemini 3.1) to find the issue. Often when I know it’s a hard problem I let it be and go get coffee or lunch.
GPT 5.5 - Strong, not mind-blowing, but very token efficient (www.reddit.com)

+61 8w gpt-5 gemini openai

I've been benching GPT-5.5 for the past couple days and would like to share my findings. This is based on a benchmark I've created that pits models against each other in autonomous games of Blood on the Clocktower - a highly complex social…
Cursor (again) not working with Gemini 3.1 API (www.reddit.com)

+11 8w gemini cursor

Last week it was broken, then they "fixed" smth few days ago. Now again...
The Significance of Google's recent TPU 8t and TPU 8i (www.reddit.com)

+4 8w gemini

Cost & Performance Efficiency Training Cost-Performance (8t): +170% to +180% gain (2.7x–2.8x) Inference Cost-Performance (8i): +80% gain Training Power Efficiency (8t): +124% gain in performance-per-watt Inference Power Efficiency (8i): +1…
Unexpected $50 charge due to hidden model settings — is this intended? (www.reddit.com)

+2 8w gemini cursor

I’ve been using Cursor for ~1.5 years, mainly with Gemini 3.1 Pro. Recently I ran into a serious pricing issue.
Real benchmark breakdown in AI agents (www.reddit.com)

+42 8w gpt-5 gemini opus

I dove deep into the most recent benchmark stats from GPT-5.5, Claude Opus 4.7, and Gemini 3.1 Pro via official reports & third-party evaluations. I found a interesting thing:There’s no such thing as a “one-size-fits-all model.” My finding…
Kimi K2.6 - the mighty turtle that wins the race (www.reddit.com)

+62 8w gemini

Hi folks, I've been benching Kimi K2.6 for the past few days, and I'd like to share my findings. For context, this is based on a benchmark I've created that pits models against each other in autonomous games of Blood on the Clocktower - a…
Deepseek V4 Pro is 15x cost to run Artificial Analysis bench from V3.2, higher than Gemini 3.1 Pro (www.reddit.com)

+8535 8w deepseek gemini

Major performance jump though. Worth it?
Back to the real world.....anyone having problems using Gemini API after the update on the model descriptiont/selection? Gemini 3.1 Pro and Gemini 3 Flash are not working, only Gemini 2.5 flash. Is there an update on the pipeline to fix it? (www.reddit.com)

+44 9w gemini

could not extract summary

← all threads