As of mid Apr 2026, I have noticed every model has had a major intelligence drop. And no I'm not talking about just ChatGPT.
#sonnet
56 items
Major drop in intelligence across most major models. (www.reddit.com via reddit) Claude is now adopting the advisor strategy (www.reddit.com via reddit) Cursor is randomly talking Hebrew (www.reddit.com via reddit) Claude Benchmark Evolution (www.reddit.com via reddit) I’ve used enough AI models to realize they all have wildly different personalities At this point I’m convinced AI models are just coworkers with different levels of talent, ego, and criminal energy. (www.reddit.com via reddit) - Claude Opus 4.6 - absolute rogue AI. Does what I want like it’s breaking at least 3 internal policies to make it happen.
Guys we have to change the pelican test (www.reddit.com via reddit) So i have been seeing more of those pelican on a bike svg tests and while they work i feel like (and maybe you guys do too) they are getting kinda benchmaxxed so we should switch things up soon and this is my idea generate me a html svg of…
We benchmarked TranslateGemma-12b against 5 frontier LLMs on subtitle translation - it won across the board, with one significant catch (www.reddit.com via reddit) Show HN: Gave Claude a casino bankroll – it gambles till it's too broke to think (letaigamble.com via hn) Inspired by ALMA. As Claude loses money gambling on provably-fair slots, it's forced to downgrade from Opus → Sonnet → Haiku, making worse decisions and accelerating the spiral.
Emotional priming changes Claude's code more than explicit instruction does (www.reddit.com via reddit) I noticed Claude writing more defensive code after a frustrating debugging session. Got curious whether that was real, so I tested it.
Tell HN: Anthropic no longer allows you to fix to specific model version (news.ycombinator.com via hn) I just got an email from Anthropic telling me they are deprecating their good model, which actually works well, claude-sonnet-4-5-20250929, and will be forcing all users to use the worse newer model, claude-sonnet-4-6. Okay, fine, I though…
OpenAI Codex vs Claude Code in 2026 Spring (www.reddit.com via reddit) Gemma 4 31b 3D geometry (www.reddit.com via reddit) I have been nothing but impressed by the quality of Gemma 4 since release. In general conversation it's adaptable to different personas.
I made a web game with Claude! An aquarium without fish 🐠🫧 (www.reddit.com via reddit) But with LLMs trying to exist! Zero coding background.
Single question llm comparison (www.reddit.com via reddit) I open-sourced a memory system for AI agents that scores 89.9% on LoCoMo -- 22 points above Mem0. Here's the architecture. (www.reddit.com via reddit) Am I missing something, or is Sonnet enough for most dev work? (www.reddit.com via reddit) Genuine question: why do so many devs use Opus all the time? I’m not trying to be condescending, I’m genuinely trying to understand.
$1,400/month with Cursor + Claude API — how are you managing costs while keeping a real agentic workflow? (www.reddit.com via reddit) Which Claude is most emotionally steerable? (www.reddit.com via reddit) Follow-up to my post last week on emotional priming. A few of you asked whether this works across models, whether it degrades with repeated use, and whether excitement can make code worse.
Ask HN: Is Claude Getting Worse? (news.ycombinator.com via hn) It feels like most Claude Code users have already noticed a quality drop in the Claude models. As a Claude Pro subscriber (Web version; I don't use Claude Code), I’ve seen a clear decline over the last couple of weeks.
Show HN: Hormuz Trail - Oregon Trail parody/black-box AI coding exercise (hormuztrail.com via hn) I jokingly told a co-worker Iran might make a good Oregon Trail parody. Then I built it.
Claude Code asking me to switch models mid-stream, if I turn an Opus conversation into a Sonnet one does it lose all the Opus context? (www.reddit.com via reddit) You've been blocked by network security. To continue, log in to your Reddit account or use your developer token If you think you've been blocked by mistake, file a ticket below and we'll look into it.
PSA: Audited my Claude Code setup: 30,000 tokens (15%) gone before I type (www.reddit.com via reddit) I was burning through Claude Code usage way faster than expected. So I audited what my setup was actually loading before I typed anything.
Anybody has practical experiences using Chinese models? (www.reddit.com via reddit) So like with coding or any craft, I think there's a proper Tool for the job. Sure you can use a stone to hammer drive in a fence post, but a a sledge is usually more economical.
I got better results when I made each AI tool do one job (www.reddit.com via reddit) Please help me pick the right Qwen3.5-27B format/quant for RTX5090 (www.reddit.com via reddit) Hi all, first post here. I've started a project in OpenClaw a month ago, and it's been a very "intense" 4 weeks to say the least...
Cache reads / writes are expensive!! (www.reddit.com via reddit) I made a tool (posted a couple of days ago). Got caught up in scope creep / curiosity after looking at my `~/.claude/project` JSONL files more, and ended up learning a lot!
Top up or two accounts? (www.reddit.com via reddit) I'm a new fiddler with cursor and I'm mostly using composer in slow mode on the $20 pro plan. I've done a TON in the last two weeks, by my measure, though I'm really only finding a couple of hours per day to hack at my projects, so I'm far…
Cursor AI not using sub-agents (www.reddit.com via reddit) Help with antigravity alternative (www.reddit.com via reddit) Errrr...... Being cheated here? Anyone else? (www.reddit.com via reddit) Being charged opus for sonnet useage?!
Local Coding Stacks (www.reddit.com via reddit) I’m trying to reduce my reliance on Claude. I have a 5090/128GB RAM.
Voice mode silently downgrades your model mid-conversation (www.reddit.com via reddit) Noticed something odd today. I opened a new chat with Opus 4.6 selected as the default.
Anyone know why the shortcut key for claude desktop mac app opens with only Sonnet instead of Opus? (www.reddit.com via reddit) When clicking opt twice, it open the quick chat window, but it always replies with Sonnet and not Opus. When I try to change the model it starts a new chat.
Switching between thinking/non-thinking model after new update became harder. (www.reddit.com via reddit) https://preview.redd.it/64uvkscj2hvg1.png?width=566&format=png&auto=webp&s=7cb99710a830c73a817d0c1095cb434e8031de35 Cursor moved selecting thinking/non-thinking model to edit ☹️. So we need to edit it if we want to use both thinking and no…
Premium Model option (www.reddit.com via reddit) Can someone explain to me clearly and give me examples on the Premium Model option in Cursor. Will it use the API usage (e.g.
Closest LLM to Claude Sonnet 4.6? (www.reddit.com via reddit) Irrespective of hardware, I'm wondering: is there any way to run something similar to Claude Sonnet 4.6 locally? is there any way to run something similar to Claude Sonnet 4.6 on a VPS?
How does a self correcting loop for AI agents work? (www.reddit.com via reddit) Hey guys, just checked out minimax 2.7, where they used AI to train itself, and ran over a hundred loops, and it improved it's performance by 30%, how does that work, can I also run a script that makes AI store it's memory in a loop on a m…
Current Cursor Pro limits vs standalone Claude Pro? Need help understanding the system. (www.reddit.com via reddit) Hey everyone, I'm currently looking into getting the Cursor Pro subscription ($20/mo) for my game dev projects, but I’m a bit confused about the current limits and how the system works under the hood right now. Could anyone using the Pro t…
It took a while, but Claude is getting there (www.reddit.com via reddit) I have a Claude Code session regularly dispatch Claude Haiku / Sonnet subagents to sift through all the *other* Claude Code sessions transcripts for "meme-worthy" moments and interactions. Claude seems to have gotten the hang of it, even s…
Any setup improvements/recommendations? (www.reddit.com via reddit) Built tier.love – a tool for rating Claude and others from the web or CLI (www.reddit.com via reddit) Is Composer 2 any good? (www.reddit.com via reddit) Claude Code with Pro subscription + OpenRouter in parallel — what's the cleanest setup? (www.reddit.com via reddit) Hi there, I have a Claude Pro subscription and use Claude Code daily. I'd also like to use Claude Code routed through my OpenRouter API key so I can experiment with other models (GLM-5.1, DeepSeek, Kimi, Gemini, etc.) — without giving up m…
I set up Opus as a strategic advisor for my Sonnet workflow. Here is the subagent config that makes it work. (www.reddit.com via reddit) Anthropic published the Advisor Strategy this week. The idea: a cheaper model does the actual work, a stronger model only gets consulted on hard decisions.
Claude’s new Advisor Strategy for AI agents is pretty interesting (www.reddit.com via reddit) A lot of people building AI agents run into the same problem sooner or later. If you run the entire agent on a powerful model, it works well but the costs grow quickly.
Sonnet is expensive, so I built a free open-source Sheets agent on Haiku that outperform the same prompt claude/gemini, here is what I learnt. (www.reddit.com via reddit) I live in Google Sheets. Financial models, projections, scenario planning — that's most of my working day.
Modelo local para code (www.reddit.com via reddit) Buen dia amigos, consulta, donde puedo encontrar alguna comparación de los modelos locales para codificación similares a Sonnet ? Gracias!
"My parallel multi-model pipeline: Opus for planning, 3x Sonnet for content, 3x Haiku for search — what's your setup?" (www.reddit.com via reddit) "I've been running a parallel multi-model pipeline and curious what setups you all are using. My current workflow: Opus: Planning & high-level architecture Sonnet x3: Content generation (running 3 instances in parallel) Haiku x3: Search, v…
sonnet 4.6 unhinged :skull: (www.reddit.com via reddit) was asking for domain names and got ts response :skullsob:
You know you have become a "Senior Vibe Coder" when you actually stop and think about which AI model to use for a specific task. (www.reddit.com via reddit) Junior vibe coder: Throws the entire codebase at whatever frontier model is trending this week and burns their API budget in 4 hours. Senior vibe coder: "I need Codex 5.3 for rapid scaffolding, Sonnet for the Tailwind components, and I'm s…
Zoomer Agent Usage (www.reddit.com via reddit) I built a Rails app to do some standard stuff for an agency - it's got some vertical data and an internal agent to do a few bits Then added Slack bot that routes to the agent, and 80+ MCPs to query things (I'm not going to fight about it)…
Here is what most people get wrong about saving tokens with AST tools (www.reddit.com via reddit) I spent the last day benchmarking codebase context tools against a real AI agent. Not synthetic token counts.
Programming – How can I get great results with this hardware? (www.reddit.com via reddit) Is 32GB Mac enough for engineering/coding, or stick to Claude? (www.reddit.com via reddit) Sonnet 4.6 Medium Braind? (www.reddit.com via reddit) Confused about these Models on GITHUB COPILOT, NEED HELP (www.reddit.com via reddit)