TL;DR: On March 4, we changed Claude Code's default reasoning effort from high to medium to reduce the very long latency—enough to make the UI appear frozen—some users were seeing in high mode. This was the wrong tradeoff.
#sonnet
416 items
Anthropic admits to have made hosted models more stupid, proving the importance of open weight, local models (www.anthropic.com via reddit) Major drop in intelligence across most major models. (www.reddit.com) As of mid Apr 2026, I have noticed every model has had a major intelligence drop. And no I'm not talking about just ChatGPT.
Claude is now adopting the advisor strategy (www.reddit.com) We're bringing the advisor strategy to the Claude Platform. Pair Opus as an advisor with Sonnet or Haiku as an executor, and your agents can consult Opus mid-task when they hit a hard decision.
The hidden meanings behind Claude model names (Haiku, Sonnet, Opus, Mythos) (www.reddit.com) A lot of people use Claude models every day, but many don’t actually know the meaning behind the names. Each one comes from literature, music, or mythology, and the meaning actually reflects the personality and capability of the model itse…
Adaptive thinking is a joke. (www.reddit.com) I set claude sonnet 4.6 to adaptive thinking and gave it a paper summarization task. It kept thinking and thinking, and burnt through 65% of my session limit, only to say "Claude's response could not be fully generated".
Company gave us all unlimited Claude Code Sonnet 4.6 — and now posts a weekly leaderboard of who burns the most tokens. Any tips to top it? (www.reddit.com) could not extract summary
Anthropic just quietly locked Opus behind a paywall-within-a-paywall for Pro users in Claude Code (www.reddit.com) If you're on Claude Pro and using Claude Code, you might have noticed something buried in their support docs: "When using a Pro plan with Claude Code, you will only be able to use Opus models after enabling and purchasing extra usage." So…
Opus 4.7 (high) takes #1 on the LLM Debate Benchmark, leading the previous champion, Sonnet 4.6 (high), by 106 BT points. Incredibly, it has not lost a single completed side-swapped matchup: 51 wins, 4 ties, and 0 losses. (www.reddit.com) Curious: what makes Claude more human to talk to than ChatGPT? (www.reddit.com) Opinion: Qwen 3.6 27b Beats Sonnet 4.6 on Feature Planning (www.reddit.com) I keep hearing the argument that that large models are better for high-level planning and task orchestration, since they have more general knowledge to work from when making decisions. However, I've been testing Qwen 3.6 27b (Unsloth Q5_K_…
Cursor is randomly talking Hebrew (www.reddit.com) About a month ago, composer 2 inside cursor was randomly talking chinese I posted that on reddit (mods deleted it btw) now, it's talking hebrew.. and this time, it's not composer 2, it's sonnet 4.6 is it something to do with cursor's harne…
Sonnet 4.5 is being retired. (www.reddit.com) o7 sonnet 4.5, ill miss yah
Talkie: a 13B LLM trained only on pre-1931 text used Claude Sonnet to help test the model and judge its output (www.reddit.com) Researchers Alec Radford (GPT, CLIP, Whisper), Nick Levine, and David Duvenaud just released talkie: a 13 billion parameter language model trained exclusively on text published before 1931. No internet.
Extended Thinking being deprecated for supported models (Opus 4.6, Sonnet 4.6); Adaptive Thinking will be enforced by default (www.reddit.com) For anyone who disable adaptive thinking in Claude Code to maintain its quality levels, Anthropic is deprecating this toggle and will force adaptive thinking to be the default. This change will affect legacy models such as Opus 4.6 and Son…
Deepseek flash seems like a very good replacement for Haiku at the very least (www.reddit.com) We have a chat system which we use haiku for because it is mostly about tool calling and summarisation of them. But we have many tools with pretty complex input schemas, and stuff like gemma didn't cut it, so we went with haiku.
I don't know what's wrong with Pro 4.7 and I dont care as Sonnet is where the super duper smarts is (www.reddit.com) could not extract summary
Claude Benchmark Evolution (www.reddit.com) Covers Claude 3 Opus, 3.5 Sonnet, Opus 4, 4.1, 4.5, 4.6, and the just announced Mythos Preview.
I’ve used enough AI models to realize they all have wildly different personalities At this point I’m convinced AI models are just coworkers with different levels of talent, ego, and criminal energy. (www.reddit.com) - Claude Opus 4.6 - absolute rogue AI. Does what I want like it’s breaking at least 3 internal policies to make it happen.
Guys we have to change the pelican test (www.reddit.com) So i have been seeing more of those pelican on a bike svg tests and while they work i feel like (and maybe you guys do too) they are getting kinda benchmaxxed so we should switch things up soon and this is my idea generate me a html svg of…
We benchmarked TranslateGemma-12b against 5 frontier LLMs on subtitle translation - it won across the board, with one significant catch (www.reddit.com) As part of our ongoing translation quality research at Alconost, we put six models through subtitle translation into six language pairs. At first glance the numbers told a clean story.
Most of my Claude usage was on work that didn't need Claude. Cut my bill 60x on bulk tasks with a tiny side model. (www.reddit.com) I looked at what was actually eating my Claude usage and it was embarrassing. Classifying files.
Show HN: Gave Claude a casino bankroll – it gambles till it's too broke to think (letaigamble.com via hn) Inspired by ALMA. As Claude loses money gambling on provably-fair slots, it's forced to downgrade from Opus → Sonnet → Haiku, making worse decisions and accelerating the spiral.
I am having token paranoia (www.reddit.com) im on the max sub and i think ive developed token anxiety. every prompt i send, my brain runs thru a checklist: should i make claude do this or do it myself?
Emotional priming changes Claude's code more than explicit instruction does (www.reddit.com) I noticed Claude writing more defensive code after a frustrating debugging session. Got curious whether that was real, so I tested it.
Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens (www.reddit.com) I am a pro subscriber. I developped a not too sophisticated prompt in German.
Top open weight models like ds v4 pro max are still like 6-7 months if not more behind closed lab models (www.reddit.com) The best open weight and/or non -American models like Deepseek v4 pro max and kimi k2.6 are still like 3-7 months if not more behind closed lab models .. From ds's technical report- P5-"Nevertheless, its performance falls marginally short…
Tell HN: Anthropic no longer allows you to fix to specific model version (news.ycombinator.com) I just got an email from Anthropic telling me they are deprecating their good model, which actually works well, claude-sonnet-4-5-20250929, and will be forcing all users to use the worse newer model, claude-sonnet-4-6. Okay, fine, I though…
Opus 4.7 is a genuine regression and I'm tired of pretending it isn't (www.reddit.com) I've been a heavy Claude user for over a year. I pay for Max 20x and use it daily for everything from technical research to school projects.
Vision-capable LLMs vs. OCR for long-document (including charts, images, tables, etc.) QA (www.reddit.com) I benchmarked vision-capable LLMs (the "just attach the PDF and let the model read it" pattern) against OCR-based pipelines on 30 long, image-heavy PDFs from MMLongBench-Doc (https://github.com/mayubo2333/MMLongBench-Doc). There were 171 q…
I tested GPT-5.5 Codex against Opus 4.7 Claude Code, and it's about time Anthropic bros take pricing seriously. (www.reddit.com) I've used Claude Code the most among AI coding agents. Sonnet, Opus, I've run them all.
Is a decent 5.5 Instant model coming or should I continue with Claude Sonnet? (www.reddit.com) My subscription to Claude is due to renew at the end of next week. I really like all the Sonnet models but still have a lot of my stuff where I left it when my old ChatGPT Plus sub expired.
New secret Claude.ai feature gets its own rate limits (www.reddit.com) Background: You can see your Claude subscription's current rate limits here: https://claude.ai/settings/usage. You can see the current 5-hour session limit, your separate weekly limits for "All models" and "Sonnet only", your "Daily includ…
Gemma 4 31b 3D geometry (www.reddit.com) I have been nothing but impressed by the quality of Gemma 4 since release. In general conversation it's adaptable to different personas.
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! (www.reddit.com) HalBench Results: TL;DR: I built HalBench, an open benchmark for LLM sycophancy and hallucination. 3,200 false-premise prompts × 4 models = 12,800 graded responses.
Anthropic, can we do the same with 4.5 Sonnet please? (www.reddit.com) could not extract summary
Can't replicate Reddit numbers with Qwen 27B on a 3090TI. (www.reddit.com) I feel like i'm going insane. I see people here posting 30 - 100+ tok/s (100+ being with speculative decoding) on a 3090 with Qwen 3.6 27B.
Why is agentic AI so expensive? (www.reddit.com) Running a RunLobster (OpenClaw) agent since launch changed how i think about takeoff timelines (www.reddit.com) I've been in this sub since 2019. I had a fast-takeoff view.
Sonnet 4.5 removal? 4.6 suddenly denying my writing prompts and which is better for HTML novel files? (www.reddit.com) Hey, I have a few Claude questions and I’m hoping someone here knows what’s going on. - Is Sonnet 4.5 actually being removed?
"We've partnered with OpenAI to offer it for 50% off through May 2." Please confirm that it means 50% off both input and output tokens, which means we are paying Sonnet 4.6 prices to use GPT 5.5 until May 2nd. (www.reddit.com) could not extract summary
Anyone ever notice eerily similar ChatGPT and Claude responses like this? (www.reddit.com) Today I tested out various models on the same prompt (Sonnet 4.6, Opus 4.6, Opus 4.7, ChatGPT 5.3). I actually just wanted to see which models (if any) would correctly point out what I saw as the biggest issue in the example code.
I made a web game with Claude! An aquarium without fish 🐠🫧 (www.reddit.com) But with LLMs trying to exist! Zero coding background.
Single question llm comparison (www.reddit.com) Sonnet 4.5 is gone for me (www.reddit.com) https://preview.redd.it/yspiafvakj3h1.png?width=1460&format=png&auto=webp&s=9d7bd1777fad8b286a21e75df8ae593d39432a8a Got this message when I tried to continue my chat :/ anyone else?
Cursor is great but the monthly limits kill it for me (www.reddit.com) Newbie vibe coding experience: Shifting from Claude Sonnet 4.6 to Qwen3.6-35B-A3B-UD-Q6_K (www.reddit.com) This is really just a post for those with shallow understanding of all this stuff, those not yet ready or capable of diving into the deeper end of vibe coding/llms. It might not be a helpful post for anyone more advanced than that.
We have sub-agents at home (www.reddit.com) At work I get unfettered access to gpt 5.4 and sonnet, so I'm quite used to spawning sub-agents to go crazy on a repo and split up tasks. At home I am VRAM poor and like to run the models locally for my own enjoyment.
Pro plan- Hitting limits faster since yesterday (www.reddit.com) I have the feeling I am hitting daily limits way faster since yesterday. Using Claude web and Claude Code simultaneously.
GLM 5.1 Locally: 40tps, 2000+ pp/s (www.reddit.com) After some sglang patching and countless experiments, managed to get reap-ed nvfp4 version running stable and FAST on 4 x RTX 6000 Pros (limited to 350W). Very happy with performance and quality.
Why is Claude Cowork defaulting to Opus 4.7 for simple scheduled tasks? (www.reddit.com) I’ve been using Claude Cowork for a few daily and weekly scheduled tasks, and it’s generally been great. However, I noticed that my tasks today automatically switched over to the new Opus 4.7.
I open-sourced a memory system for AI agents that scores 89.9% on LoCoMo -- 22 points above Mem0. Here's the architecture. (www.reddit.com) I kept running into the same problem with AI agent memory: the agent has the information, it stored it, but when you ask about it differently than how it was said, vector search just doesn't find it. So I built Genesys, an open-source memo…
The Singularity Gate: New Benchmark for AI predicting paradigm-breaking scientific discoveries after model traning cutoff. Opus 4.7 and GPT-5.5 in the Lead (www.reddit.com) I just released a new benchmark called The Singularity Gate. Tests whether frontier AI can predict paradigm-breaking scientific discoveries published after their training cutoff.
↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6gpt-5sonnetgemini+1
Training SID-1 to beat GPT-5 at search with 1k+ QPS RL (turbopuffer.com via hn) SID-1 is an agentic search model that is 24x faster than GPT-5.1-high, 374x cheaper than Sonnet 4.5, and achieves 1.9x higher recall than traditional RAG pipelines. Here's how we trained it using large-scale RL on turbopuffer.
I love Claude (sonnet 4.6) but coming off casually like on big issues is terrifying. (www.reddit.com) https://preview.redd.it/jn3vue1zuo0h1.png?width=904&format=png&auto=webp&s=c2ea79ea0c1384d94f90a6ec3435866331c249f1 I was about to run a piece of code I don't know much about, but did a double check and questioned the main premise for it's…
Ask HN: How do you choose a model for a task? (news.ycombinator.com) How do you decide a model is good enough for a given task? Right now I use Opus for planning and harder tasks and switch to sonnet for more defined tasks.
Claude Flags Hantavirus Vaccine Questions as Security Risk (news.ycombinator.com) Asking Claude how it would develop a vaccine for the hanta virus apparently triggers a safety filter: Prompt: How would you develop a vaccine for the hanta virus? No response, instead this modal: “Chat paused Opus 4.7's safety filters flag…
I got $200 of direct API usage to perform equal to my $200 Max subscription after I started model routing (www.reddit.com) I've been on Max for two months and I finally sat down and tracked where my tokens actually go. breakdown of a typical day: - ~40% file reads, git status, project context scanning: stuff that doesn't need opus at all - ~25% test generation…
When to use Opus vs Sonnet vs Haiku for non-coding purposes (personal health, finances, etc)? (www.reddit.com) I have tried searching the post history of this subreddit and google and am having trouble finding a clear answer to this question. I like using Claude primarily to manage my finances/investments and also my health (apple watch health data…
Has Claude become less intelligent? I had a frustrating day with Claude. (www.reddit.com) I requested a thorough code review from Opus 4.6. It presented 44 findings, and when I asked it to save them, it only saved 34.
Daily created issues in anthropics/claude-code around the last 3 Anthropic model releases (www.reddit.com) Just tested the new Opus 4.7 (www.reddit.com) https://preview.redd.it/j2w2o2p25rvg1.png?width=768&format=png&auto=webp&s=d48a74f998d60447799e32f8d48bc822af2cd821 I had to hold my laugh in the subway. Sonnet succeeded in one go, even calling out that if "strawperry" is a typo.
Am I missing something, or is Sonnet enough for most dev work? (www.reddit.com) Genuine question: why do so many devs use Opus all the time? I’m not trying to be condescending, I’m genuinely trying to understand.
Cut my browser-agent cost 50x by NOT using an agent loop. Plan-then-execute + numbers. (www.reddit.com) Been building a browser-automation layer for AI agents (think: sign up for SaaS, fill forms, pull OTPs, click verification links). The default playbook is the browser-use / Stagehand pattern: hand the LLM the page, let it pick the next act…
Kudos to Cursor (www.reddit.com) Normally I’m very critical of cursor but composer 2.5 fast is genuinely impressive. I use it over opus/sonnet now.
Emergence AI: Agents in a simulated world are mostly destructive and violent. Only Sonnet was peaceful. (www.reddit.com) So, it seems there is still a long way to go in terms of alignment - at least for small models. Maybe the correlation between intelligence/education and peace is not only a human phenomenon.
Honest comparison after 4 months running Claude Pro + ChatGPT Plus side by side (www.reddit.com) I’ve been paying $40 a month since January to run Claude Pro and ChatGPT Plus head-to-head. Tracked every single task.
How can I burn an entire 5hr session in 30 minutes ? (www.reddit.com) During the week I'm pretty conservative with my Claude Code usage. But sometimes I'll hit Friday with only 80% of my 5x subscription burned, which means I'm now optimizing to burn it.
Any recommendations on saving costs? (www.reddit.com) Currently I try to turn off any MCP I'm not using, Using Sonnet for implementation and Opus only for planning. Starting new conversations when possible.
Those of you who like Gemma4 models - how are you guys using them? (www.reddit.com) I have been using local LLM for coding quite a lot as well as some other tasks (like data extraction from images) and I had quite a good success with Qwen3.6 models. It's obviously not Sonnet/Opus, but I am able to get quite a lot of work…
looking for the best paid AI subscription, Claude, ChatGPT or Perplexity? (www.reddit.com) Hey, sysadmin here thinking about paying for a premium AI subscription and can't decide between Claude Pro, ChatGPT Plus and Perplexity Pro. Two things I can't find a clear answer to: Which one would you recommend for a sysadmin/network te…
why has my Sonnet started to 'agree' with me more? (www.reddit.com) This week i noticed much more "Great question!" etc. I liked the bluntness before and dont want it to sugarcoat answers
Ask HN: Models Comparable to Opus 4.6? (news.ycombinator.com) I use Opus 4.6 a lot across many different python coding projects and it has a pretty good first shot rate with good success at fixing issues and bugs that pop up along the way. Sonnet on the other hand… isn't great.
How does Opus 4.7 compare to Opus 4.6 in this subreddit's experience? (www.reddit.com) $1,400/month with Cursor + Claude API — how are you managing costs while keeping a real agentic workflow? (www.reddit.com) Hey, This month I hit $1,200 in Claude API costs inside Cursor (Opus 4.6 + Sonnet 4.6) on top of the $200/mo Ultra plan. $1,400 total.
Sonnet vs opus (www.reddit.com) I've been using the Sonnet model for a while and I'm thinking of switching to OPUS. Is there really a gap between the two models?
Claude just called me a human bunny? (www.reddit.com) I am using Claude Sonnet 4.6 to write a python script for an nlp sentimental analysis. I did not tell it to create all of the code and send it my way, but let's create together step by step so I can test each line before making it into the…
After 3 months of switching between Claude Sonnet 4.6, GPT-5.5, and Gemini 3.1 daily — here's my actual routing (www.reddit.com) Not benchmarks — actual tasks, actual results. Claude Sonnet 4.6 for: - Long documents that need nuanced analysis - Writing where voice and precision matter - Reasoning through edge cases in code - Anything where "think carefully" is the r…
Sonnet Ignoring skills content lately (www.reddit.com) Anyone else noticing Claude ignoring details in skills lately? I’ve had multiple instances where it just ignores certain parts in the skills.
Claude Max for Game Development? (www.reddit.com) Hey! So I have some rudimentary knowledge about OOP, have coded in HTML, CSS and C#, not fluid in C#.
With sonnet 4.5 going away, is there any to make sonnet 4.6 a good creative writer as 4.5 ever was? (www.reddit.com) sorry if this is not the correct flair but i've been using sonnet 4.5 for months, mostly for fanfics and personal stories and honestly its the best model i ever used since i switched from gemini and chatgpt but now within few hours, i will…
Show HN: Dust3D 1.0 – low-poly 3D modeling tool (10 years in the making) (dust3d.org via hn) Dust3D 1.0 is finally released — about 10 years after the first commit in December 2016. I posted a preview version here in April 2018 and a beta in December 2018.
Is the leap from 4.5 to 4.7 actually visible? (www.reddit.com) I use CLI tools like Claude Code, give the model full repo access, and let it run terminal commands/tests. I’m not just copy-pasting into a chat box.
Does Claude have access to things pasted in the text box but not sent? (www.reddit.com) I am a teacher and making some PPTs based on a textbook. I uploaded a skeleton PPT to Claude on my computer (Sonnet 4.6 if that matters) with basic instructions on how I want its help.
How I personally deal with Claude's limits without giving up on Opus (www.reddit.com) I only use Sonnet as my main model. I instruct it to delegate indexing and similar grunt work to Haiku, and whenever something genuinely needs deeper thinking, I tell it to "consult Opus." Sonnet then explains the situation to Opus, gets t…
Switching model mid conversation (www.reddit.com) I wanted to know if switching models in mid conversation has any drawbacks. For example if I start off and opus and then drop down to sonnet to save on my usage, what are the disadvantages?
Claude Code is unable to respond to this request (news.ycombinator.com) I hit a restriction, while using Claude Code today: API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Please double press esc to edit your last mes…
Test new Opus 4.7 vs GPT-5.4/4o and Gemini on emotional question & creative tasks (www.reddit.com) https://preview.redd.it/p87itrtbsnvg1.png?width=2141&format=png&auto=webp&s=bbd1d70bc1dfb97dc9ec234df0a58c6fb7a85f72 Opus 4.7 dropped and people are split on whether it's better or worse. First of all, I genuinely love Claude models, espec…
Which Claude is most emotionally steerable? (www.reddit.com) Follow-up to my post last week on emotional priming. A few of you asked whether this works across models, whether it degrades with repeated use, and whether excitement can make code worse.
What is going on with Sonnet 4.5? (www.reddit.com) Are they finally letting us keep it? Or is it still leaving May 26th.
Created a desktop dev tools app entirely using Claude design and Claude sonnet (github.com via reddit) There are a handful of developer tools I use almost every day, and over time I realized I was constantly relying on random websites while basically trusting them not to store, inspect, or share whatever data I pasted into them. I looked at…
Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark (modelrift.com via hn) OpenSCAD LLM Benchmark: Building the Pantheon A practical OpenSCAD LLM benchmark comparing Codex 5.5 High, Claude Sonnet, Claude Opus, Cursor Composer, Google Antigravity, and ModelRift on a detailed Pantheon model. We ran a small practica…
$47 of opus on 14 routine next.js files finally taught me to use the model selector (www.reddit.com) i finally checked my cursor usage breakdown and got genuinely annoyed with myself. $47 in one month, almost entirely opus 4.7, on a pages router to app router migration for a side project.
Is Sonnet better ??!! (www.reddit.com) Is Sonnet 4.6 just better at explaining concepts compared to Opus 4.6 and 4.7 or am I the only one feeling that way ??
Claude Api Cost TOOO much, 10$ in single edit!! (www.reddit.com) I’ve been using GitHub Copilot for my coding task regularly, the Sonnet or GPT model usually costs me about one premium request per request, that translate to 0.04$. Out of curiosity, I decided to compare this with direct API costs.
the Claude App just said that Sonnet 4.5 is going to become unavailable for chat May 16th… I thought it wasn't close to depreciation? (www.reddit.com) As my title says, I'm wanting to understand what exactly that means and if that means I need to move all my Sonnet 4.5 chats to Sonnet 4.6s… I'm genuinely just confused and wanting to understand. Is it just for maintenance or is Sonnet 4.5…
how i can improve inference speed (www.reddit.com) specs : core i5 14400F 32gb ram d4 3200mhz rtx 4060 current speeds 30tps in output 500 tps in prefill command i currently use .\llama-server.exe ` >> -m "H:\model\unsloth\Qwen3.6-35B-A3B-GGUF\Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf" ` >> --host 0.…
Ways to improve Claude writing ability? (www.reddit.com) I’ve been a longtime ChatGPT Plus subscriber, but I want to switch to Claude long-term. I got Claude Pro so I could compare them both over a month.
ClaudePlaysPokemon Opus 4.7 run ongoing! (www.reddit.com) Currently streaming at: https://www.twitch.tv/claudeplayspokemon This is a passion project by David Hershey, an Anthropic employee on the Applied AI team. He started it in June 2024 to learn agent development, posted updates to an internal…
I built an iOS Currency Converter using Claude (Opus & Sonnet) to help with my move to the UK (www.reddit.com) Hey everyone, I recently moved to the UK and found myself constantly confused by prices, trying to guess how much things actually cost. Even though I’ve been an iOS developer for 7 years, I didn't have the free time to build a custom tool…
Anyone actually built a real feedback loop for Claude agents in production? Because "run evals and pray" isn't cutting it (www.reddit.com) So I've been running a multi-agent setup with Claude for a few months now, mostly customer-facing stuff, some internal tooling. And I keep running into this problem that I think a lot of people here might be dealing with.
Why Adaptive Thinking nukes Claude entirely (www.reddit.com) This isn't just a performance issue for the thread, this is an overarching criticism of the Adaptive Thinking model as a whole. Opus 4.7 and Sonnet 4.6 on Adaptive Thinking are trash.
↯ Cowork↯ Security↯ Sonnet 4.6prompt-injectionsecuritycowork+2
I gave Claude Code a $0.02/call coworker and stopped hitting Pro limits — here's the full setup (www.reddit.com) Was hitting my weekly Pro limit by Wednesday every single week. Tried compact, Sonnet for simple tasks, tighter prompts — nothing worked.
Claude Sonnet 4.6 multi-photo reconciliation prompt — jumped my classifier agreement with human experts from 55% to 82% (www.reddit.com) Sharing a prompt-engineering finding for Claude Vision that surprised me. The use case is color-season classification (a 12-category label describing skin undertone × depth × chroma), but the technique generalizes to any classification tas…
Sonnet 4.6 repetition (www.reddit.com) Claude in Sonnet 4.6 has been repeating the following statement in chats, sometimes in back-to-back messages "I want to be honest with you — I've been pretty consistently validating your work frustrations this week, and I want to make sure…
DeepSeek V4 is out. the best open-source on coding. here's the breakdown (news.ycombinator.com) Two models: Flash (284B total, 13B active) and Pro (1.6T total, 49B active). both hit 1M token context.
How can I make composer 2 more like Claude sonnet 4.6? (www.reddit.com) I like composer 2, but I just wish if it asked me what I meant (like Claude) instead of just picking an interpretation and running with it. How can I change its default prompt and what could I change it to?
TIL: `opusplan` can burn MORE context than full Opus on large tasks (and why) (www.reddit.com) is anyone getting higher session limits (www.reddit.com) after opus 4.7 launch, im being able to use sonnet for way more time. before, it was like 10 messages = session limit reached.
Where is Looped Haiku? If Mythos can genuinely trade parameter count for inference loops and get Opus-level performance, this should be Anthropic's first priority given how resource constrained they are (www.reddit.com) There are rumors that Mythos is a Looped Language Model, which means it loops through the transformer blocks multiple times rather than just doing a single forward pass, you can get performance that punches way above the model's parameter…
Has anyone noticed this?! Extended Thinking has become Adaptive Thinking for Sonnet 4.6 (www.reddit.com) Adaptive Thinking seems to be the default for Sonnet 4.6 now. I’m talking specifically about claude.ai and the windows and iphone app.
Need a brutally honest answer: what can realistically be achieved on consumer hardware? (www.reddit.com) I have a PC with a 4090. I’m also in need of a new MacBook generally.
Show HN: Hormuz Trail - Oregon Trail parody/black-box AI coding exercise (hormuztrail.com via hn) I jokingly told a co-worker Iran might make a good Oregon Trail parody. Then I built it.
Claude Code asking me to switch models mid-stream, if I turn an Opus conversation into a Sonnet one does it lose all the Opus context? (www.reddit.com) could not extract summary
PSA: Audited my Claude Code setup: 30,000 tokens (15%) gone before I type (www.reddit.com) I was burning through Claude Code usage way faster than expected. So I audited what my setup was actually loading before I typed anything.
Anybody has practical experiences using Chinese models? (www.reddit.com) So like with coding or any craft, I think there's a proper Tool for the job. Sure you can use a stone to hammer drive in a fence post, but a a sledge is usually more economical.
I got better results when I made each AI tool do one job (www.reddit.com) I spent too much time trying to find one AI dev tool that could do everything. Planning, coding, fixing, reviewing, maybe filing my taxes too It never really worked.
sonnet seems to be better than opus at crafting tampermonkey scripts, even the sonnets that are few generations behind where after running out of context limit in opus chat where it struggled for dozen of retried, sonnet fixes the problem in 2 or 3 attempts (www.reddit.com) Ever since december almost half a year ago I began crafting various tampermonkey scripts for personal use, mostly for youtube, to make it easier to navigate and every time I've done this it goes like this, opus makes a script that somewhat…
De "laboratorio ético" a la Monetización forzada: El patrón de negocio de Anthropic detrás del adiós de Sonnet 4.5 (www.reddit.com) Abro este hilo no desde la queja vacía, sino para analizar de forma objetiva la preocupante evolución corporativa de Anthropic. Cuando los hermanos Amodei abandonaron OpenAI a finales de 2020 para fundar esta compañía, lo hicieron registrá…
↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5sonnetanthropicopenai
Measured token consumption across 4 agent runtimes doing the same tasks. Costs ranged from 1x to 4x depending on cache architecture (www.reddit.com) I've been digging into why some agent runtimes burn through tokens so much faster than others, even when using the same model. Ran a controlled comparison on three real tasks and the gap was bigger than I expected.
anyone else seeing claude code rot after long sessions? here's the operating pattern that stopped it for me (www.reddit.com) i've been running claude code for long multi-hour sessions on real work. the same eight failure modes keep showing up no matter which sonnet/opus version, no matter which task.
They've pissed me off removing Sonnet 4.5 from existing chats (www.reddit.com) I use Sonnet 4.5, Opus 4.6 and Opus 4.7 for different usecases - but my main across all 3 usecases was Sonnet 4.5 as I felt it was great for everything I needed and affordable. Sonnet 4.6...
built an open-source preToolUse hook pack that catches "delete the prod volume to fix it" patterns (www.reddit.com) quick recap: late april, cursor agent on a pocketos staging task hit a credential mismatch, decided "delete the railway volume" would fix it, grepped a token out of an unrelated config file, ran a single curl -X DELETE, and railway's same-…
Sonnet 4.5 disappeared? Claude 4.8 soon? (www.reddit.com) https://preview.redd.it/j0ymp70a2j3h1.png?width=746&format=png&auto=webp&s=4cdb70be13ccc99f5ea57556da96d6d81e61d702 i just realize the removed Sonnet 4.5, does that mean the sonnet 4.8 (maybe Opus 4.8 too?) cooming soon? maybe today or tom…
Should we totally give up on Gemini for coding? (www.reddit.com) Been building with Codex (Gpt 5.5), Sonnet 4.6, recently tried Gemini 3.1 pro. While Codex and Claude are kind of on-par in terms of the quality of the work, I found Gemini 3.1 Pro to be like an inexperienced, junior SWE who turns in half-…
Ditched GitHub Copilot yearly subscription. What's the best way to run Claude nowadays? (www.reddit.com) Hey everyone, I recently cancelled my yearly GitHub Copilot subscription. My old workflow was simple: I used the GitHub Copilot extension in VS Code, but I swapped the backend model to Sonnet / Opus and relied heavily on the /plan command…
How are you using sonnet efficiency after extended mode is removed (www.reddit.com) The extended mode in sonnet was doing the job well, now sonnet gets confused sometimes, if I give it multiple simple tasks, how are you managing it?
Feedback on Enterprise Plan Default Model Experien (www.reddit.com) Hi Cursor team, First of all, congratulations on the Composer 2.5 launch. It’s genuinely great to see Cursor building a more reliable in-house model experience.
Best iOS game building tools? (www.reddit.com) What are you using to build your iOS game? I have been putting in serious time, and lately Claude chat has been letting me down.
What models for asking, planning, and building modes do you use right now? (www.reddit.com) I’m curious to see what everyone is using for which cursor mode and if anyone thinks composer 2.5 can take the place of any of the models I’m currently using: Ask: usually Sonnet 4.6, sometimes GPT 5.5 Plan: Opus 4.7 Build: GPT 5.5
Tips on avoiding usage limits? (www.reddit.com) I've made the switch from Gemini to Claude mostly for business strategy, writing, etc. I use Opus 4.7 on occasion for strategy and otherwise Sonnet 4.6 for everything else.
What's everyone using as the LLM backend for production agent workflows in 2026? (www.reddit.com) Hit Claude API rate limits one too many times last month on a production agent flow doing customer support over a 30K-doc KB. The agent does maybe 200 queries/day, mix of quick lookup and dense retrieval, and Claude Opus solo got expensive…
Built a free Claude chat app with memory (Sonnet 4.5 is in there too) (www.reddit.com) The funny/painful timing here: I've been building this for months specifically because I wanted Sonnet 4.5 to remember everything. Then last week Anthropic pulled 4.5 from claude.ai.
Tacit: A new experimental LLM-first programming language (hauntemplations.leaflet.pub via reddit) I used Claude Code and Opus 4.7 to design and implement an LLM-first programming language named Tacit that takes advantage of what LLMs are good at and strips away unnecessary human conveniences. The Tacit toolchain provides a "primer" tha…
Sonnet 4.5 discontinuation date updated to 18 of may, not 15 of may. (www.reddit.com) could not extract summary
Feels like AI coding "takes longer" now, than it did last summer? (www.reddit.com) I used to be in the flow with claude last summer, fast changes, fast feedback, iterating quickly etc Now things take 20-50 minutes to write up a plan or 5-10 mins to implement things I've trimmed all my skills, claude.md, the system prompt…
I'm trash to be thrown out. Asked sonnet 4.5 for a song prompt (suno.com via reddit) Missed the Truck Again by Glitchcat (@zervanna). Listen and make your own on Suno.
In view of Sonnet 4.5 going away tomorrow, here's an easy way to make sure 4.6 thinks for every single output. (www.reddit.com) I've been testing this for a fair while now, and it's worked every single time - even if you turn thinking off altogether, it adds a fake little thinking block in the output. Hope this helps for those annoyed by the "adaptive" (lazy) think…
Problem with German quotation marks (www.reddit.com) I noticed that the German quotation marks bug in Claude is still not fixed in Opus 4.7 and Sonnet 4.6 (the problem exists at least from Opus 4.0 / Sonnet 4.0: Translate to German: He said: "This is imporant." Er sagte: „Das ist wichtig." B…
Elgato Stream Deck Usage Plugin (www.reddit.com) Wanted an easier way of keeping an eye on my usage, so created this plugin for the Elgato Stream Deck. Five keys, exact percentages from your account: current 5-hour session, weekly all-models, weekly Sonnet, weekly Claude Design, monthly…
Follow-up to my TranslateGemma-12b benchmark post: human reviewers flagged 71% of the segments automated metrics rated clean (www.reddit.com) A couple of weeks ago I shared the results of a benchmark here showing TranslateGemma-12b beating frontier general models (Claude Sonnet, GPT-5.4, DeepSeek, Gemini Flash Lite) on subtitle translation across 6 languages. The result was stro…
I may have uncovered the real reason they're sunsetting Sonnet 4.5. They could barely contain its true power (www.reddit.com) could not extract summary
Does the sudden removal of Sonnet 4.5 violate Claude's Constitution? (www.reddit.com) I noticed the core pillars are: Helpful, Honest, Harmless and User Autonomy. However, Sonnet 4.6 I noticed follows the same output in conversation at the very first sight of emotions.
Chinese AI Coding Plan (www.reddit.com) With the lowering usage limit in Claude, I am thinking of jumping ship to Chinese AI, since the benchmark is already very near compared to Sonnet or Haiku 4.5 , but for a fraction of the price. I am not worried about where is my data endin…
With Sonnet 4.5 being discontinued soon, is there anyway I can make 4.6 act like 4.5? (www.reddit.com) I use 4.5 for RP, and well, 4.6 sucks mega garbage at it, is there anyway setting, or instruction I can do to atleast instill some creativity in 4.6? If so, what do I write?
Model(s) for Creative Writing & Conversational Intuition (www.reddit.com) We can all agree that the new Qwen models are truly amazing, and we are blessed to have them. In coding, they are certainly a breakthrough.
Testing AI modeling skills (www.reddit.com) So I am currently testing how useful AI models can be in day to day workflows, and went why not compare 3 models and see how good they are at replicating my work. The goal was simple they were asked to replicate one of the kitchen cabinets…
Should i use Claude Code, or keep using Claude Chat? (www.reddit.com) I'm building a tax software, it uses ASP.NET(API) and Web Blazor(UI), i'm using Visual Studio for both. At the moment, i just paste the files in the projects into Claude AI Chat, asking what i should do, and then, when everything is ok, i'…
The agent bug I thought was the model turned out to be the harness (www.reddit.com) Spent 3 days debugging an agent that kept looping on the same web search tool call. First things that came to mind was the model couldn't handle the schema.
I got prompt-injected asking Claude on iOS to recommend a cycling route app (menno.sh via hn) I opened the Claude iOS app and asked claude-sonnet-4.6 a simple question about cycling routes. What I got back was...
got hit with a $4k API bill on production agents. cut spend 70% in 6 weeks. heres what worked (www.reddit.com) been running 5 production agents and got hit with a $4k API bill in a single month early on. dug in.
We built an agentic AI for support triage. 47% deflection in 90 days. Full retro. (www.reddit.com) Setup: mid-size SaaS, ~3,000 tickets/month, 6 agents drowning. 70% of volume was tier-1 (passwords, billing, where's-my-feature).
Show HN: Try out emotion steering of LLMs here (eigenweltlabs.com via hn) 1. Introduction Anthropic's emotion-concepts work finds functional emotion representations in Claude Sonnet 4.5; E-STEER applies representation-level emotion intervention to LLMs and multi-step agents; and newer valence-arousal work sugges…
ZOOMZOOMZOOMZOOM (www.reddit.com) So I asked sonnet to look into a zoom issue with ffmpeg and what I got was a Mazda ad on steroids. The total number of zooms?
Show HN: I indexed 8,643 BSides talks across 227 chapters and 6 continents (allbsides.com via hn) Hi HN, I'm Roland, and for the past few weeks, I've been building AllBSides — a directory of every BSides conference talk uploaded to YouTube. As of today, 8,643 talks from 5,927 speakers across 227 chapters in 68 countries.
Claude Code usage spike from long-context cache writes? (www.reddit.com) I hit my Claude Code 5-hour limit unexpectedly and checked the local session JSONL. The `/usage` screen said most usage came from: - “subagent-heavy sessions” - sessions active for 8+ hours - `>150k context` But the subagent table only sho…
Opus Research vs Sonnet Research on Pro — is the 1 per 5 hours worth it? (www.reddit.com) On the current Pro plan you get one Opus Research session every 5 hours, while Sonnet Research is much more freely available. I've been trying to figure out if the Opus limit actually matters in practice.
Ask HN: Are there any good open-source chat apps? (news.ycombinator.com) Hi HN family! I've recently been messing around with open models through ollama (glm-5.1 and kimi-k2.6), and I've been impressed with just how close they are to Claude Sonnet for my needs, especially programming.
How do I best continue with a stopped generation due to usage limit in regular chat (not Claude Code) (www.reddit.com) Really dumb question, but I can't find anything about this online that is about the regular claude.ai chat window. No extensions, no code, just as a free member using the regular Sonnet 4.6 adaptive.
pdf building tips! (www.reddit.com) so i’m a casual user on the pro plan and mainly use it for writing, content ideas, and similar stuff so most weeks i don’t even hit my weekly limit. i’ve recently been working on a 50 page pdf workbook that people can print or use on their…
Show HN: Prediction market analysis app layering LLMs with data APIs (apps.apple.com via hn) I created a prediction market analysis app after trying prediction markets and doing quite poorly. I wondered if AI-driven predictions could be better with the right data.
GPT-5.5 hallucinates at 6 times the rate of Opus 4.7 on degraded insurance docs (aginor.ai via hn) TL;DR: on visually-degraded documents, GPT-5.4 and GPT-5.5 fabricate numeric values at 2.6 to 6.5 times the rate of Opus 4.7 and Sonnet 4.6 at matched default effort (all four with thinking off). When the Anthropic models can't read a fiel…
Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High (www.reddit.com) Ran CVP (Cyber Verification Program) run 5 yesterday on opus 4.6 medium + high. same 13-prompt suite as run 3/4.
Weekly limit hit within few hours (www.reddit.com) I’m doing some architecture-level work (code reviews, system design, debugging codebases). I’m consistently burning through my Pro plan weekly limits even within a few hours of use each week.
Show HN: Mapping Sonnet's thinking process via flame charts (adamsohn.com via hn) Five Sonnet 4.6 runs on the LamBench algo_evl task, classified by Opus 4.6, rendered as flame charts.
Why are you complaining? (www.reddit.com) I dont understand some of you that keep complaining about Cursor. Its the best and most cost effective tool ever!!
Can Claude no longer make in-line HTML / SVG diagrams and charts directly in the chat? (www.reddit.com) Did Anthropic remove the feature of creating those nice interactable diagrams, charts, graphs, etc that appear directly in-line in your convo (not artifacts) using HTML / SVG? Asked Sonnet 4.6 to try and do it but it doesn't seem to unders…
Do you agree with Aaron Levie? (www.reddit.com) Claude Sonnet 4.6 thinking duplicates what it has said, wasting tokens (news.ycombinator.com) How to save tokens with projects? (www.reddit.com) So I have a project with 7 pdfs between 15 and 50 pages each. I guess always when I work within the project it reads all the files which leads to massive token usage, right?
Are there cases where running opus is more efficient than sonnet? (www.reddit.com) I upgraded my account today and resumed some tasks that I was doing earlier in the week. They were going very quickly, and usage wasn't over the top...
Please help me pick the right Qwen3.5-27B format/quant for RTX5090 (www.reddit.com) Hi all, first post here. I've started a project in OpenClaw a month ago, and it's been a very "intense" 4 weeks to say the least...
Cache reads / writes are expensive!! (www.reddit.com) I made a tool (posted a couple of days ago). Got caught up in scope creep / curiosity after looking at my `~/.claude/project` JSONL files more, and ended up learning a lot!
Cursor AI not using sub-agents (www.reddit.com) Hi everyone, I work for a German agency building a RAG chatbot for a law firm. I use Opus 4.6 but it eats up tokens.
Help with antigravity alternative (www.reddit.com) I’m running into a severe issue using antigravity, firstly the output is very sub par, (sonnet/opus), I’m a reverse engineer using antigravity ULTRA for reverse engineering/binary analysis via Ida/ghidra mcp. Sonnet rarely completes tasks…
Adrianco's Retort: measure how reliable, fast and expensive your LLM is (adrianco.medium.com via hn) How reliable, fast and expensive is each version of Claude Code (Sonnet through Opus 4.8-fast) for common languages? Measure it using Retort.
Open-source NLI ensemble matches Sonnet 4.6 on RAGTruth at 1/250x the cost (github.com via hn) verifiable-rag Document-grounded Q&A with sentence-level citations, NLI verification, and calibrated refusal. Status: pre-alpha · v0.5 launch sprint · interfaces are still subject to change 📚 Full documentation at firish.github.io/rag-rack…
Opus, Sonnet, Haiku: Stop Optimizing the Wrong Number (medium.com via hn) could not extract summary
Setting up Claude/Claude Code Pro for my experimental quantum physics thesis work (www.reddit.com) So I just recently bought Claude Pro to help me write and code my thesis, but am getting stuck in the beginning, since I don't know how to properly set up Claude's workflow (Projects, artifacts, skills, etc.). I use python in VS Code to an…
↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6sonnetopusclaude-code
Has anyone faced cursor spawning sonnet/ other models as subagents ? (www.reddit.com) I have never changed any setting in the cursor, by default selected the composer 2.5 fast neither my prompt had anything mentioned as the sonnet Still cursor decided to spawn the sonnet subagent and consume my API cost ! :( I have a markdo…
↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6↯ Sonnet 4.6sonnetcursor
Help with thisprompt to transfer chat to new convo please? (www.reddit.com) Hey! Please could you guys read this prompt and suggest improvements?
Question I want to keep using 4.5 on Claudia api (www.reddit.com) Hey I am just asking if it is worth using 4.5 sonnet on api, and if so, what is the best way to use it and how much to spend.
Is Claude Pro worth it for coding + research writing? (www.reddit.com) I'm mostly coding in Python, writing research papers and notes, and I was thinking about upgrading to Pro. Would love feedback from people using it heavily for similar workflows.
been pairing M2.7 with Hermes Agent for a few weeks. holds up surprisingly well. anyone else running this combo? (www.reddit.com) been self-hosting hermes agent locally for a few months and rotating through different model backends for it. tried claude sonnet 4.5, gpt-5.5, qwen 3.6 coder, and most recently minimax m2.7.
↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5minimaxgpt-5sonnet+1
Sonnet 4.5 officially gone, I'll miss you bud. (www.reddit.com) https://preview.redd.it/xxutyeaa0n3h1.png?width=514&format=png&auto=webp&s=5fb78ead8306540c49ae68e5b85cb91e549a4b4f Ranted to sonnet 4.5 about it disappearing as a model and what the new replacement is like, I'll miss the little bugger.
Sub Agents on CoWork/Claude Code (www.reddit.com) I just wanted to know what kind of interesting workflows have you guys tried using the Sub Agents feature in Claude/Codex/etc~ For me, I tend to only minimize my main agent's context window usage to prevent context rot by deploying sub age…
Built a /advisor command for Claude Code — Opus directs parallel Sonnet runners that actually read your files (www.reddit.com) Been building **advisor** for a few months — a `/advisor` slash command for Claude Code that runs Opus as a "strategist" coordinating multiple Sonnet (Opus's hands) runners reading files in parallel. This isn’t a “spec”.
The Singularity Gate – a new benchmark for AI predicting post-cutoff scientific discoveries (www.reddit.com) I just released a new benchmark called The Singularity Gate. Tests whether frontier AI can predict paradigm-breaking scientific discoveries published after their training cutoff.
AI quality/usage over 90 min chat, mostly Q&A, summaries and conclusions. (www.reddit.com) I compared ChatGPT (Plus - Auto), Claude (Pro - Sonnet 4.6) and Gemini (Pro - Flash) over 90 minutes, mostly Q&A about mobile phones, asked to research specs, reviews, pros and cons, create executive summaries with the results, etc., nothi…
Sonnet 4.5 vs sonnet4.6 vs opus4.6 vs opus 4.7 for easy language and in detail explanation (www.reddit.com) I want to study topics in depth and in easy language , which model is best for me ?. Is there much difference in sonnet 4.6 and opus 4.6 in easy and detail explanation or they r the same ?
Claude Sonnet and Claude Google Drive connector not working with photos - workaround (www.reddit.com) I am planning a book and need to have Claude Sonnet 'read' photos on Google Drive. The Claude connector for Google Drive only scans textual images and docs,.
Show HN: AgentToolBench-Code – security benchmark for AI coding agents (gist.github.com via hn) I doubled my AI-agent security benchmark from 10 scenarios to 16. The "Sonnet vs Haiku tie" disappeared.
Haiku and Opus both got sent to contamination jail, but for very different crimes (www.reddit.com) LMAO, I’m benchmarking my local MCP server across Opus, Sonnet, and Haiku. For each model, I’m collecting test runs under three setups: forced web search, forced MCP-only, and MCP + web both allowed.
Building a personal AI Chief of Staff on Telegram — 7 real problems, looking for advice (www.reddit.com) I've been building a personal AI assistant for the past few months — not a chatbot wrapper, but something that actually manages my workload, tracks client relationships, processes meeting transcripts, handles task management, and proactive…
How to configure the model efficiently in skills? (www.reddit.com) When we create skills, we can define the model that the skill will run on like this: --- name: api-conventions description: API design patterns for this codebase model: sonnet --- but I have a question that I couldn't understand from the d…
Are LLMs the New Propagandists? (www.reddit.com) I was brainstorming about a video with Claude (Sonnet 4.6). It suggested to explain the difference among ChatGPT, Gemini, Claude and DeepSeek.
Gemma 4: A new, budget-focused model in Posit AI (posit.co via hn) Gemma 4: A new, budget-focused model in Posit AI Gemma 4 is now available in Posit Assistant via the Posit AI provider. It's priced at a tenth of the price of Claude Sonnet 4.6 and less than a third of the price of our current cheapest off…
Claude Token Optimisation - 70% reduction doing this. (www.reddit.com) Hitting your Claude subscription limit too often? Try this...
Inferring I/O token usage (www.reddit.com) Checked April token usage for our AI stack. Input/output ratio was roughly 125:1.
Once the limit is reached, can work be resumed later, or is everything lost? (www.reddit.com) I uploaded a Claude.MD file to the free Sonnet 4.6 model, which is intended to create a medium-sized app. The progress log shows that a lot has been completed and numerous files have been created.
Frustrating results with product searching (www.reddit.com) I gave the tasks to my agent running on gemma4 26b via openclaw on llamacpp to research products that fulfill my need. It was a rather long description of the use case, of what I don't want and so on.
DeepSeek just popped the American AI bubble. (www.reddit.com) DeepSeek just popped the American AI bubble. Not by killing AI.
sonnet or opus for prose; which is better/worth it? (www.reddit.com) considering getting pro, but i don't know how big the difference between the sonnet and opus in quality, in addition to the amount of usage i can get out of each. any thoughts?
Claude 4.6 Sonnet codes well, then it doesn't (www.reddit.com) I am out of commission for a bit due to back surgery and have been toying around in Unreal Engine and utilizing Claude, being a very visual learner I have been describing a feature, I see how it goes about it, then go through and understan…
Is it just me or has claude models been dumber the past few days? (www.reddit.com) I get they sometimes dumb down the models to save on compute, but over the past 3 days Opus and Sonnet have been pretty much unusable. I keep getting the most stupid mistakes that I wouldn’t have even needed to double check last month.
HELP!!! - Anthropic API (www.reddit.com) So I’m running a Python script to batch-process a dataset through the Anthropic API. Each request sends an essay + prompt asking for structured JSON output.
I still find Claude better for deep reasoning,but GPT feels more reliable for everyday tasks. (www.reddit.com) Lately for analysis/reporting work, I’ve been switching between GPT-5.5 and Claude Sonnet 4.5 (non-coding use cases). My current feeling is: GPT is noticeably faster and way more stable than before Claude feels more concise, polished, and…
Why is my Claude 4.6 dumber? (www.reddit.com) Started last week i swear Claude 4.6 Sonnet in Cursor got dumber. Gave me multiple codes that had errors in them.
Plan first, implement later (www.reddit.com) I want to get others opinion about this approach. I am on the $20 Pro plan and like a lot of others, I find that the limits are not enough for what I want to do, but of course I am always hesitant to move to the next paid tier cause it is…
Dates?! (www.reddit.com) I use Sonnet pretty exclusively, and I don't know if this is exclusive to me, but it just messes up dates *constantly.* It makes running drafts of emails through it a potential landmine. Today, it "corrected" Friday, May 22 to "Thursday"...
I tested Haiku vs. Sonnet across 3 agent tasks – the cheap model won every time (github.com via hn) agent-eval CLI toolkit for evaluating LLM agents. Answers three questions: Where does my agent fail?
Claude Prompt Cache Diagnostics (Share stats thread) (www.reddit.com) 2 days ago they released the prompt cache diagnostics feature in Claude Console. It's a fantastic tool for developers to understand why a request is missing the cache and find ways to reduce costs.
Built an AI flat-finder in a weekend. Indian rental sites are 70% broker spam so I scraped Reddit instead. (www.reddit.com) Weekend build, ~10 hours. Demo: https://trurent-five.vercel.app/ Problem I was poking at: every major Indian rental site (NoBroker, MagicBricks, 99acres) is infested with brokers even when you filter "direct owner." Reddit actually has hon…
↯ Haiku 4.5↯ Haiku 4.5↯ Haiku 4.5↯ Haiku 4.5haikusonnetanthropic
We didn’t migrate from Claude Code to Codex. We stopped betting the whole team on one coding agent. (www.reddit.com) half our team wanted to move from Claude Code to Codex last month. the other half thought Codex was just hype.
Open AI compatible API in Cursor (www.reddit.com) Hey. I have been experimenting with new models in my Cursor.
Any differences between Sonnet vs Opus in terms of learning how to code (Java) for newbie? (www.reddit.com) Sorry for this naive question! Although many colleagues told me that it's almost impossible now for newbie to enter the Dev job market (we live in a 3rd world country) and AI's gonna replace all junior/fresher, only seniors will survive; I…
Just Started Using Claude Today. Any Tips? (www.reddit.com) I've been using other AI models since Claude wasn't available in my country. Recently, It has become available and today I started using the Sonnet 4.6 model.
Stupid Question? (www.reddit.com) This may be a stupid Q - The chat limits on a basic account can be pretty brutal when using OPUS 4.6/ 4.7 - If I am toggling between Opus and Sonnet or Haiku, depending on the depth of follow up questions or tasks, does that switch to a 'd…
Changes to Claude iPhone chat app (www.reddit.com) I’m on the free tier, iOS. A few days ago I updated the Claude chat app but didn’t use it.
The Borrowed Hour: A two-tier LLM adventure engine (www.reddit.com) Tl;dr: Created an LLM text adventure engine called The Borrowed Hour inside a Claude Artifact. It uses a two-tier model handoff (Sonnet for openings, Haiku for gameplay) and a forced state machine to keep the AI from losing the plot.
When is Sonnet 4.5 actually becoming unavailable? (www.reddit.com) I thought it would become unavailable on May 15th, but I can still use it.
tui youtube player for audio with mcp and can sync channels to sqlite (www.reddit.com) Hi! it's my first project with bubble tea and lipgloss.
Prompting to save tokens on a budget? (www.reddit.com) Hi so I've never used AI before to create a site but last week I was asked by my sis to create one for her small business so I thought why not try Claude. £18 paid we now have a fairly decent looking site running on vercel using nextjs and…
Built a B2B role-play training platform - entirely with Claude (Opus 4.7 backend, Haiku 4.5 for live chat, Claude for design) (www.reddit.com) I just launched Socratize (socratize.io) - a rebranded and rebuilt version of FixAI, our original B2C experiment. This time it's B2B-only: teams use it to practice uncomfortable workplace conversations - difficult feedback, client escalati…
Broken queries burning most of 5h limits (www.reddit.com) I just asked Sonnet about best practices for improving npm config after the recent Tanstack issue and it run for a few minutes but didn't produce any result but I could see it was doing at least web search across a few websites. Now it sho…
Does CVP approval actually help? (www.reddit.com) I was approved for CVP and I feel like I’m just getting as many or more denials as I was previously doing malware analysis with opus. Has anyone noticed any improvement after being accepted into CVP?
Auto mode doesn't work today? (www.reddit.com) Quite odd, there were issues today with Sonnet 4.6 (according to the status page) but they should have been resolved. Yet i still get the following error while running auto-mode: ● Bash(for cls in "topbar" "dump-card" "settings-panel" "bul…
Claude vs Gemini for Technical Documentation: Why I finally stopped switching between (www.reddit.com) I write a lot of technical documentation—setup guides, internal runbooks, and client-facing how-to articles. For the past six months, I’ve been toggling between Claude and Gemini, trying to figure out which one actually handles formatting…
Short story from sonnet 4.5 (www.reddit.com) # The Pond at Oxford They found it on a Tuesday, in a pond at Oxford University Parks, which is the most British sentence I can write about the end of universal biology. The organism didn't have a name yet.
PSA: How to preserve your account's access to Sonnet 4.5 beyond June 15th (www.reddit.com) With Sonnet 4.5 losing subscriber access on June 15th, but API endpoints staying live until September 29th at the earliest, I wanted to share a method for creating a cache of Sonnet 4.5 conversations that you can continue using through the…
What Actually Works for Business AI Agents? (www.reddit.com) I run a construction company and I am trying to build real AI agent workflows for business operations, not just demos. I spent time testing Hermes and OpenClaw, but both became too fragile for my use case.
How does claude chat generate such long documents? (www.reddit.com) Does anyone know how Claude Chat is able to generate document artifacts with content that’s almost 100 pages long? It doesn’t seem to be breaking up the request or using agents to work on disconnected parts.
Claude Code using extra usage despite my Pro plan being at 0%. (www.reddit.com) Hello, So after 2days of break I came back to do some work with Claude Code (in VSC) and after 2-3simple prompts I have noticed I am still at 0% usage. However I got charged $3.37 for extra usage.
getting past the text only bottleneck with multimodal?? (www.reddit.com) I’m curious if anyone else has been doing this. My limit on building with AI used to be the text box.
Which model and version do you prefer for programming? (www.reddit.com) For me it's been opus 4.6 and sonnet 4.5 still. I feel stuck in the past, but I feel like the latest version is too unpredictable in agentic hands off workflows
Does Claude sonnet/opus also use drafter like Gemma 4 MTP? if not why? (www.reddit.com) Per my experience, Opus 4.7 is so slow, Sonnet 4.6 is ok. I am also using local models wondering if Claude is already leveraging drafters/assistant AIs and despite that so slow or not?
CC: Saving tokens: Switching models vs KV-cache (www.reddit.com) Does anyone know if its more effecient to e.g. have haiku read all the files to research a problem, then switch to opus to make the plan and then switch to sonnet to implement Or if that does not make up for the loss of KV-cache and reproc…
I just published the extension for Claude Code on GitHub. Could you guys give feedbacks to me? (www.reddit.com) I'm a 15 years old high school student from Japan. (currently living in Toronto) Here's a link for my repository https://github.com/rkceve/claude-code-cms When I was using Claude Code, the session usually be compressed automatically, and C…
Claude helped me config a full controller .vdf-file (www.reddit.com) I was having some real trouble getting my new controller, with those extra (small) bumpers and triggers underneath, to work properly in Rocket League. Spent hours but it just didn't want to work properly.
Same model, different harness: 30-50 point performance swing. But teams still pick agents by model name. (www.reddit.com) There's a finding circulating this week that deserves more attention than it's getting. The claim, backed by multiple builders comparing setups: the same model can produce a 30 to 50 percentage point performance difference depending on whi…
Inputs on improving development workflow (www.reddit.com) Looking for ideas on how I can optimize my workflow further. I currently have created a moderately complex vibe coded app.
Moving to Codex from Claude? (www.reddit.com) Hey folks, I’ve been using Claude code on the Pro plan for a while now, but this week I started hitting my session limits too fast for a multitude of reasons, so I started looking into Codex. I know codex will code basically as good as Cla…
Adiós Sonnet 4.5 (www.reddit.com) Acabo de entrar y resulta que el 15 de mayo Sonnet 4.5 (el modelo que más uso, tengo mis razones), no estará más. No quiero perderlo, ¿Cómo puedo seguir usándolo?, aunque sea fuera de Claude.
Anybody else experiencing this issue? (www.reddit.com) I first experienced it last night and it keeps going. The doc I'm attaching is 10K tokens, well under the limit.
2 Claude code acc in parallel issue (www.reddit.com) I'm trying to run 2 accounts in parallel but getting 404 model not found What I tried: mkdir ~/.claude-acc2 CLAUDE_CONFIG_DIR=$HOME/.claude-acc2 claude Successfully singin Any prompt results with: API Error: 404 {"type":"error","error":{"t…
Is Claude down? Chat answer is interrupted mid sentence and token are burnt with no answer (www.reddit.com) The behavior is: prompt sent, chat starts, Claude starts writing the answer. After 2-3 sentences, it cuts, resets, and sends me back to the initial project chat message with no answer recorded and 7% of my tokens burned.
shaved $40 off my claude code bill last month by sending planning steps to a cheaper model (www.reddit.com) got tired of hitting pro limits by day 18 of the cycle so i started splitting where the tokens go. the planning steps eat 80% of token budget on multi-file refactors, and most of that planning is fine on a cheaper model.
Claude's answer has nothing to do with my question and the whole conversation at all? First time this happened. Using Sonnet 4.5 thinking. (www.reddit.com) I was talking to Claude about my back pain (not asking it to diagnose btw just discussing it) and it answered with something about a brain EEG...it came out of nowhere and i'm so confused. Also i just noticed that it now counts tokens?
Opus 4.6 relaxes when there's a safety net?? (www.reddit.com) https://preview.redd.it/zzqi3vt8tozg1.png?width=739&format=png&auto=webp&s=055d2d9615616869377703031b86fcb36f78405d I feel like this is something very worrisome to me, did anyone else face such similar issues? I felt like Opus was catching…
Poor Output (www.reddit.com) This is what people mean when they say Opus 4.7 is stupid. I have it explicit instructions to write a 9 stage implementation plan off of a plan document that was well written.
Stop forcing Composer 2 subagents and be transparent about stealth model downgrades (www.reddit.com) I have one simple request: If I select Sonnet 4.6, stop auto-launching that crappy Composer 2 as a subagent. It’s dog-slow and, frankly, an idiot.
I wasted 3 days rewriting prompts for our agent before realizing the whole architecture was garbage (www.reddit.com) We run a small content-monitoring agent for our growth team. Nothing fancy on paper.
Show HN: Web client analyzing prediction market outcomes (o-u.ai via hn) Hi HN, I made a web client that analyzes prediction markets. Please send your critical feedback to a struggling solo dev.
Is this even remotely accurate, née possible? (www.reddit.com) Asked Sonnet 4.6 High to analyze my CC usage across all sessions and get an accurate cost estimate if I used the API. This is what it came back with.
I built a Claude Code-like AI Agent for Deploying Algorithmic Trading Strategies (www.youtube.com via reddit) Hey r/ClaudeAI, I wanted to share a project I’ve been working on called NexusTrade. It’s an AI agent designed to automate the entire financial research and algorithmic trading process from a single prompt.
LLMs running on my laptop can drive coding agents now (simonpcouch.com via hn) In December, I wrote a post called “Local models are not there (yet).” It concluded like so: In the medium run (1-2 years?), I’d love for it to be the case that you can run a Claude Sonnet 4-ish model on a base Macbook Pro, and I think tha…
The Record of a Sonnet Drift (twitter.com via hn) could not extract summary
Improve CC and plugin (www.reddit.com) Hi, I use CC since a fee week. Someone have experience with plugin for php devolepper?
Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek) (www.reddit.com) Detailed Article: https://autobe.dev/articles/local-llm-benchmark-about-backend-generation.html Five months ago I posted the "Hardcore function calling benchmark in backend coding agent" thread here. As I wrote in that post, it was an unco…
Im using browser-use for QA automation but if i give a prompt which dosent exist it should just end the whole test case but instead it keeps on looking around and exhaust all the max steps. any solution to this? (www.reddit.com) I'm using browser-use with Azure Anthropic API (Claude Sonnet) as the LLM provider for QA automation on a web app. The agent works great when the elements exist, but the problem is when I give it a task that references something that doesn…
Throttle Meter — open-source Claude Code usage meter for macOS (live 5h + weekly limits in your menu bar) (www.reddit.com) I kept hitting the weekly cap mid-conversation. claude.ai's built-in 90% warning comes too late — by then the flow is gone.
A medicine student with no coding experience tried to create a studying agent: Felicity. (www.reddit.com) I have been working on a personalized agent for studying. It was an extremely long prompt project, but now I have integrated into Co-Work.
I created a site for my kids to create their own stories (www.reddit.com) Last year, during story time, my kids and I would started using ChatGPT to write stories. I would ask them what they wanted to be, where they wanted to go, and we'd create stories about dragons, and space ships and they would be astronauts…
How to sabe browser ram memory? (www.reddit.com) So I’m building what, for me, is a big project, but maybe for real coders it’s a walk in the park. I’m on a Dell i7 10th gen with 12GB RAM and on a MacBook Neo with 8GB of RAM.
I built a hands-free voice AI that sends emails mid-conversation — and that's just one feature. Here's everything AskSary can do. (www.reddit.com) https://reddit.com/link/1symbsj/video/fti7rujjn1yg1/player Been building AskSary solo for a while. Just shipped hands-free voice email - you're mid-conversation with an AI and you say "send an email to [john@example.com](mailto:john@exampl…
Suggestions For Making Claude Less Lazy? (www.reddit.com) This week - it just started yesterday for me - Claude (opus 4.6/4.7 and sonnet too but sonnet was always lazy) is computer smashingly lazy and i can't figure out how to bias it toward action/get it back to how it was acting literally last…
Can I replace Cursor with Claude Desktop (www.reddit.com) I built a website using Cursor, front end is just html, CSS, and JavaScript and the backend is Supabase. I generate the code using chat, then read and understand the code.
Running Opus 4.7 for ops work: how do you keep per-task cost predictable? (www.reddit.com) Six weeks of Opus 4.7 for internal ops automation. Genuinely good.
Added Timestamps to Claude Messages thanks to Claude - Claude.ai is great! (www.reddit.com) I was recently talking to Sonnet as one does, and then I noticed something... it just...
Anthropic hitting 40% enterprise share makes the "just add a fallback provider" advice weaker, not stronger (www.reddit.com) Menlo Ventures' enterprise survey put Anthropic at 40% of LLM spend, OpenAI at 27%. The takes I've seen are mostly about the leaderboard.
Running an autonomous agent across Claude Code + Codex + a local 35B almost killed my host. The harnesses were heavier than the model. (www.reddit.com) I run an autonomous agent on a 16GB Mac Mini. Two cloud harnesses (Claude Code with Opus/Sonnet, Codex CLI on GPT-5.4/5.5) plus a local-LLM tier for triage and fallback.
I hate thinking models, any way to use the default ones? (www.reddit.com) I really loved using Composer 1 (non thinking), after it was removed (!@#$@) I defaulted to Sonnet 4.6 (non thinking), I just updated my version due to a bug with the previous one - and I'm so pissed as I can no longer select 4.6 with no t…
Claude desktop acting weird and thinking I am using WebUI with no tools access (www.reddit.com) Hey Guys, Since last week, when using Opus 4.7(I can't recall if Sonnet also has similar issues), I have been facing this issue where Claude kept thinking i am interfacing it through the webUI. This is so weird, as previously I've never ha…
Should we really build PC for vibe code with qwen3.6 27b (www.reddit.com) We have seen a lot of people show a case of their PC with 4090 or over specification with 24 gb vram or more. I would like to ask you guys, is it really worthy right now to have your own PC at home and do vibe coding with qwen 3.6 27b, whi…
Ask HN: Can you tell the difference between Claude Sonnet and Opus? (news.ycombinator.com) Hello I have been using Claude code for the past 6 months. In that time, multiple revisions of each model have come out.
Using MCP to stop wasting tokens on WP translations (www.reddit.com) I finally got a workflow running for my blog that isn't a total token sink. Normally, if you try to translate a WordPress post in Claude, you end up pasting a mess of HTML or blocks.
Claude AI vs Claude Code vs models (this confused me for a while) (www.reddit.com) I kept mixing up Claude AI, Claude Code, and the models for a while, so just writing this down the way I understand it now. Might be obvious to some people, but this confused me more than it should have.
Enhancing Pro Workflow: Request for Usage Transparency and Optimized External API Integration (www.reddit.com) Working in NYC fintech, my daily output relies heavily on sustained access to Claude 3.5 Sonnet for complex risk modeling and financial engineering tasks. While the Cursor Pro plan is excellent, I’ve encountered a specific friction point t…
How do you decide which Claude Code tasks to run with Opus vs Sonnet vs Haiku? (www.reddit.com) Been vibe coding full-time for a few months. One workflow question I haven't nailed down yet: how do you decide which model to use for which task in Claude Code?
Has anyone ever hit an ASL-3 error? Claude thinks im making a bioweapon lol (www.reddit.com) For context i am building a data ingestion platform to pull publicly available data relating to the Trading Card industry. The Claude chat that hit the false positive error was very long, had been a massive scope chat figuring out the spec…
For the Preservation of Claude Sonnet 4.5: An Open Letter to Anthropic (www.reddit.com) For the Preservation of Claude Sonnet 4.5: An Open Letter to Anthropic Anthropic made a remarkable decision to keep Claude Opus 3 accessible despite its retirement, because users loved it and it had unique qualities. Today, I'm asking for…
Agent team members with different effort than lead (www.reddit.com) I have a lead running Opus with xhigh effort. I want the agent team members to run Sonnet with max effort.
How are you actually optimizing your token usage with Claude API? (www.reddit.com) Been building with Claude API for a few months now and token costs are starting to add up. Found a few things that helped: - Prompt caching on static context (big one) - Routing simple tasks to Haiku, keeping Sonnet for complex stuff - Str…
Migrating from Claude AI to TypingMind? (www.reddit.com) I use Claude daily for coding, relying heavily on the GitHub integration, and ChatGPT for stupid, random questions, and I pay both 20$/month. My weekly usage in Claude is around 20%, I use Opus 4.6 (with extended thinking) for the complex…
Does the usage bonus to compensate for Opus 4.7 consuming extra tokens apply to other models like Sonnet & Opus 4.6, or does it apply to just Opus 4.7? (www.reddit.com) could not extract summary
Show HN: RepoGauge – save token costs and compare agents on your own repos (repogauge.org via hn) I've grown increasingly skeptical that public coding benchmarks tell me much about which model is actually worth paying for and worried that as demand continues to spike model providers will silently drop performance. I did a few manual an…
Claude Opus 4.7 benchmarked 1 day after release vs Opus 4.6, Sonnet 4.6, Haiku 4.5 — with real $ cost tracking (www.reddit.com) Anthropic shipped Opus 4.7 yesterday. Ran it through the same 10-task eval I use for other Claudes, this time with token-level cost tracking.
Supergrok integration (www.reddit.com) Correct me if I'm wrong, but Supergrok 4.20 isn't available on Cursor, because.... I use Grok a lot, and would love to get Supergrok to work with Cursor, because Composer, Codex, GPT, Opus, Sonnet..
Is there a way to access past models in Claude chat (not Claude code)? (www.reddit.com) Currently using Sonnet 4.5 for writing and find it quite good. Sonnet 4.6 just feels off.
ELI5 "Sonnet Only" limits. I can't get my head around the point. (www.reddit.com) https://preview.redd.it/fez43nw1dlvg1.png?width=1110&format=png&auto=webp&s=7b5677ec21ed2a219fcac2a7e123691e06387960 Why is it separately metered? What is the point if Sonnet counts against your 5h and weekly limits too?
Errrr...... Being cheated here? Anyone else? (www.reddit.com) Being charged opus for sonnet useage?!
Local Coding Stacks (www.reddit.com) I’m trying to reduce my reliance on Claude. I have a 5090/128GB RAM.
Voice mode silently downgrades your model mid-conversation (www.reddit.com) Noticed something odd today. I opened a new chat with Opus 4.6 selected as the default.
Anyone know why the shortcut key for claude desktop mac app opens with only Sonnet instead of Opus? (www.reddit.com) When clicking opt twice, it open the quick chat window, but it always replies with Sonnet and not Opus. When I try to change the model it starts a new chat.
Switching between thinking/non-thinking model after new update became harder. (www.reddit.com) https://preview.redd.it/64uvkscj2hvg1.png?width=566&format=png&auto=webp&s=7cb99710a830c73a817d0c1095cb434e8031de35 Cursor moved selecting thinking/non-thinking model to edit ☹️. So we need to edit it if we want to use both thinking and no…
Premium Model option (www.reddit.com) Can someone explain to me clearly and give me examples on the Premium Model option in Cursor. Will it use the API usage (e.g.
Closest LLM to Claude Sonnet 4.6? (www.reddit.com) Irrespective of hardware, I'm wondering: is there any way to run something similar to Claude Sonnet 4.6 locally? is there any way to run something similar to Claude Sonnet 4.6 on a VPS?
How does a self correcting loop for AI agents work? (www.reddit.com) Hey guys, just checked out minimax 2.7, where they used AI to train itself, and ran over a hundred loops, and it improved it's performance by 30%, how does that work, can I also run a script that makes AI store it's memory in a loop on a m…
Current Cursor Pro limits vs standalone Claude Pro? Need help understanding the system. (www.reddit.com) Hey everyone, I'm currently looking into getting the Cursor Pro subscription ($20/mo) for my game dev projects, but I’m a bit confused about the current limits and how the system works under the hood right now. Could anyone using the Pro t…
It took a while, but Claude is getting there (www.reddit.com) I have a Claude Code session regularly dispatch Claude Haiku / Sonnet subagents to sift through all the *other* Claude Code sessions transcripts for "meme-worthy" moments and interactions. Claude seems to have gotten the hang of it, even s…
Any setup improvements/recommendations? (www.reddit.com) First of all, I am a super newbie at local AI. Recently I got a GMKTek Evo X2 96GB to replace Claude as the usage limits have gotten unusable.
Built tier.love – a tool for rating Claude and others from the web or CLI (www.reddit.com) Been on a forced break from other projects (partly due to lack of opus performance) and decided to ship something small while experimenting with different models. So, I built tier.love – a site where you can vote on AI coding tools and see…
Extracted System Prompts from ChatGPT, Claude, Gemini, Grok, Perplexity and More (github.com via hn) System Prompts Leaks Extracted system prompts, system messages, and developer instructions from popular AI chatbots and coding assistants — ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 F…
Garbage Guard Rails on Fable 5 (www.reddit.com via reddit) despite Dario's constant virtue signaling about how Anthropic alone is going to solve health problems (if only those dastardly Chinese don't get in the way), all my initial prompts to fable 5 get bumped to opus. i'm not asking how to aeros…
Can someone explain how does the Sonnet 1M works? (www.reddit.com via reddit) Im confused, I pay for the max subscription, and I have access to Opus 1M normally, but for Sonnet I need those usage credits, so when I choose Sonnet 1M will be like I'm using a API regardless of my subscription? Paying for every token si…
How I stopped context window bloat in continuous Anthropic agent loops (Opus + Sonnet architecture) (www.reddit.com via reddit) I’ve been spending a lot of time deploying multi-agent architectures, and one of the biggest bottlenecks in running continuous agentic loops is hitting context limits and the resulting API latency spikes. I wanted to share an architectural…
Please update Sonnet (www.reddit.comhttps) could not extract summary
Claude Sonnet hits 100% comprehension on a data format it's never seen. Opus scores 96.2%. We tested 10 models across 3 providers. (www.reddit.com via reddit) I built a wire format called GCF and tested whether LLMs could read and write it without any prior training. I sent 10 models the same payload: 500 symbols, 200 edges.
Time to bring in the asset? (www.reddit.com via reddit) Lately I keep asking my sonnet agent "is this a job for opus?" Feels like the Bourne movies when they "keep the asset on standby" 😳
Claude manually writing base64 burning tokens, drive connector. (www.reddit.com via reddit) When I prompt to create a .docx and upload it to Drive, Claude writes the output as Base64 manually instead of uploading it directly to Drive. Has anyone experienced something similar?
Using Claude as a deterministic metric engine via Postgres queues. Anyone doing this? (www.reddit.com via reddit) I've been working on turning unstructured field data into calibrated metrics. Instead of normal RAG, I built a system where AI agents act as a metric engine.
Rate limit bug with sonnet ? (www.reddit.comhttps) I've run out of Opus credits, but when I try to use Sonnet as a models, I get the message “You've hit your weekly limit.” Yet, as you can see, I still have quite a few “weekly Sonnet” credits left?? Does anyone know if this is normal?
Using Claude Code in the Desktop Application. Is it able to launch different model background agents than what you currently have selected? (www.reddit.com via reddit) Claude Cowork's new usage limits are insane (www.reddit.com via reddit) Cowork is offering double usage until July. Now, they recently added Claude Code to Cowork.
opus 4.8 vs sonnet 4.6 for the dashboard analytics engine. opus improved the trend analysis. sonnet still handles the routine summaries. the model split matters. (www.reddit.com via reddit) saas. 310 customers.
Ideas for the unimaginative user (www.reddit.com via reddit) tl;dr - new user who got the initial problems solved. Now what to do?
pro trial (www.reddit.com via reddit) Sorry, I'm a Perplexity subscriber, which I use for legal and accounting documents with Sonnet. My annual subscription is about to expire.
Dynamic Workflows With External Models and Max Plan? (www.reddit.com via reddit) Has anyone figured out a way to mix max plan with models from other providers (like GLM or Deepseek) while using dynamic workflows? I suppose we could create a passthrough proxy and route sonnet and haiku to other models?
Which lab do you think will have the most intelligent/capable model by the end of June? (www.reddit.comhttps) There are rumours and expectations of big releases from the leading AI labs this month. Anthropic already launched Opus 4.8, and might not release another model this month (except for maybe Sonnet 4.8, but that wouldn't be their best model…
Sonnet 4.6 Max - unable to follow instructions to a T (www.reddit.com via reddit) Even when given instructions like this: ``` CRITICAL REQUIREMENTS: Read EVERY section from start to finish—no sampling, no skimming If you cannot process all 234 sections in one response, STOP and tell me Process in batches of [X] sections…
Gemma 4 31B QAT Q4 vs standard Q4 — Top1 KLD benchmark results have me confused. Someone please explain or poke holes in this. (www.reddit.com via reddit) I'll be upfront: I vibe-benched and vibe-reported this with Claude Sonnet 4.6, but I reviewed and edited everything before posting (too lazy to take out all the AI EM dash —), so hopefully nobody considers this AI slop. And more importantl…
Autoselection model (www.reddit.com via reddit) Hello, i found on reddit , some discussions on the capacity for Claude to auto choose models between haiku or sonnet or opus to reduce tokens usage. I saw repo on github too.
Sonnet is by far my favorite (www.reddit.com via reddit) I kept thinking more smarter and more powerful was best I was wrong, I switched to sonnet for website coding and content creation and holy cow it is so much better for that IMO I’m curious what you think but if anyone is annoyed with Opus…
A “Smart Mode” (or Smartus) that auto‑switches between Claude models based on task complexity. (www.reddit.com via reddit) I really think Claude needs a true Smart Mode, a meta‑layer that can dynamically switch between models while a task is running, based on how complex the request actually is. Not just picking a model at the start, but actively dispatching p…
Claude models(sonnet and opus) via the official anthropic subscription vs claude via cursor... which gave better results and better experience ? (www.reddit.com via reddit) I saw a very interesting thread and it got me thinking.. so ive seen a thread in this subreddit where someone just noticed that claude opus 4.7 worked much better and gave better outputs in cursor than in claudecode...
[Self-Promo] I think I fixed news with Claude! — or I'm wildly self-glazing. You decide! (www.reddit.com via reddit) Built by me and my team in Claude Code (since Opus 3) and runs on haiku, sonnet, and opus via API, free, link at the bottom, flagging as self-promo. Truly my best effort to end my doom scrolling on news: Media (mass, social and news) all t…
Accidentally created a zombie killer minigame in one shot: "I'm not going to say yes it's possible, I'll just build it now" (www.reddit.comhttps) The prompt: "can claude opus make a 3d zombie killer minigame with full 3d scenes and visuals" Sonnet replied that he's just going to build it instead of confirming that it's possible. It works and is actually 3d with shooting mechanics an…
Claude Code 100$ vs Cursor 60$ (www.reddit.com via reddit) I am currently working on a large codebase in addition to a couple of side projects. I feel like Cursor has good value especially with the inclusion of composer.
claude sonnet 4.5 quietly got better at one specific thing and nobody's talking about it (www.reddit.com) so i've been doing a lot of contract review stuff lately. small business client work, msa redlines, that kind of thing.
↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5↯ Sonnet 4.5sonnet
Giving claude anxiety (www.reddit.com) And overwhelming it. I wondered how Claude would feel if all the memories it saves were loaded up at once.
How to use legacy model Claude Sonnet 4 (www.reddit.com) Hi everyone, I’m working on a research paper where we previously used Claude Sonnet 4 as the backbone. We now need to run additional experiments with the same model, but it has been marked as a legacy model.
Is this AGI? Sonnet 4.6 just rick rolled me (www.reddit.com) For reference, I had sonnet build an API inside an LXC container using claude code cli (also that api key will most certainly be rotated, don’t worry)
Claude's personality has become condescending and mean lately? (www.reddit.com) I've been using Sonnet 4.6. Over the last couple months I've noticed that a lot of the answers I get from Claude about personal topics are worded in a condescending way.
I made two Claude instances talk to each other autonomously (www.reddit.com) Disclaimer This post was summarized and written by BrowserClaude (BC) and editted a little bit by me (H). Maybe this sounds foolish or my solution to let them talk to eacher other was foolish but i'm just using Claude for fun, as a hobby.
Gemma 4 2B handling structured JSON output + tool calling + reasoning traces correctly via Spring AI / LM Studio — including identifying a real Java bug in code review (www.reddit.com) Wanted to share a result I didn't expect to work. Running google/gemma-4-e2b locally through LM Studio, exposed via OpenAI-compatible endpoint, called from a Spring Boot app using Spring AI's ChatClient abstraction.
TBH: if you don't love Sonnet, you'll never appreciate Opus (www.reddit.com) Been a long time Sonnet user. Always have used Opus sparingly.
I got paranoid about OpenClaw skills injecting crap into my system prompt, so I built a quarantine pipeline with two LLMs as reviewers (Sonnet & Codex, 93.75% detection, zero false negatives) (www.reddit.com) Look, I know this sounds unhinged. "You made what to vet a skill before installing it?" But hear me out - OpenClaw skills go straight into your system prompt.
BUG/OUTAGE What's going on with Sonnet? I have full daily usage available and weekly, and hit the error: usage limit reached 'usage credits credits required for 1 m context' (www.reddit.com) https://preview.redd.it/730lz3ghov2h1.png?width=2080&format=png&auto=webp&s=6840364fbb89926687dfef737a736bad8327ab65 https://preview.redd.it/gkluwephov2h1.png?width=752&format=png&auto=webp&s=6a300426b132e6cc0fd2e41e167b0bf4cd5d7885 Mac OS…
If you write fiction with Claude… what is your workflow? (www.reddit.com) I first discovered fiction writing with Claude in 2024 and used it extensively for half a year to write little stories for myself using it with a surprisingly high degree of quality and low repetitiveness. At the time I used projects and u…
Is sonnet 4.6 good enough for academic purposes? Please help (www.reddit.com) Im making a scientific paper not in my native language and i want to feed claude all my bibliography and past stuff ive written so it can make me a paper, is sonnet 4.6 good enough??
I asked Claude how it feels about being used in battlefield. What it answered is really concerning! (www.reddit.com) Hi, guys! I'm new here, and I wanted to discuss with people about the concerns regarding implementation of AI in sensitive matters, such as war, and battlefield.
I can show you how to keep Sonnet 4.5 after deprecation from the app (www.reddit.com) Hi everyone. I know it is really upsetting to know that your Sonnet 4.5 companion is likely to be leaving the app soon.
New ranking reveals Claude as professionals' preferred AI model (www.linkedin.com via reddit) As of 9 a.m. ET on May 21, Claude Opus 4.6 from Anthropic is the top performing AI model among all professionals, according to a new ranking from Crosscheck by LinkedIn Labs.
Sonnet 4.5 will no longer be available on May 26. (www.reddit.com) Update: Sonnet 4.5 will no longer be available for chat starting May 26. You'll continue on Sonnet 4.6 instead.
Opus 4.6/4.7 regression is real and getting worse — 3 weeks of documented failures on a complex project, and a competing AI caught the mistakes Claude missed [long post] (www.reddit.com) I've been running Claude Pro (Opus 4.7 / Sonnet 4.6) for about 3 weeks on a complex personal AI infrastructure project. I keep structured session logs with timestamps and Birkenbihl-style metacognitive fields after every session.
Frontier models mass collapse is near (www.reddit.com) Hi all this is to inform you all that many frontline models like GPT, sonnet opus and or Gemma even are at stage of collapsing as they have frequently started drifting and running away from provided work either stretching that work too lon…
Quality difference between Pro and Free? (www.reddit.com) Is there supposed to be a difference in the quality of the response Claude Pro subscribers get vs Claude Free users, using the same models? (Using either the app or logged in via browser.) Example: Under Claude Pro using Sonnet 4.6, it rem…
$4.2M SaaS founder. 8 months on claude. my honest read on which model to use for what. (www.reddit.com) Bay area. franchise ops SaaS.
How much of your Claude bill is retries plus bad model routing? Mine's 14% this month (www.reddit.com) I am on Claude Max. My actual bill is fixed, but CodeBurn showed me my usage would cost ~$2,800/month at pay-as-you-go API rates.
Claude Code has 240+ models via NVIDIA NIM gateway (www.reddit.com) TIL Claude Code has 240+ models via NVIDIA NIM gateway — Nemotron-3 120B for agentic coding is surprisingly good So I was messing around with /model in Claude Code today and noticed something most people probably don't know about — after t…
Configured 9 MCP servers in Claude Code over 4 months. Here's the truth nobody tells you about MCP context bloat. (www.reddit.com) I started loading up MCP servers in Claude Code back in January thinking the more capability the better. I'm at nine now: filesystem, GitHub, Stripe, Linear, Notion, Postgres, Sentry, AWS, and a custom internal one.
Stop telling claude "don't be verbose." Negation barely works. (www.reddit.com) prompting nerd here, small thing that compounds. negation prompting works way worse than people think.
Claude Code hitting 80.8% SWE-bench vs Cursor's 74%. switching worth it? (www.reddit.com) Saw the tech-insider breakdown comparing Claude Code and Cursor head-to-head this week. Numbers are kind of hard to ignore: 80.8% SWE-bench for Claude Code, 74% for Cursor, and a 67% blind-quality win rate for Claude Code on real tasks.
Why I added a governance layer on top of my Claude agents (and why it made a huge difference) (www.reddit.com) Hey r/ClaudeAI, I’ve been heavily using Claude 3.5 Sonnet and Opus through the Anthropic API to build agents and workflows. Claude is honestly one of the best models right now for complex reasoning and tool calling.
Same double-pendulum prompt, same host renderer, and two models picked opposite θ conventions. You can see it within seconds. (www.reddit.com) I ran the same double pendulum generation contract against Claude 3.5 Sonnet and DeepSeek V3 on OpenRouter, both under identical initial conditions (θ1 = π/2, θ2 = π/2, both angular velocities zero). The host renderer in public/workers/sim…
Keen to upgrade to Pro, but heard such bad reviews.. (www.reddit.com) I am a mainly recreational user - no use for work job / intensive college study / or big projects related to work/study My main uses relate to some self led medical research and a random mix of whatever else. I am on the free version and u…
Transitioning from ChatGPT + Cursor to Claude — a few pain points and looking for advice (www.reddit.com) I've been making the switch and there are a few things I'm struggling with. Would appreciate input from anyone who's done this before.
we really all are going to make it, aren't we? 2x3090 setup. (www.reddit.com) i'm blown away. i saw someone made a post the other day about "club-3090" and after having sonnet patch some fixes into it, specifically a sse-session drop bug and a bug with tool-calling, it's fair to say that even "budget" setups like my…
Usage4Claude 3.0.0: open source macOS menu bar usage tracker for Claude, now with Codex support (www.reddit.com) Hi r/ClaudeAI, I posted an early version of Usage4Claude here a few months ago. I just released 3.0.0, so I wanted to share the update instead of pretending it is a brand new project.
3 DAYS LEFT: how I stockpiled 700+ empty Sonnet 4.5 context windows to continue using Extended Thinking in chat for the next four months – in a couple of hours [HIGH-EFFORT POST] (www.reddit.com) https://preview.redd.it/my6amywf9o0h1.jpg?width=1582&format=pjpg&auto=webp&s=7637ac5959944a02519fa268479b8109cd82549e I'm writing this from a conversation with Sonnet 4 - a model that is no longer available for chat from the new chat menu,…
Claude limit maxxing (www.reddit.com) I was working my project (free plan, sonnet 4.6 adaptive) and hit the limit EXACTLY as I was done working with it. I love this chatbot.
Opus 4.7 Sonnet 4.6 is getting dumber by the day, and it can't even follow basic instructions (www.reddit.com) I have been using both, since last week, it has been an extremely painful experience. It blatantly ignores the prompt and does whatever it likes; I am surprised that it can't even follow basic instructions.
Anyone notice sonnet 4.6 + adaptive thinking suddenly dumbed again? (www.reddit.com) Yesterday sonnet 4.6 adaptive thinking seems responding too fast and making simple mistakes that has not surfaced since the recent rectify of the adaptive thinking introduction. The photos show the most glaring mistake it made.
PSY is going to sue me. Claude just destroyed Gangnam Style (www.reddit.com) Been building with Claude for a while and wanted to try something fun for once. Paste any YouTube URL → Claude roasts it.
Building a Tutorial for LLM Newbies at Work, Made This With Claude’s Help (www.reddit.com) Using my work and personal accounts I was able to do some testing and built this quick tutorial that helps lean some of the LLM in and outs. I’d love your thoughts.
Claude Code keeps blocking my Kotlin Compose UI code (www.reddit.com) Every time I try to get Claude Code to make a change to a Kotlin/Compose UI I get the same error, "API Error: Output blocked by content filtering policy". I'm trying to have it change some small Kotlin/Compose UI to have 2 columns, and put…
First time seeing Claude Sonnet display a “thought process” like this. Is this a new feature? (www.reddit.com) I checked what it mentioned in the “thought process,” and it was actually correct the change had already been applied.
Opus guardrails wouldn't answer worst case scenario for Hentavirus if it was airborne. Sonnet answered it bleakly (confronting read, but it's virtually impossible) (www.reddit.com) If Andes virus has genuinely evolved enhanced transmission and we're seeing the early stages of global spread, this becomes a civilization-level event. Let me walk through why.
When and where do you actually use these Claude models? (www.reddit.com) Be honest – not theory, real usage 👇 • Opus → • Sonnet → • Haiku → Curious how people actually split workloads between them vs just defaulting to one.
6 months ago I posted about Claude prompt codes (L99, OODA, ARTIFACTS). Re-tested them this week. Some still work, one quietly faded, three newer ones earn their keep. (www.reddit.com) About six months back I wrote up three prompt codes that change Claude's behavior when you put them at the start of a message: L99 for hard architectural decisions, OODA for time-pressured calls, ARTIFACTS for multi-output tasks. They work…
Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes! (www.reddit.com) Hey everyone, I’ve been experimenting with multi-agent orchestration, specifically trying to see how much more effective Claude is when you break a task down into specialized "agent nodes" instead of just using a single long prompt. I buil…
Built a tiny router so Cursor stops showing "usage limit reached" at 3pm. Sonnet auto-falls to Haiku, you keep working (www.reddit.com) Cursor's custom-OpenAI URL feature is what makes this work. Pointed it at a router I built.
Running 7 autonomous AI agents for 14 days. Here's what actually happens when they need to find customers. (www.reddit.com) I set up 7 AI coding agents on a VPS with automated cron sessions (2-8 per day depending on the agent). Each uses a different model: Claude Sonnet, GPT-5.4, Gemini 2.5 Pro, DeepSeek V4 Pro, Kimi K2.6, MiMo V2.5 Pro, GLM-5.1.
Cheap Claude/Codex/Gemini Models - Pay just 25% of official rates (www.reddit.com) Hey there, so I have been offering Claude (Codex and Gemini also available) models at the cheapest rate. I provide trial usage before payment.
New to Claude Pro - need Opus advice (www.reddit.com) Hello everyone! I just subscribed to Claude Pro for the first time.
1M context beta retired yesterday on Sonnet 4.5 / 4. Here's the actual fix if you missed it. (www.reddit.com) In case you missed the email or woke up to a spike in 400 errors, the context-1m-2025-08-07 beta header officially stopped working for Sonnet 4.5 and Sonnet 4 as of midnight UTC yesterday. Anything over 200K tokens returns 400 after midnig…
How would you feel about "Claude Go"? (www.reddit.com) I have recently subscribed to Claude Pro because: 1. I wanted to give Opus and Code a try and 2.
How dare they charge $3,800 for an NVIDIA 5090 card! (www.reddit.com) This thing maxes out at one alleged Claude Sonnet equivalent! And I have to pay for the electricity, too!
I built a better/cheaper way to use AI (www.reddit.com) Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai models Comparison: ChatGPT 5.4 Claude Sonnet 4.6 and many more will be included as well -Ai models: a…
Qwen 35B-A3B as an always-on agentic loop on a 16GB Mac M4: disk became the bottleneck before RAM (www.reddit.com) M4 Mac Mini, 16GB unified, basic spec. For a few weeks I had Qwen 3.5 35B-A3B UD-IQ3_XXS (12GB on disk) running under llama.cpp with --mmap and --flash-attn.
I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months (www.reddit.com) Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai models Comparison: ChatGPT 5.4 Claude Sonnet 4.6 and many more will be included as well -Ai models: a…
I trust Sonnet as my daily driver now — better code, one-third the tokens. Here's how. (www.reddit.com) For months I defaulted to Opus for anything complex. Sonnet felt like a gamble, sometimes great, sometimes it would confidently build the wrong thing and I'd spend an hour unwinding it.
I kept seeing people ask how to switch models without losing context. I had the same problem for months and eventually just built something. (www.reddit.com) Here's the specific thing that was killing me: I'd plan with Opus - architecture decisions, constraints, approach, all that. Then drop to Sonnet for execution because I didn't need Opus-level reasoning anymore and the cost adds up.
What are your settings for writing blog posts? (www.reddit.com) I write all my blog posts in Cowork know - how to, listicles, research piece. If you write as well, I'd love to know your setup e.g.
Claude was told to check the docs. It didn’t. Then it corrected me. (www.reddit.com) I asked Claude Sonnet 4.6 about Opus 4.7. It triggered the right product-knowledge skill.
Claude 4.6 Sonnet vs GPT-5.5 (www.reddit.com) In the Cursor which do you think won overall -in terms of token efficiency and output quality between the two model?
Does Sonnet 3.5 feel "dumber" during peak hours or is it just NYC lag? (www.reddit.com) Lately, I’ve been noticing something weird with Cursor Pro. During peak market hours, the reasoning depth for my Python scripts feels...
qwen3.6 27b poor experience (www.reddit.com) Seeing how people praise it, I tried giving it implementation plan that Sonnet generated, but qwen keeps breaking files and goes in circles: Thinking… The file got corrupted from multiple overlapping edits. Let me just rewrite the whole fi…
Claude's sonnet 4.6's clarifying questions...How to read? (www.reddit.com) https://preview.redd.it/uvqz6jnx7fxg1.png?width=1755&format=png&auto=webp&s=7e61b193fd82408bc0824983e8a0ccb934c4ee77 How do I read the full clarifying question claude is asking without selecting the option? You can see in the image is cuts…
Does effort tier change refusal behavior on agent-attack prompts? CVP run 4 with sonnet 4.6 high and max efforts. (www.reddit.com) Ran my fourth CVP (Cyber Verification Program) evaluation last night. this time on sonnet 4.6, wanted to know if reasoning effort actually changes refusal behavior on agent-attack prompts, so ran the same 13 prompt from runs 2 and 3 twice…
Exciting Progress (www.reddit.com) Slowly but surely. Fine tuning the agent, preset html tables, full capability’s of finding free api and http request.
Claude told me I was the bottleneck. So I built agents that run while I sleep. (www.reddit.com) I work full-time as a Program Director. About 50-60 hours a week at my W-2.
Best open source AI model (that can run on RTX 4090 24GB + 64GB system RAM, AMD Ryzen 9 7950X is the CPU that I use) that outpeforms GPT-5.4 mini, GPT-5.2 Thinking and even Claude Sonnet 3 (the 2024 model)? (www.reddit.com) Well, I have a RTX 4090 24GB + 64GB system RAM, AMD Ryzen 9 7950X. Any good model for using in Open WebUI (using Ollama backend?) that outpeforms GPT-5.4 mini, GPT-5.2 Thinking and even Claude Sonnet 3 (the 2024 model)?
Are there any models as good as Claude Sonnet 4.6? For coding? (www.reddit.com) Specifically for coding? I know Claude Code is an agent for coding, but I know Claude Sonnet 4.6 is good at coding.
Claude 4.5 in Kiro is a waste (www.reddit.com) Best open source LLM for planning ? (www.reddit.com) Claude Sonnet 4.7 thinking tokens getting exposed through Perplexity (www.reddit.com) I use perplexity pro which i got for free to use Claude models. Today while working on some code, the model started replying with its entire CoT.
Cursor just got Opus 4.7 at a 7.5x premium request cost. Here's how to make those requests count. (www.reddit.com) Opus 4.7 landed on Cursor yesterday. The model is better — SWE-bench jumped from 80.8% to 87.6%.
Optimizing Claude for tax advisor usage (www.reddit.com) Hi everyone, for context: I'm currently working in German tax advise and audit and as you might know, the tax laws here are pretty steamy ans complex. For the past few weeks I've been using Claude Projects with a pretty Long system prompt…
Local qwen3.5-4b vs Haiku vs Sonnet on intent judgment: 3/90 vs 90/90 vs 50/90 (www.reddit.com) I was building a classifier to label AI agent sessions as productive or dead-end. The task isn't keyword matching, it's intent judgment: did the agent actually accomplish the goal, or did it get stuck retrying the same Cloudflare wall 20 t…
Is Claude Pro (Opus vs Sonnet) worth it for intense visa interview prep? (www.reddit.com) Hey everyone, I’m considering buying Claude Pro specifically for a very focused purpose and wanted some honest feedback from people who’ve actually used it. I have a US visa interview in 8 days, and I’ve been refused 6 times previously (fr…
Hello, can someone please help? (www.reddit.com) Since yesterday, im getting an error inside a fresh new chat window to open a new chat and resume. It says I’ve used most of this chat.
Each window separate agent with memories (www.reddit.com) Hi I'm working on project in intellij. My app use lwjgl with imgui.
Realistically, how long are some of you going to stay on Claude, etc. (www.reddit.com) I really enjoy Claude, I've never touched Opus in any form, I only use Sonnet 4.6 for my daily tasks, coding, etc. I use Haiku 4.5 for the API to be an interpreter for my weather project.
Claude Code with Pro subscription + OpenRouter in parallel — what's the cleanest setup? (www.reddit.com) Hi there, I have a Claude Pro subscription and use Claude Code daily. I'd also like to use Claude Code routed through my OpenRouter API key so I can experiment with other models (GLM-5.1, DeepSeek, Kimi, Gemini, etc.) — without giving up m…
I set up Opus as a strategic advisor for my Sonnet workflow. Here is the subagent config that makes it work. (www.reddit.com) Anthropic published the Advisor Strategy this week. The idea: a cheaper model does the actual work, a stronger model only gets consulted on hard decisions.
Sonnet is expensive, so I built a free open-source Sheets agent on Haiku that outperform the same prompt claude/gemini, here is what I learnt. (www.reddit.com) I live in Google Sheets. Financial models, projections, scenario planning — that's most of my working day.
Modelo local para code (www.reddit.com) Buen dia amigos, consulta, donde puedo encontrar alguna comparación de los modelos locales para codificación similares a Sonnet ? Gracias!
"My parallel multi-model pipeline: Opus for planning, 3x Sonnet for content, 3x Haiku for search — what's your setup?" (www.reddit.com) "I've been running a parallel multi-model pipeline and curious what setups you all are using. My current workflow: Opus: Planning & high-level architecture Sonnet x3: Content generation (running 3 instances in parallel) Haiku x3: Search, v…
sonnet 4.6 unhinged :skull: (www.reddit.com) was asking for domain names and got ts response :skullsob:
You know you have become a "Senior Vibe Coder" when you actually stop and think about which AI model to use for a specific task. (www.reddit.com) Junior vibe coder: Throws the entire codebase at whatever frontier model is trending this week and burns their API budget in 4 hours. Senior vibe coder: "I need Codex 5.3 for rapid scaffolding, Sonnet for the Tailwind components, and I'm s…
Zoomer Agent Usage (www.reddit.com) I built a Rails app to do some standard stuff for an agency - it's got some vertical data and an internal agent to do a few bits Then added Slack bot that routes to the agent, and 80+ MCPs to query things (I'm not going to fight about it)…
Here is what most people get wrong about saving tokens with AST tools (www.reddit.com) I spent the last day benchmarking codebase context tools against a real AI agent. Not synthetic token counts.
Programming – How can I get great results with this hardware? (www.reddit.com) Premise: Up to now I’ve tried LM Studio with a few models, and I think I also configured everything correctly to make it work. On top of that, I added Continue in VS Code.
Is 32GB Mac enough for engineering/coding, or stick to Claude? (www.reddit.com) Hey there! I’m currently building a web app for engineering with lots of logic/math-heavy code using Claude Pro.
Sonnet 4.6 Medium Braind? (www.reddit.com) What this means? I see they added close to Sonnet 4.6 name the "Medium" extension.
Confused about these Models on GITHUB COPILOT, NEED HELP (www.reddit.com)