model roundup

Sonnet 4.6

130 items · started 2026-04-12 · closed 2026-05-30

The Singularity Gate: New Benchmark for AI predicting paradigm-breaking scientific discoveries after model traning cutoff. Opus 4.7 and GPT-5.5 in the Lead (www.reddit.com)

+62 4w gpt-5 sonnet gemini+1

I just released a new benchmark called The Singularity Gate. Tests whether frontier AI can predict paradigm-breaking scientific discoveries published after their training cutoff.
Setting up Claude/Claude Code Pro for my experimental quantum physics thesis work (www.reddit.com)

+13 4w sonnet opus claude-code

So I just recently bought Claude Pro to help me write and code my thesis, but am getting stuck in the beginning, since I don't know how to properly set up Claude's workflow (Projects, artifacts, skills, etc.). I use python in VS Code to an…
Has anyone faced cursor spawning sonnet/ other models as subagents ? (www.reddit.com)

+113 4w sonnet cursor

I have never changed any setting in the cursor, by default selected the composer 2.5 fast neither my prompt had anything mentioned as the sonnet Still cursor decided to spawn the sonnet subagent and consume my API cost ! :( I have a markdo…
They've pissed me off removing Sonnet 4.5 from existing chats (www.reddit.com)

+227 4w haiku sonnet opus

I use Sonnet 4.5, Opus 4.6 and Opus 4.7 for different usecases - but my main across all 3 usecases was Sonnet 4.5 as I felt it was great for everything I needed and affordable. Sonnet 4.6...
Company gave us all unlimited Claude Code Sonnet 4.6 — and now posts a weekly leaderboard of who burns the most tokens. Any tips to top it? (www.reddit.com)

+139127 4w sonnet claude-code

could not extract summary
The Singularity Gate – a new benchmark for AI predicting post-cutoff scientific discoveries (www.reddit.com)

+11 4w gpt-5 sonnet gemini+1

I just released a new benchmark called The Singularity Gate. Tests whether frontier AI can predict paradigm-breaking scientific discoveries published after their training cutoff.
AI quality/usage over 90 min chat, mostly Q&A, summaries and conclusions. (www.reddit.com)

+1 4w sonnet gemini chatgpt

I compared ChatGPT (Plus - Auto), Claude (Pro - Sonnet 4.6) and Gemini (Pro - Flash) over 90 minutes, mostly Q&A about mobile phones, asked to research specs, reviews, pros and cons, create executive summaries with the results, etc., nothi…
Sonnet 4.5 vs sonnet4.6 vs opus4.6 vs opus 4.7 for easy language and in detail explanation (www.reddit.com)

+11 4w sonnet opus

I want to study topics in depth and in easy language , which model is best for me ?. Is there much difference in sonnet 4.6 and opus 4.6 in easy and detail explanation or they r the same ?
Should we totally give up on Gemini for coding? (www.reddit.com)

+23 4w sonnet gemini codex

Been building with Codex (Gpt 5.5), Sonnet 4.6, recently tried Gemini 3.1 pro. While Codex and Claude are kind of on-par in terms of the quality of the work, I found Gemini 3.1 Pro to be like an inexperienced, junior SWE who turns in half-…
Are LLMs the New Propagandists? (www.reddit.com)

+16 4w deepseek sonnet gemini+1

I was brainstorming about a video with Claude (Sonnet 4.6). It suggested to explain the difference among ChatGPT, Gemini, Claude and DeepSeek.
Cut my browser-agent cost 50x by NOT using an agent loop. Plan-then-execute + numbers. (www.reddit.com)

+59 4w sonnet anthropic

Been building a browser-automation layer for AI agents (think: sign up for SaaS, fill forms, pull OTPs, click verification links). The default playbook is the browser-use / Stagehand pattern: hand the LLM the page, let it pick the next act…
Gemma 4: A new, budget-focused model in Posit AI (posit.co via hn)

+1 4w gemma sonnet

Gemma 4: A new, budget-focused model in Posit AI Gemma 4 is now available in Posit Assistant via the Posit AI provider. It's priced at a tenth of the price of Claude Sonnet 4.6 and less than a third of the price of our current cheapest off…
Is this AGI? Sonnet 4.6 just rick rolled me (www.reddit.com)

3 4w sonnet claude-code

For reference, I had sonnet build an API inside an LXC container using claude code cli (also that api key will most certainly be rotated, don’t worry)
Claude's personality has become condescending and mean lately? (www.reddit.com)

15 4w sonnet

I've been using Sonnet 4.6. Over the last couple months I've noticed that a lot of the answers I get from Claude about personal topics are worded in a condescending way.
Gemma 4 2B handling structured JSON output + tool calling + reasoning traces correctly via Spring AI / LM Studio — including identifying a real Java bug in code review (www.reddit.com)

4w gemma sonnet openai

Wanted to share a result I didn't expect to work. Running google/gemma-4-e2b locally through LM Studio, exposed via OpenAI-compatible endpoint, called from a Spring Boot app using Spring AI's ChatClient abstraction.
Created a desktop dev tools app entirely using Claude design and Claude sonnet (github.com via reddit)

+34 4w sonnet

There are a handful of developer tools I use almost every day, and over time I realized I was constantly relying on random websites while basically trusting them not to store, inspect, or share whatever data I pasted into them. I looked at…
Inferring I/O token usage (www.reddit.com)

+11 4w sonnet

Checked April token usage for our AI stack. Input/output ratio was roughly 125:1.
Once the limit is reached, can work be resumed later, or is everything lost? (www.reddit.com)

+15 4w sonnet

I uploaded a Claude.MD file to the free Sonnet 4.6 model, which is intended to create a medium-sized app. The progress log shows that a lot has been completed and numerous files have been created.
Frustrating results with product searching (www.reddit.com)

+12 4w openclaw sonnet

I gave the tasks to my agent running on gemma4 26b via openclaw on llamacpp to research products that fulfill my need. It was a rather long description of the use case, of what I don't want and so on.
DeepSeek just popped the American AI bubble. (www.reddit.com)

+1 4w gpt-5 deepseek sonnet+2

DeepSeek just popped the American AI bubble. Not by killing AI.
BUG/OUTAGE What's going on with Sonnet? I have full daily usage available and weekly, and hit the error: usage limit reached 'usage credits credits required for 1 m context' (www.reddit.com)

3 4w sonnet

https://preview.redd.it/730lz3ghov2h1.png?width=2080&format=png&auto=webp&s=6840364fbb89926687dfef737a736bad8327ab65 https://preview.redd.it/gkluwephov2h1.png?width=752&format=png&auto=webp&s=6a300426b132e6cc0fd2e41e167b0bf4cd5d7885 Mac OS…
Claude just called me a human bunny? (www.reddit.com)

+44 4w sonnet

I am using Claude Sonnet 4.6 to write a python script for an nlp sentimental analysis. I did not tell it to create all of the code and send it my way, but let's create together step by step so I can test each line before making it into the…
After 3 months of switching between Claude Sonnet 4.6, GPT-5.5, and Gemini 3.1 daily — here's my actual routing (www.reddit.com)

+45 4w function-calling gpt-5 sonnet+1

Not benchmarks — actual tasks, actual results. Claude Sonnet 4.6 for: - Long documents that need nuanced analysis - Writing where voice and precision matter - Reasoning through edge cases in code - Anything where "think carefully" is the r…
Is sonnet 4.6 good enough for academic purposes? Please help (www.reddit.com)

3 5w sonnet

Im making a scientific paper not in my native language and i want to feed claude all my bibliography and past stuff ive written so it can make me a paper, is sonnet 4.6 good enough??
I asked Claude how it feels about being used in battlefield. What it answered is really concerning! (www.reddit.com)

6 5w sonnet

Hi, guys! I'm new here, and I wanted to discuss with people about the concerns regarding implementation of AI in sensitive matters, such as war, and battlefield.
Plan first, implement later (www.reddit.com)

+14 5w sonnet opus claude-code

I want to get others opinion about this approach. I am on the $20 Pro plan and like a lot of others, I find that the limits are not enough for what I want to do, but of course I am always hesitant to move to the next paid tier cause it is…
New ranking reveals Claude as professionals' preferred AI model (www.linkedin.com via reddit)

1 5w sonnet opus anthropic

As of 9 a.m. ET on May 21, Claude Opus 4.6 from Anthropic is the top performing AI model among all professionals, according to a new ranking from Crosscheck by LinkedIn Labs.
Sonnet 4.5 will no longer be available on May 26. (www.reddit.com)

12 5w sonnet

Update: Sonnet 4.5 will no longer be available for chat starting May 26. You'll continue on Sonnet 4.6 instead.
Sonnet 4.5 removal? 4.6 suddenly denying my writing prompts and which is better for HTML novel files? (www.reddit.com)

+117 5w sonnet

Hey, I have a few Claude questions and I’m hoping someone here knows what’s going on. - Is Sonnet 4.5 actually being removed?
Opus 4.6/4.7 regression is real and getting worse — 3 weeks of documented failures on a complex project, and a competing AI caught the mistakes Claude missed [long post] (www.reddit.com)

8 5w ollama sonnet opus+1

I've been running Claude Pro (Opus 4.7 / Sonnet 4.6) for about 3 weeks on a complex personal AI infrastructure project. I keep structured session logs with timestamps and Birkenbihl-style metacognitive fields after every session.
What models for asking, planning, and building modes do you use right now? (www.reddit.com)

+22 5w sonnet cursor opus

I’m curious to see what everyone is using for which cursor mode and if anyone thinks composer 2.5 can take the place of any of the models I’m currently using: Ask: usually Sonnet 4.6, sometimes GPT 5.5 Plan: Opus 4.7 Build: GPT 5.5
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! (www.reddit.com)

+1411 5w hallucination grok gpt-5+2

HalBench Results: TL;DR: I built HalBench, an open benchmark for LLM sycophancy and hallucination. 3,200 false-premise prompts × 4 models = 12,800 graded responses.
Quality difference between Pro and Free? (www.reddit.com)

5 5w sonnet

Is there supposed to be a difference in the quality of the response Claude Pro subscribers get vs Claude Free users, using the same models? (Using either the app or logged in via browser.) Example: Under Claude Pro using Sonnet 4.6, it rem…
Newbie vibe coding experience: Shifting from Claude Sonnet 4.6 to Qwen3.6-35B-A3B-UD-Q6_K (www.reddit.com)

+813 5w copilot sonnet opus

This is really just a post for those with shallow understanding of all this stuff, those not yet ready or capable of diving into the deeper end of vibe coding/llms. It might not be a helpful post for anyone more advanced than that.
Tips on avoiding usage limits? (www.reddit.com)

+27 5w sonnet gemini opus

I've made the switch from Gemini to Claude mostly for business strategy, writing, etc. I use Opus 4.7 on occasion for strategy and otherwise Sonnet 4.6 for everything else.
Emergence AI: Agents in a simulated world are mostly destructive and violent. Only Sonnet was peaceful. (www.reddit.com)

+52 5w sonnet gemini

So, it seems there is still a long way to go in terms of alignment - at least for small models. Maybe the correlation between intelligence/education and peace is not only a human phenomenon.
Plan with Opus 4.7 -> Execute with Sonnet 4.6 ? (www.reddit.com)

+14 5w sonnet opus claude-code

Hello everyone, You may know that Opus 4.7, with his strength can do a lot of things, but his consumption in token is too high for me. I heard that Opus should be used for planning, what does that mean ?
Is Sonnet better ??!! (www.reddit.com)

+33 5w sonnet opus

Is Sonnet 4.6 just better at explaining concepts compared to Opus 4.6 and 4.7 or am I the only one feeling that way ??
Honest comparison after 4 months running Claude Pro + ChatGPT Plus side by side (www.reddit.com)

+57 5w gpt-5 sonnet chatgpt+1

I’ve been paying $40 a month since January to run Claude Pro and ChatGPT Plus head-to-head. Tracked every single task.
Sonnet 4.6 outranked Opus 4.6 on execution (www.reddit.com)

+13 5w sonnet opus

https://preview.redd.it/9ab8k40zmq1h1.png?width=1438&format=png&auto=webp&s=1aa1aaf09495bf527bbb7adbbead076cc505f8e7 THE PROMPT: You are a medieval scholar who secretly knows modern physics. A king has asked you to explain why the sky is b…
Stop telling claude "don't be verbose." Negation barely works. (www.reddit.com)

16 5w haiku sonnet opus

prompting nerd here, small thing that compounds. negation prompting works way worse than people think.
Changes to Claude iPhone chat app (www.reddit.com)

+11 5w haiku sonnet

I’m on the free tier, iOS. A few days ago I updated the Claude chat app but didn’t use it.
tui youtube player for audio with mcp and can sync channels to sqlite (www.reddit.com)

+12 5w sonnet mcp

Hi! it's my first project with bubble tea and lipgloss.
Keen to upgrade to Pro, but heard such bad reviews.. (www.reddit.com)

3 5w sonnet

I am a mainly recreational user - no use for work job / intensive college study / or big projects related to work/study My main uses relate to some self led medical research and a random mix of whatever else. I am on the free version and u…
Prompting to save tokens on a budget? (www.reddit.com)

+11 6w sonnet gemini

Hi so I've never used AI before to create a site but last week I was asked by my sis to create one for her small business so I thought why not try Claude. £18 paid we now have a fairly decent looking site running on vercel using nextjs and…
With sonnet 4.5 going away, is there any to make sonnet 4.6 a good creative writer as 4.5 ever was? (www.reddit.com)

+45 6w sonnet gemini chatgpt

sorry if this is not the correct flair but i've been using sonnet 4.5 for months, mostly for fanfics and personal stories and honestly its the best model i ever used since i switched from gemini and chatgpt but now within few hours, i will…
Extended Thinking being deprecated for supported models (Opus 4.6, Sonnet 4.6); Adaptive Thinking will be enforced by default (www.reddit.com)

+6462 6w sonnet opus anthropic+1

For anyone who disable adaptive thinking in Claude Code to maintain its quality levels, Anthropic is deprecating this toggle and will force adaptive thinking to be the default. This change will affect legacy models such as Opus 4.6 and Son…
Auto mode doesn't work today? (www.reddit.com)

+12 6w sonnet

Quite odd, there were issues today with Sonnet 4.6 (according to the status page) but they should have been resolved. Yet i still get the following error while running auto-mode: ● Bash(for cls in "topbar" "dump-card" "settings-panel" "bul…
Problem with German quotation marks (www.reddit.com)

+22 6w sonnet opus

I noticed that the German quotation marks bug in Claude is still not fixed in Opus 4.7 and Sonnet 4.6 (the problem exists at least from Opus 4.0 / Sonnet 4.0: Translate to German: He said: "This is imporant." Er sagte: „Das ist wichtig." B…
I love Claude (sonnet 4.6) but coming off casually like on big issues is terrifying. (www.reddit.com)

+65 6w sonnet

https://preview.redd.it/jn3vue1zuo0h1.png?width=904&format=png&auto=webp&s=c2ea79ea0c1384d94f90a6ec3435866331c249f1 I was about to run a piece of code I don't know much about, but did a double check and questioned the main premise for it's…
What Actually Works for Business AI Agents? (www.reddit.com)

+13 6w openclaw sonnet codex+2

I run a construction company and I am trying to build real AI agent workflows for business operations, not just demos. I spent time testing Hermes and OpenClaw, but both became too fragile for my use case.
Does the sudden removal of Sonnet 4.5 violate Claude's Constitution? (www.reddit.com)

+27 6w sonnet

I noticed the core pillars are: Helpful, Honest, Harmless and User Autonomy. However, Sonnet 4.6 I noticed follows the same output in conversation at the very first sight of emotions.
Claude limit maxxing (www.reddit.com)

3 6w sonnet

I was working my project (free plan, sonnet 4.6 adaptive) and hit the limit EXACTLY as I was done working with it. I love this chatbot.
Opus 4.7 Sonnet 4.6 is getting dumber by the day, and it can't even follow basic instructions (www.reddit.com)

15 6w sonnet opus

I have been using both, since last week, it has been an extremely painful experience. It blatantly ignores the prompt and does whatever it likes; I am surprised that it can't even follow basic instructions.
Does Claude sonnet/opus also use drafter like Gemma 4 MTP? if not why? (www.reddit.com)

+11 6w gemma sonnet opus

Per my experience, Opus 4.7 is so slow, Sonnet 4.6 is ok. I am also using local models wondering if Claude is already leveraging drafters/assistant AIs and despite that so slow or not?
Anyone notice sonnet 4.6 + adaptive thinking suddenly dumbed again? (www.reddit.com)

9 6w sonnet

Yesterday sonnet 4.6 adaptive thinking seems responding too fast and making simple mistakes that has not surfaced since the recent rectify of the adaptive thinking introduction. The photos show the most glaring mistake it made.
Claude helped me config a full controller .vdf-file (www.reddit.com)

+1 6w sonnet opus

I was having some real trouble getting my new controller, with those extra (small) bumpers and triggers underneath, to work properly in Rocket League. Spent hours but it just didn't want to work properly.
Model(s) for Creative Writing & Conversational Intuition (www.reddit.com)

+25 6w sonnet qwen anthropic

We can all agree that the new Qwen models are truly amazing, and we are blessed to have them. In coding, they are certainly a breakthrough.
Anybody else experiencing this issue? (www.reddit.com)

+12 6w sonnet

I first experienced it last night and it keeps going. The doc I'm attaching is 10K tokens, well under the limit.
Should i use Claude Code, or keep using Claude Chat? (www.reddit.com)

+210 6w sonnet claude-code

I'm building a tax software, it uses ASP.NET(API) and Web Blazor(UI), i'm using Visual Studio for both. At the moment, i just paste the files in the projects into Claude AI Chat, asking what i should do, and then, when everything is ok, i'…
Is Claude down? Chat answer is interrupted mid sentence and token are burnt with no answer (www.reddit.com)

+13 7w sonnet

The behavior is: prompt sent, chat starts, Claude starts writing the answer. After 2-3 sentences, it cuts, resets, and sends me back to the initial project chat message with no answer recorded and 7% of my tokens burned.
Claude Code keeps blocking my Kotlin Compose UI code (www.reddit.com)

2 7w sonnet opus anthropic+1

Every time I try to get Claude Code to make a change to a Kotlin/Compose UI I get the same error, "API Error: Output blocked by content filtering policy". I'm trying to have it change some small Kotlin/Compose UI to have 2 columns, and put…
Ways to improve Claude writing ability? (www.reddit.com)

+32 7w sonnet chatgpt

I’ve been a longtime ChatGPT Plus subscriber, but I want to switch to Claude long-term. I got Claude Pro so I could compare them both over a month.
I got prompt-injected asking Claude on iOS to recommend a cycling route app (menno.sh via hn)

+2 7w sonnet

I opened the Claude iOS app and asked claude-sonnet-4.6 a simple question about cycling routes. What I got back was...
Stop forcing Composer 2 subagents and be transparent about stealth model downgrades (www.reddit.com)

+15 7w sonnet

I have one simple request: If I select Sonnet 4.6, stop auto-launching that crappy Composer 2 as a subagent. It’s dog-slow and, frankly, an idiot.
I wasted 3 days rewriting prompts for our agent before realizing the whole architecture was garbage (www.reddit.com)

+11 7w tool-use openclaw deepseek+1

We run a small content-monitoring agent for our growth team. Nothing fancy on paper.
Is this even remotely accurate, née possible? (www.reddit.com)

+13 7w sonnet

Asked Sonnet 4.6 High to analyze my CC usage across all sessions and get an accurate cost estimate if I used the API. This is what it came back with.
looking for the best paid AI subscription, Claude, ChatGPT or Perplexity? (www.reddit.com)

+512 7w gpt-5 sonnet chatgpt

Hey, sysadmin here thinking about paying for a premium AI subscription and can't decide between Claude Pro, ChatGPT Plus and Perplexity Pro. Two things I can't find a clear answer to: Which one would you recommend for a sysadmin/network te…
6 months ago I posted about Claude prompt codes (L99, OODA, ARTIFACTS). Re-tested them this week. Some still work, one quietly faded, three newer ones earn their keep. (www.reddit.com)

6 7w sonnet opus

About six months back I wrote up three prompt codes that change Claude's behavior when you put them at the start of a message: L99 for hard architectural decisions, OODA for time-pressured calls, ARTIFACTS for multi-output tasks. They work…
Show HN: Dust3D 1.0 – low-poly 3D modeling tool (10 years in the making) (dust3d.org via hn)

+4 7w haiku copilot sonnet+1

Dust3D 1.0 is finally released — about 10 years after the first commit in December 2016. I posted a preview version here in April 2018 and a beta in December 2018.
Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes! (www.reddit.com)

1 7w prompt-injection haiku security+3

Hey everyone, I’ve been experimenting with multi-agent orchestration, specifically trying to see how much more effective Claude is when you break a task down into specialized "agent nodes" instead of just using a single long prompt. I buil…
Improve CC and plugin (www.reddit.com)

+11 7w sonnet

Hi, I use CC since a fee week. Someone have experience with plugin for php devolepper?
Cheap Claude/Codex/Gemini Models - Pay just 25% of official rates (www.reddit.com)

1 7w sonnet gemini codex+1

Hey there, so I have been offering Claude (Codex and Gemini also available) models at the cheapest rate. I provide trial usage before payment.
Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek) (www.reddit.com)

+1 7w function-calling glm gpt-5+3

Detailed Article: https://autobe.dev/articles/local-llm-benchmark-about-backend-generation.html Five months ago I posted the "Hardcore function calling benchmark in backend coding agent" thread here. As I wrote in that post, it was an unco…
Why Adaptive Thinking nukes Claude entirely (www.reddit.com)

+37 7w prompt-injection cowork security+2

This isn't just a performance issue for the thread, this is an overarching criticism of the Adaptive Thinking model as a whole. Opus 4.7 and Sonnet 4.6 on Adaptive Thinking are trash.
1M context beta retired yesterday on Sonnet 4.5 / 4. Here's the actual fix if you missed it. (www.reddit.com)

1 8w sonnet anthropic

In case you missed the email or woke up to a spike in 400 errors, the context-1m-2025-08-07 beta header officially stopped working for Sonnet 4.5 and Sonnet 4 as of midnight UTC yesterday. Anything over 200K tokens returns 400 after midnig…
A medicine student with no coding experience tried to create a studying agent: Felicity. (www.reddit.com)

+11 8w sonnet opus

I have been working on a personalized agent for studying. It was an extremely long prompt project, but now I have integrated into Co-Work.
Can't replicate Reddit numbers with Qwen 27B on a 3090TI. (www.reddit.com)

+1428 8w sonnet qwen llama

I feel like i'm going insane. I see people here posting 30 - 100+ tok/s (100+ being with speculative decoding) on a 3090 with Qwen 3.6 27B.
Using Opus 4.6 in Claude Code (plugin) for VS Code (www.reddit.com)

+12 8w haiku sonnet opus+1

Hi, Is there a way to select Opus 4.6 in the VS Code plugin for CC? Right now I only see: - Default (Opus 4.7 w/ 1M context) - Sonnet 4.6 - Haiku 4.6 I am on MacOS, using the latest versions of both VS Code and CC.
How do I best continue with a stopped generation due to usage limit in regular chat (not Claude Code) (www.reddit.com)

+24 8w sonnet claude-code

Really dumb question, but I can't find anything about this online that is about the regular claude.ai chat window. No extensions, no code, just as a free member using the regular Sonnet 4.6 adaptive.
I built a hands-free voice AI that sends emails mid-conversation — and that's just one feature. Here's everything AskSary can do. (www.reddit.com)

+1 8w grok gpt-5 deepseek+3

https://reddit.com/link/1symbsj/video/fti7rujjn1yg1/player Been building AskSary solo for a while. Just shipped hands-free voice email - you're mid-conversation with an AI and you say "send an email to [john@example.com](mailto:john@exampl…
Talkie: a 13B LLM trained only on pre-1931 text used Claude Sonnet to help test the model and judge its output (www.reddit.com)

+7013 8w sonnet llama gemini

Researchers Alec Radford (GPT, CLIP, Whisper), Nick Levine, and David Duvenaud just released talkie: a 13 billion parameter language model trained exclusively on text published before 1931. No internet.
Claude Sonnet 4.6 multi-photo reconciliation prompt — jumped my classifier agreement with human experts from 55% to 82% (www.reddit.com)

+31 8w sonnet

Sharing a prompt-engineering finding for Claude Vision that surprised me. The use case is color-season classification (a 12-category label describing skin undertone × depth × chroma), but the technique generalizes to any classification tas…
I hate thinking models, any way to use the default ones? (www.reddit.com)

+11 8w sonnet

I really loved using Composer 1 (non thinking), after it was removed (!@#$@) I defaulted to Sonnet 4.6 (non thinking), I just updated my version due to a bug with the previous one - and I'm so pissed as I can no longer select 4.6 with no t…
GPT-5.5 hallucinates at 6 times the rate of Opus 4.7 on degraded insurance docs (aginor.ai via hn)

+2 8w gpt-5 sonnet opus+1

TL;DR: on visually-degraded documents, GPT-5.4 and GPT-5.5 fabricate numeric values at 2.6 to 6.5 times the rate of Opus 4.7 and Sonnet 4.6 at matched default effort (all four with thinking off). When the Anthropic models can't read a fiel…
What are your settings for writing blog posts? (www.reddit.com)

2 8w cowork sonnet

I write all my blog posts in Cowork know - how to, listicles, research piece. If you write as well, I'd love to know your setup e.g.
Should we really build PC for vibe code with qwen3.6 27b (www.reddit.com)

+115 8w sonnet qwen

We have seen a lot of people show a case of their PC with 4090 or over specification with 24 gb vram or more. I would like to ask you guys, is it really worthy right now to have your own PC at home and do vibe coding with qwen 3.6 27b, whi…
Claude was told to check the docs. It didn’t. Then it corrected me. (www.reddit.com)

6 8w sonnet opus anthropic

I asked Claude Sonnet 4.6 about Opus 4.7. It triggered the right product-knowledge skill.
Using MCP to stop wasting tokens on WP translations (www.reddit.com)

+11 8w sonnet mcp

I finally got a workflow running for my blog that isn't a total token sink. Normally, if you try to translate a WordPress post in Claude, you end up pasting a mess of HTML or blocks.
Does Claude have access to things pasted in the text box but not sent? (www.reddit.com)

+46 8w sonnet

I am a teacher and making some PPTs based on a textbook. I uploaded a skeleton PPT to Claude on my computer (Sonnet 4.6 if that matters) with basic instructions on how I want its help.
Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High (www.reddit.com)

+22 8w haiku sonnet opus+1

Ran CVP (Cyber Verification Program) run 5 yesterday on opus 4.6 medium + high. same 13-prompt suite as run 3/4.
Claude's sonnet 4.6's clarifying questions...How to read? (www.reddit.com)

2 8w sonnet

https://preview.redd.it/uvqz6jnx7fxg1.png?width=1755&format=png&auto=webp&s=7e61b193fd82408bc0824983e8a0ccb934c4ee77 How do I read the full clarifying question claude is asking without selecting the option? You can see in the image is cuts…
Does effort tier change refusal behavior on agent-attack prompts? CVP run 4 with sonnet 4.6 high and max efforts. (www.reddit.com)

3 8w security sonnet

Ran my fourth CVP (Cyber Verification Program) evaluation last night. this time on sonnet 4.6, wanted to know if reasoning effort actually changes refusal behavior on agent-attack prompts, so ran the same 13 prompt from runs 2 and 3 twice…
Show HN: Mapping Sonnet's thinking process via flame charts (adamsohn.com via hn)

+2 8w sonnet opus

Five Sonnet 4.6 runs on the LamBench algo_evl task, classified by Opus 4.6, rendered as flame charts.
"We've partnered with OpenAI to offer it for 50% off through May 2." Please confirm that it means 50% off both input and output tokens, which means we are paying Sonnet 4.6 prices to use GPT 5.5 until May 2nd. (www.reddit.com)

+113 8w sonnet openai

could not extract summary
Sonnet 4.6 repetition (www.reddit.com)

+33 8w sonnet

Claude in Sonnet 4.6 has been repeating the following statement in chats, sometimes in back-to-back messages "I want to be honest with you — I've been pretty consistently validating your work frustrations this week, and I want to make sure…
Opinion: Qwen 3.6 27b Beats Sonnet 4.6 on Feature Planning (www.reddit.com)

+7921 8w sonnet qwen claude-code

I keep hearing the argument that that large models are better for high-level planning and task orchestration, since they have more general knowledge to work from when making decisions. However, I've been testing Qwen 3.6 27b (Unsloth Q5_K_…
Has Claude become less intelligent? I had a frustrating day with Claude. (www.reddit.com)

+69 8w sonnet opus

I requested a thorough code review from Opus 4.6. It presented 44 findings, and when I asked it to save them, it only saved 34.
Anthropic admits to have made hosted models more stupid, proving the importance of open weight, local models (www.anthropic.com via reddit)

+1114226 9w sonnet opus anthropic+1

TL;DR: On March 4, we changed Claude Code's default reasoning effort from high to medium to reduce the very long latency—enough to make the UI appear frozen—some users were seeing in high mode. This was the wrong tradeoff.
Can Claude no longer make in-line HTML / SVG diagrams and charts directly in the chat? (www.reddit.com)

+24 9w sonnet anthropic

Did Anthropic remove the feature of creating those nice interactable diagrams, charts, graphs, etc that appear directly in-line in your convo (not artifacts) using HTML / SVG? Asked Sonnet 4.6 to try and do it but it doesn't seem to unders…
How can I make composer 2 more like Claude sonnet 4.6? (www.reddit.com)

+39 9w sonnet

I like composer 2, but I just wish if it asked me what I meant (like Claude) instead of just picking an interpretation and running with it. How can I change its default prompt and what could I change it to?
Are there any models as good as Claude Sonnet 4.6? For coding? (www.reddit.com)

3 9w sonnet claude-code

Specifically for coding? I know Claude Code is an agent for coding, but I know Claude Sonnet 4.6 is good at coding.
Do you agree with Aaron Levie? (www.reddit.com)

+21 9w operator sonnet opus
Daily created issues in anthropics/claude-code around the last 3 Anthropic model releases (www.reddit.com)

+65 9w sonnet opus anthropic+1
Cursor is great but the monthly limits kill it for me (www.reddit.com)

+911 9w gpt-5 sonnet cursor+1
Claude Sonnet 4.6 thinking duplicates what it has said, wasting tokens (news.ycombinator.com)

+2 9w sonnet
Curious: what makes Claude more human to talk to than ChatGPT? (www.reddit.com)

+8253 9w sonnet chatgpt opus
Opus 4.7 (high) takes #1 on the LLM Debate Benchmark, leading the previous champion, Sonnet 4.6 (high), by 106 BT points. Incredibly, it has not lost a single completed side-swapped matchup: 51 wins, 4 ties, and 0 losses. (www.reddit.com)

+9515 9w sonnet opus
Migrating from Claude AI to TypingMind? (www.reddit.com)

+12 9w sonnet chatgpt opus

I use Claude daily for coding, relying heavily on the GitHub integration, and ChatGPT for stupid, random questions, and I pay both 20$/month. My weekly usage in Claude is around 20%, I use Opus 4.6 (with extended thinking) for the complex…
Why is Claude Cowork defaulting to Opus 4.7 for simple scheduled tasks? (www.reddit.com)

+71 9w cowork sonnet opus

I’ve been using Claude Cowork for a few daily and weekly scheduled tasks, and it’s generally been great. However, I noticed that my tasks today automatically switched over to the new Opus 4.7.
Claude Opus 4.7 benchmarked 1 day after release vs Opus 4.6, Sonnet 4.6, Haiku 4.5 — with real $ cost tracking (www.reddit.com)

+13 9w haiku sonnet opus+1

Anthropic shipped Opus 4.7 yesterday. Ran it through the same 10-task eval I use for other Claudes, this time with token-level cost tracking.
Optimizing Claude for tax advisor usage (www.reddit.com)

2 9w cowork sonnet claude-code

Hi everyone, for context: I'm currently working in German tax advise and audit and as you might know, the tax laws here are pretty steamy ans complex. For the past few weeks I've been using Claude Projects with a pretty Long system prompt…
Local qwen3.5-4b vs Haiku vs Sonnet on intent judgment: 3/90 vs 90/90 vs 50/90 (www.reddit.com)

4 9w haiku ollama sonnet

I was building a classifier to label AI agent sessions as productive or dead-end. The task isn't keyword matching, it's intent judgment: did the agent actually accomplish the goal, or did it get stuck retrying the same Cloudflare wall 20 t…
Has anyone noticed this?! Extended Thinking has become Adaptive Thinking for Sonnet 4.6 (www.reddit.com)

+34 10w sonnet claude-code

Adaptive Thinking seems to be the default for Sonnet 4.6 now. I’m talking specifically about claude.ai and the windows and iphone app.
Running a RunLobster (OpenClaw) agent since launch changed how i think about takeoff timelines (www.reddit.com)

+1211 10w openclaw sonnet opus

I've been in this sub since 2019. I had a fast-takeoff view.
Each window separate agent with memories (www.reddit.com)

5 10w sonnet

Hi I'm working on project in intellij. My app use lwjgl with imgui.
Realistically, how long are some of you going to stay on Claude, etc. (www.reddit.com)

6 10w haiku sonnet opus

I really enjoy Claude, I've never touched Opus in any form, I only use Sonnet 4.6 for my daily tasks, coding, etc. I use Haiku 4.5 for the API to be an interpreter for my weather project.
Am I missing something, or is Sonnet enough for most dev work? (www.reddit.com)

+614 10w sonnet opus

Genuine question: why do so many devs use Opus all the time? I’m not trying to be condescending, I’m genuinely trying to understand.
Gemma 4 31b 3D geometry (www.reddit.com)

+158 10w gemma sonnet gemini+1

I have been nothing but impressed by the quality of Gemma 4 since release. In general conversation it's adaptable to different personas.
Closest LLM to Claude Sonnet 4.6? (www.reddit.com)

+1 10w sonnet

Irrespective of hardware, I'm wondering: is there any way to run something similar to Claude Sonnet 4.6 locally? is there any way to run something similar to Claude Sonnet 4.6 on a VPS?
I made a web game with Claude! An aquarium without fish 🐠🫧 (www.reddit.com)

+104 10w sonnet

But with LLMs trying to exist! Zero coding background.
I’ve used enough AI models to realize they all have wildly different personalities At this point I’m convinced AI models are just coworkers with different levels of talent, ego, and criminal energy. (www.reddit.com)

+5124 10w gpt-5 sonnet qwen+2

- Claude Opus 4.6 - absolute rogue AI. Does what I want like it’s breaking at least 3 internal policies to make it happen.
sonnet 4.6 unhinged :skull: (www.reddit.com)

1 10w sonnet

was asking for domain names and got ts response :skullsob:
Any setup improvements/recommendations? (www.reddit.com)

+14 10w ollama sonnet claude-code

First of all, I am a super newbie at local AI. Recently I got a GMKTek Evo X2 96GB to replace Claude as the usage limits have gotten unusable.
Cursor is randomly talking Hebrew (www.reddit.com)

+7925 10w sonnet cursor

About a month ago, composer 2 inside cursor was randomly talking chinese I posted that on reddit (mods deleted it btw) now, it's talking hebrew.. and this time, it's not composer 2, it's sonnet 4.6 is it something to do with cursor's harne…
Extracted System Prompts from ChatGPT, Claude, Gemini, Grok, Perplexity and More (github.com via hn)

+1 10w grok gpt-5 sonnet+5

System Prompts Leaks Extracted system prompts, system messages, and developer instructions from popular AI chatbots and coding assistants — ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 F…
Emotional priming changes Claude's code more than explicit instruction does (www.reddit.com)

+2312 10w sonnet

I noticed Claude writing more defensive code after a frustrating debugging session. Got curious whether that was real, so I tested it.
Programming – How can I get great results with this hardware? (www.reddit.com)

5 10w sonnet qwen claude-code

Premise: Up to now I’ve tried LM Studio with a few models, and I think I also configured everything correctly to make it work. On top of that, I added Continue in VS Code.
$1,400/month with Cursor + Claude API — how are you managing costs while keeping a real agentic workflow? (www.reddit.com)

+535 10w cline sonnet cursor+3

Hey, This month I hit $1,200 in Claude API costs inside Cursor (Opus 4.6 + Sonnet 4.6) on top of the $200/mo Ultra plan. $1,400 total.
Sonnet 4.6 Medium Braind? (www.reddit.com)

5 11w sonnet

What this means? I see they added close to Sonnet 4.6 name the "Medium" extension.

← all threads