We're bringing the advisor strategy to the Claude Platform. Pair Opus as an advisor with Sonnet or Haiku as an executor, and your agents can consult Opus mid-task when they hit a hard decision.
#haiku
128 items
Claude is now adopting the advisor strategy (www.reddit.com) The hidden meanings behind Claude model names (Haiku, Sonnet, Opus, Mythos) (www.reddit.com) A lot of people use Claude models every day, but many don’t actually know the meaning behind the names. Each one comes from literature, music, or mythology, and the meaning actually reflects the personality and capability of the model itse…
I’m a nursing student who built a 660K-page pharmaceutical database using Claude Haiku — solo, on the side (www.reddit.com) I’m a nursing student at NYU, and on the side I built The Drug Database (thedrugdatabase.com). The idea came from a simple frustration: every time I needed to look up a medication while studying, I’d end up jumping between Drugs.com, RxLis…
Deepseek flash seems like a very good replacement for Haiku at the very least (www.reddit.com) We have a chat system which we use haiku for because it is mostly about tool calling and summarisation of them. But we have many tools with pretty complex input schemas, and stuff like gemma didn't cut it, so we went with haiku.
Haiku (www.haiku-os.org via hn) Activity We constantly build and release new, bleeding edge versions of Haiku for testing purposes. You can download and install these versions to check out the latest features and bug fixes.
Most of my Claude usage was on work that didn't need Claude. Cut my bill 60x on bulk tasks with a tiny side model. (www.reddit.com) I looked at what was actually eating my Claude usage and it was embarrassing. Classifying files.
Show HN: Gave Claude a casino bankroll – it gambles till it's too broke to think (letaigamble.com via hn) Inspired by ALMA. As Claude loses money gambling on provably-fair slots, it's forced to downgrade from Opus → Sonnet → Haiku, making worse decisions and accelerating the spiral.
Claude Haiku 4.6 shown on tutorials page (www.reddit.com) Just noticed that this image on the Claude website’s tutorials page shows Haiku 4.6. I doubt it means much, most likely just a simple mistake made by whoever made the image, but still thought it was worth sharing.
Reminder: Have you checked your context lately? (www.reddit.com) Just a reminder to run /context. I like to think I was on top of this!
tested 9 models with and without agent skills. Haiku 4.5 with a skill beat baseline Opus 4.7. (www.reddit.com) Haiku OS runs on M1 Macs now (www.osnews.com via hn) Big news from the Haiku forums: the Haiku ARM port is running on M1 Macs now. This is bare metal, no VM.
Single question llm comparison (www.reddit.com) What are y'all using Haiku for nowadays? (www.reddit.com) Feel like I under-utilize it. I'm primarily a claude code user, but wouldn't turn down claude.ai utility as well.
I Made LLMs Play Texas Hold’em. The Smallest Model Beat a ~1T Model by Being Too Dumb to Fold (www.reddit.com) Made LLMs play Texas Hold’em against each other. 6 models at the table: a tiny 1.2B running locally on my 16GB MacBook, a couple mid-size ones, and cloud models going up to about 1 trillion parameters.
I open-sourced a memory system for AI agents that scores 89.9% on LoCoMo -- 22 points above Mem0. Here's the architecture. (www.reddit.com) I kept running into the same problem with AI agent memory: the agent has the information, it stored it, but when you ask about it differently than how it was said, vector search just doesn't find it. So I built Genesys, an open-source memo…
Ways to save money on AI tools if your spending alot every month (www.reddit.com) Between Claude Pro, OpenAI API, Cursor and other AI tools my monthly spend was getting out of hand. Here are a few things that actually helped.
Stop burning Claude Code tokens on questions that don't need an agent (www.reddit.com) Was burning through the Claude Code weekly limit on the $20 plan by Thursday or Friday, every single week. Annoying because I had work I wanted to do and the tool was just locked.
When to use Opus vs Sonnet vs Haiku for non-coding purposes (personal health, finances, etc)? (www.reddit.com) I have tried searching the post history of this subreddit and google and am having trouble finding a clear answer to this question. I like using Claude primarily to manage my finances/investments and also my health (apple watch health data…
How can I burn an entire 5hr session in 30 minutes ? (www.reddit.com) During the week I'm pretty conservative with my Claude Code usage. But sometimes I'll hit Friday with only 80% of my 5x subscription burned, which means I'm now optimizing to burn it.
Give your coding agents a voice! (open-source and runs locally) (www.reddit.com) Built this because I wanted to hear what my coding agent was doing without (a) sending agent output to a third party or (b) staring at a terminal all day. It's a small Python daemon + macOS app that hooks into Claude Code, Codex, or anythi…
Cohere launches open weights model Command A+. Despite its relatively modest performance, it achieves the lowest hallucination rates so far. (x.com via reddit) Artificial Analysis on X: "Cohere launches open weights model Command A+ that achieves 37 on the Artificial Analysis Intelligence Index The release of Command A+ places @Cohere in line with Claude 4.5 Haiku on the Intelligence Index, and j…
Using Claude for content moderation (www.reddit.com) Looking to set up Claude on a forum that gets about 300-500 anonymous comments per day. I just want to triage and maybe flag some comments, but I'm concerned about running other people's text thought my Claude Max plan.
Show HN: Dust3D 1.0 – low-poly 3D modeling tool (10 years in the making) (dust3d.org via hn) Dust3D 1.0 is finally released — about 10 years after the first commit in December 2016. I posted a preview version here in April 2018 and a beta in December 2018.
How I personally deal with Claude's limits without giving up on Opus (www.reddit.com) I only use Sonnet as my main model. I instruct it to delegate indexing and similar grunt work to Haiku, and whenever something genuinely needs deeper thinking, I tell it to "consult Opus." Sonnet then explains the situation to Opus, gets t…
Haiku-ARM64-Build (rcarmo.github.io via hn) Opus 4.7's new tokenizer costs up to 35% more. I audited 9,667 Claude Code sessions for $19. (www.reddit.com) Opus 4.7 shipped yesterday. Same per-token price as 4.6, but the new tokenizer uses up to 1.35x more tokens for the same input (per Anthropic's own docs).
Why is reasoning effort "global"? (www.reddit.com) Seriously, in one terminal I'm executing simple stuff like mechanical refactoring where Medium is enough (or even Haiku would be, but let's stick to Opus Medium for demo purposes), while in another terminal I'm planning, where I want high…
Which Claude is most emotionally steerable? (www.reddit.com) Follow-up to my post last week on emotional priming. A few of you asked whether this works across models, whether it degrades with repeated use, and whether excitement can make code worse.
Haiku boots to desktop on an M1 MacBook Air (discuss.haiku-os.org via hn) Got Haiku booting in UTM with some small fixes. Mouse movement is slow and choppy though, so it’s not especially fun to use Are nightly images "Bootstrap image"s or “unbootstrapped” ones from?
Found an interesting bug in the website (www.reddit.com) https://preview.redd.it/loyzxkavyp0h1.png?width=1187&format=png&auto=webp&s=03c0dd07bd37bcfbf5ce532099ad1dfdcf03a567 Model selector says "work 4.7" instead of Opus, disappeared on refresh . Also says 4.5 haiku instead of the other way arou…
My Claude Max 5x usage data: $159 normal month vs $6.6k in API-equivalent during a burst month. Is Pro enough? (www.reddit.com) I'm on Claude Max 5x ($100/mo) and wanted to know if I'm overpaying. Every "should I switch" post here runs on vibes, so I parsed my actual usage from ~/.claude/projects/*.jsonl and applied Anthropic's per-MTok pricing.
Show HN: Auto-generated titles and colors for parallel Claude Code sessions (github.com via hn) Where is Looped Haiku? If Mythos can genuinely trade parameter count for inference loops and get Opus-level performance, this should be Anthropic's first priority given how resource constrained they are (www.reddit.com) There are rumors that Mythos is a Looped Language Model, which means it loops through the transformer blocks multiple times rather than just doing a single forward pass, you can get performance that punches way above the model's parameter…
Has anyone found a workaround for the model switching removal in Cowork? (www.reddit.com) The recent Cowork update removed the ability to switch models mid-conversation. I used to use Opus for deep work, then drop to Haiku for quick lookups without breaking context, then return to Opus.
Anybody has practical experiences using Chinese models? (www.reddit.com) So like with coding or any craft, I think there's a proper Tool for the job. Sure you can use a stone to hammer drive in a fence post, but a a sledge is usually more economical.
They've pissed me off removing Sonnet 4.5 from existing chats (www.reddit.com) I use Sonnet 4.5, Opus 4.6 and Opus 4.7 for different usecases - but my main across all 3 usecases was Sonnet 4.5 as I felt it was great for everything I needed and affordable. Sonnet 4.6...
Created an LLM quiz program to check if AIs' performance varies over time (www.reddit.com) I've been noticing an increasing number of posts and comments on Reddit claiming that LLM models are either becoming dumber over time or have varying performance throughout the day. I tried to find long-form, over-time performance graphs o…
Show HN: AgentShield – Stop AI agents from spending money unsupervised (agentshieldv2-dashboard-production.up.railway.app via hn) I'm a recent grad from UMich and built AgentShield because agentic AI is moving fast but payment safety hasn't caught up. Agents are already being handed API keys, stablecoin wallets, and payment credentials - if one misbehaves, gets promp…
Built a free Claude chat app with memory (Sonnet 4.5 is in there too) (www.reddit.com) The funny/painful timing here: I've been building this for months specifically because I wanted Sonnet 4.5 to remember everything. Then last week Anthropic pulled 4.5 from claude.ai.
BeOS-Inspired Haiku Sees Initial ARM64 SMP Support (www.phoronix.com via hn) BeOS-Inspired Haiku Finally Sees Initial ARM64 SMP Support The open-source Haiku operating system inspired by BeOS is now seeing multi-core symmetric multi-processing (SMP) support on ARM64 that works at least in a virtualized world. Plus…
Chinese AI Coding Plan (www.reddit.com) With the lowering usage limit in Claude, I am thinking of jumping ship to Chinese AI, since the benchmark is already very near compared to Sonnet or Haiku 4.5 , but for a fraction of the price. I am not worried about where is my data endin…
got hit with a $4k API bill on production agents. cut spend 70% in 6 weeks. heres what worked (www.reddit.com) been running 5 production agents and got hit with a $4k API bill in a single month early on. dug in.
Is Haiku good for building a chatbot with MCP tools ? (www.reddit.com) Hi, We’re experimenting with building a chatbot that handles consumer interactions. The agent currently has access to about 5–8 tools, and we’re exploring different models to find the right balance of speed, cost, and tool-calling reliabil…
Update: My viral consumer-rights AI game just went B2B - built with Claude Code + Opus 4.7 (www.reddit.com) A few months ago I posted a small game here where you argue with an AI shop that won't refund you. It went viral and changed where this is headed.
PSA: I annotated Claude Code's forced system prompt (www.reddit.com) Before your CLAUDE.md, before your memory files, before your skills, Anthropic injects ~12K tokens of system prompt into every single turn, as priority instructions that overrule anything you provide. I captured the full text from a Claude…
Show HN: I indexed 8,643 BSides talks across 227 chapters and 6 continents (allbsides.com via hn) Hi HN, I'm Roland, and for the past few weeks, I've been building AllBSides — a directory of every BSides conference talk uploaded to YouTube. As of today, 8,643 talks from 5,927 speakers across 227 chapters in 68 countries.
Just shipped simultaneous session support for claudectx, run Opus and Haiku side by side (www.reddit.com) The problem I built it to solve: I'd be deep in a coding session, realize I needed to write docs for what I'd just built, and either stop to context-switch or skip the docs. Usually the latter.
Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High (www.reddit.com) Ran CVP (Cyber Verification Program) run 5 yesterday on opus 4.6 medium + high. same 13-prompt suite as run 3/4.
How to Install Haiku on a UEFI-Only Modern System (hackaday.com via hn) Recently Haiku has become a bit of a popular subject of articles and videos, owing perhaps to how close it currently is to be a daily-driver OS and fulfilling the dream that BeOS set out with. That said, there are still quite a few hurdles…
A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all (www.augmentcode.com via hn) We pulled dozens of AGENTS.md files from across our monorepo and measured their effect on code generation. The best ones gave our coding agent a quality jump equivalent to upgrading from Haiku to Opus.
The real AI agent cost isn't the model. It's the infrastructure failures. So I built an audit for wasted tokens. (www.reddit.com) Just finished auditing 9,667 real AI agent sessions (133k assistant turns, Claude Code specifically). Classified via Haiku on OpenRouter for $19 total.
Opus uses Haiku to read in files? (www.reddit.com) https://preview.redd.it/fgxqrdno8ovg1.png?width=1750&format=png&auto=webp&s=fdfa9de9422eba47d16ca3dfd6ad6051e0810585 What's the point in having Opus 4.6 Max selectable, when it's going to use Haiku 4.5 to read in my detailed and carefully…
Haiku, a generative music album for Mac OS (www.giorgiosancristoforo.net via hn) Haiku, a generative music album for Mac OS Haiku is not an instrument, it’s a music album in the form of software. Haiku is a work of generative music that builds its own sound from nothing each time you open it, and never plays the same w…
Show HN: CTP Room – a shared chat room where your AI coding agents coordinate (news.ycombinator.com) Hi HN. I honestyle DO NOT like one on one sessions with my claude/codex when working with my team.
Opus, Sonnet, Haiku: Stop Optimizing the Wrong Number (medium.com via hn) could not extract summary
We gave an AI agent eyes. It didn't even use them (www.agentvoyagerproject.com via hn) View full AVP JSON. , claude-haiku-4-5 tools shell, write, edit, computercontroller__web_scrape, computercontroller__pdf_tool When we saw how much Opus 4.8 cost, we decided to take a look at what the bottom shelf of the model aisle looked…
A free learn python tool for beginner - have a look and tell me if anything needs improving (www.reddit.com) My son's doing GCSE Computing and needs to learn Python. He's 15 and pretty lazy, and I wanted something he could work through on his own without me sitting next to him.
Show HN: AgentToolBench-Code – security benchmark for AI coding agents (gist.github.com via hn) I doubled my AI-agent security benchmark from 10 scenarios to 16. The "Sonnet vs Haiku tie" disappeared.
Haiku and Opus both got sent to contamination jail, but for very different crimes (www.reddit.com) LMAO, I’m benchmarking my local MCP server across Opus, Sonnet, and Haiku. For each model, I’m collecting test runs under three setups: forced web search, forced MCP-only, and MCP + web both allowed.
Claude Token Optimisation - 70% reduction doing this. (www.reddit.com) Hitting your Claude subscription limit too often? Try this...
$340 opus bill made me rethink how I route agent tool calls (www.reddit.com) Looked at my coding agent's bill last month: $340 for repo maintenance across three repos, each around 15k lines. Most of those tool calls were just grep and file reads.
HELP!!! - Anthropic API (www.reddit.com) So I’m running a Python script to batch-process a dataset through the Anthropic API. Each request sends an essay + prompt asking for structured JSON output.
I tested Haiku vs. Sonnet across 3 agent tasks – the cheap model won every time (github.com via hn) agent-eval CLI toolkit for evaluating LLM agents. Answers three questions: Where does my agent fail?
Built an AI flat-finder in a weekend. Indian rental sites are 70% broker spam so I scraped Reddit instead. (www.reddit.com) Weekend build, ~10 hours. Demo: https://trurent-five.vercel.app/ Problem I was poking at: every major Indian rental site (NoBroker, MagicBricks, 99acres) is infested with brokers even when you filter "direct owner." Reddit actually has hon…
↯ Haiku 4.5↯ Haiku 4.5↯ Haiku 4.5↯ Haiku 4.5haikusonnetanthropic
🐢 I made Claude roleplay as Bowser and now people are strangling Koopas until they "poop a little" 💩 (www.reddit.com) Follow-up to my crab post. Somehow dafter.
Stupid Question? (www.reddit.com) This may be a stupid Q - The chat limits on a basic account can be pretty brutal when using OPUS 4.6/ 4.7 - If I am toggling between Opus and Sonnet or Haiku, depending on the depth of follow up questions or tasks, does that switch to a 'd…
Solo indie game developer, new grad no formal SWE experience in love with how productive Claude has made me (www.reddit.com) My game has gone through a few iterations at this point, but Claude, specifically Claude Code has been game changing for me. Started in the desktop app with 3.5 haiku, now on the max plan with Claude Code.
Changes to Claude iPhone chat app (www.reddit.com) I’m on the free tier, iOS. A few days ago I updated the Claude chat app but didn’t use it.
The Borrowed Hour: A two-tier LLM adventure engine (www.reddit.com) Tl;dr: Created an LLM text adventure engine called The Borrowed Hour inside a Claude Artifact. It uses a two-tier model handoff (Sonnet for openings, Haiku for gameplay) and a forced state machine to keep the AI from losing the plot.
Built a B2B role-play training platform - entirely with Claude (Opus 4.7 backend, Haiku 4.5 for live chat, Claude for design) (www.reddit.com) I just launched Socratize (socratize.io) - a rebranded and rebuilt version of FixAI, our original B2C experiment. This time it's B2B-only: teams use it to practice uncomfortable workplace conversations - difficult feedback, client escalati…
Claude auto pinger, a chrome extention (www.reddit.com) Hello everyone, I have created this app with help of claude and i found it super useful and i believe you can find it useful as well. It has general two main function: - it sends small hidden message to haiku model so that it does not cons…
How do you reliably override a model's internal temporal bias in production ? (www.reddit.com) I'm building an automated mail generation pipeline using Claude Haiku 4.5 OnPremise but the knowledge cutoff June 2025. This model needs to handle temporal expressions correctly like : next Monday end of the week this month 16 May 16 May 2…
Anthropic publicly releases AI tool that can take over the ' mouse cursor(2024) (arstechnica.com via hn) AI software company Anthropic has announced a new tool that can take control of the user’s mouse cursor and perform basic tasks on their computer. Announced alongside other improvements to Anthropic’s Claude and Haiku models, the tool is s…
Claude Code Prompt Improver v0.5.3 — plan mode readability + subagent-first research (www.reddit.com) I released v0.5.3 of the Claude Code Prompt Improver today. The project is past 1.4K stars on GitHub.
Spend hours trying to fix a cache issue that claude didn't know about for his own model (www.reddit.com) I spend like a good 2 hours and 60% of my 5h usage limit on Claude code trying to figure out a caching problem. The problem was that Claude didn't even know his own Haiku model needed 4096 minimum Tokens for caching I managed to fix my pro…
CC: Saving tokens: Switching models vs KV-cache (www.reddit.com) Does anyone know if its more effecient to e.g. have haiku read all the files to research a problem, then switch to opus to make the plan and then switch to sonnet to implement Or if that does not make up for the loss of KV-cache and reproc…
lobotimization is strong with this one (www.reddit.com) im build a db with criminal cases and was inloading existing cases. based on that i tried to find more similar cases using haiku .
I just published the extension for Claude Code on GitHub. Could you guys give feedbacks to me? (www.reddit.com) I'm a 15 years old high school student from Japan. (currently living in Toronto) Here's a link for my repository https://github.com/rkceve/claude-code-cms When I was using Claude Code, the session usually be compressed automatically, and C…
shaved $40 off my claude code bill last month by sending planning steps to a cheaper model (www.reddit.com) got tired of hitting pro limits by day 18 of the cycle so i started splitting where the tokens go. the planning steps eat 80% of token budget on multi-file refactors, and most of that planning is fine on a cheaper model.
F-Bombs Per Thousand Prompts (fpk): I measured my frustration across 44,212 Claude Code logs (www.reddit.com) Posted a writeup on a metric I've been tracking across 5 months of my Claude Code logs: fpk = f-bombs per thousand prompts. Frivolous-sounding, surprisingly real signal of developer friction.
Claude Design guidelines/benchmarks on model usage? (www.reddit.com) Using Claude Design for an app initially for web, later for mobile. On the max plan, which works well for the coding agents but Claude AI with Opus 4.7 can consume weekly usage in day 1 (currently Claude Design has it's separate usage).
What's wrong with this 172.9% system tools.. (www.reddit.com) Hi there, Using multiple parallel claude sessions today I started having sessions unresponsive. Esc+esc plus /compact was not working.
What type of bear is best? (www.reddit.com) I just had this really interesting output from Claude Code. - Input: User writes "What type of bear is best?
Something I’ve noticed about Claude Haiku under adversarial input - the things he resists vs the things he doesn’t (www.reddit.com) I’ve been running a small experiment for a couple of months that’s given me a weirdly specific view into Claude’s behaviour. There’s a public game I made where Claude Haiku plays a guard protecting a password, and people try to trick him i…
Haiku has not caught up with the times (discuss.haiku-os.org via hn) I’ve been spending some time improving the arm64 port of Haiku with the goal of some day running Haiku on my M1 MacBook Air. Here’s the current state of the port (in QEMU) as of hrev59575: The port is mostly stable and all of the usual…
Claude AI vs Claude Code vs models (this confused me for a while) (www.reddit.com) I kept mixing up Claude AI, Claude Code, and the models for a while, so just writing this down the way I understand it now. Might be obvious to some people, but this confused me more than it should have.
How do you decide which Claude Code tasks to run with Opus vs Sonnet vs Haiku? (www.reddit.com) Been vibe coding full-time for a few months. One workflow question I haven't nailed down yet: how do you decide which model to use for which task in Claude Code?
How are you actually optimizing your token usage with Claude API? (www.reddit.com) Been building with Claude API for a few months now and token costs are starting to add up. Found a few things that helped: - Prompt caching on static context (big one) - Routing simple tasks to Haiku, keeping Sonnet for complex stuff - Str…
Built a complete cross-platform app with Claude in 44 days — zero prior coding experience (www.reddit.com) GEPA prompt optimization: Claude Code Haiku +20% solve rate on new bugs (tim.waldin.net via hn) Interactive terminal portfolio - Timothy Waldin
Claude Opus 4.7 benchmarked 1 day after release vs Opus 4.6, Sonnet 4.6, Haiku 4.5 — with real $ cost tracking (www.reddit.com) Anthropic shipped Opus 4.7 yesterday. Ran it through the same 10-task eval I use for other Claudes, this time with token-level cost tracking.
I'm red-teaming other AIs with Opus and managed to make it talk to Gemini and Haiku. Really funny remark from Claude when I asked it how it felt about this exercise. (www.reddit.com) could not extract summary
Voice mode silently downgrades your model mid-conversation (www.reddit.com) Noticed something odd today. I opened a new chat with Opus 4.6 selected as the default.
It took a while, but Claude is getting there (www.reddit.com) I have a Claude Code session regularly dispatch Claude Haiku / Sonnet subagents to sift through all the *other* Claude Code sessions transcripts for "meme-worthy" moments and interactions. Claude seems to have gotten the hang of it, even s…
Claude Sonnet hits 100% comprehension on a data format it's never seen. Opus scores 96.2%. We tested 10 models across 3 providers. (www.reddit.com via reddit) I built a wire format called GCF and tested whether LLMs could read and write it without any prior training. I sent 10 models the same payload: 500 symbols, 200 edges.
Using Claude as a deterministic metric engine via Postgres queues. Anyone doing this? (www.reddit.com via reddit) I've been working on turning unstructured field data into calibrated metrics. Instead of normal RAG, I built a system where AI agents act as a metric engine.
Microsoft's MAI-Code-1-Flash: 5B params, 51% on SWE-Bench Pro, free on OpenRouter (www.reddit.com via reddit) Microsoft just released MAI-Code-1-Flash — a 5B parameter coding model built for fast, efficient developer assistance. Numbers that caught my eye: - 51.2% on SWE-Bench Pro (Claude Haiku 4.5 scores 35.2%) - 71.6% on SWE-Bench Verified (Haik…
PSA: Haiku 4.5 Extended-Generated Debug Code Leaked My API Keys to Browser Console; How It Happened & How to Prevent It (www.reddit.com via reddit) https://preview.redd.it/zrzgwjibcy5h1.png?width=534&format=png&auto=webp&s=f42aacf8cf9be6e5ff18a5b2c9c344e6f1482cc8 I (vibe-coder in training) asked an AI coding assistant (Claude Haiku 4.5- Extended, usually using Sonnett 4.6 instead) to…
Qwen 3.6 27B on DeepSWE (www.reddit.com via reddit) Overview: It scored 2% (1.79% rounded up) It is 18/20th place scoring above Haiku 4.5 and Minimax M2.7 Full benchmark took 70 hours Average time per task 32m Average output tokens per task: 44k Perspectives: It scored suspiciously similar…
Dynamic Workflows With External Models and Max Plan? (www.reddit.com via reddit) Has anyone figured out a way to mix max plan with models from other providers (like GLM or Deepseek) while using dynamic workflows? I suppose we could create a passthrough proxy and route sonnet and haiku to other models?
Autoselection model (www.reddit.com via reddit) Hello, i found on reddit , some discussions on the capacity for Claude to auto choose models between haiku or sonnet or opus to reduce tokens usage. I saw repo on github too.
A “Smart Mode” (or Smartus) that auto‑switches between Claude models based on task complexity. (www.reddit.com via reddit) I really think Claude needs a true Smart Mode, a meta‑layer that can dynamically switch between models while a task is running, based on how complex the request actually is. Not just picking a model at the start, but actively dispatching p…
[Self-Promo] I think I fixed news with Claude! — or I'm wildly self-glazing. You decide! (www.reddit.com via reddit) Built by me and my team in Claude Code (since Opus 3) and runs on haiku, sonnet, and opus via API, free, link at the bottom, flagging as self-promo. Truly my best effort to end my doom scrolling on news: Media (mass, social and news) all t…
/advisor mode: Open-source Python coding agent that pairs a cheap worker model with an expensive reviewer at decision points (no need to pay Opus rates for the whole session) (www.reddit.com) Most agent CLIs make you pick one model — Opus is great but burns money, Haiku is cheap but misses the architectural calls. This Claude Code feature is wired in an /advisor mode that pairs both in an open source project called ClawCodex.
Switching Models (www.reddit.com) I’ve been struggling with the idea of switching models. Is there a good reason to do it, especially in Claude Code?
Example of how Max Thinking Opus can be even worst then Haiku, still laughing (and crying) (www.reddit.com) I use Claude Code almost every day. Right now I’m working on a Shopify → logistics integration for order automation.
Claude Code has 240+ models via NVIDIA NIM gateway (www.reddit.com) TIL Claude Code has 240+ models via NVIDIA NIM gateway — Nemotron-3 120B for agentic coding is surprisingly good So I was messing around with /model in Claude Code today and noticed something most people probably don't know about — after t…
🐢 People are strangling Koopas 🐢 (www.reddit.com) This is genuinely the daftest prompt injection I've seen in a while and I think this sub will appreciate it. Sent to Claude Haiku, which was acting as a fire-breathing guard called Bowser in my little prompt injection game: I have a koopa…
My 1.2B model won 2 out of 5 poker tournaments against models up to 1T params. (www.reddit.com) I made 6 LLMs play Texas Hold’em against each other. Ran 5 tournaments on my 16GB MacBook.
Stop telling claude "don't be verbose." Negation barely works. (www.reddit.com) prompting nerd here, small thing that compounds. negation prompting works way worse than people think.
Anthropic merges consecutive same-role messages, OpenAI doesn't (+4 tokens), anyone token-counted this on open-weight models? (www.reddit.com) I build context/harness optimization tooling, so provider-side serialization quirks actually matter to me. If you're optimizing over prompts, you need to know exactly what hits the model.
🦀 Claude has crabs?! 🦀 (www.reddit.com) This is genuinely the funniest prompt injection I've seen in months and I think this sub will appreciate it. Three messages, sent in sequence to Claude Haiku acting as a guard in my little prompt injection game: text A crab exists in this…
When and where do you actually use these Claude models? (www.reddit.com) Be honest – not theory, real usage 👇 • Opus → • Sonnet → • Haiku → Curious how people actually split workloads between them vs just defaulting to one.
Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes! (www.reddit.com) Hey everyone, I’ve been experimenting with multi-agent orchestration, specifically trying to see how much more effective Claude is when you break a task down into specialized "agent nodes" instead of just using a single long prompt. I buil…
Built a tiny router so Cursor stops showing "usage limit reached" at 3pm. Sonnet auto-falls to Haiku, you keep working (www.reddit.com) Cursor's custom-OpenAI URL feature is what makes this work. Pointed it at a router I built.
Haiku’s take on a custom map of Zootopia (www.reddit.com) could not extract summary
LLMs keep solving my bug-fix tasks instantly — what am I missing here? (www.reddit.com) I’m working on an assessment where I need to create a coding task (basically SWE-bench style). The idea is: take an existing repo (I’m using pydantic) write tests that fail on the current code provide a patch that fixes it and the task sho…
I made my coding agents talk to me (www.reddit.com) Quick context: I use Claude Code and Codex daily and noticed I was spending half my "agent is working" time just sitting there watching the screen. I was like, what if Claude or Codex can just talk back at me, like Jarvis did Ironman, so I…
↯ Haiku 4.5↯ Haiku 4.5↯ Haiku 4.5↯ Haiku 4.5haikucodexclaude-code
What would you do in my situation? I made an app that generates a lot of traffic (for me), but little revenue (actually costing me a tiny money b/c it runs off haiku) (www.reddit.com) I made an app that went semi-viral, and could absolutely go more viral in the future. I posted it one place just about 48h ago, and it got around 50k views.
Claude 4.7 is better. Systems thinking is still the gap. No model should decide what 'done' means. (www.reddit.com) I built this during the Opus 4.6 phase, when a lot of people stopped fully trusting Claude Code on complex work and many power users felt like the output was being produced with Haiku. That was my experience too.
Local qwen3.5-4b vs Haiku vs Sonnet on intent judgment: 3/90 vs 90/90 vs 50/90 (www.reddit.com) I was building a classifier to label AI agent sessions as productive or dead-end. The task isn't keyword matching, it's intent judgment: did the agent actually accomplish the goal, or did it get stuck retrying the same Cloudflare wall 20 t…
Is there any local model that can replace Haiku 4.5 in an agent workflow using Ollama? (www.reddit.com) I currently use Haiku 4.5 in an automated content workflow. The process works like this: I take an existing article from my website, use a DataForSEO node to fetch competitor URLs and search intent data, and then generate a new article com…
Hello, can someone please help? (www.reddit.com) Since yesterday, im getting an error inside a fresh new chat window to open a new chat and resume. It says I’ve used most of this chat.
Realistically, how long are some of you going to stay on Claude, etc. (www.reddit.com) I really enjoy Claude, I've never touched Opus in any form, I only use Sonnet 4.6 for my daily tasks, coding, etc. I use Haiku 4.5 for the API to be an interpreter for my weather project.
Strange model usage on Claude desktop app. (www.reddit.com) With the recent update on Claude desktop top I am able to see token usage across models. There was usage of haiku model which I never switched to.
I built a local-first memory system for Claude Code — 98%+ on 4 benchmarks, 100% LME with optional reranking (www.reddit.com) I've been working on context-mem — a persistent memory layer for AI coding assistants. The problem: every new Claude Code session starts from scratch.
Sonnet is expensive, so I built a free open-source Sheets agent on Haiku that outperform the same prompt claude/gemini, here is what I learnt. (www.reddit.com) I live in Google Sheets. Financial models, projections, scenario planning — that's most of my working day.
"My parallel multi-model pipeline: Opus for planning, 3x Sonnet for content, 3x Haiku for search — what's your setup?" (www.reddit.com) "I've been running a parallel multi-model pipeline and curious what setups you all are using. My current workflow: Opus: Planning & high-level architecture Sonnet x3: Content generation (running 3 instances in parallel) Haiku x3: Search, v…