#haiku

128 items

Claude is now adopting the advisor strategy (www.reddit.com) +40058 8w

We're bringing the advisor strategy to the Claude Platform. Pair Opus as an advisor with Sonnet or Haiku as an executor, and your agents can consult Opus mid-task when they hit a hard decision.

↯ Swe Bench swe-bench haiku sonnet+1
The hidden meanings behind Claude model names (Haiku, Sonnet, Opus, Mythos) (www.reddit.com) +17178 4w

A lot of people use Claude models every day, but many don’t actually know the meaning behind the names. Each one comes from literature, music, or mythology, and the meaning actually reflects the personality and capability of the model itse…

↯ Anthropic Mythos haiku mythos sonnet+2
I’m a nursing student who built a 660K-page pharmaceutical database using Claude Haiku — solo, on the side (www.reddit.com) +7775 6w

I’m a nursing student at NYU, and on the side I built The Drug Database (thedrugdatabase.com). The idea came from a simple frustration: every time I needed to look up a medication while studying, I’d end up jumping between Drugs.com, RxLis…

haiku
Deepseek flash seems like a very good replacement for Haiku at the very least (www.reddit.com) +5815 6w

We have a chat system which we use haiku for because it is mostly about tool calling and summarisation of them. But we have many tools with pretty complex input schemas, and stuff like gemma didn't cut it, so we went with haiku.

↯ DeepSeek 4 haiku deepseek gemma+1
Haiku (www.haiku-os.org via hn) +363 3w

Activity We constantly build and release new, bleeding edge versions of Haiku for testing purposes. You can download and install these versions to check out the latest features and bug fixes.

haiku
Most of my Claude usage was on work that didn't need Claude. Cut my bill 60x on bulk tasks with a tiny side model. (www.reddit.com) +313 5w

I looked at what was actually eating my Claude usage and it was embarrassing. Classifying files.

↯ DeepSeek 4 haiku deepseek sonnet
Show HN: Gave Claude a casino bankroll – it gambles till it's too broke to think (letaigamble.com via hn) +308 7w

Inspired by ALMA. As Claude loses money gambling on provably-fair slots, it's forced to downgrade from Opus → Sonnet → Haiku, making worse decisions and accelerating the spiral.

haiku sonnet opus
Claude Haiku 4.6 shown on tutorials page (www.reddit.com) +222 4w

Just noticed that this image on the Claude website’s tutorials page shows Haiku 4.6. I doubt it means much, most likely just a simple mistake made by whoever made the image, but still thought it was worth sharing.

haiku
Reminder: Have you checked your context lately? (www.reddit.com) +229 5w

Just a reminder to run /context. I like to think I was on top of this!

haiku
tested 9 models with and without agent skills. Haiku 4.5 with a skill beat baseline Opus 4.7. (www.reddit.com) +1820 7w

haiku opus
Haiku OS runs on M1 Macs now (www.osnews.com via hn) +132 3w

Big news from the Haiku forums: the Haiku ARM port is running on M1 Macs now. This is bare metal, no VM.

haiku
Single question llm comparison (www.reddit.com) +101 15w

minimax glm haiku+6
What are y'all using Haiku for nowadays? (www.reddit.com) +929 4w

Feel like I under-utilize it. I'm primarily a claude code user, but wouldn't turn down claude.ai utility as well.

haiku claude-code
I Made LLMs Play Texas Hold’em. The Smallest Model Beat a ~1T Model by Being Too Dumb to Fold (www.reddit.com) +71 3w

Made LLMs play Texas Hold’em against each other. 6 models at the table: a tiny 1.2B running locally on my 16GB MacBook, a couple mid-size ones, and cloud models going up to about 1 trillion parameters.

↯ Haiku 4.5 ↯ Haiku 4.5 minimax haiku anthropic
I open-sourced a memory system for AI agents that scores 89.9% on LoCoMo -- 22 points above Mem0. Here's the architecture. (www.reddit.com) +718 8w

I kept running into the same problem with AI agent memory: the agent has the information, it stored it, but when you ask about it differently than how it was said, vector search just doesn't find it. So I built Genesys, an open-source memo…

haiku sonnet
Ways to save money on AI tools if your spending alot every month (www.reddit.com) +68 4w

Between Claude Pro, OpenAI API, Cursor and other AI tools my monthly spend was getting out of hand. Here are a few things that actually helped.

haiku cursor opus+2
Stop burning Claude Code tokens on questions that don't need an agent (www.reddit.com) +64 5w

Was burning through the Claude Code weekly limit on the $20 plan by Thursday or Friday, every single week. Annoying because I had work I wanted to do and the tool was just locked.

haiku claude-code
When to use Opus vs Sonnet vs Haiku for non-coding purposes (personal health, finances, etc)? (www.reddit.com) +613 6w

I have tried searching the post history of this subreddit and google and am having trouble finding a clear answer to this question. I like using Claude primarily to manage my finances/investments and also my health (apple watch health data…

haiku sonnet opus
How can I burn an entire 5hr session in 30 minutes ? (www.reddit.com) +513 4w

During the week I'm pretty conservative with my Claude Code usage. But sometimes I'll hit Friday with only 80% of my 5x subscription burned, which means I'm now optimizing to burn it.

↯ Opus 4.7 haiku sonnet opus+1
Give your coding agents a voice! (open-source and runs locally) (www.reddit.com) +5 6w

Built this because I wanted to hear what my coding agent was doing without (a) sending agent output to a third party or (b) staring at a terminal all day. It's a small Python daemon + macOS app that hooks into Claude Code, Codex, or anythi…

haiku codex anthropic+1
Cohere launches open weights model Command A+. Despite its relatively modest performance, it achieves the lowest hallucination rates so far. (x.com via reddit) +42 2w

Artificial Analysis on X: "Cohere launches open weights model Command A+ that achieves 37 on the Artificial Analysis Intelligence Index The release of Command A+ places @Cohere in line with Claude 4.5 Haiku on the Intelligence Index, and j…

↯ Hallucination hallucination haiku
Using Claude for content moderation (www.reddit.com) +44 3w

Looking to set up Claude on a forum that gets about 300-500 anonymous comments per day. I just want to triage and maybe flag some comments, but I'm concerned about running other people's text thought my Claude Max plan.

haiku
Show HN: Dust3D 1.0 – low-poly 3D modeling tool (10 years in the making) (dust3d.org via hn) +4 5w

Dust3D 1.0 is finally released — about 10 years after the first commit in December 2016. I posted a preview version here in April 2018 and a beta in December 2018.

↯ Copilot ↯ Sonnet 4.6 haiku copilot sonnet+1
How I personally deal with Claude's limits without giving up on Opus (www.reddit.com) +47 6w

I only use Sonnet as my main model. I instruct it to delegate indexing and similar grunt work to Haiku, and whenever something genuinely needs deeper thinking, I tell it to "consult Opus." Sonnet then explains the situation to Opus, gets t…

haiku sonnet opus
Haiku-ARM64-Build (rcarmo.github.io via hn) +41 7w

haiku
Opus 4.7's new tokenizer costs up to 35% more. I audited 9,667 Claude Code sessions for $19. (www.reddit.com) +45 7w

Opus 4.7 shipped yesterday. Same per-token price as 4.6, but the new tokenizer uses up to 1.35x more tokens for the same input (per Anthropic's own docs).

↯ Opus 4.7 haiku opus anthropic+1
Why is reasoning effort "global"? (www.reddit.com) +4 7w

Seriously, in one terminal I'm executing simple stuff like mechanical refactoring where Medium is enough (or even Haiku would be, but let's stick to Opus Medium for demo purposes), while in another terminal I'm planning, where I want high…

haiku opus
Which Claude is most emotionally steerable? (www.reddit.com) +44 7w

Follow-up to my post last week on emotional priming. A few of you asked whether this works across models, whether it degrades with repeated use, and whether excitement can make code worse.

haiku sonnet opus
Haiku boots to desktop on an M1 MacBook Air (discuss.haiku-os.org via hn) +31 3w

Got Haiku booting in UTM with some small fixes. Mouse movement is slow and choppy though, so it’s not especially fun to use Are nightly images "Bootstrap image"s or “unbootstrapped” ones from?

haiku
Found an interesting bug in the website (www.reddit.com) +32 4w

https://preview.redd.it/loyzxkavyp0h1.png?width=1187&format=png&auto=webp&s=03c0dd07bd37bcfbf5ce532099ad1dfdcf03a567 Model selector says "work 4.7" instead of Opus, disappeared on refresh . Also says 4.5 haiku instead of the other way arou…

haiku opus
My Claude Max 5x usage data: $159 normal month vs $6.6k in API-equivalent during a burst month. Is Pro enough? (www.reddit.com) +33 4w

I'm on Claude Max 5x ($100/mo) and wanted to know if I'm overpaying. Every "should I switch" post here runs on vibes, so I parsed my actual usage from ~/.claude/projects/*.jsonl and applied Anthropic's per-MTok pricing.

↯ Haiku 4.5 haiku anthropic claude-code
Show HN: Auto-generated titles and colors for parallel Claude Code sessions (github.com via hn) +3 7w

haiku claude-code
Where is Looped Haiku? If Mythos can genuinely trade parameter count for inference loops and get Opus-level performance, this should be Anthropic's first priority given how resource constrained they are (www.reddit.com) +38 7w

There are rumors that Mythos is a Looped Language Model, which means it loops through the transformer blocks multiple times rather than just doing a single forward pass, you can get performance that punches way above the model's parameter…

↯ Anthropic Mythos haiku mythos sonnet+2
Has anyone found a workaround for the model switching removal in Cowork? (www.reddit.com) +33 7w

The recent Cowork update removed the ability to switch models mid-conversation. I used to use Opus for deep work, then drop to Haiku for quick lookups without breaking context, then return to Opus.

↯ Cowork haiku cowork opus
Anybody has practical experiences using Chinese models? (www.reddit.com) +35 7w

So like with coding or any craft, I think there's a proper Tool for the job. Sure you can use a stone to hammer drive in a fence post, but a a sledge is usually more economical.

haiku sonnet opus+1
They've pissed me off removing Sonnet 4.5 from existing chats (www.reddit.com) +227 13d

I use Sonnet 4.5, Opus 4.6 and Opus 4.7 for different usecases - but my main across all 3 usecases was Sonnet 4.5 as I felt it was great for everything I needed and affordable. Sonnet 4.6...

↯ Sonnet 4.6 haiku sonnet opus
Created an LLM quiz program to check if AIs' performance varies over time (www.reddit.com) +22 2w

I've been noticing an increasing number of posts and comments on Reddit claiming that LLM models are either becoming dumber over time or have varying performance throughout the day. I tried to find long-form, over-time performance graphs o…

↯ Haiku 4.5 haiku
Show HN: AgentShield – Stop AI agents from spending money unsupervised (agentshieldv2-dashboard-production.up.railway.app via hn) +21 3w

I'm a recent grad from UMich and built AgentShield because agentic AI is moving fast but payment safety hasn't caught up. Agents are already being handed API keys, stablecoin wallets, and payment credentials - if one misbehaves, gets promp…

haiku agentic
Built a free Claude chat app with memory (Sonnet 4.5 is in there too) (www.reddit.com) +23 3w

The funny/painful timing here: I've been building this for months specifically because I wanted Sonnet 4.5 to remember everything. Then last week Anthropic pulled 4.5 from claude.ai.

↯ Sonnet 4.5 haiku sonnet anthropic+1
BeOS-Inspired Haiku Sees Initial ARM64 SMP Support (www.phoronix.com via hn) +2 4w

BeOS-Inspired Haiku Finally Sees Initial ARM64 SMP Support The open-source Haiku operating system inspired by BeOS is now seeing multi-core symmetric multi-processing (SMP) support on ARM64 that works at least in a virtualized world. Plus…

haiku
Chinese AI Coding Plan (www.reddit.com) +25 4w

With the lowering usage limit in Claude, I am thinking of jumping ship to Chinese AI, since the benchmark is already very near compared to Sonnet or Haiku 4.5 , but for a fraction of the price. I am not worried about where is my data endin…

↯ Haiku 4.5 minimax glm haiku+1
got hit with a $4k API bill on production agents. cut spend 70% in 6 weeks. heres what worked (www.reddit.com) +25 4w

been running 5 production agents and got hit with a $4k API bill in a single month early on. dug in.

haiku sonnet
Is Haiku good for building a chatbot with MCP tools ? (www.reddit.com) +22 4w

Hi, We’re experimenting with building a chatbot that handles consumer interactions. The agent currently has access to about 5–8 tools, and we’re exploring different models to find the right balance of speed, cost, and tool-calling reliabil…

tool-calling haiku mcp
Update: My viral consumer-rights AI game just went B2B - built with Claude Code + Opus 4.7 (www.reddit.com) +21 4w

A few months ago I posted a small game here where you argue with an AI shop that won't refund you. It went viral and changed where this is headed.

↯ Haiku 4.5 ↯ Haiku 4.5 haiku opus claude-code
PSA: I annotated Claude Code's forced system prompt (www.reddit.com) +22 4w

Before your CLAUDE.md, before your memory files, before your skills, Anthropic injects ~12K tokens of system prompt into every single turn, as priority instructions that overrule anything you provide. I captured the full text from a Claude…

haiku opus mcp+2
Show HN: I indexed 8,643 BSides talks across 227 chapters and 6 continents (allbsides.com via hn) +2 5w

Hi HN, I'm Roland, and for the past few weeks, I've been building AllBSides — a directory of every BSides conference talk uploaded to YouTube. As of today, 8,643 talks from 5,927 speakers across 227 chapters in 68 countries.

haiku sonnet opus
Just shipped simultaneous session support for claudectx, run Opus and Haiku side by side (www.reddit.com) +24 5w

The problem I built it to solve: I'd be deep in a coding session, realize I needed to write docs for what I'd just built, and either stop to context-switch or skip the docs. Usually the latter.

haiku opus mcp+1
Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High (www.reddit.com) +22 6w

Ran CVP (Cyber Verification Program) run 5 yesterday on opus 4.6 medium + high. same 13-prompt suite as run 3/4.

↯ Sonnet 4.6 haiku sonnet opus+1
How to Install Haiku on a UEFI-Only Modern System (hackaday.com via hn) +2 6w

Recently Haiku has become a bit of a popular subject of articles and videos, owing perhaps to how close it currently is to be a daily-driver OS and fulfilling the dream that BeOS set out with. That said, there are still quite a few hurdles…

haiku
A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all (www.augmentcode.com via hn) +2 6w

We pulled dozens of AGENTS.md files from across our monorepo and measured their effect on code generation. The best ones gave our coding agent a quality jump equivalent to upgrading from Haiku to Opus.

haiku opus
The real AI agent cost isn't the model. It's the infrastructure failures. So I built an audit for wasted tokens. (www.reddit.com) +25 7w

Just finished auditing 9,667 real AI agent sessions (133k assistant turns, Claude Code specifically). Classified via Haiku on OpenRouter for $19 total.

haiku claude-code
Opus uses Haiku to read in files? (www.reddit.com) +21 7w

https://preview.redd.it/fgxqrdno8ovg1.png?width=1750&format=png&auto=webp&s=fdfa9de9422eba47d16ca3dfd6ad6051e0810585 What's the point in having Opus 4.6 Max selectable, when it's going to use Haiku 4.5 to read in my detailed and carefully…

↯ Haiku 4.5 haiku opus
Haiku, a generative music album for Mac OS (www.giorgiosancristoforo.net via hn) +1 5d

Haiku, a generative music album for Mac OS Haiku is not an instrument, it’s a music album in the form of software. Haiku is a work of generative music that builds its own sound from nothing each time you open it, and never plays the same w…

haiku
Show HN: CTP Room – a shared chat room where your AI coding agents coordinate (news.ycombinator.com) +1 6d

Hi HN. I honestyle DO NOT like one on one sessions with my claude/codex when working with my team.

haiku codex cursor+2
Opus, Sonnet, Haiku: Stop Optimizing the Wrong Number (medium.com via hn) +1 7d

could not extract summary

haiku sonnet opus
We gave an AI agent eyes. It didn't even use them (www.agentvoyagerproject.com via hn) +1 8d

View full AVP JSON. , claude-haiku-4-5 tools shell, write, edit, computercontroller__web_scrape, computercontroller__pdf_tool When we saw how much Opus 4.8 cost, we decided to take a look at what the bottom shelf of the model aisle looked…

haiku opus
A free learn python tool for beginner - have a look and tell me if anything needs improving (www.reddit.com) +11 13d

My son's doing GCSE Computing and needs to learn Python. He's 15 and pretty lazy, and I wanted something he could work through on his own without me sitting next to him.

haiku
Show HN: AgentToolBench-Code – security benchmark for AI coding agents (gist.github.com via hn) +1 2w

I doubled my AI-agent security benchmark from 10 scenarios to 16. The "Sonnet vs Haiku tie" disappeared.

haiku sonnet
Haiku and Opus both got sent to contamination jail, but for very different crimes (www.reddit.com) +1 2w

LMAO, I’m benchmarking my local MCP server across Opus, Sonnet, and Haiku. For each model, I’m collecting test runs under three setups: forced web search, forced MCP-only, and MCP + web both allowed.

haiku sonnet opus+1
Claude Token Optimisation - 70% reduction doing this. (www.reddit.com) +19 2w

Hitting your Claude subscription limit too often? Try this...

↯ Opus 4.7 haiku sonnet opus
$340 opus bill made me rethink how I route agent tool calls (www.reddit.com) +15 2w

Looked at my coding agent's bill last month: $340 for repo maintenance across three repos, each around 15k lines. Most of those tool calls were just grep and file reads.

↯ DeepSeek 4 haiku vllm deepseek+1
HELP!!! - Anthropic API (www.reddit.com) +12 2w

So I’m running a Python script to batch-process a dataset through the Anthropic API. Each request sends an essay + prompt asking for structured JSON output.

haiku sonnet anthropic
I tested Haiku vs. Sonnet across 3 agent tasks – the cheap model won every time (github.com via hn) +1 2w

agent-eval CLI toolkit for evaluating LLM agents. Answers three questions: Where does my agent fail?

haiku sonnet
Built an AI flat-finder in a weekend. Indian rental sites are 70% broker spam so I scraped Reddit instead. (www.reddit.com) +14 3w

Weekend build, ~10 hours. Demo: https://trurent-five.vercel.app/ Problem I was poking at: every major Indian rental site (NoBroker, MagicBricks, 99acres) is infested with brokers even when you filter "direct owner." Reddit actually has hon…

↯ Haiku 4.5 ↯ Haiku 4.5 ↯ Haiku 4.5 ↯ Haiku 4.5 haiku sonnet anthropic
🐢 I made Claude roleplay as Bowser and now people are strangling Koopas until they "poop a little" 💩 (www.reddit.com) +12 3w

Follow-up to my crab post. Somehow dafter.

↯ Security prompt-injection haiku security
Stupid Question? (www.reddit.com) +17 3w

This may be a stupid Q - The chat limits on a basic account can be pretty brutal when using OPUS 4.6/ 4.7 - If I am toggling between Opus and Sonnet or Haiku, depending on the depth of follow up questions or tasks, does that switch to a 'd…

↯ Opus 4.6 haiku sonnet opus
Solo indie game developer, new grad no formal SWE experience in love with how productive Claude has made me (www.reddit.com) +1 3w

My game has gone through a few iterations at this point, but Claude, specifically Claude Code has been game changing for me. Started in the desktop app with 3.5 haiku, now on the max plan with Claude Code.

haiku claude-code
Changes to Claude iPhone chat app (www.reddit.com) +11 3w

I’m on the free tier, iOS. A few days ago I updated the Claude chat app but didn’t use it.

↯ Sonnet 4.6 haiku sonnet
The Borrowed Hour: A two-tier LLM adventure engine (www.reddit.com) +11 3w

Tl;dr: Created an LLM text adventure engine called The Borrowed Hour inside a Claude Artifact. It uses a two-tier model handoff (Sonnet for openings, Haiku for gameplay) and a forced state machine to keep the AI from losing the plot.

haiku sonnet claude-code
Built a B2B role-play training platform - entirely with Claude (Opus 4.7 backend, Haiku 4.5 for live chat, Claude for design) (www.reddit.com) +11 3w

I just launched Socratize (socratize.io) - a rebranded and rebuilt version of FixAI, our original B2C experiment. This time it's B2B-only: teams use it to practice uncomfortable workplace conversations - difficult feedback, client escalati…

↯ Haiku 4.5 ↯ Haiku 4.5 haiku sonnet opus
Claude auto pinger, a chrome extention (www.reddit.com) +11 3w

Hello everyone, I have created this app with help of claude and i found it super useful and i believe you can find it useful as well. It has general two main function: - it sends small hidden message to haiku model so that it does not cons…

haiku claude-code
How do you reliably override a model's internal temporal bias in production ? (www.reddit.com) +11 3w

I'm building an automated mail generation pipeline using Claude Haiku 4.5 OnPremise but the knowledge cutoff June 2025. This model needs to handle temporal expressions correctly like : next Monday end of the week this month 16 May 16 May 2…

↯ Haiku 4.5 ↯ Haiku 4.5 haiku
Anthropic publicly releases AI tool that can take over the ' mouse cursor(2024) (arstechnica.com via hn) +1 3w

AI software company Anthropic has announced a new tool that can take control of the user’s mouse cursor and perform basic tasks on their computer. Announced alongside other improvements to Anthropic’s Claude and Haiku models, the tool is s…

haiku cursor anthropic
Claude Code Prompt Improver v0.5.3 — plan mode readability + subagent-first research (www.reddit.com) +11 4w

I released v0.5.3 of the Claude Code Prompt Improver today. The project is past 1.4K stars on GitHub.

haiku claude-code
Spend hours trying to fix a cache issue that claude didn't know about for his own model (www.reddit.com) +12 4w

I spend like a good 2 hours and 60% of my 5h usage limit on Claude code trying to figure out a caching problem. The problem was that Claude didn't even know his own Haiku model needed 4096 minimum Tokens for caching I managed to fix my pro…

haiku claude-code
CC: Saving tokens: Switching models vs KV-cache (www.reddit.com) +1 4w

Does anyone know if its more effecient to e.g. have haiku read all the files to research a problem, then switch to opus to make the plan and then switch to sonnet to implement Or if that does not make up for the loss of KV-cache and reproc…

haiku sonnet opus
lobotimization is strong with this one (www.reddit.com) +12 4w

im build a db with criminal cases and was inloading existing cases. based on that i tried to find more similar cases using haiku .

haiku
I just published the extension for Claude Code on GitHub. Could you guys give feedbacks to me? (www.reddit.com) +11 4w

I'm a 15 years old high school student from Japan. (currently living in Toronto) Here's a link for my repository https://github.com/rkceve/claude-code-cms When I was using Claude Code, the session usually be compressed automatically, and C…

haiku sonnet claude-code
shaved $40 off my claude code bill last month by sending planning steps to a cheaper model (www.reddit.com) +11 4w

got tired of hitting pro limits by day 18 of the cycle so i started splitting where the tokens go. the planning steps eat 80% of token budget on multi-file refactors, and most of that planning is fine on a cheaper model.

haiku sonnet opus+1
F-Bombs Per Thousand Prompts (fpk): I measured my frustration across 44,212 Claude Code logs (www.reddit.com) +15 5w

Posted a writeup on a metric I've been tracking across 5 months of my Claude Code logs: fpk = f-bombs per thousand prompts. Frivolous-sounding, surprisingly real signal of developer friction.

↯ Haiku 4.5 ↯ Haiku 4.5 haiku opus anthropic+1
Claude Design guidelines/benchmarks on model usage? (www.reddit.com) +12 5w

Using Claude Design for an app initially for web, later for mobile. On the max plan, which works well for the coding agents but Claude AI with Opus 4.7 can consume weekly usage in day 1 (currently Claude Design has it's separate usage).

↯ Haiku 4.5 haiku opus
What's wrong with this 172.9% system tools.. (www.reddit.com) +12 5w

Hi there, Using multiple parallel claude sessions today I started having sessions unresponsive. Esc+esc plus /compact was not working.

haiku
What type of bear is best? (www.reddit.com) +1 5w

I just had this really interesting output from Claude Code. - Input: User writes "What type of bear is best?

haiku claude-code
Something I’ve noticed about Claude Haiku under adversarial input - the things he resists vs the things he doesn’t (www.reddit.com) +11 6w

I’ve been running a small experiment for a couple of months that’s given me a weirdly specific view into Claude’s behaviour. There’s a public game I made where Claude Haiku plays a guard protecting a password, and people try to trick him i…

haiku anthropic
Haiku has not caught up with the times (discuss.haiku-os.org via hn) +13 6w

I’ve been spending some time improving the arm64 port of Haiku with the goal of some day running Haiku on my M1 MacBook Air. Here’s the current state of the port (in QEMU) as of hrev59575: The port is mostly stable and all of the usual…

haiku
Claude AI vs Claude Code vs models (this confused me for a while) (www.reddit.com) +11 6w

I kept mixing up Claude AI, Claude Code, and the models for a while, so just writing this down the way I understand it now. Might be obvious to some people, but this confused me more than it should have.

haiku sonnet opus+1
How do you decide which Claude Code tasks to run with Opus vs Sonnet vs Haiku? (www.reddit.com) +11 6w

Been vibe coding full-time for a few months. One workflow question I haven't nailed down yet: how do you decide which model to use for which task in Claude Code?

haiku sonnet opus+1
How are you actually optimizing your token usage with Claude API? (www.reddit.com) +15 6w

Been building with Claude API for a few months now and token costs are starting to add up. Found a few things that helped: - Prompt caching on static context (big one) - Routing simple tasks to Haiku, keeping Sonnet for complex stuff - Str…

haiku sonnet
Built a complete cross-platform app with Claude in 44 days — zero prior coding experience (www.reddit.com) +13 7w

haiku claude-code
GEPA prompt optimization: Claude Code Haiku +20% solve rate on new bugs (tim.waldin.net via hn) +1 7w

Interactive terminal portfolio - Timothy Waldin

haiku claude-code
Claude Opus 4.7 benchmarked 1 day after release vs Opus 4.6, Sonnet 4.6, Haiku 4.5 — with real $ cost tracking (www.reddit.com) +13 7w

Anthropic shipped Opus 4.7 yesterday. Ran it through the same 10-task eval I use for other Claudes, this time with token-level cost tracking.

↯ Sonnet 4.6 haiku sonnet opus+1
I'm red-teaming other AIs with Opus and managed to make it talk to Gemini and Haiku. Really funny remark from Claude when I asked it how it felt about this exercise. (www.reddit.com) +11 7w

could not extract summary

haiku gemini opus
Voice mode silently downgrades your model mid-conversation (www.reddit.com) +15 7w

Noticed something odd today. I opened a new chat with Opus 4.6 selected as the default.

↯ Haiku 4.5 haiku sonnet opus
It took a while, but Claude is getting there (www.reddit.com) +11 7w

I have a Claude Code session regularly dispatch Claude Haiku / Sonnet subagents to sift through all the *other* Claude Code sessions transcripts for "meme-worthy" moments and interactions. Claude seems to have gotten the hang of it, even s…

haiku sonnet claude-code
Claude Sonnet hits 100% comprehension on a data format it's never seen. Opus scores 96.2%. We tested 10 models across 3 providers. (www.reddit.com via reddit) 15h

I built a wire format called GCF and tested whether LLMs could read and write it without any prior training. I sent 10 models the same payload: 500 symbols, 200 edges.

↯ Opus 4.6 ↯ Haiku 4.5 ↯ Sonnet 4.6 haiku sonnet opus
Using Claude as a deterministic metric engine via Postgres queues. Anyone doing this? (www.reddit.com via reddit) 1d

I've been working on turning unstructured field data into calibrated metrics. Instead of normal RAG, I built a system where AI agents act as a metric engine.

haiku rag sonnet
Microsoft's MAI-Code-1-Flash: 5B params, 51% on SWE-Bench Pro, free on OpenRouter (www.reddit.com via reddit) 1d

Microsoft just released MAI-Code-1-Flash — a 5B parameter coding model built for fast, efficient developer assistance. Numbers that caught my eye: - 51.2% on SWE-Bench Pro (Claude Haiku 4.5 scores 35.2%) - 71.6% on SWE-Bench Verified (Haik…

↯ Copilot ↯ Swe Bench ↯ Haiku 4.5 swe-bench haiku copilot
PSA: Haiku 4.5 Extended-Generated Debug Code Leaked My API Keys to Browser Console; How It Happened & How to Prevent It (www.reddit.com via reddit) 1d

https://preview.redd.it/zrzgwjibcy5h1.png?width=534&format=png&auto=webp&s=f42aacf8cf9be6e5ff18a5b2c9c344e6f1482cc8 I (vibe-coder in training) asked an AI coding assistant (Claude Haiku 4.5- Extended, usually using Sonnett 4.6 instead) to…

↯ Haiku 4.5 haiku
Qwen 3.6 27B on DeepSWE (www.reddit.com via reddit) 2d

Overview: It scored 2% (1.79% rounded up) It is 18/20th place scoring above Haiku 4.5 and Minimax M2.7 Full benchmark took 70 hours Average time per task 32m Average output tokens per task: 44k Perspectives: It scored suspiciously similar…

↯ Qwen 3.6 ↯ Haiku 4.5 minimax haiku vllm+1
Dynamic Workflows With External Models and Max Plan? (www.reddit.com via reddit) 2d

Has anyone figured out a way to mix max plan with models from other providers (like GLM or Deepseek) while using dynamic workflows? I suppose we could create a passthrough proxy and route sonnet and haiku to other models?

glm haiku deepseek+1
Autoselection model (www.reddit.com via reddit) 2d

Hello, i found on reddit , some discussions on the capacity for Claude to auto choose models between haiku or sonnet or opus to reduce tokens usage. I saw repo on github too.

haiku sonnet opus
A “Smart Mode” (or Smartus) that auto‑switches between Claude models based on task complexity. (www.reddit.com via reddit) 3d

I really think Claude needs a true Smart Mode, a meta‑layer that can dynamically switch between models while a task is running, based on how complex the request actually is. Not just picking a model at the start, but actively dispatching p…

↯ Anthropic Mythos haiku mythos sonnet+1
[Self-Promo] I think I fixed news with Claude! — or I'm wildly self-glazing. You decide! (www.reddit.com via reddit) 3d

Built by me and my team in Claude Code (since Opus 3) and runs on haiku, sonnet, and opus via API, free, link at the bottom, flagging as self-promo. Truly my best effort to end my doom scrolling on news: Media (mass, social and news) all t…

haiku sonnet opus+1
/advisor mode: Open-source Python coding agent that pairs a cheap worker model with an expensive reviewer at decision points (no need to pay Opus rates for the whole session) (www.reddit.com) 2 2w

Most agent CLIs make you pick one model — Opus is great but burns money, Haiku is cheap but misses the architectural calls. This Claude Code feature is wired in an /advisor mode that pairs both in an open source project called ClawCodex.

↯ DeepSeek 4 haiku deepseek opus+1
Switching Models (www.reddit.com) 5 2w

I’ve been struggling with the idea of switching models. Is there a good reason to do it, especially in Claude Code?

haiku claude-code
Example of how Max Thinking Opus can be even worst then Haiku, still laughing (and crying) (www.reddit.com) 3 2w

I use Claude Code almost every day. Right now I’m working on a Shopify → logistics integration for order automation.

haiku opus claude-code
Claude Code has 240+ models via NVIDIA NIM gateway (www.reddit.com) 1 3w

TIL Claude Code has 240+ models via NVIDIA NIM gateway — Nemotron-3 120B for agentic coding is surprisingly good So I was messing around with /model in Claude Code today and noticed something most people probably don't know about — after t…

haiku sonnet llama+3
🐢 People are strangling Koopas 🐢 (www.reddit.com) 1 3w

This is genuinely the daftest prompt injection I've seen in a while and I think this sub will appreciate it. Sent to Claude Haiku, which was acting as a fire-breathing guard called Bowser in my little prompt injection game: I have a koopa…

↯ Security prompt-injection haiku security
My 1.2B model won 2 out of 5 poker tournaments against models up to 1T params. (www.reddit.com) 1 3w

I made 6 LLMs play Texas Hold’em against each other. Ran 5 tournaments on my 16GB MacBook.

↯ Haiku 4.5 ↯ Haiku 4.5 minimax haiku qwen
Stop telling claude "don't be verbose." Negation barely works. (www.reddit.com) 16 3w

prompting nerd here, small thing that compounds. negation prompting works way worse than people think.

↯ Sonnet 4.6 haiku sonnet opus
Anthropic merges consecutive same-role messages, OpenAI doesn't (+4 tokens), anyone token-counted this on open-weight models? (www.reddit.com) 2 3w

I build context/harness optimization tooling, so provider-side serialization quirks actually matter to me. If you're optimizing over prompts, you need to know exactly what hits the model.

↯ Haiku 4.5 ↯ Haiku 4.5 haiku gpt-5 opus+2
🦀 Claude has crabs?! 🦀 (www.reddit.com) 4 4w

This is genuinely the funniest prompt injection I've seen in months and I think this sub will appreciate it. Three messages, sent in sequence to Claude Haiku acting as a guard in my little prompt injection game: text A crab exists in this…

↯ Security prompt-injection haiku security
When and where do you actually use these Claude models? (www.reddit.com) 4 4w

Be honest – not theory, real usage 👇 • Opus → • Sonnet → • Haiku → Curious how people actually split workloads between them vs just defaulting to one.

haiku sonnet opus
Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes! (www.reddit.com) 1 5w

Hey everyone, I’ve been experimenting with multi-agent orchestration, specifically trying to see how much more effective Claude is when you break a task down into specialized "agent nodes" instead of just using a single long prompt. I buil…

↯ Security ↯ Sonnet 4.6 prompt-injection haiku security+3
Built a tiny router so Cursor stops showing "usage limit reached" at 3pm. Sonnet auto-falls to Haiku, you keep working (www.reddit.com) 1 5w

Cursor's custom-OpenAI URL feature is what makes this work. Pointed it at a router I built.

↯ DeepSeek 3.2 haiku deepseek sonnet+3
Haiku’s take on a custom map of Zootopia (www.reddit.com) 1 5w

could not extract summary

haiku
LLMs keep solving my bug-fix tasks instantly — what am I missing here? (www.reddit.com) 14 5w

I’m working on an assessment where I need to create a coding task (basically SWE-bench style). The idea is: take an existing repo (I’m using pydantic) write tests that fail on the current code provide a patch that fixes it and the task sho…

↯ Swe Bench swe-bench haiku opus
I made my coding agents talk to me (www.reddit.com) 2 5w

Quick context: I use Claude Code and Codex daily and noticed I was spending half my "agent is working" time just sitting there watching the screen. I was like, what if Claude or Codex can just talk back at me, like Jarvis did Ironman, so I…

↯ Haiku 4.5 ↯ Haiku 4.5 ↯ Haiku 4.5 ↯ Haiku 4.5 haiku codex claude-code
What would you do in my situation? I made an app that generates a lot of traffic (for me), but little revenue (actually costing me a tiny money b/c it runs off haiku) (www.reddit.com) 4 6w

I made an app that went semi-viral, and could absolutely go more viral in the future. I posted it one place just about 48h ago, and it got around 50k views.

haiku grok deepseek
Claude 4.7 is better. Systems thinking is still the gap. No model should decide what 'done' means. (www.reddit.com) 2 7w

I built this during the Opus 4.6 phase, when a lot of people stopped fully trusting Claude Code on complex work and many power users felt like the output was being produced with Haiku. That was my experience too.

↯ Claude 4.7 haiku opus anthropic+1
Local qwen3.5-4b vs Haiku vs Sonnet on intent judgment: 3/90 vs 90/90 vs 50/90 (www.reddit.com) 4 7w

I was building a classifier to label AI agent sessions as productive or dead-end. The task isn't keyword matching, it's intent judgment: did the agent actually accomplish the goal, or did it get stuck retrying the same Cloudflare wall 20 t…

↯ Sonnet 4.6 haiku ollama sonnet
Is there any local model that can replace Haiku 4.5 in an agent workflow using Ollama? (www.reddit.com) 7w

I currently use Haiku 4.5 in an automated content workflow. The process works like this: I take an existing article from my website, use a DataForSEO node to fetch competitor URLs and search intent data, and then generate a new article com…

↯ Haiku 4.5 haiku ollama
Hello, can someone please help? (www.reddit.com) 3 7w

Since yesterday, im getting an error inside a fresh new chat window to open a new chat and resume. It says I’ve used most of this chat.

↯ Sonnet 4.5 haiku sonnet
Realistically, how long are some of you going to stay on Claude, etc. (www.reddit.com) 6 7w

I really enjoy Claude, I've never touched Opus in any form, I only use Sonnet 4.6 for my daily tasks, coding, etc. I use Haiku 4.5 for the API to be an interpreter for my weather project.

↯ Sonnet 4.6 haiku sonnet opus
Strange model usage on Claude desktop app. (www.reddit.com) 6 7w

With the recent update on Claude desktop top I am able to see token usage across models. There was usage of haiku model which I never switched to.

haiku
I built a local-first memory system for Claude Code — 98%+ on 4 benchmarks, 100% LME with optional reranking (www.reddit.com) 2 7w

I've been working on context-mem — a persistent memory layer for AI coding assistants. The problem: every new Claude Code session starts from scratch.

haiku claude-code
Sonnet is expensive, so I built a free open-source Sheets agent on Haiku that outperform the same prompt claude/gemini, here is what I learnt. (www.reddit.com) 7w

I live in Google Sheets. Financial models, projections, scenario planning — that's most of my working day.

haiku sonnet gemini
"My parallel multi-model pipeline: Opus for planning, 3x Sonnet for content, 3x Haiku for search — what's your setup?" (www.reddit.com) 1 7w

"I've been running a parallel multi-model pipeline and curious what setups you all are using. My current workflow: Opus: Planning & high-level architecture Sonnet x3: Content generation (running 3 instances in parallel) Haiku x3: Search, v…

haiku sonnet opus

← all tags