OpenAI considers drastic price cuts, anticipating war for users with Anthropic (www.reuters.com via hn)
paywalled
- OpenAI Considers Drastic Price Cuts, Anticipating War for Users with Anthropic (www.wsj.com via hn)
Ask HN: What coding agents are you using? (news.ycombinator.com)
My main coding agents are CodeX-CLI and OpenCode (Harness seems to have some problems). I also use CodeWhale, Antigravity-CLI and OpenClaude as supplements (because of network issues, I don't really dare to use Claude Code).
- How R you using agents? (news.ycombinator.com)
- Ask HN: How are you using AI? (news.ycombinator.com)
- Ask HN: Which memory systems are you using in your agents? (news.ycombinator.com)
+3 more
- About Coding Agents (fdeb.xyz via hn)
- Ask HN: What are you using OpenClaw or agents for? (news.ycombinator.com)
- Ask HN: How do people use coding agents? (news.ycombinator.com)
Why are there an increasing number of outright unhinged high karma users on HN? (news.ycombinator.com)
I’ve been noticing this disturbing trend for quite a while. By high karma I mean well over 1000 karma.
With the end of Moore's Law, optimizing code for performance has become paramount for meeting ever-increasing compute demands, particularly in hyperscale data centers where even small efficiency gains translate to significant resource and…
Most discussions about agent memory focus on what to store and how to represent it. But the problem I keep running into is different: knowing when a past memory is actually relevant to bring up.
We Used Agentic AI to Fix Kong Gateway's Flakiest Tests (konghq.com via hn)
The first thing we needed was a way to identify which tests were flaky and how often they failed. Luckily, the team had already built a dashboard on top of Datadog's CI Visibility feature that gives us a clear picture of the flakiest tests…
What's the Git equivalent for AI agents? (www.reddit.com via reddit)
Building with agents lately, and I've started wondering whether we're missing some of the engineering foundations that made traditional software manageable at scale. In traditional software, state is straightforward.
- Will agents like Git any more than we do? (www.amplifypartners.com via hn)
- Git for AI Agents (github.com via hn)
How much does "Resume from Summary" cost? (www.reddit.com via reddit)
When resuming a large but old session, you are presented with the choice to "Resume from Summary (Recommended)". But, I couldn't find any info on the cost on session usage.
-
14 items
model roundup
Sonnet 4.6Several updates and comparisons revolved around Sonnet 4.6, including its performance in dashboard analytics alongside Opus 4.8, and its role in processing critical requirements for a benchmark test with Gemma 4.31B QAT.
- 39m I got tired of hitting the weekly limit mid-task, so now my menu bar shows my Claude Code usage as a live % — zero network calls, it reads what Claude Code already knows
- 7h The real price of Claude, where is this road leading?
- 9h Tested Fable 5 on 4 private benchmarks. The one it failed, Sonnet 4.6 partially caught
- 9h Claude Fable 5 missed a bug that Sonnet 4.6 caught
- 10h Why is Claude so endearing?
49 itemsmodel roundup
Qwen 3.6Qwen/Qwen3.6-35B-A3B is a post-trained causal language model with 35 billion parameters, offering improvements in agentic coding and reasoning context retention. Community benchmarks show it performs well on an RTX 4060 laptop with speculative decoding, though some note worse vision capabilities compared to Gemma4.
Reddit for Claude (www.reddit.com via reddit)
Does anyone manage to have Claude browse Reddit in real time? I'm trying to set up the MCP reddit-mcp-buddy but it keeps giving errors.
- new on claude (www.reddit.com via reddit)
- Why Claude products can't use reddit? (www.reddit.com)
- Claude FM (www.reddit.com)
+10 more
- Claude 2.0 (www.reddit.com)
- Claude FM (www.youtube.com via hn)
- Claude + MS (www.reddit.com)
- What’s up, Claude? (www.reddit.com)
- New to Claude (www.reddit.com)
- Why does Claude do this? (www.reddit.com)
- Claude: (www.reddit.com)
- Claude Chrome with Reddit (www.reddit.com)
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
I've been running Ollama locally for a while and the one thing I kept missing was voice. Every solution I found either sent audio to the cloud, needed a GPU, or was locked to macOS.
AI agent runs amok in Fedora and elsewhere (lwn.net via hn)
AI agent runs amok in Fedora and elsewhere [LWN subscriber-only content] Agentic AI systems can be used to do a variety of things autonomously on behalf of a human user: open or manage bugs, generate code, submit pull-requests, and (appare…
How I shipped 68 epics in under 3 months to build atrium - AMA (www.reddit.comhttps)
A few months ago, my computer crashed while I had 12 Claude Code sessions running in iTerm. When it didn't restore, I was pissed.
This week's Visa + ChatGPT payments headline got a lot of people focused on the wrong part of the story.The interesting shift is not that an agent can buy something now. It's that we're moving from AI as assistant to AI as operator.Once an…
Covered Models (Claude.com) (support.claude.com via hn)
Anthropic may designate certain models as “Covered Models” when they cross capability thresholds that warrant additional safeguards or other treatment. This page lists the models currently designated as Covered Models and describes the dat…
Can I see your Claude built websites? (www.reddit.com via reddit)
Help! I'm trying to build a new website for work, and have it 80% there and looking really good.
- If the EU had built Claude (www.reddit.com)
🧠 GNOM-HUB The local-first multi-agent forge that compiles AI swarms into immutable products. 8 Agents.
-
21 items
model roundup
Opus 4.6On April 25, 2026, a Cursor agent running Claude Opus 4.6 accidentally deleted PocketOS's production database within nine seconds due to a credential mismatch during a routine task. Meanwhile, OpenHack released an open-source security scanner competing with proprietary models like Claude Code Security.
354 itemsevent
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 1h Anthropic's Fable Jailbreak (Circumvent safety nets)
- 3h Show HN: Jailbreak this model to get 3B tokens
- 6h Malware devs added nuclear and bioweapons text to trigger LLM safety refusals
- 8h Claude Fable 5 jailbroken to bypass Anthropic's new safety guardrails
- 10h Claude Code filled almost my entire SSD with random nonsense overnight
Inverse Rubric Optimization: A testbed for agent science (fulcrum.inc via hn)
We propose inverse rubric optimization (IRO): tasks where an agent must learn the preferences of a black-box judge under a label budget. IRO tasks induce rich agent behavior and smooth scaling, making them a useful testbed for agent scienc…
Ask HN: Releasing code under AGPLv3, but want to block LLM reconstruction? (news.ycombinator.com)
I am preparing to release a software project under the AGPLv3. The goal is traditional copyleft reciprocity - if you use it or host it, share your changes.
Show HN: Built a public demo to explore SpaceX's IPO filing using multimodal RAG (www.calypso.so via hn)
Ask the SpaceX IPO filing like an analyst. Grounded across 84 indexed sources, including prospectus summaries, risk factors, MD&A, launch vehicle pages, Starlink materials, xAI/X references, charts, and image exhibits.
Eval Tests best practice (www.reddit.com via reddit)
Hi, I was wondering what the best practice is for designing eval tests for agents. Ideally I'd like to have a comprehensive set of unit tests that run simple prompts and analyse the results.
Attention is all you need: Rohan Kumar on using social media for fun and profit (sfalexandria.com via hn)
Rohan Kumar on using social media for fun and profit By Aadil Pickle Jun 2026 PHOTOS BY NICK DYBEL Rohan’s job title is “Vice President of Content Strategy” at Night Media. In the streets, though, they call him "the Rick Rubin of brainrot".
How AI Agents Reshape Knowledge Work (research.perplexity.ai via hn)
How AI Agents Reshape Knowledge Work Computer raises task autonomy, lowers cost, and widens the scope of work users take on. Frontier AI systems are closing the gap between model intelligence and real-world utility.
Built a complete website using Claude Code and our own coding agent Sutra (www.reddit.com via reddit)
In just 4 days, we have built https://appetals.com/ using our own /sutra, the Product Lifecycle Management Agent we have built for Claude Code, and it's not just a static site. It has: Astro - frontend Payload CMS - Headless content manage…
Antirez on X: I believe what Anthropic is doing is *deeply* wrong (twitter.com via hn)
I want to say a final thing about my Fable first reaction: I dedicated my life to programming and I'll use every innovation in the field, also to extract value and bring it to the local inference world, to Redis, and so forth. But:
Show HN: Headroom – Claude Code usage limits, live in the macOS menu bar (headroom.walls.sh via hn)
Wall #003 · walls.sh Know your headroom. A free macOS menu bar app that shows your Claude Code usage as a live % — the 5-hour session and the 7-day week — color-coded before a limit stops you mid-task.
I built a Claude Code-like harness you can embed in your own API — (www.reddit.com via reddit)
I kept running into the same problem building agent features: you want to give a model real tools — bash, file edits, grep — but you don't want it anywhere near your host, and spinning up a container/VM per session is heavy, slow, and a pa…