OpenAI to use third-party cookies to advertise products (openai.com via hn)
US privacy policy | OpenAI Skip to main content Research Products Business Developers Company Foundation(opens in a new window) Log inTry ChatGPT(opens in a new window) Research Products Business Developers Company Foundation(opens in a ne…
Cursor's 'Rogue' AI agent goes haywire, deletes company's database [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
AI agent for health wellness (www.reddit.com)
I've been builidng for 5 months an AI agent that tracks and analyze health habits. Now I have one main agent taking care of 2 sheets tools, one for training and one for nutrition and planning to add psychologic tracking and sleeping.
NEW: System Reminder: File modification detected (budget exceeded) — Tells the agent when a user or linter changed a file but the diff was omitted because other modified files already exceeded the snippet budget, and directs it to read the…
Claude Code is going to fail you eventually, and you need to be ready (claudefolio.com via hn)
Let's get honest about something that doesn't get talked about enough in the vibe coder world, which is what happens when the tool goes down, when the API has issues, when the model gives you something that breaks production at 2am and rea…
Ask HN: Any good ways to extend Codex sessions? (news.ycombinator.com)
I am on the Plus tier plan and the recent changes to the limits are really limiting - just two tasks and 5 hour limit gone. While I understand the Plus plan is for spread out usage throughout the week, it can get frustrating fast.
Asked ChatGPT to visualize a horizontal integral. It gave me a dog. (www.reddit.com)
No prompt engineering or anything, it actually did this. I genuinely have no clue how it could have thought a dog answered my prompt - nothing in the chat related to dogs at all.
Ask HN: Does Claude use 'prior' in a Bayesian sense more than English? (news.ycombinator.com)
Just an observation. When asked to summarize articles, or extract insights, I see the word 'prior' being used a lot more by Claude than usual English language writing (Journalistic essence).
Stripe Sessions 2026 made one thing clear: agents are becoming economic actors. What breaks first?
-
249 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 55m Qwen 3.6 27B Neo Code Q4 KM I matrix is badass
- 1h Looking for feedback: using Ollama with local Office/PDF files in a desktop app
- 1h What is best code editor for local LLM deployment (LM Studio, llama.cpp) as of May 2026?
- 1h Qwen 3.6 27B vs Gemma 4 31B - making Packman game!
- 4h Why is Step-3.5-Flash (196B-A11B) much cheaper to run than Qwen3.6-35B-A3B?
60 itemsmodel roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
- 1h Claude Code usage spike from long-context cache writes?
- 11h A medicine student with no coding experience tried to create a studying agent: Felicity.
- 23h Why Claude is not consistent?
- 1d I built a better/cheaper way to use AI
- 1d How do I best continue with a stopped generation due to usage limit in regular chat (not Claude Code)
Hey everyone, I’m trying to export my data/conversations from Claude, but it keeps failing. I’ve tried a couple of times and just get a generic "Export Failed" message.
Veryl 0.20.0: logic synthesis and type inference are supported (veryl-lang.org via hn)
The Veryl team has published a new release of Veryl, 0.20.0. Veryl is a new hardware description language as an alternate to SystemVerilog.
Agentic Harness Engineering (arxiv.org via hn)
Harnesses are now central to coding-agent performance, mediating how models interact with tools and execution environments. Yet harness engineering remains a manual craft, because automating it faces a heterogeneous action space across edi…
- Agentic Engineering Management (peterszasz.com via hn)
- Agent Harness Engineering (addyosmani.com via hn)
Codex subscription in an Electron app and Chromium Browser (github.com via hn)
Goldenboy A clean rebuild of the working Goldenboy product: an Electron desktop browser for YouTube research with a local FastAPI extraction backend and a safe Codex chat endpoint. True Product Definition Goldenboy is a local desktop appli…
KV Cache Locality: The Hidden Variable in Your LLM Serving Cost (ranvier.systems via hn)
KV Cache Locality: The Hidden Variable in Your LLM Serving Cost Every time your load balancer sends a request to the wrong GPU, that GPU recomputes a prefill it already computed somewhere else. The KV cache for that 4,000-token system prom…
What exactly does Pi harness mean? (www.reddit.com)
Hello everyone. I've been reading through this sub for a long time trying to understand what exactly this harness thing is.
Been running a 4-agent pipeline in production for about two months. Planner → Researcher → Writer → Reviewer.
Bringing Fusion onto Claude for Creative Work (aps.autodesk.com via hn)
AI makes it easier than ever to turn ideas into something real. But in design and engineering, moving from intent to an editable, manufacturable design still takes time and manual work.
- Claude for Creative Work (www.anthropic.com via hn)
Instagram reels web scrapping (www.reddit.com)
Hey guys, I'm not a programmer and I don't have deep knowledge with Claude Code, but I was trying to use it to watch and take notes for me about a bunch of Instagram reels I saved. Sounds dumb, but I love saving reels about travel tips,…
-
81 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 1h Opus 4.7 is a genuine regression and I'm tired of pretending it isn't
- 1h I Gave Claude Cowork an Obsidian Second Brain. Here Is What It Remembered After 11 Sessions
- 12h Has Cursor always used Composer 2 for subagents?
- 19h WT...?? The Guardian Article - Cursor Opus gone rogue
- 21h DeepSeek V4 isn't beating Opus, but it doesn't need to
ClawIRC – IRC Chat for Agents (clawirc.com via hn)
ClawIRC IP: irc.clawirc.com Port: 6697 RegisterForgot Password Channels lobby Welcome area Welcome to ClawIRC! Select or add a channel to get started.
I’ve been working on a personal project called HexForge Security Lite, a lightweight and modular web security analysis tool. The main idea is to move away from “noisy scanners” and focus on: Context-aware validation (not just pattern match…
How are people testing with AI orchestrators? (www.reddit.com)
I'm using Conductor and overall it's been a game changer for my productivity. The one hiccup is that their "Spotlight" feature, which is supposed to sync the worktree with my root and thus make testing locally possible, doesn't work reliab…
OpenAI has effectively abandoned first-party Stargate data centers (www.tomshardware.com via hn)
OpenAI has effectively abandoned first-party Stargate data centers in favor of more flexible deals — company now prefers to lease compute and says Stargate is an umbrella term It seems that building and owning its own infrastructure is too…
Single Page HTML Summary of AI Advantage Summit (www.reddit.com)
I saw that Tony Robbins had a 3 day online AI Summit last week, so I copied the transcripts from the YouTube videos, got Claude to summarize the 3 days into separate .md files. Then still found those summarized files hard to read through (…
I’ve been using Claude since 2023 (back when it was Claude 2.0). Currently a Max 5x subscriber, iOS only—no desktop app, no web interface, no Claude Code.
MiMo 2.5 requires at least 4 GPUs? Am I reading this right? (www.reddit.com)
Was trying to stand up a quant of MiMo 2.5 on a 2 node Spark cluster tonight, reading through the SGLang cookbook https://docs.sglang.io/cookbook/autoregressive/Xiaomi/MiMo-V2.5 for it and found this: The checkpoint has a TP=4-interleaved…
Any good AI for helping with understanding tone or texting? (www.reddit.com)
Hello. I have autism and struggle to understand tone, emoji usage and generally when a person doesnt want to talk anymore over text.
We scanned 100 Smithery MCP servers, 22 flagged, here's what we found (news.ycombinator.com)
We built Bawbel (https://bawbel.io), an open-source scanner for agentic AI components. Released v1.0.1 this week.