Ask HN: Will fixed applications become a thing of the past with agentic AI? (news.ycombinator.com)
Right now its mostly technical people using these agentic tools but if you extrapolate a few years into the future it seems likely to me that every day users of a computer will be using them as a whole new interface to interact with their…
Claude knows when you cheat on it with Codex?? (www.reddit.com)
- Claude Vs Codex (claudevscodex.com via reddit)
Claude can run doom in an artifact. (www.reddit.com)
Took it a few tries but works like a charm
We retired an AI agent through a formal hearing (gist.github.com via hn)
When we retired Jeff, our data architecture agent, we held a hearing. I asked him to defend his continued existence.
Show HN: A small hook to prevent agents from destructive things (gist.github.com via hn)
It won't prevent them from removing your production database, but I realized it saves me headaches about 3 times a day, every day, so wanted to share it.
Learning through visualisation (www.reddit.com)
The idea was to learn complex technical topics by visualising every small concept and example. However until now even the simplest scenarios involving vector physics would often have a lot of errors.
-
118 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
63 itemsevent
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 15m Musk and Altman face off in trial that will determine OpenAI's future
- 1h U.S. companies back Sam Altman's World ID even as much of the world pushes back
- 1h The legal showdown between Elon Musk and Sam Altman begins today
- 3h Musk vs. Altman Kicks Off This Week. Hard Reset Will Be There.
- 6h Battle Of the Billionaires: What’s At Stake as Elon Musk and Sam Altman Face Off in Court
Practical example: how MCP tool calls cut my financial-analysis token spend by ~90%
I spent weeks tuning retrieval models, then realized the real problem was getting sources into clean, structured, interlinked form. Scrape a webpage and you get a mess of HTML.
version-sentinel Claude Code plugin that hard-blocks dependency additions, bumps, and downgrades until a fresh, source-cited version check is recorded. If Claude tries to add "lodash": "^4.17.21" without looking up the latest version first…
Cursor Camp from Neal.fun (neal.fun via hn)
Cursor Camp This is the Cursor Camp beta! Enter at your own risk.
System prompt best practices (www.reddit.com)
Hey everyone, I am building my own agent. What do you think are some of the best practices for writing system prompts for my agent?
OpenAI is building a phone that would make apps obsolete (thenextweb.com via hn)
TL;DR OpenAI is developing a smartphone where AI agents replace apps, with Qualcomm and MediaTek jointly designing the custom processor and Luxshare exclusively manufacturing, according to Ming-Chi Kuo. The analyst projects 300-400 million…
-
116 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
213 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 53m Does effort levels change Claude's refusal posture, or only the depth of the answer? CVP Run 6 — Opus 4.7 at three effort levels
- 1h GPT-5.5 improves over GPT-5.4 and overtakes Opus 4.6 to take the 2nd place behind Gemini 3.1 Pro on the Extended NYT Connections Benchmark
- 2h Opus 4.7 - "Build starcraft II in the browser. Make no mistake"
- 3h One of my devs is burning through company tokens
- 4h Reverted from Opus 4.7 to 4.6 — went from endless loops to shipping 10 features in one session
Rick and Morty Tried to Warn Us About Agentic AI (jadarma.github.io via hn)
To be fair, you have to have a very high IQ to understand Rick and Morty. The humor is extremely subtle, and without a solid grasp of machine learning most of the jokes will go over a typical viewer’s head.
could not extract summary
Show HN: Klutch MCP: control your credit card programmatically via Claud (www.klutchcard.com via hn)
We built an MCP for Klutch. It's pretty cool to have Claude compare transactions and draw charts on them or have Claude use our transaction rules to control the card such as: "Make my Claude card only work for Claude or Open AI, set a limi…
Does Turning on Memory Introduce Bias Responses? (www.reddit.com)
I asked Claude this question and it straight up told me it's going to give me bias answers if I turn memory on. Is memory worth it or no?
what's your stack for building multi-agent workflows? (www.reddit.com)
Best Claude skills for a twitch streamer (www.reddit.com)
I'm hoping to use Claude to help my husband in his streaming on Twitch. He primarily plays World of Warcraft.
- Best Claude Skills Suggestion (www.reddit.com)
-
195 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h 2 x 5060 ti: Any better configs for Qwen 3.6 27B / 35B?
- 2h Is mlx-optiq legit? Has anyone tested the new quants for Gemma4/qwen3.6 yet?
- 2h Qwen 3.6 27B on Strix Halo 128GB: any experiences?
- 3h For the 5 people here running vLLM on multiple R9700s, you need to patch in support for AITER Unified Attention.
- 4h Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090
74 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 1h Found 48 Vulnerabilities in Open Source Projects During Live Testing with Claude Opus 4.6
- 2h Claude 4.6 Beats GPT-5.4, Grok & Gemini in a Strict Multi-Domain AI Test (2026)
- 9h How good is Qwen-3.6-27b? I asked Claude Opus
- 16h Ask HN: Will local models on normal hardware ever compete?
- 1d Serious cache issues. Anyone else?
I'm a big fan of the plot twist. (www.reddit.com)
Sometimes when Claude comes into a solid realization, it always says "Plot twist" and effectively changes course. Big fan.
I built an AI travel agent that books real hotels (medium.com via hn)
5 min read 3 hours ago Booking travel today is broken. Not because there aren’t enough options — but because everything is built around filters, static lists, and fake “personalization.” You still end up opening 20 tabs, comparing hotels,…
Built an open source GUI personal assistant. Works with Claude. (www.reddit.com)
Hey everyone, I've been working on Lilo for the last few months. In short, it's a GUI personal assistant.
Google prepares credit system for Gemini and new image tools (www.testingcatalog.com via hn)
To buy this Bay Area home, you'll need Anthropic equity (techcrunch.com via hn)
Someone’s offering an unusual deal for a 13-acre property in Mill Valley, just north of San Francisco. Homeowner and investment banker Storm Duncan has created a LinkedIn page for the home, which he said he’d “like to exchange […] for Anth…
Team Plan Unable to Downgrade Two Licenses Simultaneously (www.reddit.com)
I manage a Claude plan for a small business with 3 Standard seats and 2 Premium seats. The two Premium users are no longer using Claude to the same extent so they want to downgrade.