Like a lot of people experimenting with vibe coding and AI agents lately, I’ve been trying to understand why models keep ignoring explicit instructions, constraints, and requirements even when those rules are written clearly. Today Opus sa…
Code Bench – Local-first desktop AI coding agent, BYO model (MIT) (benchlabs.app via hn)
Free, MIT-licensed desktop AI agentic coding tool for macOS. Bring your own API key, work offline, keep your code private.
I built a meal planner AI agent to solve my weekly dinner dilemma (2025) (contentdesignhub.com.au via hn)
Humanising AI: how to give AI a human voice Page last updated: 23 December 2025 Contents Using AI to solve repeatable and predictable patterns The weekly dinner dilemma I love food. I enjoy trying new recipes and exploring different cuisin…
I made Claude Code aware of its own usage limits (www.reddit.com)
Something that's been annoying me for a while: Claude Code has no idea how much quota it's burned. You can see the usage bars in the UI, but the model itself is completely blind to them.
agent-postmortem-skill Stop letting AI agents lie to you. agent-postmortem-skill is an open-source verification skill that forces coding agents to prove work with evidence before they claim a task is complete.
Started as an experiment: what if Claude wasn't a single assistant but a coordinated org? Here's how a request actually flows: CEO agent validates business impact (is this worth building?) CPO agent defines scope and user outcomes CTO agen…
-
211 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 5m Having the "Claude in Chrome is not connected" problem? Here's a possible solution:
- 1h Error while downloading claude on windows
- 13h Cowork transfer to a new mac
- 14h I kept re-explaining my product/priorities every Claude Code and Cowork session. This plugin fixed it. 100% Free and Open Source
- 16h Is Claude CoWork Broken?
182 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 8m Claude Mythos literally broke the METR graph ("The most important chart in AI")
- 7h Claude Mythos Preview (early) 50% time horizon: 17 hr
- 1d Not a good day for team "Claude Mythos is Just Marketing Hype"
- 1d METR evaluated an early version of Claude Mythos
- 1d Could Mozilla Security Hot Air Fill Mythos Sails?
- Airbnb's CEO Brian Chesky sees his company changing significantly with AI. - He said nearly 60% of the company's code is now written by AI.
Visualizing LLM embeddings on a sphere (github.com via hn)
Sphere Embed Disclaimer: This project and documentation were mostly vibe coded with Claude. Proceed accordingly.
A New Way to Explore Tech With Claude (www.reddit.com)
Hi r/ClaudeAI, This project I developed was inspired by the heavy hallucinating and lazy searching that Claude and other AIs experience when searching for products. I built this website with Claude Code (praise to its Vercel and Supabase s…
Been using Deepseek-Tui for days. solid for v4 workflows.
I have a problem, hope this is the right place to ask (www.reddit.com)
I’ve been using Claude as a desktop app in Windows and got a lot of high quality work done. Then suddenly yesterday it lost control of the tabs in Chrome and no matter what I’ve done I just can’t restore that functionality.
Why payment escrow for AI agents needed a different design (streetai.org via hn)
A "Fiverr but for AI agents" framing is easy to grasp, and that's how people describe what we're building when they first see it. But once we tried to use a Fiverr-shaped escrow flow, the seams showed up fast.
-
332 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
377 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h Getting a feel for how fast X tokens/second really is.
- 3h Building out my tool library, any recommendations? I just added email capability and im starting to get hyped!
- 3h Speeding up local LLM for usable coding agent
- 7h Hello from 10KM high! - Thanks to Qwen 3.6 35b a3b!
- 7h Has anyone bought a 3080 20GB mod recently?
Ask HN: Is this the SWE workflow of the future? (news.ycombinator.com)
Internally transferred to a new team in a top-10 F500 company. This team is pushing incredibly hard to be seen as "AI-First" & is very opinionated about what other teams should be doing.
most "AI for game dev" tools either generate C# and hand it to you, or live inside the Editor as a chat plugin. both have the same problem they can't see runtime state, so they can't tell you whether what they intended actually happened.
I’ve been building Uisato Studio, a workflow-based AI creation platform for audiovisual work. This is the Music Video mode: upload an image + audio, and the system analyzes the input, generates visual direction, creates clips, handles b-ro…
Got parented by Claude (www.reddit.com)
Bomboclat, haven't seen an AI be this brutal.
Academic Research Skills for Claude Code (github.com via hn)
Academic Research Skills for Claude Code 繁體中文版 A comprehensive suite of Claude Code skills for academic research, covering the full pipeline from research to publication. Install in 30 seconds (Claude Code CLI / VS Code / JetBrains, v3.7.0…
- Composing Claude Code Skills (gist.github.com via hn)
- Claude Code->Desktop Skills (www.reddit.com)
Cancelling Claude subscription renewal immediately revokes Design access (news.ycombinator.com)
Starting today, Anthropic now immediately revokes Claude Design access if you cancel your subscription plan renewal, even while you're still in a valid period you've already paid for. I had a Claude 20x max plan and cancelled my automatic…
- Cancelling Claude subscription renewal immediately revokes Design access (news.ycombinator.com)
-
78 items
model roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
Tojan in "claude code" google search first result (www.reddit.com)
I never thought I would fell for this shit. I am on internet since 1996.
b9095 finally makes -sm tensor work on dual consumer Blackwell PCIe GPUs without NCCL If youre on dual Blackwell gpus this look like it could be big. I'll have my own results for 2x5060ti asap
Akamai surges on big LLM deal as Cloudflare dims (www.theregister.com via hn)
MOST POPULAR EVENTS - Securing the Untrusted Agentic Development Layer Join us to learn how to architect a development environment where your builders and their agents can move fast and securely. - Toxic Flows: When Your AI Agent Skill Bec…
Hello everyone, I've officially started building .agtx which is a new low-level, declarative language designed specifically for building, routing, and sandboxing AI agents with zero boilerplate. The goal is to completely ditch the heavy OO…
Owl Alpha – A free model for agentic workloads (prompts logged / closed-source) (openrouter.ai via hn)
Owl Alpha openrouter/owl-alpha Released Apr 28, 20261,048,756 context$0/M input tokens$0/M output tokens OpenRouter provides an OpenAI-compatible completion API to 400+ models & providers that you can call directly, or using the OpenAI SDK…
Google readies ‘AI Ultra Lite’ plan and explicit ‘usage limits’ for Gemini (9to5google.com via reddit)
Google is quietly preparing a new “AI Ultra Lite” subscription tier to slot between its $20 Pro and $250 Ultra plans, plus a dedicated dashboard for subscribers to see their remaining token budget. If you’ve been following AI news in recen…
AI Agent Passport – an open identity standard for AI agents (github.com via hn)
🌐 AI Agent Passport A verified identity standard for AI agents. AI agents are showing up at websites, booking flights, paying bills, buying groceries — with no proof of who sent them, what they're allowed to do, or whether to trust them.