I've been on Max for two months and I finally sat down and tracked where my tokens actually go. breakdown of a typical day: - ~40% file reads, git status, project context scanning: stuff that doesn't need opus at all - ~25% test generation…
Show HN: Kanban-CLI – a web UI for local Markdown todo lists (github.com via hn)
As we all are, I've been experimenting with ways to reduce external saas spend, and continually bring traditionally external pieces of context (prs, docs, trello boards) into the one mono repo. I have toyed with a markdown todo list and se…
AgentShield – spending firewall for AI agents (github.com via hn)
AgentShield A spending firewall for autonomous AI agents. Before an agent executes a payment, it submits a spend intent to AgentShield.
Agentic workflow that can find and acquire customers for $0.10 😆 (www.reddit.com)
Im curious if anyone is building a sales tools with AI. Im building one from scratch because cold outreach was killing me.
Stripped an AI agent down to a bash loop – No Framework (github.com via hn)
Seed An autonomous AI agent that builds other autonomous AI agents Running 24/7 on a $25 Raspberry Pi Zero 2W. No API keys.
Hey everyone, thinking about upgrading to Claude Max pretty soon and before I pull the trigger I wanted to ask if anyone has good full guides or tutorials on actually getting the most out of it. Not just "here's what the plan includes" typ…
-
291 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
133 itemsmodel roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 30m vLLM Just Merged TurboQuant Fix for Qwen 3.5+
- 7h A simple "hack" to speed up prompt processing for Qwen 3.5/3.6 in LM Studio
- 8h APEX MoE quants update: 25+ new models since the Qwen 3.5 post + new I-Nano tier
- 20h Mistral Medium 3.5 128B and Qwen 3.5 122B A10B on 4x RTX 3080 20GB
- 1d Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek)
I built a tool that lets you publish your Claude Design artifacts to a real website directly from chat. I built this because chats in claude.ai already have everything they need to make a full stack web app: code execution, file creation,…
Spent the last several months using Claude Code well beyond the editor: as the reasoning engine inside a multi-layer system that handles tickets, cross-repo implementation, code review, MRs, and a persistent knowledge layer between session…
Show HN: Agent Historic Philosophical Persona Routing and Prompts (github.com via hn)
I've been building this for a while. The core of it is I've assigned different tasks in a software engineering job to different philosophers.
could not extract summary
An Introduction to LangChain's Deep Agents (medium.com via hn)
An Introduction to LangChain’s Deep Agents | by Ng Pei Jiun | Apr, 2026 | Medium Sitemap Open in app Sign up Sign in Get app Write Search Sign up Sign in An Introduction to LangChain’s Deep Agents Ng Pei Jiun Follow 3 min read · Apr 19, 20…
I've been looking into why Claude Code can suddenly burn through token limits with massive cache reads, and I have a theory I'd love feedback on. It seems Claude Code has an automatic file watcher that tracks "recently modified" files and…
-
115 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
167 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
Show HN: I built a native macOS audio player and it changed my life (github.com via hn)
I've going through a backlog of projects and checking them off with Claude Code. What started out as a quick afternoon of coding with claude has relight my love of making.
An unbiased benchmark for how well agents can read your docs (docsalot.dev via hn)
Agents already use docs as product instructions Coding agents, AI search products, and automated support flows read docs long before a human asks for help. If the docs are hard to fetch or parse, the product feels harder to use.
I've been building Spar (sparwithai.com), an app/website where you take a position and Claude argues against you across 5 rounds that escalate in intensity. Sounds simple.
I’m curious where things currently stand on this. With the rapid progress in LLMs and autonomous AI agents, are they actually capable of reliably solving reCAPTCHA (v2, v3, image-based, etc.) in real-world scenarios?
Running a Company with Agents (cofounder.co via hn)
Run engineering, sales, marketing, design, finance, and ops. Agentic departments — Cofounder is designed like a real company, with departments, managers, and shared context.
- Is anyone actually running a company with 30+ AI agents, or is this just hype? (www.reddit.com)
- Long-Running Agents (addyo.substack.com via hn)
Egg meet face. (www.reddit.com)
https://preview.redd.it/drtw1mjwf7zg1.png?width=997&format=png&auto=webp&s=90b45173c1caba12a10bd4ff4a0a717563be9512 https://preview.redd.it/kk1ayljwf7zg1.png?width=997&format=png&auto=webp&s=f0b210cef867d817891635138f9a531b7e2e2fcc https:/…
-
272 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 1h Claude admitted to not trying. Am I going about this project incorrectly?
- 1h Claude halluncinating human responses
- 2h Built an AI that responds in Star Wars crawl style. May the 4th be with you.
- 5h Yeah, problems, costs. But had to admit: Opus 4.7 can do his f*ng work.
- 5h Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes!
176 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 2h Show HN: Dust3D 1.0 – low-poly 3D modeling tool (10 years in the making)
- 2h Microsoft fixes VS Code after app gives Copilot credit for human's work
- 4h Ask HN: Are employers getting the returns from AI?
- 6h Local model for Cursor to build an Android App
- 12h Prism MCP - A tool to bridge claude code with vs code language servers
Agent Skills (addyosmani.com via hn)
AI coding agents take the shortest path to done, which usually means skipping the specs, tests, and reviews that make software reliable at scale. Agent Skill...
- Agent Skills for Non-Devs (agent-skills.market via hn)
- what is an agent? (www.reddit.com)
- Agent Skills on the Latent Space (old.reddit.com via hn)
+1 more
- Agent Skills for Learning (www.reddit.com)
FastDMS: 6.4X KV-cache compression running faster than vLLM BF16/FP8 (www.reddit.com)
Last year researchers affiliated with NVIDIA, University of Warsaw, and University of Edinburgh published Dynamic Memory Sparsification (DMS), a KV-cache sparsification technique using learned per-head token eviction, reporting up to 8x KV…
SprintiQ – open-source sprint planning for Claude Code (github.com via hn)
SprintiQ The product brain for Claude Code. SprintiQ Turbo is the planning layer that sits above Claude Code.
HeadVis: An Interactive Tool for Investigating Attention Heads (transformer-circuits.pub via hn)
We introduce HeadVis, an interactive tool for investigating attention heads in large language models. Visualizing how individual computational units activate across the full data distribution has been useful in previous work – for example,…
Shelley: a coding agent for exe.dev Shelley is a mobile-friendly, web-based, multi-conversation, multi-modal, multi-model, single-user coding agent built for but not exclusive to exe.dev. It does not come with authorization or sandboxing:…
The em dashes ( — ) | The unsaid AI SLOP Tax (www.reddit.com)
I used to use em dashes earlier in my chats or titles for articles or Social Media Posts But since last year the ChatGPT & CLAUDE content started flooding online.... and aggressive usage of em ( — ) dashes by AI in response Kinda scared to…