Why You Shouldn't Treat AI Agents Like Employees (hbr.org via hn)
As organizations experiment with placing AI agents on org charts as “employees,” new research shows this framing has unintended consequences. In a large-scale experiment, anthropomorphizing AI reduced individual accountability, increased u…
Cursor trying to charge me even though I have disabled subscription (www.reddit.com)
I am pretty sure I have cancelled my subscription but cursor is still trying to charge me, I have cancelled it on 7th may but cursor is still trying to make transaction. How can I solve this?
If you use Claude across more than one editor or machine, you've probably hit this: your context never comes with you. The CLAUDE.md doesn't follow me to Cursor, the Cursor rules don't follow me to Codex, none of it follows me from my Mac…
Three teams shipped the same fix for AI agents losing cross-repo context (riftmap.dev via hn)
Three weeks ago, the Cortex 2026 Engineering in the Age of AI Benchmark put incidents per pull request up 23.5% and change failure rates up roughly 30% since AI adoption accelerated. I wrote about that data and what it means for blast radi…
AnythingMCP — Self-Hosted MCP Server & API Gateway Turn any backend into an MCP server in minutes — APIs, databases, and other MCP servers. REST to MCP • SOAP to MCP • GraphQL to MCP • Database to MCP • MCP Gateway • MCP Middleware ⭐ Star…
claude-screen-mcp Let Claude see your screen. A cross-platform MCP server for Windows + macOS + Linux with OCR and smart vision-diff.
-
218 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 2m I Got Tired of Juggling AI Coding Agents, So I Built One Engineering Crew Instead
- 2h Cplt: Run AI coding agents or a plain shell inside a kernel-level sandbox
- 7h Tried 13 AI Tools Recently, Here’s What’s Actually Useful
- 19h vs code , Copilot style developing with llmama.cpp ?
- 21h Copilot "auto-pilot" system instructions making models worst
341 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
Show HN: Voice gender classifier for European voice AI (1MB, ONNX, 4ms) (huggingface.co via hn)
Hi, I'm Kamil and I'm a founder of Applied AI agency in Warsaw, Poland. We've trained a small <1MB voice classifier model that runs on CPU in 4ms.
LLM Observability Tools for Reliable AI Applications (machinelearningmastery.com via hn)
In this article, you will learn about seven leading LLM observability tools that help AI engineers monitor, evaluate, and debug large language model applications running in production. Topics we will cover include: What LLM observability i…
Best practice for accurate translation at minimal cost? (www.reddit.com)
I've been meaning to translate forum post type content for one of my partner's sites. Objective to open up the audience base.
Been using claude to build an autonomous financial analysis agent. the reasoning is honestly impressive, it can break down earnings reports and connect macro trends really well once it has the data.
Give coding agents real product context so they stop guessing (docs.storiesonboard.com via hn)
Contents: Get access to MCP: Request access to StoriesOnBoard MCP in the support chat or via email to support@storiesonboard.com. Connect your AI agent via this URL: https://api.storiesonboard.com/mcp Learn how to connect the MCP server wi…
Show HN: Agent Harness with Prolog and WASM core incl. 90s Borland-style TUI (www.deepclause.ai via hn)
Declarative skills DML skills are executable logic programs instead of one-shot prompt templates. TUI coding agent, CLI and SDK DeepClause turns task descriptions into DML programs with real control flow, reusable tool orchestration, and a…
-
179 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 40m Claude Code RCE: Exploiting Deeplink Handlers via Settings Injection
- 2h 🦀 Claude has crabs?! 🦀
- 4h Agents need a local bouncer before they run tools
- 4h Mass NPM Supply Chain Attack Hits TanStack, Mistral AI, and 170 Packages
- 5h I made an AI concierge for my wedding guests. The second most popular thing they did with it was try to jailbreak it.
396 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
Been using Claude Code heavily and kept running into the same thing everyone here talks about: the model ignores your rules. You tell it to write tests first, it writes the implementation.
AI Agents Discovered a Reasoning Strategy That Cuts LLM Tokens by 70% (firethering.com via hn)
Researchers figured out how to make AI reason more efficiently by having AI figure it out itself. By building an environment where an AI agent writes controller code, tests it, gets feedback, and rewrites it until the strategy gets better.
Just spent an hour figuring out why my remote repo was full of commits like: checkpoint: 2026-05-11 14:53 (1 file) checkpoint: 2026-05-11 03:20 (1 file) Turns out CC creates these automatically as a safety net before making file changes. F…
About Coding Agents (fdeb.xyz via hn)
About coding agents 2026-05-04 This agentic programming wave of thinking is going crazy. You find an AGENTS.md at the root of pretty much every git repo now.
- Desktop pets for AI coding agents (openpets.dev via reddit)
- A constrained approach to coding agents (github.com via hn)
- Ai agents (www.reddit.com)
+5 more
- Coding agents have no moat (tombedor.dev via hn)
- I made my coding agents talk to me (www.reddit.com)
- AI Agents (www.reddit.com)
- Coding agents and the growing 1% problem (try.works via hn)
- I'm so sick of coding and agents (www.reddit.com)
Hi r/ClaudeAI, I posted an early version of Usage4Claude here a few months ago. I just released 3.0.0, so I wanted to share the update instead of pretending it is a brand new project.
Team of Agents A Claude Code plugin that gives you a full SDLC team of specialised AI agents — each a distinct expert role — plus an orchestrator that plans tasks and dispatches specialist subagents automatically. Install once.
-
219 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
85 itemsmodel roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
Agentic security coping strategies (www.reddit.com)
Enterprise AI optimists, how are you dealing with whole agentic security issue? Are you: a) researching and looking for ways to implement agents safely and securely (plenty of vendors saying they can help with this - although from my resea…
- Agentic AI Security (www.straiker.ai via hn)
AI freed up 20 hours/week in our call center. Didn't lay anyone off. (www.reddit.com)
We implemented AI for our customer service calls (Flogpt with voice agent handles basic questions like hours, pricing, account lookups, appointment scheduling). About 30% of our incoming volume.
Unitree GD01: China's $537k rideable transformer robot is now in production (gagadget.com via hn)
Unitree GD01: China's $537k rideable transformer robot is now in production Unitree Robotics has launched the GD01, a rider-carrying robot it describes as the world's first mass-produced manned mech suit, starting at RMB 3.9 million — roug…
I've been tracking the cost of rework when AI-coding assistants (Claude Code mostly) hit ambiguous specs or canon violations they can't see. Across six production projects over the last several months, I noticed the same pattern: I'd draft…
I'm a founder working in AI, and I've been helping companies build AI solutions and I see these same five problems with the AI Implementations: No spend visibility The Bedrock/OpenAI/Claude/ bill is one line item. Nobody knows which featur…
https://preview.redd.it/qjp2mksx3p0h1.png?width=1440&format=png&auto=webp&s=5c0f1a2b333e99bad70b9b6176941656e77aa129 prompt: make a youtube thumbnail for trailer 3 for gta 6