1. Hi all, RipStop is a node package implementing a set of rules that consumers can use to protect their repos from wilder actions by LLM agents. A consumer needs only a few lines of code to configure the rules they wish to apply.

  2. The sore pain is here, LLM agents, NPM ecosystem. The timing looks perfect.

  3. Morning Everyone! Big one today (104 changes!): Claude Code just went async.

  4. I think the most interesting AI use cases right now aren’t the flashy demos- it’s the weird internal AI employees people quietly build for their businesses. For example, I saw a Reddit post from an ecommerce operator who built what was bas…

  5. model roundup

    Gemma 4
    166 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

    event

    Security
    177 items

    OpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.

  6. Might be a noob question. Suppose I get Claude Design (CD) to mockup.

  7. Hey all, We're a three-person AI consultancy that just passed initial review for the Claude Partner Network. This was quite unexpected but we're excited about it,.

  8. MOST POPULAR EVENTS - Securing the Untrusted Agentic Development Layer Join us to learn how to architect a development environment where your builders and their agents can move fast and securely. - Toxic Flows: When Your AI Agent Skill Bec…

  9. I lost hundreds of dollars following Claude’s health insurance recommendations. I explained my situation completely, but Claude never asked what healthcare services I actually use before confidently recommending I buy insurance.

  10. model roundup

    Sonnet 4.5
    13 items

    On May 4, 2026, multiple automated status updates reported elevated errors for Claude Opus 4.5 and Sonnet 4.5 around the same time, with Anthropic introducing a feature called E-STEER that applies emotion intervention to these models.

    55 items

    Claude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.

  11. Jotform Claude App Build Jotform forms directly in Claude using simple prompts. This integration connects your forms to Claude, allowing you to generate, edit, and manage them conversationally.

  12. I am currently looking to get into automation for German Mittelstand and I am now talking to an SME, which got an offer from a consulting firm for document processing automations and trying to figure out if the pricing is normal or inflate…

  13. It reached version 1.6, now it covers more than 80% of the standard. alisp ships with ASDF and is capable of loading many real-world systems, let me know if your favorite system succeeds!

  14. Agent FM Ambient radio for AI coding agents on macOS. Agent FM turns every Claude Code and Codex session into a live radio station.

  15. model roundup

    Qwen 3.6
    392 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

    model roundup

    Opus 4.6
    90 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

  16. Hi there! I was playing around with Ollama and LMstudio, testing local models and had the idea of letting Claude evaluate a few models on their actual capabilities rather than doing it myself.

  17. building a zine-making app (90s/y2k aesthetic, hot pink, chunky outlines, all that). the templates are real designed layouts (y2k chat bubbles, riot grrrl flyer collages, myspace-style pages).

  18. A few months ago I was a traditional magazine editor with zero coding background. This year I somehow ended up building and launching my first iOS app using Claude and Claude Code.

  19. been running agents in production for a while now and the failure handling question keeps coming up. in testing agents fail cleanly.

  20. model roundup

    GPT 5.5
    133 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

    model roundup

    Qwen 3
    3 items

    Qwen3-0.6B is a large language model in the Qwen series, featuring dense and mixture-of-experts architecture, excelling in reasoning, instruction-following, and multilingual support with seamless switching between thinking and non-thinking modes. Community feedback suggests it's favored for default chat and coding tasks over newer models like Llama 3, though specific benchmarks are not provided.

  21. could not extract summary

  22. bonjour a tous je travaille sur un projet apk et j'ai rencontré quelques problème au niveau des notifications ect...si quelqu'un peut m'aider a corriger les code et faire fonctionner apk . je utilise en ce moment Android studio.

  23. Hi HN — my previous post got flagged for some reason so re-posting to spread the word as well as get some actionable feedback. When I was a kid and was playing soccer in my home town, my Dad had an idea - what if there was a correlation be…

  24. Been using Claude Team with a client and the experience is not good - is it just me? The #1 reason is the inability to work in accept-all mode as you can with an individual account.

  25. could not extract summary

  26. A seemingly insignificant question at the moment, but it might become important in the future: If an AI agent recommends a product, a tool, or a service, and its developer can earn revenue through click-through rates, registration numbers,…