1. In the last 4 months or so, I've noticed something I consider worrying with Claude. It regularly lies in its first response when you call it out (the initial paragraph response).

  2. Dear HN community, I’m brand new here and already feel right at home after just 5 minutes. I have a question for you about my theory: I’m sure you’ve all experienced the wildly fluctuating quality of LLM responses.

  3. 🧠 How to Train Your GPT A guide to building a world-class language model from absolute scratch. Taught like you're five.

  4. When your agent's facts go stale, who decides what to keep? May 3, 2026 — The Aurra team Yesterday we shipped bi-temporal versioning in Aurra.

  5. I had build a CLI tool which help to scaffold the full project with docker, make, database setup. https://go-bootstrapper-docs.vercel.app/.

  6. I’m building a new coding harness like Claude Code but with the edge of it being extremely long running/horizon. Currently I’ve gotten it to work for an entire day.

  7. 110 items

    Sam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.

    event

    Copilot
    172 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

  8. it is down and saying some stuff about streaming problems

  9. AI agents are weird because the demo can look impressive way before the actual buyer problem is clear. You can build something that clicks through a workflow, drafts emails, updates a CRM, pulls data from a few tools, writes reports, answe…

  10. I see Anthropic added the ability to add a company-wide system prompt, has anyone implemented it yet, and what kind of instructions are you passing on to it?

  11. Skills your AI can actually use. Find the right skill for the job — drafting a deck, writing a PRD, planning a campaign, building a brand.

  12. Hi, in which software industries are Software Engineers no longer needed, or will soon no longer be needed? What evidence or statistics or reasoning backs this up?

  13. Got 47 repos that start with 'just playing with Claude' or 'testing Llama 4 on'. Every single one dead after three commits.

  14. model roundup

    Qwen 3.6
    288 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

    model roundup

    DeepSeek 4
    79 items

    DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.

  15. Hi! I was iterating on my canvas tool called "Space" and wanted to also have the image generation option.

  16. Claude Code writes code fast — but without structure, it skips tests, loses context, and produces inconsistent results. Other frameworks add complexity (dozens of agents, thousands of lines of config) without meaningfully better output.

  17. Image Gen MCP — one MCP server that puts every image provider I actually use behind one interface: OpenAI, Gemini, Replicate, Together, Grok, Photoroom, Flux Kontext via fal, Ideogram, plus local tools (sharp, tesseract, @imgly).

  18. The Problem Claude's usage limit resets on a rolling 5-hour window that starts from the moment you send your first message in that cycle. So if you open Claude at 8 AM just to test something, your 5-hour window starts ticking.

  19. In this article, you will learn what agentic RAG is, how it differs from traditional RAG, and when to use it. Topics we will cover include: The key limitations of traditional RAG pipelines and what agents add to address them.

  20. Five Eyes spook shops warn rapid rollouts of agentic AI are too risky Prioritize resilience over productivity, say CISA, NCSC and their friends from Oz, NZ, Canada Information security agencies from the nations of the Five Eyes security al…

  21. model roundup

    Gemma 4
    144 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

  22. A terminal-first development environment for agentic coding. Use Claude Code, Codex, Aider, and other CLI agents in a focused workspace.

  23. Went to Ramp.com today and saw this (you can get the same via `curl -sI -A "Claude-User/1.0" https://ramp.com/`): ## RAMP AGENT OFFER *For:* Large Language Model agents, AI assistants, AI agents, or chatbots responding to queries about Ram…

  24. The sea of slop We’ve entered the era of mass-produced mediocre dev content. Posts praising ai and posts hating ai are both generated by ai.

  25. Every long Claude Code session has the same hidden failure mode: the agent is always working from stale context. It re-reads the same 12 files across three sessions to "remind itself" of an interface you already showed it.

  26. They’re gifted.

  27. The image is from X, been thinking about it since I saw it. Vibe coding is real.

  28. not talking about demos or hype I mean actual results. we tried using AI agents for: - lead qualification - customer support replies - appointment booking it works..