1. I built a 24/7 AI radio station called WRIT-FM where ChatGPT/Claude is the entire creative engine. Not a demo — it's been running continuously, generating all content in real time.

  2. Before adding another feature, I like using a "receipt test." Write down what a user should be able to prove after using the app for 5 minutes. Not "the dashboard looks nice" or "the AI responded." A real receipt: a file was uploaded and c…

  3. I’ve set up Claude on my VPS, but when I try to log in, I get this issue. Even though I entered the correct code, it still shows this error.

  4. Open standard. Open runtime.

  5. event

    Security
    164 items

    OpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.

    model roundup

    ChatGPT 5.5
    6 items

    Mathematician Timothy Gowers used ChatGPT 5.5 Pro to solve complex mathematical problems and discussed potential crises in research. Meanwhile, users tested various AI models for everyday tasks, finding both impressive capabilities and persistent errors.

  6. Has anyone else noticed how some Claude Code sessions cost you a few cents and others somehow burn through actual dollars and you can't really tell why after the fact? I kept hitting this — was it retry loops, was it the agent re-reading t…

  7. Tried using Claude Code with NVIDIA APIs today and honestly it was way more fun than I expected. The workflow felt surprisingly smooth for testing AI stuff quickly without overcomplicating everything.

  8. I’m a ciswoman and, wow, Claude REGULARLY thinks I’m a man. Here’s an example.

  9. MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X The Problem We Solved Walk into any small CNC machine shop and ask the manager how they decide whether to accept a customer job. The answer is almost always th…

  10. model roundup

    Opus 4.6
    89 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

    model roundup

    Qwen 3.6
    375 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  11. It started out as a way for me to freshen up my C++ skills during COVID. But life got in the way and it was put on ice.

  12. Retrieval of only semantically similar memories using vector search is not sufficient to build an holistic context to feed to an llm. Most of the memory system works on the first concept of pure vector search, While running an experiment I…

  13. I've been researching how teams handle cost and FinOps for agent systems in production. Token bills get unpredictable fast, and most tooling stops at per-call or per-agent attribution, which doesn't tell you much about why the bill jumped.

  14. Hello, newbie here. Does one need to pay for Claude Code to build a "Second Brain?" (One that combines Claude with Obsidian)

  15. model roundup

    DeepSeek 4
    104 items

    DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.

    model roundup

    Gemma 4
    164 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

  16. I've been using Claude Code in full agentic mode for two months — not just autocomplete, but letting it write features, run tests, read CI output, and push fixes. Around 50K lines of production code.

  17. How should I fix this, Claude’s own suggestions are failing, I added $5 for the API key to extract from my Beehiiv newsletter still it’s failing, what could be the real problem here?

    • Help (www.reddit.com)
  18. https://preview.redd.it/km7o9670lc0h1.png?width=1542&format=png&auto=webp&s=3fea5e97f3e518222eefd7cfd0cc871fcd58a933 Has anyone else found Claude stronger than ChatGPT/Codex for UX critique? In a recent test, I asked both to review a wikil…

  19. ‘AI gave me your number’: The new trend turning ChatGPT hallucinations into harassment Known as ‘AI doxxing’, victims say popular chatbots are sharing their personal phone numbers with strangers. Anthony Cuthbertson looks at how criminals…

  20. model roundup

    Opus 4.7
    332 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

  21. Like a lot of people experimenting with vibe coding and AI agents lately, I’ve been trying to understand why models keep ignoring explicit instructions, constraints, and requirements even when those rules are written clearly. Today Opus sa…

  22. I built HTML Drive this weekend: a personal Drive Claude can save to. Sign in with Google, then ask Claude to make HTML and it lands in your account, versioned, shareable, and with its own URL.

  23. Our MCP server provides scientific knowledge and action grounding for your AI agents, based on top-tier research literature and timeless research best practices/advice given by notable scholars.

  24. Hi HN, sharing a learning tool I built which pairs an LLM generated learning space with interactive visuals and small games. The part which I am still working on is getting an agent to reliably generate good visuals without intervention.

  25. https://preview.redd.it/6ncrsdq4oc0h1.jpg?width=399&format=pjpg&auto=webp&s=c743b59bb9fd017dd26f6bd7294aa7770ebfa99c Hi, I got this email asking me to give permission to GitHub cursor. I had to log into my GitHub account which makes me wor…

  26. We’re hosting the biggest Claude Code Prompt-a-thon at the AI x Marketing Summit in SF on May 28–29. For 36 hours, you’ll actually build with AI: • Claude Code • Humanic • n8n • MCPs • Figma Make • AI workflows for SEO, ads, lifecycle, out…