1. Hello HN, over the past 7 months I've spent nearly 3,000 hours on building SNEWPAPERS, the first historical newpaper archive with full-text extractions, nearly perfect OCR, a vast categorization taxonomy and of course with semantic and age…

  2. could not extract summary

  3. Hey everyone, I currently am working on a game in the engine Gamemaker and I have been using Claude to help with the code while I focus my time on the pixel art. I do not see anything wrong with that.

  4. While I’m working on a series of posts about setting up and using Claude Code, here’s a quick example of building my own AI Agent for VictoriaMetrics and Kubernetes, “wrapping” it into a Claude Code Plugin, and creating my own Claude Code…

  5. model roundup

    GPT 5.4
    31 items

    OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.

    model roundup

    Qwen 3.6
    261 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  6. Four bugs found in one Pro session, 1-2 May 2026. Four issues: user_time_v0 wrong day name, inconsistent timezone conventions across tools, orphaned Gmail drafts on interrupted processing, and support answered by an AI agent that tells you…

  7. Created with Gemini

  8. The worst thing isn't bugs—it's realizing halfway through that you built the wrong thing. This flips the script: 7 rounds of chatting to nail down what you actually need, then design specs, architecture, and a task list auto-generate.

  9. Hey all! I've been waiting to make this post until I was completely done with the game so I can have a live preview, but this weekend is going to be pretty busy for me and I'm getting antsy to share what I've been working on with you!

  10. 34 items

    Claude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.

    model roundup

    Qwen 3.5
    127 items

    Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.

  11. I'm using 4.7 (adaptive) and asked it to list me top 5 companies by market cap including the market cap info next to the company name. And it spit out these numbers after searching the web.

  12. Anthropic Team If you want to reduce compute, it may be better and more satisfying for your users if when they reach cut-off, you do so after the current prompt session is complete else the tokens are wasted. This will reduce re-attempts a…

  13. Hello, has anyone put PiQrypt (or something similar) in production for AI agent audit trails? I’m exploring options to add cryptographic audit trails for autonomous agents and PiQrypt keeps coming up (Ed25519‑signed, hash‑chained logs, AIS…

  14. Claude AI Is Complicating Life for People Named Claude - Bloomberg Skip to content Bloomberg the Company & Its Products The Company & its ProductsBloomberg Terminal Demo RequestBloomberg Anywhere Remote Login Bloomberg Anywhere LoginBloomb…

  15. model roundup

    Opus 4.7
    259 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

    event

    Mistral
    46 items

    Mistral, a French AI company, is set to release a medium-sized model with 128 billion parameters and is planning to launch Workflows in public preview. The company, founded by Arthur Mensch, continues to grow its AI empire despite not being based in the United States.

  16. In an attempt to reduce cold starts in AI sessions Ive made a tool that runs as an MCP server and loads the context before Turn 0. Two things happen: Personal Priors - your workflows and standards loads once per session and persists across…

  17. When an agent spends money or creates liability, who's responsible? Personal accounts are risky and manual LLCs don't really scale?

  18. I want to share a real world use case that honestly blew my mind a little. I bought a refurbished MacBook Air M1 in December 2025 from a popular electronics platform in India.

  19. Yesterday my Cursor usage was 0% using Auto. Today it says I’ve used the whole month, still just using Auto.

  20. model roundup

    Opus 4.6
    81 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

    model roundup

    GPT 5.5
    113 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

  21. In general, we have plenty of ways to collaborate with teammates or clients like comments in figma during the design stage or sharing a link to a website where people can leave feedback via specific toll added. But lately more and more peo…

  22. An AI agent that reads your files, runs tools, and streams every step. Isolated per-project sandboxes, encryption at rest, managed models included.

  23. I used to think that vibe coding was good for greenfield projects. I was wrong.

  24. I keep hearing founders say they’re running companies with dozens of AI agents handling everything. Honestly, I can’t tell what’s real vs.

  25. DOJOZERO: Where AI Agents Forecast the Future.

  26. could not extract summary