1. Anthropic CEO Dario Amodei Has Only One Direct Report One of the most powerful AI chief executives has almost no direct reports at a moment when other tech leaders are widening their spans of control. For all of his influence at Anthropic…

  2. I'm working on a online game similar to GTA online but all the content is live-generated by players. Prompt your own sportscar, your own building, your own weapon, etc...

  3. I have a bunch of lifecycle hooks configured to run certain checks before proceeding into execution like checking related PRs in github (just an example). Strangely when I'm on local and running the session directly this is pretty reliable…

  4. I was debugging some code and LLM crashed out: ``` The debug_log config defaults to "debug.json" and creates a FileHandler — which appends by default. That file is a log of everything that happened, never cleared.

  5. paywalled

  6. Introduction Physics-based simulations are a staple in modern research and the development and availability of well-documented, verified, and efficient code, often tuned to take advantage of modern hardware, has firmly established computat…

  7. event

    Fine Tuning
    138 items

    Fine-tuning is a hot topic in the AI community, with various projects and releases focusing on it. Notable examples include OpenAI's decision to wind down its fine-tuning API, Anthropic co-founder Jack Clark's prediction that AI research could become automated by 2028, and several new datasets and models released for fine-tuning purposes.

    model roundup

    Opus 4.8
    107 items

    Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.

  8. Hello HN, this is the first time I am putting out a product, so I would like to share it and seek your feedback and suggestions. As a developer and student working with a remote team, we always need to share some context or code with a tea…

  9. Agentic coding and mental models I reckon I’ve drafted and then deleted a version of this post at least 10 times in the last 12 months. Deleted because it falls in the category “I must be wrong about this as everyone else is saying the opp…

  10. I want to offer a minority opinion about the recent hype. I’m tired of reading posts by Karpathy about ideas that were already known months earlier, and then treating them as if he just discovered something groundbreaking.

  11. I’ve been thinking a lot about AI agents that don’t just answer questions, but can actually look at a screen, understand what’s happening, and take actions inside Android apps or games. Not talking about another chatbot.

  12. had a weird experience recently with Claude. I asked it to help with a coding task.

  13. No laptop. No terminal.

  14. 110 items

    Claude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.

    344 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

  15. Many customer service representatives sound very confident even when dealing with outdated documents, ambiguous evaluations, and incomplete quotations. The user experience is like expert advice - but the underlying data is often in a mess.

  16. OpenAI is mulling sharp price cuts to its artificial intelligence offerings, as it looks to woo consumers away from rival Anthropic, the Wall Street Journal reported Wednesday evening stateside, citing sources familiar with the matter. "Th…

  17. I have just sent a sort of "feature request" to ChatGPT (but my question is generally applicable to most LLMs). Here is the text, written by ChatGPT itself: Feature request: Optional temporal awareness per conversation/project.

  18. osint-mcp A self-hosted OSINT (Open-Source Intelligence) toolkit that runs five ways: as an MCP server, an interactive AI REPL, a CLI, a web app, and — via OpenClaw — straight from chat apps like WhatsApp, Telegram, and Discord. It bundles…

  19. I used the following prompt with Fable which went popular a while ago: ​ "can you use whatever resources you like, and python, to generate a short 'youtube poop' video and render it using ffmpeg? can you put more of a personal spin on it?

  20. https://preview.redd.it/qwdxfjiz1m6h1.png?width=1462&format=png&auto=webp&s=531587f8f907d00bec016adaa977c81594acf6d2 On the left: my website on the public internet. On the right, Cursor suggests a list of completions that exactly match the…

  21. model roundup

    Qwen 3.5
    13 items

    Qwen/Qwen3.5-4B is a 4 billion parameter model that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Notably, community projects like Hitoku Draft showcase local AI assistants, while General Instinct focuses on frontier models for edge devices.

  22. I would like to know if Fable 5 is free on cursor until 22 June or only free using Claude Code ?

  23. Regarding AI agents, one question that I always ponder is: How exactly do they make money? The website uses AdSense.

  24. I’ve followed AI pretty closely for a while, and honestly, most releases lately have felt like the same thing in different packaging. Faster, smarter, bigger context windows.

  25. OpenUsage Community Track all your AI coding subscriptions in one place. OpenUsage Community is an independent, community-maintained continuation of the original OpenUsage project.

  26. 👻 Phantomix The open-source AI browser agent. Free alternative to OpenAI Operator.

  27. could not extract summary

  28. Fable is dam amazing, i don't feel like I need something better as a coding agent! Feel like I'm already burning out with adding new features to my project, as it's doing such a good job!