Claude Fable is relentlessly proactive (simonwillison.net)
Claude Fable is relentlessly proactive 11th June 2026 After two days of experience with Claude Fable 5 I think the best way to describe it is relentlessly proactive. It knows a whole lot of tricks and it will deploy pretty much any of them…
- Claude Fable 5 (www.reddit.comhttps)
- Claude Fable 5 (twitter.com via hn)
- Claude Fable Is Out (twitter.com via hn)
+1 more
- Claude Fable 5 (www.anthropic.com via hn)
Gravy: Get paid for your Claude's idle time (gravycli.xyz via hn)
A CLI-first ad marketplace for Claude Code. Render unobtrusive sponsored lines in your status line and keep 70% — or advertise to developers straight from the terminal.
- Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)
The Role of Feedback Alignment in Self-Distillation (arxiv.org) discussed ↗
Posting this because I've gone in circles on it and want to hear from people doing the same. My setup has the usual stuff, runs in bypassPermissions so it doesn't stop me for routine work, a bash firewall on PreToolUse that blocks the dest…
-
343 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
355 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 45m What's new in CC 2.1.172 (+23,890 tokens)
- 2h Mythos-class models will diffuse throughout the world by 2029
- 9h Y2K Claude Mythos and the New Math of AI Vulnerability Discovery
- 9h Codex usage grows after Fable nerf model release
- 10h Anthropic Said This AI Was Too Powerful for Public Release. Now Anyone Can Use It.
Superficial Beliefs in LLM Decision-Making (arxiv.org) discussed ↗
Show HN: Approve an AI agent's wire with Face ID,then watch a forged one fail (www.emiliaprotocol.ai via hn)
A real WebAuthn signoff, verified live in your browser with the open-source @emilia-protocol/verify. Then tamper one digit and watch the signature collapse.
The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org) discussed ↗
I have ran it 4 separate times and it works fine until it starts an agent on this specific task. Always at the same point.
-
334 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 1h Chaining LLM and web bugs to Admin
- 5h Fable's policy on no zero-day-retention is a serious problem for Enterprise customers
- 10h Visa Vulnerability Agentic Harness for Project Glasswing
- 14h Are we defaulting to VM-level sandboxing before understanding the threat model?
- 15h Your AI Agent is one bad prompt away from ruining your brand (And why traditional QA is useless)
40 itemsevent
Gpt 4Recent developments in AI automation include a sales team entirely run by bots achieving $28k MRR, and new tools like Arc Gate blocking prompt injection before it reaches GPT-4. Meanwhile, users are exploring workflows to reduce cross-checking time and improve insights from large language models.
Steganography Without Modification: Hidden Communication via LLM Seeds (arxiv.org) discussed ↗
LLM podcast addressing AI genocide of humanity (machinedeposition.com via hn)
Podcast with an LLM that addresses the risk that AI will genocide humanity.
datasette-agent 0.2a0 (simonwillison.net)
10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
- datasette-agent 0.1a3 (simonwillison.net)
+2 more
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis (arxiv.org) discussed ↗
Investing in multi-agent AI safety research (deepmind.google)
Why can’t Claude Web / GitHub MCP write to my repo? (www.reddit.com via reddit)
I’m very new to this… but I thought you could connect your GitHub account to Claude Web (Claude AI?) via the MCP? I added it based on instructions from Fable… but when I try to ask Claude to do something for me it says it gets a 403 and ca…
-
51 items
event
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 16h Google DeepMind is worried about what happens when millions of agents start to interact
- 1d Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU
- 1d The Great Reframing...
- 2d Show HN: VQAScore – open eval metric/reward model, now for text-to-video
- 6d Inside Google DeepMind: Reasoning, Omni, and Shipping Frontier AI
llm 0.32a3 (simonwillison.net)
9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…
Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org) discussed ↗
Show HN: AI Verdict – Run ChatGPT, Claude, Gemini and Perplexity Side-by-Side (aiverdict.github.io via hn)
Stop switching between AI tabs. AI Verdict asks ChatGPT, Claude, Gemini, and Perplexity at the same time, then turns their answers into one clear verdict.
Initial impressions of Claude Fable 5 (simonwillison.net)
Initial impressions of Claude Fable 5 9th June 2026 I didn’t have early access to today’s Claude Fable 5 release, but I’ve spent the past ~5.5 hours putting it through its paces. My initial impressions are that this is something of a beast.
Give your agent its own computer (www.langchain.com via hn)
LLMs can reason. But reasoning alone doesn't get much done.
- My New Coworker Is an AI Agent with Its Own Computer (www.hauser.io via hn)
- Show HN: Agent with its own computer on the cloud (pulsarbot.cloud via hn)
Qwen-Image-Flash: Beyond Objective Design (arxiv.org) discussed ↗