Missspelt dangerous domain cladue.ai (www.reddit.com via reddit)
This morning, a few minutes ago I wanted to go to the claude.ai website because Claude Desktop wouldn't connect for some reason. I didn't noticed it but UBlock Origin popped up blocking it.
Claude Fable is relentlessly proactive (simonwillison.net)
Claude Fable is relentlessly proactive 11th June 2026 After two days of experience with Claude Fable 5 I think the best way to describe it is relentlessly proactive. It knows a whole lot of tricks and it will deploy pretty much any of them…
The Role of Feedback Alignment in Self-Distillation (arxiv.org) discussed ↗
Auto mode for pi.dev. An LLM reviews your coding agent's commands (github.com via hn)
pi-auto-reviewer Automatically review bash commands that your pi agent wants to execute - akin to Codex "Auto-review" and Claude Code "auto mode". How it works Every bash command the agent wants to run goes through three tiers: | Tier | Ac…
datasette-agent 0.2a0 (simonwillison.net)
10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
- datasette-agent 0.1a3 (simonwillison.net)
+2 more
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org) discussed ↗
-
194 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 13m OMG Fable one-shotting everything
- 4h Fable 5 added to the Artificial Analysis Coding Agent Index... barely 1 point ahead of GPT-5.5 ???
- 11h Ask HN: Why not compare Fable 5 with GPT "Pro"? Why compare with GPT xhigh?
- 14h GPT Memory Audit - Copy/Paste
- 15h Differences Between Claude Opus 4.8 and Claude Fable 5 on MineBench
358 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 1h Canceled my sub over the silent-sabotage guardrail, renewed when they walked it back
- 4h The Paradox of the "Dangerous" Product
- 6h What's new in CC 2.1.172 (+23,890 tokens)
- 8h Mythos-class models will diffuse throughout the world by 2029
- 15h Y2K Claude Mythos and the New Math of AI Vulnerability Discovery
Investing in multi-agent AI safety research (deepmind.google)
Superficial Beliefs in LLM Decision-Making (arxiv.org) discussed ↗
Steganography Without Modification: Hidden Communication via LLM Seeds (arxiv.org) discussed ↗
As the post states, I am having Claude build my mileage reimbursement sheet weekly. However, it is unable to get the data needed from Google Maps API key to determine how many miles driving are between two points.
TripoSplat Generate 3D models from a single image I asked a coding agent to build a beautiful website showcasing the monuments of Paris as 3D Gaussian splats. I never opened an image generator.
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis (arxiv.org) discussed ↗
Hi everyone, I'm curious about the real-world adoption of Claude Desktop in corporate environments. I've recently been experimenting with Claude Desktop, MCP (Model Context Protocol) servers, Power BI semantic models, and automation workfl…
-
77 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.
51 itemsevent
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 22h Google DeepMind is worried about what happens when millions of agents start to interact
- 1d Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU
- 2d The Great Reframing...
- 2d Show HN: VQAScore – open eval metric/reward model, now for text-to-video
- 7d Inside Google DeepMind: Reasoning, Omni, and Shipping Frontier AI
llm 0.32a3 (simonwillison.net)
9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…
Posting this because I've gone in circles on it and want to hear from people doing the same. My setup has the usual stuff, runs in bypassPermissions so it doesn't stop me for routine work, a bash firewall on PreToolUse that blocks the dest…
Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org) discussed ↗
The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.
i maintain a small cli called brandmd that turns websites into DESIGN.md files for coding agents. last week it completely misread cognition.ai's blog design: mood: "Dark and moody".
Initial impressions of Claude Fable 5 (simonwillison.net)
Initial impressions of Claude Fable 5 9th June 2026 I didn’t have early access to today’s Claude Fable 5 release, but I’ve spent the past ~5.5 hours putting it through its paces. My initial impressions are that this is something of a beast.
Show HN: Resolve Discourse Forum Issues Faster with AI Agents (news.ycombinator.com)
https://seaticket.ai/discourse-forum-issues-resolving/
Qwen-Image-Flash: Beyond Objective Design (arxiv.org) discussed ↗
We built a travel MCP that lets Claude search and book 2.2M+ hotels (www.reddit.com via reddit)
Most AI agents can research & write. Very few can transact.
- Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)