3code, the Economical Coding Agent (3code.capocasa.dev via hn)
It's so lean you can use it for free! When your subscription runs out, or you don't have one, keep vibecoding with 3code.
Claude Fable is relentlessly proactive (simonwillison.net)
Claude Fable is relentlessly proactive 11th June 2026 After two days of experience with Claude Fable 5 I think the best way to describe it is relentlessly proactive. It knows a whole lot of tricks and it will deploy pretty much any of them…
- Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)
The Role of Feedback Alignment in Self-Distillation (arxiv.org) discussed ↗
agribrain Give your AI assistant a licensed agronomist's brain. [DEMO GIF GOES HERE — 15s: real field coordinates → "What's the spray situation and water balance for my olives this week?" → real answer with numbers] Every LLM can write a p…
The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org) discussed ↗
Claude ran out mid-debug and I wanted to throw my phone so I did this (www.reddit.com via reddit)
You're 2 hours into a problem. Claude actually understands your codebase, knows the file structure, remembers what you already tried.
-
58 items
event
WindsurfWindsurf 2.0 has been released with improved local and cloud agent integration and bug fixes. The update follows a series of announcements about AI tools and MCP servers, including gondola.ai's hotel search server and Stork for indexing over 14,000 AI tools.
- 10m Production-Grade Claude/AI Skills for Ruby on Rails
- 2d We built a free CLI to keep CLAUDE.md, slash commands, MCP servers, and skills in sync across machines
- 3d Project Brain– Persistent Second Brain for Claude Code and Windsurf, Cursor etc.
- 4d Midas: 100% local agent memory — no LLM at ingest, $0, nothing leaves the box (MCP + Python SDK)
- 6d VS Code extension that lets you switch AI agent harnesses/skills/prompts in one click (works with Claude Code, Github Copilot, Cursor, and Windsurf)
358 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 3h Canceled my sub over the silent-sabotage guardrail, renewed when they walked it back
- 6h The Paradox of the "Dangerous" Product
- 8h What's new in CC 2.1.172 (+23,890 tokens)
- 10h Mythos-class models will diffuse throughout the world by 2029
- 17h Y2K Claude Mythos and the New Math of AI Vulnerability Discovery
Investing in multi-agent AI safety research (deepmind.google)
Superficial Beliefs in LLM Decision-Making (arxiv.org) discussed ↗
I've just realized why Claude is called Claude (www.reddit.com via reddit)
They are going to release an image generation model and call it Claude Monet. And they will release a music generation model and call it Claude Debussy.
- Claude just called me out and I deserved it (www.reddit.com)
datasette-agent 0.2a0 (simonwillison.net)
10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
- datasette-agent 0.1a3 (simonwillison.net)
+2 more
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis (arxiv.org) discussed ↗
could not extract summary
Steganography Without Modification: Hidden Communication via LLM Seeds (arxiv.org) discussed ↗
Will PowerPoint survive Claude? HTML help me tell a better story (www.reddit.com via reddit)
My reasoning is that a slide deck forces your story into fixed rectangles. An HTML deck reveals one idea per screen as you scroll, builds a chart in front of the audience, and lets you toggle scenarios live.
-
51 items
event
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 1d Google DeepMind is worried about what happens when millions of agents start to interact
- 1d Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU
- 2d The Great Reframing...
- 2d Show HN: VQAScore – open eval metric/reward model, now for text-to-video
- 7d Inside Google DeepMind: Reasoning, Omni, and Shipping Frontier AI
Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org) discussed ↗
i maintain a small cli called brandmd that turns websites into DESIGN.md files for coding agents. last week it completely misread cognition.ai's blog design: mood: "Dark and moody".
llm 0.32a3 (simonwillison.net)
9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…
Similarities between human psychopathology and errors in LLMs (www.nature.com via hn)
Abstract Two striking phenomena of the human mind encountered in mental healthcare are hallucinations and confabulations; perceiving things that are not there, or filling memory gaps with invented stories. Interestingly, contemporary artif…
Initial impressions of Claude Fable 5 (simonwillison.net)
Initial impressions of Claude Fable 5 9th June 2026 I didn’t have early access to today’s Claude Fable 5 release, but I’ve spent the past ~5.5 hours putting it through its paces. My initial impressions are that this is something of a beast.
Show HN: LiveHere – AI Videos, Self-Hosted Nvidia Cosmos on H200 GPUs (twitter.com via hn)
there is still time to submit - https://ship.builders/ build something cool, and i would love to catchup on the yacht - if you are in sf and coming.. ---- Yacht Hackathon by Composio, Nebius, Tavily - Product Demo Pitch: 30 seconds.
Qwen-Image-Flash: Beyond Objective Design (arxiv.org) discussed ↗
TripoSplat Generate 3D models from a single image I asked a coding agent to build a beautiful website showcasing the monuments of Paris as 3D Gaussian splats. I never opened an image generator.
CursorBar A lightweight macOS menu bar app that shows your Cursor plan usage and how much you have left this billing cycle. No browser tab, no manual cookie paste — CursorBar reads your session from the local Cursor IDE database and fetche…