Cursor trails of people currently browsing the web (wewere.online via hn)
turning the internet into a living, shared space the cursor trails are from real people browsing install the extension to try it out! It's controversial to have hope for the internet these days.
How to use Fable without annihilating your tokens (www.reddit.com via reddit)
So I've been using Fable since it got out on Pro Mode and I think I found an issue that I think is the main reason people overuse tokens and reach limits fast. Everytime you prompt it, it rereads your whole conversation since the start.
The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.
Superficial Beliefs in LLM Decision-Making (arxiv.org) discussed ↗
ORP – Turn AI agent failures into regression tests and tested lessons (github.com via hn)
Open Reflection Protocol (ORP) Turn agent failures into regression tests, reusable lessons, and measurable improvements. Tracing tells you what your agent did.
- Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)
-
333 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 1h Fable's policy on no zero-day-retention is a serious problem for Enterprise customers
- 5h Y2K Claude Mythos and the New Math of AI Vulnerability Discovery
- 6h Visa Vulnerability Agentic Harness for Project Glasswing
- 6h Claude Fable 5: mid-tier results on coding tasks
- 10h Are we defaulting to VM-level sandboxing before understanding the threat model?
79 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.
The Role of Feedback Alignment in Self-Distillation (arxiv.org) discussed ↗
datasette-agent 0.2a0 (simonwillison.net)
10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
- datasette-agent 0.1a3 (simonwillison.net)
+2 more
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
I used Claude Fable to predict who will win the World Cup 🏆 (www.reddit.comhttps)
It's not accurate or scientific and doesn't exactly follow the rules of football but Fable did great work implementing my design and then helping to tweak the code. Actually I used Sonnet for the tweaking that since Fable had eaten nearly…
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org) discussed ↗
How I avoided having to reexplain my company to Claude every single session (www.reddit.com via reddit)
I work in a small startup (MAAT), The team and I use AI (mostly claude) every day for work, and the biggest problem that me and the team was facing was repetition. The persistent context problem Every session started the same way.
Steganography Without Modification: Hidden Communication via LLM Seeds (arxiv.org) discussed ↗
llm 0.32a3 (simonwillison.net)
9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…
DiffusionGemma: Discrete diffusion in a large language model (idlemachines.co.uk via hn)
Curated sets of ML problems around the papers, methods, and ideas getting attention right now.
-
353 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 5h Codex usage grows after Fable nerf model release
- 6h Anthropic Said This AI Was Too Powerful for Public Release. Now Anyone Can Use It.
- 7h what is openai planning with the next release
- 11h Fable/Mythos safeguards are overly strict
- 16h Claude Fable 5 is the best AI model right now — and it's not even a debate
51 itemsevent
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 12h Google DeepMind is worried about what happens when millions of agents start to interact
- 1d Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU
- 1d The Great Reframing...
- 2d Show HN: VQAScore – open eval metric/reward model, now for text-to-video
- 6d Inside Google DeepMind: Reasoning, Omni, and Shipping Frontier AI
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis (arxiv.org) discussed ↗
Kickbacks: An ad marketplace for coding agent spinners (twitter.com via hn)
Get paid to wait The Claude Code spinner might be the most watched line on Earth. So I turned it into an ad marketplace.
Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org) discussed ↗
Show HN: Diffcat – a TUI for delightful Git diffs (github.com via hn)
Built this github style git diff TUI, hope others find it useful. Very focused in scope compared to other existing tools.
Investing in multi-agent AI safety research (deepmind.google)
So I ran Doom inside Claude.ai (twitter.com via hn)
So I just ran Doom on Claude Fable 5. https://t.co/Ja5ZMcPTTb
- Show HN: Doom Inside Claude Code (github.com via hn)
Qwen-Image-Flash: Beyond Objective Design (arxiv.org) discussed ↗
Initial impressions of Claude Fable 5 (simonwillison.net)
Initial impressions of Claude Fable 5 9th June 2026 I didn’t have early access to today’s Claude Fable 5 release, but I’ve spent the past ~5.5 hours putting it through its paces. My initial impressions are that this is something of a beast.
MCP Apps vs. Generative UI (www.openui.com via hn)
Vercel Security Checkpoint | sfo1::1781218905-o5QPt4YtRbLHaAHET9BnnDi1qMKSI1PI