The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.
MCP tools groups. How Datadog proved the pattern (www.speakeasy.com via hn)
AI & MCP MCP tool filtering: how Datadog proved the pattern and we shipped the solution Nolan Di Mare Sullivan June 11, 2026 - 6 min read Tool Filtering Documentation Learn how to configure tag-based tool filtering for your Speakeasy-hoste…
- Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org) discussed ↗
-
120 items
model roundup
Opus 4.8Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.
395 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 32m How is Claude better at Word/Excel than Microsoft/Copilot?
- 3h Best practices for a newbie transitioning from GitHub Copilot to Claude?
- 3h Most Underrated AI Apps & Tools in 2026? Here's what deserves more attention.
- 6h Gen AI website traffic share update: OpenAI will go under 50% this year
- 8h M365 toolkit custom agent cost consumption
Kickbacks: An ad marketplace for coding agent spinners (twitter.com via hn)
Get paid to wait The Claude Code spinner might be the most watched line on Earth. So I turned it into an ad marketplace.
Superficial Beliefs in LLM Decision-Making (arxiv.org) discussed ↗
Investing in multi-agent AI safety research (deepmind.google)
Advanced Vedic Astrology Prompt for research purpose (System + Modifier prompt) (www.reddit.com via reddit)
After my last post 'Ai astrologer vs Real astrologer', many have reached out to learn more about prompts. Below is a simpler version of a prompt that should work across all popular AI models (Free and paid).
The Role of Feedback Alignment in Self-Distillation (arxiv.org) discussed ↗
He Hacked Teslas for Elon Musk. Now He's Launching a $100M AI Cyber Agent (www.forbes.com via hn)
Yoni Ramon has been one of Elon Musk’s favorite cybersecurity guys for over a decade. He led the in-house hacking team at Tesla for six years, breaking into vehicles, robots and solar products to find their weaknesses and fix them.
-
341 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
352 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis (arxiv.org) discussed ↗
datasette-agent 0.2a0 (simonwillison.net)
10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
- datasette-agent 0.1a3 (simonwillison.net)
+2 more
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
Steganography Without Modification: Hidden Communication via LLM Seeds (arxiv.org) discussed ↗
Would anyone try making this for me? (www.reddit.com via reddit)
So I often test new AI models with a certain prompt, but I can for some reason not get access to Claude Fable 5. Would anyone by any chance be interested in trying it for me?
llm 0.32a3 (simonwillison.net)
9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…
Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org) discussed ↗
-
51 items
event
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 10h Google DeepMind is worried about what happens when millions of agents start to interact
- 22h Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU
- 1d The Great Reframing...
- 2d Show HN: VQAScore – open eval metric/reward model, now for text-to-video
- 6d Inside Google DeepMind: Reasoning, Omni, and Shipping Frontier AI
TripoSplat Generate 3D models from a single image I asked a coding agent to build a beautiful website showcasing the monuments of Paris as 3D Gaussian splats. I never opened an image generator.
Shall we play a game? – LLMs use tactical nukes in 95% of simulations (www.kennethpayne.uk via hn)
Shall we play a game? My AI nuclear simulation is out now, and it's a WOPR.
Building a Personal RAG Chatbot in a Few Days (e-mahmoudi.me via hn)
Building a Personal RAG Chatbot in a Few Days: Learning by Engineering How I built a small personal RAG chatbot using FastAPI, PostgreSQL, and Docker as a practical engineering exercise. Building a Personal RAG Chatbot in a Few Days: Learn…
Initial impressions of Claude Fable 5 (simonwillison.net)
Initial impressions of Claude Fable 5 9th June 2026 I didn’t have early access to today’s Claude Fable 5 release, but I’ve spent the past ~5.5 hours putting it through its paces. My initial impressions are that this is something of a beast.
Qwen-Image-Flash: Beyond Objective Design (arxiv.org) discussed ↗
The prompt I used was the following: https://pastebin.com/tSR0hgTg It spun up 2 workflows to do it's magic. 30mins and 5 Million tokens later, it's verdict was: `## TL;DR` 1.