Copy Has a Blind Spot. This Claude Skill Finds It (aiforcontentmarketing.ai via hn)
Ego Check is a free Claude skill that scores your copy, finds the ego moments killing conversions, and rewrites them. Setup takes 5 minutes.
datasette-agent 0.2a0 (simonwillison.net)
10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
- datasette-agent 0.1a3 (simonwillison.net)
+2 more
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
Welcome to the OpenAI, Anthropic, and Google price wars (sherwood.news via hn)
Welcome to the OpenAI, Anthropic, and Google price wars It’s the clearest signal yet that AI models are becoming commoditized. In a matter of days, the narrative surrounding the artificial intelligence boom has violently shifted from perfo…
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity (arxiv.org) discussed ↗
The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.
Split the work to the proper models (www.reddit.com via reddit)
In my effort to take the most out of Fable I found a great workflow that make my limits last, maybe you all know it but if you don't now you do. First, use Fable only for planning or extremely hard problems.
-
392 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 10m Gen AI website traffic share update: OpenAI will go under 50% this year
- 2h M365 toolkit custom agent cost consumption
- 4h Cursor burned my premium requests on background agents without me noticing, building a tracker, looking for feedback
- 10h 12 months ago nobody understood why we were building Agentic SDLC. Now it feels like everyone is heading in the same direction.
- 19h Ring-0 AI Interview Copilot
76 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.
- 2h Reasoning, but without actually *drafting* replies?
- 9h Is Qwen 3.6 27B IQ4XS better than Gemma 4 31B QAT as a Hermes agent?
- 11h nvidia/diffusiongemma-26B-A4B-it-NVFP4 · Hugging Face
- 12h Monitor your screen using local LLMs with only one sentence! Free, Open Source and Local.
- 15h LLMs and tabletop games
The Role of Feedback Alignment in Self-Distillation (arxiv.org) discussed ↗
Investing in multi-agent AI safety research (deepmind.google)
DiffusionGemma under real workloads feels very different from benchmark demos (www.reddit.comhttps)
okay after testing DiffusionGemma a bit more internally we genuinely can’t tell if this is the start of something big or if everyone’s just getting distracted by crazy TPS numbers again lol but one thing that stood out REALLY fast for us w…
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis (arxiv.org) discussed ↗
Odu: A CI runner for agents and humans (kolu.dev via hn)
odu: a CI runner for agents and humans Local CI built on @kolu/surface and oRPC-over-ssh — it provisions real build hosts with nothing installed, holds the pipeline as live typed state you attach to from a terminal, and lets your coding ag…
Steganography Without Modification: Hidden Communication via LLM Seeds (arxiv.org) discussed ↗
Account switching between work and personal (www.reddit.com via reddit)
Is there a fast way on Windows to change account in Claude Desktop and in Claude Code in Terminal between my accounts? I have one from my team subscription and one from my personal account.
-
347 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
51 itemsevent
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 3h Google DeepMind is worried about what happens when millions of agents start to interact
- 16h Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU
- 1d The Great Reframing...
- 1d Show HN: VQAScore – open eval metric/reward model, now for text-to-video
- 6d Inside Google DeepMind: Reasoning, Omni, and Shipping Frontier AI
Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org) discussed ↗
llm 0.32a3 (simonwillison.net)
9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…
What's the Best Cloud Agent you've actually used in production? (www.reddit.com via reddit)
TripoSplat Generate 3D models from a single image I asked a coding agent to build a beautiful website showcasing the monuments of Paris as 3D Gaussian splats. I never opened an image generator.
Qwen-Image-Flash: Beyond Objective Design (arxiv.org) discussed ↗
Initial impressions of Claude Fable 5 (simonwillison.net)
Initial impressions of Claude Fable 5 9th June 2026 I didn’t have early access to today’s Claude Fable 5 release, but I’ve spent the past ~5.5 hours putting it through its paces. My initial impressions are that this is something of a beast.
The hard part of self-maintaining context for analytics agents (blog.getcassis.com via hn)
Context is now table stakes for analytics agents. The new problem is keeping it true as the business keeps moving.
Show HN: Lumen–free Real-time LLM token and cost monitor (github.com via hn)
- Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)
Key Takeaways - OpenAI filed confidentially for an IPO at an $852 billion valuation, just days after rival Anthropic launched its own IPO process - The ChatGPT maker is reportedly weighing token price cuts as competition for enterprise AI…