1. My last observation re: Anthropic’s secret sabotage safety policy, is that it undermines actually good safety policy. How?

  2. AI is giving organizations a new capacity to act. Work that once waited for scarce time or expertise can increasingly move forward with AI.

  3. When large language models (LLMs) fail to generalize or make haphazard errors in reasoning, it is often taken as evidence that LLMs are not truly reasoning, but rather performing a kind of pattern matching. The implication is that people's…

  4. https://github.com/DanMcInerney/architect-loop Fable absolutely rules, but the load-bearing work of coding agents is in the design and the review, not the actual coding. So this is two skills: /architect uses Fable as the orchestrator and…

  5. Claude Fable is relentlessly proactive 11th June 2026 After two days of experience with Claude Fable 5 I think the best way to describe it is relentlessly proactive. It knows a whole lot of tricks and it will deploy pretty much any of them…

  6. model roundup

    GPT 5.5
    197 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

    model roundup

    Opus 4.8
    132 items

    Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.

  7. Last year, a 24-year-old Canadian woman was in a mental health crisis and turned to ChatGPT for help. Hours later, that woman, Alice Carrier, took her own life.

  8. Rubric Agent behavior testing for LLM apps. Test what your agent did — tools called, arguments, trace, latency — not just what it said.

  9. 10th June 2026 Highlights from the release notes: - Tools can now ask the user questions mid-execution. Tools that declare a context parameter receive aToolContext object, andawait context.ask_user(...) can ask a yes/no, multiple-choice (o…

  10. 360 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

    model roundup

    Gemma 4
    77 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.

  11. partnerships lead at a series A SaaS. ~1 cold partner outreach per week, ~3 follow-ups per week.

  12. 9th June 2026 Almost entirely written by the new Claude Fable 5, see my write-up for more details. Recent articles - Initial impressions of Claude Fable 5 - 9th June 2026 - Running Python code in a sandbox with MicroPython and WASM - 6th J…

  13. part-time B2B consultant, 8 active clients. context is my entire job.

  14. event

    Deepmind
    51 items

    Google DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.

  15. PULSE8.ai Cortex Agent-native knowledge OS built on Markdown PULSE8.ai Cortex is an agent-native knowledge OS built on Markdown. It gives AI agents and humans a shared vault backed by a typed knowledge graph, full-text search, and a MarkIt…

  16. Agent quality lives below the model: in the loop, the context engine, the tool surface, the safety stack, and the evaluator. A survey of harness engineering as of mid-2026, introducing GROOM, an open-source self-maintaining knowledge harne…

  17. TripoSplat Generate 3D models from a single image I asked a coding agent to build a beautiful website showcasing the monuments of Paris as 3D Gaussian splats. I never opened an image generator.

  18. With the release of Fable and the rug pull coming in the 22nd I got to thinking. How can I maximise its usage.