1. Notes on DeepSeek: We visited the company HQ last Tuesday. It was founded in 2023 by Liang Wenfeng and operated out of his hedge fund, High-Flyer, until somewhat recently.

  2. The top scanners disagreed 64% of the time and agreed no better than a coin flip. A data study.

  3. Outcomes over output More code is not more value. We measure what ships to users, not what ships to the merge queue.

  4. Given the increased use of AI, my experience is that teammates are moving so fast churning out so many changes that it is nigh impossible to review it all. I can't even keep up with the code being generated by my own use of LLMs at times.

  5. Faster inference won't save you Cutting agent latency with a distributed event log We went into Graphcoder assuming agent latency was mainly an inference problem. That lasted until we watched real sessions run.

  6. The dominant use of AI in 2026 is a coding agent—even though almost none of the people using AI think of themselves as programmers, and almost none of them ever see a line of code. This shift is invisible to users, but it is breaking the i…

  7. So what is a good tool to get detailed metrics locally for claude-code ? Preferably something with a UI.

  8. event

    Copilot
    387 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

    event

    Cowork
    347 items

    Issues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.

  9. After too many debugging sessions where I had no idea what my agent remembered or why it made a decision — I got frustrated and built something. notmemory is an open-source Python SDK that gives AI agents auditable, reversible memory.

  10. could not extract summary

  11. Hi everyone, as a college student, I’ve had a free Gemini Pro account (the €20 version) since October 2025, so I’ve always used Gemini, NotebookLM, and the entire Google suite for my studies. I’m a master’s student in data science, and in…

  12. our workplace LLM mass delusion I can't help but wonder whether we will look back on this AI hype in the workplace with confusion and embarrassment. If we indeed progress into a future where the bubble will burst, models will further close…

  13. Hey guys, To those of you who are using Claude code on terminal, does switching to Fable 5 really exhaust your 5 hour limit quickly? I've been using it on the terminal since yesterday and I'm getting the same usage as Opus!

  14. / San Francisco / I had Claude Code pry its own deep-research workflow out of its binary, then pointed that workflow at a question about itself. The verdict: it searches wide and never doubles back.

  15. 6 min read Just now The agent’s diff was clean. Three files, all tests green, nothing obviously wrong in the PR.

  16. model roundup

    Sonnet 4.6
    9 items

    Several updates and comparisons revolved around Sonnet 4.6, including its performance in dashboard analytics alongside Opus 4.8, and its role in processing critical requirements for a benchmark test with Gemma 4.31B QAT.

    model roundup

    GPT 5.5
    183 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

  17. looking for how to use claude for data engineering. any suggestions?

  18. This is my harness. There are many like it, but this one is mine.

  19. Yesterday I took Fabre for a ride into the writing project i had worked with both sonnet and opus for the past few months. Although there is a bit of actual writing here and there, it is mostly world building at this stage: dozens of diffe…

  20. We're building an AI healthcare receptionist platform that handles inbound patient calls, patient verification, appointment booking/rescheduling/cancellation, clinic FAQs, human call transfers, SMS notifications, call recordings, transcrip…

  21. Claude Fable 5 played a full chess game on lichess using only screenshots and mouse clicks — no chess API, no DOM access for moves. It checkmated Stockfish in 18 moves.

  22. I am not asking for you to fix my account. In https://support.claude.com/en/articles/8241253-safeguards-warnings-and-appeals ther'es an appeal form button which just links to https://claude.ai/restricted , I have tried contacting support a…

  23. Why LLMs (still) lack taste Frontier LLMs are really smart, and they’re becoming particularly good at software development. It feels like every week there’s a new model release that achieves SOTA scores on a handful of benchmarks.

  24. Embed untrusted Lua in your Elixir app: AI agent tools, user formulas, per-tenant plugins. Pure BEAM, sandboxed by default, zero NIFs.

  25. could not extract summary