1. Pylon You must construct additional pylons. Self-hosted daemon that turns events into sandboxed AI agent runs.

  2. Had 327 production traces from a restaurant-reservation agent I wanted to retrain. The plan was to fine-tune a smaller self-hostable model so I could ditch the frontier-API bill.

  3. Sparse-attention decoders rely on exact Top-K selection to choose the most important key-value entries for each query token. In long-context LLM serving, this Top-K stage runs once per decode query and becomes a meaningful latency bottlene…

  4. Artificial Intelligence: Foundations of Computational Agents, 3rd Edition This book is published by Cambridge University Press and the complete text is available here with permission of CUP. Please consider buying the book.

  5. I run Tredict, an endurance sports training platform I've been building since 2020. OpenAI opened the ChatGPT App Directory to third-party submissions in December, and the official Tredict app is now live.

  6. Apart from coding agents, are there any AI agents that are actually useful? Is it all still just hype?

  7. Qualcomm stock surged after a report from a tech analyst in Asia shed light on a potential partnership with OpenAI for a 2028 smartphone release.

  8. When LLMs Get Personal As AI answers become more personalized, do stable patterns still exist? In my last two posts, I approached the heavily polarizing topic of AI search (yes, remarkably, it’s still polarized) from two related angles.

  9. Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Abstract World models are categorized into three capability levels and four law regimes to better understand and develop predictive environment models for AI agents across…

  10. model roundup

    GPT 5.5
    78 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

    model roundup

    Qwen 3.6
    190 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  11. I genuinely want to know if anyone is able to one-shot a perfect (for you) website with ClaudeCode. In most cases I've to go back n forth 5-6 times in order to get what I want, even though I already specified same things in the original pr…

  12. Sverklo, on the record. A local-first MCP code-intelligence server, two reproducible benchmarks, and an 8-page preprint that publishes both.

  13. while people argue about ai ethics on the surface there’s a whole underground building agents that never sleep different timelines forsure which timeline are you on?

  14. hstack is a stack of tools and agent specialists built to help those who use LLMs to fight disease, including an LLM-powered "personal disease wiki" inspired by Karpathy

  15. The Self Healing Platform and the Agent Store I think DevOps as a separate identity, and a lot of agile ceremony around it, are already a bit obsolete. Engineers are doing development, operations, and lightweight management at the same tim…

  16. Google quietly dropped the Agent-to-Agent protocol in April 2025 with a blog post and a GitHub repo. Eleven months later, it has 22,700 GitHub stars, backing from over 150 organizations including AWS, Microsoft, and Salesforce, and a perma…

  17. agenv Environment manager for AI coding agents — like nvm or pyenv, but for agent accounts, config, and saved runtime args. agenv installs codex, claude, and gemini into isolated profiles and lets you pick which profile runs by default, gl…

  18. I used to run a flash games website (SWF files) years ago. I've made a few games of my own.

  19. event

    Copilot
    109 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

  20. Homescreen - Try all endpoints for free I wanted share a recent project I wanted to build a project around free-to-use data, that when brought together, enriched and made easy to use, would be valuable to people. I used Claude Code to buil…

  21. fenster Run Chrome's local Gemini Nano through a Go bridge. Chrome ships a built-in LLM (Gemini Nano, about 3B parameters, GPU-accelerated).

  22. If Claude Feels Worse, Fix Your Harness Take What Works and Leave the Rest A note before I start. In my last post I called a lot of people embarrassing for complaining that Claude is getting worse.

  23. Keywords: RL, reasoning, LLM, tool use, prompting TL;DR: We introduce the Conductor, a new kind of language model trained with reinforcement learning to automatically discover powerful coordination strategies among LLMs Abstract: Powerful…

  24. Six months ago I started a side project because Claude Code kept forgetting things I'd already explained. My architecture, the weird reason that one function exists, what broke last deploy.

  25. CNBC, CNN, and other major media sources have just reported that Meta’s acquisition of the AI startup Manus was blocked! Interestingly, I shared a survey on AI Agent platforms for knowledge workers.