1. Why I Built Another coding agent harness?: https://dev.to/patriceckhart/zot-why-i-built-another-coding-... Github Repo: https://github.com/patriceckhart/zot

  2. Built ProxVanta over a few weekends after running into the same problem over and over: useful AI context ends up scattered everywhere. Some in GitHub, some in Slack, some in docs, some in people’s heads, and some via posts from people tell…

  3. I love crowdstrike, its amazing. However, its Linux agent isn't the best.

  4. Pytest-style behavioral regression testing for AI agents. AgentCheck AgentCheck is pytest for AI agents.

  5. I recently contributed an experimental HFQ4-G256 MMQ prefill path to hipfire, an RDNA-focused LLM inference engine. Disclaimer: I authored the PR, so this is partly a contribution note, but I am mainly looking for independent validation fr…

  6. Long story short, in class I'm always searching the web for new websites and games and even when I do find one it's always full of lag and ads. So, I decided to vibe code my own website.

  7. model roundup

    Gemma 4
    124 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

    event

    Cowork
    120 items

    Issues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.

  8. I kept losing my reading flow every time I hit an unfamiliar word. The usual fix: open a new tab, search, scroll past ads, come back.

  9. Been trying to incorporate AI agents into my day-to-day for a few months now and I keep hitting the same wall. Most demos look great but when I try to plug agents into a real workflow, the friction adds up fast.

  10. OpenAI may be preparing to take its biggest leap yet, moving beyond software into hardware with a smartphone designed around AI agents instead of traditional apps. According to a new note from well-known analyst Ming-Chi Kuo, the company i…

  11. For months I defaulted to Opus for anything complex. Sonnet felt like a gamble, sometimes great, sometimes it would confidently build the wrong thing and I'd spend an hour unwinding it.

  12. What Claude Shannon Knew In 1950 That We’re Pretending Is New AI didn’t arrive yesterday; it just changed its outfit Every era gets its favorite tech panic. Ours, apparently, is watching a chatbot say something polished, half-right, and fa…

  13. Hey Everyone, Over the last few months, I noticed a massive gap in how we learn about Agentic AI. There are a million theoretical blog posts and dense whitepapers on RAG, tool calling, and swarms, but almost nowhere to just sit down, run a…

  14. model roundup

    Opus 4.6
    76 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

    model roundup

    Qwen 3.6
    203 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  15. https://preview.redd.it/j6423wfihvxg1.png?width=482&format=png&auto=webp&s=cbc433e8a96502fd370f020c551b39d06b893aa6

  16. I've been using Claude basically since it launched, and use Claude Code extensively (Swift, C++, Shaders, TS, AWS, etc)... Maybe this is just tech twitter / LinkedIn garbage, but how on earth are people using so many tokens...

  17. I wanted to see who was going to the code w/ Claude extended. I live in the Midwest and wanted to hear if others were flying in to see this.

  18. could not extract summary

  19. 1386.ai.rocm This is a fork of 1386.ai ported to ROCm, targeting specifically the AMD Strix Halo APU but compatible with any ROCm-supported hardware. I found this repo through a Reddit post where the author (@eb1386) nonchalantly announced…

  20. A lot of agent frameworks quietly assume this loop is safe: model answers model critiques itself model revises output improves The uncomfortable part is that unconditional self-correction often degrades correct answers more than it repairs…

  21. model roundup

    Sonnet 4.6
    51 items

    Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.

  22. Markdown is clearly a wave now. It is good enough for AI who can read content structure without wasting tokens.

  23. Bit of context. Over the last couple of years I've shipped automation projects for around 30 professional services founders.

  24. Hello all, I would like to automate the Git PR review process as much as possible in my company. I found several possible approaches online, but I am still missing a clear best-practice recommendation.

  25. I can't be the only one having trouble using the Claude on chrome mcp right? It worked well for like a week and then suddenly Claude can't use chrome anymore.

  26. Hey everyone, I’ve been diving deep into the Claude Code CLI and I’m hitting a bit of a wall with session management vs. agent identity.

  27. I'm a big user of Codex and Claude Code in the terminal. However after a big brainstorming and planning session I was finding myself with lots of comments and questions about difference places in the plan file.

  28. My inbox was filling up with spam and I kept putting off going through it for too long. So I vibe coded a small workflow that handles most of it for me.