1. I built a small open-source verifier for graph-backed AI memory and GraphRAG-style systems. The basic problem: graph databases can validate schema, but they usually do not know whether two accepted facts contradict each other.

  2. https://claude.ai/share/12659fcf-c1c8-4bbb-bc45-b41b26cd8b69

  3. Physical Ant Video of a learned behaviour directly on hardware. Hardware.

  4. A critical flaw in Anthropic’s “Claude in Chrome” browser extension allows any Chrome extension, even one with zero permissions, to hijack Claude’s AI capabilities and perform sensitive actions on behalf of users. The issue, discovered by…

  5. model roundup

    Opus 4.6
    90 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

    model roundup

    MiniMax 2.7
    3 items

    MiniMax-M2.7 is a large language model from MiniMaxAI capable of complex agent tasks and self-evolution, achieving a 66.6% medal rate on MLE Bench Lite, second only to Opus-4.6 and GPT-5.4. Community members have shared success in running the model with up to 100k context tokens and noted its potential for real-world applications like software engineering.

  6. I had been looking to try <500M parameter language models but you wouldn't find an API to try them anywhere, so I built this cloudflare hosted static website that hosts weights and built an inference runtime for these models that uses WebG…

  7. I'm making a new video game every day as a hobby project, but I'm vibe coding it and writing nearly zero lines of code myself (even though I could, I'm a senior SWE). Today it's an original math game, Primetime where you click the non-prim…

  8. I code on Claude's pay as you go API and started budgeting my token usage with my own CLI wrapper. Basically I set a budget for a task that I'm working on in my project: Task: "Fix mobile responsiveness" Budget: $3.00 and the budget update…

  9. caude is running out of limit when i ask it to a lengthier task in one single prompt. it would also run out of limit if i prompt such a lengthier task in several prompts either.

  10. model roundup

    Sonnet 4.5
    6 items

    On May 4, 2026, multiple automated status updates reported elevated errors for Claude Opus 4.5 and Sonnet 4.5 around the same time, with Anthropic introducing a feature called E-STEER that applies emotion intervention to these models.

    model roundup

    Gemma 4
    162 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

  11. I’m trying to understand a recurring problem in the AI agent space. A lot of people are interested in agents.

  12. A Unified Virtual Filesystem Workspace A simulated environment where AI agents reach every data through one filesystem and bash. Read the docsnpm install @struktoai/mirage-node # Node, servers, CLIs npm install @struktoai/mirage-browser #…

  13. Documentation Index Fetch the complete documentation index at: https://code.claude.com/docs/llms.txt Use this file to discover all available pages before exploring further. Overview Claude Code features native sandboxing to provide a more…

  14. The Gemini Protocol in 2026: growing, but still not setting the Internet aflame Note This article is about Gemini, the HTTP-like Internet protocol for document browsing, and not the large language model or the cryptocurrency of the same na…

  15. 181 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

    model roundup

    Qwen 3.5
    140 items

    Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.

  16. I’m looking for something that can: - chat with me (Telegram/WhatsApp/etc) - manage tasks/workflows - maybe access tools like Gmail, Calendar, Docs, GitHub - possibly code/automate things But without the huge security risks of setups like…

  17. I've been having issues lately with Claude completely ignoring certain instructions in CLAUDE.md. I did some digging and found something interesting with the claude cli harness and I'm curious if anyone else has come across this.

  18. Is Claude changing your daily habits with the dumb "5 hours from start" stuff? Today I found myself waking up, asking Haiku something basic (what day it was) so the clock starts.

  19. Hey Bros, I have around 80% tokens for the week, If anyone needs it or suggest me what I can do with it will be helpful.

  20. event

    Deepmind
    24 items

    Google DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.

  21. If you use AI agents or know people who do, AgentVet might be worth checking out. It is a community-driven site where users rate and review AI agents, the idea is to help people cut through the noise and find the right tool for their actua…

  22. mlx-code A lightweight coding agent for Mac, built on Apple's MLX framework. Fast local inference, built-in prompt caching, robust tool-calling.

  23. Hey all, Just wanted to turn you on to a open source TTY based IDE with a single line installer called LOOM. I built this editor (well me and Claude) to be purpose built to for coding along side an agent in cloud VM based environments.

  24. OpenAI Trial Highlights Criticism of Musk, Altman Management Styles - Bloomberg Skip to content Bloomberg the Company & Its Products The Company & its ProductsBloomberg Terminal Demo RequestBloomberg Anywhere Remote Login Bloomberg Anywher…

  25. Wind patterns, Koppen classification, anthropology, map design, and effects of geography on people, it can do it all with proper research on wind and climate patterns.

  26. Hi guys, I've been using claude design and I was finally able to get a beautiful design that I wanted after 5 hours of going back and forth. Then I started a new chat within the same project and it completely overwritten the previous desig…

  27. I have been building Dunetrace, a open-source real-time monitoring tool for your production agents. The latest update adds: Cross-agent pattern analysis.