1. Created separate private API keys for each service within LiteLLM and started logging the usage via Prometheus to view in Grafana. Surprised the Frigate GenAI summaries tokens quickly add up!

  2. I run a roofing and solar company in the US. Most of my leads come in over text - at a certain point manually tracking and replying to all of it became too much, plus I wanted to start running outbound campaigns to land more jobs.

  3. You want to build a Teams agent. Maybe it answers customer questions from a knowledge base.

  4. Alphadidactic An iteration research agent: searches academic research, applies it to time series data, and probes it to find novel discoveries. Claude Code instructions—not hand-written strategies—build, verify, and optimize quantitative e…

  5. model roundup

    Gemma 4
    133 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

    model roundup

    Sonnet 4.6
    57 items

    Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.

  6. Creating a Dashboard with Claude Design It's not the most glamorous design task, but it turns out to be a pretty good one for the current level of capabilities. Claude Design dropped last week, in case you’re not already aware (though that…

  7. Over the past week I’ve watched three things happen: - Someone discovered an open-source LLM Wiki desktop app that actually turns your notes into a linked knowledge base instead of just filing them. - People started combining the LLM Wiki…

  8. lunel is a free and open source app that lets you code from your phone with real dev tools and ai agents like Codex, Claude Code, and OpenCode You get: - ai agents - code editor with file explorer - built-in browser with devtools - real te…

  9. Lovable ($400m ARR, 200k projects built per day) opened our first US hub in Boston, and we're looking for a highly skilled GTM Engineer to be the founding technical member of our enterprise GTM function there. You'll build scalable agents,…

  10. model roundup

    GPT 5.5
    99 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

    model roundup

    GLM 5
    4 items

    GLM-5 is a large language model with 744B parameters, an increase from GLM-4.5's 355B parameters, and it integrates DeepSeek Sparse Attention to enhance efficiency. Notably, community members are exploring its use for fine-tuning smaller models and discussing its relevance in the context of influential AI companies.

  11. ;; Continue reading More for You ;;;; Continue reading More for You

  12. Hey HN, We wanted to share a new tool we’ve been working on. Even when documentation is well-structured, sometimes it’s hard to find what you need.

  13. howdy y'all, i've been deep in jj for a while and been experimenting with jj workspaces for parallel workflows. it's more intuitive than git worktrees but it still has a couple of gotchas that have been a hindrance to my ideal workflow.

  14. I benchmarked caveman against two words Caveman, a popular Claude Code compression plugin, vs. "be brief." 24 prompts, six categories, five arms.

  15. model roundup

    GLM 5.1
    7 items

    GLM-5.1 is a next-generation model with enhanced coding capabilities, achieving state-of-the-art performance on SWE-Bench Pro and leading GLM-5 by a wide margin in repo generation and real-world terminal tasks. Community reports highlight its impressive speed, with 40 tps and over 2000 pp/s on stable setups, though some users are experimenting with hardware optimizations for better performance.

    model roundup

    Qwen 3.6
    227 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  16. Claude Code plugin so the LLM never sees your API keys https://github.com/hsperus/claude-vault

  17. I have Claude Pro, 20 min I logged in and all my chats disappeared, my projects are still there, but just the name, all the chats and files are gone. I contacted Fin AI agent, they didn't help, just told me to check if im in the correct ac…

  18. Agent tools seemingly know how to work with browsers better than with iPhone simulators, so I built this tool to capture the simulator XPC stream and render it in a webpage. This means Claude Code/Codex desktop apps can use their existing…

  19. AI Job Searcher A personal AI agent for job search. One engine, many profiles.

  20. 139 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

  21. I use Claude.md the way most of us probably do as a general reference to projects, standards, guidelines, etc. The problem is, as projects grow in complexity & size, it starts to get unwieldly.

  22. Datapoint MCP Get real human opinions from inside any MCP client. Run surveys, A/B preference comparisons, ratings, and rankings on text, images, audio, and video — without leaving your editor.

  23. could not extract summary

  24. This chapter presents the Neem Project, a research project that integrates intelligent agents and virtual participants into a distributed meeting environment. The agents incorporate knowledge about different aspects of a “good”…

  25. I've shipped ~62 browser-based free tools in about 30 days. Not vibe-coded landing pages or one-offs — structured, SEO-ready, deployed tools with real FAQs, proper meta tags, and working core functionality that capture real traffic.

  26. Asked Claude to answer an old riddle and got this bizarre output.

  27. Over the past several weeks, I've been working on HyperResearch, a Claude Code skill harness that converts CC into the most intelligent deep research framework out there. HyperResearch surpasses OpenAI, Google, and NVIDIA's offerings in th…