1. Are they A/B testing again?

  2. Anthropic treats safety as a model behavior problem. The last month shows it's also a product reliability, pricing, and communication problem.

  3. 49 Agents IDE The first 2D agentic IDE. Open source.

  4. karpathy keeps a personal "llm wiki" — a markdown vault he and his llm both edit. it's basically his personal context, written down so the llm can use it.

  5. I started coding way before AI or coding agent existed. Worked in an observability company working on ingestion and query engine in rust.

  6. Agents can't choose between structure and flexibility Why maximizing in either direction is a failure mode I think it’s safe to say that when the LLM hype cycle started a few years ago, no one expected one of the great debates of our time…

  7. model roundup

    Qwen 3.6
    196 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

    65 items

    Sam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.

  8. The custom AI agent space has exploded but the tools serve very different audiences. I’ve built agents on five different platforms this year across client projects.

  9. Is there any proof that scrolling ruins our attention span? Are cellphones and algorithmic media actually harming our attention span?

  10. Curious what people here are working on in terms of agents automations, workflows, multi-agent setups, and open claw experience. I’ve been focused on building and testing different use cases and trying to see what actually works vs just th…

  11. The Full-Cycle Agentic Experience What we're missing, and why it matters more than the models themselves. Think about the last time you bought something in a store.

  12. Just upgraded cursor. Got a message that I needed to uninstall Microsoft remote SSH.

  13. Basicamente eu tenho uma plataforma, onde nela eu ligo e preciso confirmar um token, recebido por email ou celular, logo após se inserir login e senha. Até o momento O Claude e o gpt, só automatizaram o processo até o momento em que o site…

  14. model roundup

    GPT 5.5
    86 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

    model roundup

    GPT 5.4
    33 items

    OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.

  15. I’ve been using Claude Design a bit. It’s cool, but I feel like there are still some features missing.

  16. Has anyone experimented with MCP for financial analysis? I've been toying with xfinlink (their free tier gives enough to get some good research done) and eodhd cz i have a premium on that and they both have been doing pretty good.

  17. Maybe someone here can answer before Cursor does, like why is there no auto/opus etc to choose from in Cursor mobile? Is it worth using this?

  18. Not sure if it's just me but I've been using Cursor pretty heavily for the past few months and something feels off. The code works, shipping is faster, but I feel like I understand my own project less than I did before.

  19. Source: Claude Code Model Configuration

  20. I wrote a 20-line script that runs at the root of your Claude file. When you’re chatting with Claude in your IDE, it automatically triggers a background n8n automation workflow to create tasks in ClickUp.

  21. model roundup

    Sonnet 4.5
    6 items

    Anthropic has kept Claude Sonnet 4.5 available after its retirement due to user demand, while open-source models like DeepSeek V4 are catching up in capabilities, which remain several months behind closed lab versions.

  22. ChatGPT Images 2.0 Still Can’t Draw the Seven-legged Spider I Want Whenever a new image generator comes out, I run the same test: “Please generate a spider silhouette missing its left front leg. Use an art deco style.” And every time, the…

  23. could not extract summary

  24. Over the past few months, we kept running into a very specific problem: if you want to track competitor prices and act on them in real time, the current workflows are broken. Prices change constantly across websites, but there’s no reliabl…

  25. After using OpenRouter for more than a year i decided to try Claude Max 5x plan mostly to try Claude Design. Got my subscription on Friday afternoon, used it for 3hrs that day, 3hrs the next day and today after my first request got an erro…

  26. Executive summary NothingHumanSearch has been crawling the web for agent-readiness signals since launch, and the numbers tell a consistent story about the gap between claiming MCP support and shipping it. As of 2026-04-27 the index holds 7…

  27. Most businesses are currently "GPT-washing"—throwing a chatbot at a problem and wondering why their workflow is still broken. In 2026, the real wins aren't in the chat; they’re in the architecture.

  28. I put together a hands-on tutorial that takes you from problem framing to fine-tuning, step by step. I decided to build a wildfire prevention system that uses satellite images and a Small Vision-Language Model (LFM2.5-VL-450M) to extract r…