1. I built a browser-only studio for designing and orchestrating MCP agent systems for development and experimental purposes. The whole stack — tool authoring, multi-agent orchestration, RAG, code execution — runs from a single static HTML fi…

  2. Just tried something interesting — automated the process of filing multiple RTI applications using Claude Code + Playwright CLI. What normally takes a lot of repetitive manual effort (filling forms, payments, confirmations, etc.) was handl…

  3. Hello everyone, I'm a business owner (2 physical shops) and I'd like to create different "agents" that will help me with different parts of my life For example : "Financial Advisor" who will get feed of all my accounting documents, bank ex…

  4. model roundup

    GPT 5.4
    32 items

    OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.

    model roundup

    GPT 5.5
    49 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

  5. The superintelligence context layer between your agents and your codebase. Engram's live code knowledge graph delivers full project context in 167 ms, cuts token usage by ~60%, and replaces 20+ file-read round-trips with a single structure…

  6. Unlike LoRA and its variants, which inject trainable parameters directly into the weights of the Transformer, requiring tight coupling with the backbone. ShadowPEFT instead enhances the frozen large base model by adding a lightweight, cent…

  7. Most agent failures in production aren't actually model failures. The model didn't hallucinate randomly or ignore instructions for no reason.

  8. event

    Security
    80 items

    OpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.

    model roundup

    Opus 4.7
    192 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

  9. Recently Haiku has become a bit of a popular subject of articles and videos, owing perhaps to how close it currently is to be a daily-driver OS and fulfilling the dream that BeOS set out with. That said, there are still quite a few hurdles…

  10. I use Claude for writing and I've set up skills and system prompts to get consistent results. It works for a while but then it defaults back to its own habits.

  11. Adding the phone number normally without the trailing 0 https://preview.redd.it/b80azi3us9xg1.png?width=431&format=png&auto=webp&s=45a1cca00b928fb82270c341fb315117740219f8 Here I am adding the phone number properly as the country code alre…

  12. model roundup

    DeepSeek 4
    34 items

    model roundup

    Qwen 3.6
    153 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  13. I have been using a simple rule for deciding whether a task should be code, an agent, or human review: Stable rules -> code, formulas, scripts, or deterministic automation. Messy but bounded context -> agent workflow.

  14. Generated in one shot using GPT image 2!

  15. I'm working on a fairly large project with Claude Code, and one thing I'm not sure about is whether I need to have it scan/read through all the source files at the beginning of every new session before starting work. It feels inefficient t…

  16. 122 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

    model roundup

    Qwen 3.5
    114 items

    Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.

  17. Personal project I'm open-sourcing: LENA (Logical Execution & Navigation Assistant), a Claude Code plugin that solves a real friction point I've been hitting. The problem: Ask an AI for something simple ("fix this bug"), and it spawns in s…

  18. "Garry Tan, CEO of Y Combinator, noted that by the Winter 2025 cohort, approximately 25% of the participating startups reported that 95% or more of their entire application codebase was generated by artificial intelligence". Curious to kno…

  19. WHY AI ALIGNMENT IS ALREADY FAILING Architectures of Thought April 2026 Three recent empirical findings -- peer-preservation behavior in frontier models, accurate world modeling, and capability outside containment -- combine with one struc…

  20. 50 items

    Sam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.

    event

    Copilot
    92 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

  21. Making sure you're not a bot! Loading...

  22. GTE-Small in Go A pure Go implementation of the GTE-small text embedding model. Produces 384-dimensional, L2-normalized embeddings suitable for similarity search and clustering, ported from @antirez's C implementation.

  23. I've spent the day working with claude, going back and forth a ton, and it gave me some solid stuff to work off of for a couple of projects. This wasn't just a short exchange, it was like hours of work all throughout the day.