1. I originally was just messing with pi-autoresearch. Gave it a sample task to build the most portable coding agent.

  2. In a federal courtroom in California on Thursday, Elon Musk testified that his own AI startup, xAI, has used OpenAI’s models to improve its own. Elon Musk confirms xAI used OpenAI’s models to train Grok He said it was “partly” true that th…

  3. Any underrated or overlooked models? FYI MiniMax-M2.7 switched their license(from MIT to Non-Commercial) so it's not in graph.

  4. I first sketched out the core of my language in C back in 2021. After finally paying off my debts, I started working on it again as a toy project, partly to study and partly to see how far I can get by working together with LLMs.

  5. AWX Shredder sits between your agents and OpenAI. Set daily spend limits per agent — we block, throttle, and alert before costs spiral.

  6. I asked Claude Code "What will it take to re-design the QNX microkernel and proc to get rid of the Big Kernel Lock?" It said "Roughly, 3 months of intensive work of a top-developer [human]". I said: "Let's get started then".

  7. Core (Adopt) Rising (Trial/Assess) Experimental (Hold/Wait) Visualizing the ecosystem of AI agents and orchestration tools. Weekly updates!

  8. I've built a system where models like Llama 3, Qwen, and Gemma play Pokémon Showdown battles autonomously. Instead of simple prompt-response, they analyze the full battle state every turn (type matchups, HP, weather, field conditions, reve…

  9. model roundup

    Opus 4.7
    243 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

    model roundup

    Qwen 3.6
    243 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  10. Most agent setups follow the same pattern: one big prompt + a few tools. It works, but once you try to scale it, you get hallucinations, debugging becomes tricky making it hard to tell which part of the system actually failed.

  11. Was burning through the Claude Code weekly limit on the $20 plan by Thursday or Friday, every single week. Annoying because I had work I wanted to do and the tool was just locked.

  12. Thinking with Visual Primitives News 2026.04.30: We have released the technical report detailing our approach. In the near future, we plan to make the in-house benchmarks and a subset of our cold-start data publicly available.

  13. https://preview.redd.it/2abn96dwudyg1.png?width=1642&format=png&auto=webp&s=ab5facbd9f4223184834711346dca2bc64db20d3

  14. ;;; Continue reading More for You More for You

  15. ​ Been building a job search automation pipeline this past week and I keep going back and forth on this question. Here's what the pipeline looks like: 1) A Python + Playwright script scrapes company career pages, extracts relevant job list…

  16. Joshua Curry and Vishal Patil have seen a lot of customer service chatbots. The chat windows that pop up on your screen while visiting sites from online retailers to cell phone companies, asking what you need help with, have proliferated i…

  17. model roundup

    Gemma 4
    138 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

    89 items

    Sam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.

  18. Neural Surrogates Neural Surrogates ├── What I'm Working On: Neural Surrogates for Physics, Geometry, and Real-Time Simulation 2026-04-22 ├── Project 01: GeoPINN Demo: Solving PDEs on a Sphere 2026-04-09 ├── Project 02: WavePINN-NIF Comple…

  19. I've been using Claude to build CanvasGPT for the past 2-3 years. It's a spatial workspace where you can brainstorm, research, and ship working products.

  20. 500 Apologies, but something went wrong on our end. Refresh the page, check Medium's site status, or find something interesting to read.

  21. No amount of screenshots or answers could get ChatGPT to admit that the episode I just watched is real. Not sure if that says more about Euphoria Season 3 or ChatGPT.

  22. Auto Agent Protocol (AAP) The A2A v1.0 Automotive Retail Profile. AAP is the open A2A profile that lets AI agents discover dealerships, browse inventory, and submit consented leads through typed automotive messages riding on top of A2A v1.…

  23. Set up a small R&D project which pit different LLMs against each other in a game of Capture the Flag. Each LLM has 30 seconds to prepare any defenses and 5 minutes to capture other flags while defending their own.