Sharing this because i know a lot of people here are doing what i did. My old workflow was a long process.
Automation browser (www.reddit.com)
Hi! In my daily work, I have to check more than 12 different systems.
Most people don’t need agents. They need cleaner workflows. (www.reddit.com)
Something I keep noticing after building a bunch of these systems: people jump to agents way too early they see a messy process and think ok let’s add an agent to handle it but the process itself was never clearly defined in the first plac…
Retrospective on Black and White and it's connection to Google DeepMind (www.eurogamer.net via hn)
Black & White at 25: how Lionhead's harebrained, stoner-powered game design became the harbinger of modern AI Peter Molyneux, Google DeepMind's Richard Evans and more on a "groundbreaking" game's creation and legacy. Your disembodied hand…
sectorllm The world's smallest llama2 inference engine A complete Llama2 inference engine that fits in 1356 bytes of x86 real mode assembly. It boots directly from disk, loads a quantized model, and generates text before any operating syst…
Amazon rolls out Claude Code and Codex internally (www.businessinsider.com via hn)
- Amazon formally adopts Claude Code and Codex company-wide, expanding access to AI tools beyond Kiro. - Amazon is a close partner with Anthropic and OpenAI, having invested billions in both AI labs.
-
82 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 9m DeepSeek V4 Pro matches GPT-5.2 on FoodTruck Bench, our agentic benchmark — 10 weeks later, ~17× cheaper
- 8h Show HN: Dust3D 1.0 – low-poly 3D modeling tool (10 years in the making)
- 11h Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes!
- 20h Free Trial: Gemini 3.1 Pro & Opus 4.6 API Access via My Wrapper
- 22h Open source models are going to be the future on Cursor, OpenCode etc.
132 itemsmodel roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 1h As MTP prepares to land in llama.cpp, Models that support MTP
- 6h vLLM Just Merged TurboQuant Fix for Qwen 3.5+
- 13h A simple "hack" to speed up prompt processing for Qwen 3.5/3.6 in LM Studio
- 14h APEX MoE quants update: 25+ new models since the Qwen 3.5 post + new I-Nano tier
- 1d Mistral Medium 3.5 128B and Qwen 3.5 122B A10B on 4x RTX 3080 20GB
AI Agent Tools for Customer Support (Honest notes) (www.reddit.com)
We’ve been testing a few AI agent tools for support use cases (not just chatbots, but ones that can actually take actions). Here’s a quick roundup: OpenAI Agents: Super flexible, but needs heavy setup SparrowDesk (Zoona AI agents: More str…
Claude code agentic framework (www.reddit.com)
Hi guys, is there any low code UI based agentic builder offered by claude for building agents??
- Claude code (www.reddit.com)
Got tired of clunky extensions for pdf from ChatGPT Export (getchatcache.com via hn)
Fast one-click PDF export Hover the toolbar, select PDF, and click Export. ChatCache turns the current ChatGPT conversation into a clean PDF fast.
random idea on agents (www.reddit.com)
can I just build harnesses for different use cases like crms, video editing, company analysis for vcs, etc, and sell them as custom solutions? i have some examples.
Anthropic quietly nerfed Claude Code's 1-hour cache (www.xda-developers.com via hn)
Claude Code has become the default agentic coding tool for a lot of developers, and for good reason. It understands a codebase, calls tools, edits files, and can plan multi-step tasks with very little handholding.
Train Your Own LLM from Scratch (github.com via hn)
Train Your Own LLM From Scratch A hands-on workshop where you write every piece of a GPT training pipeline yourself, understanding what each component does and why. Andrej Karpathy's nanoGPT was my first real exposure to LLMs and transform…
- Train a LLM from Scratch (github.com via hn)
-
293 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h Qwen3.6 27B FP8 runs with 200k tokens of BF16 KV cache at 80 TPS on a single RTX 5000 PRO 48GB
- 2h qwen 3.6 27B looping problem
- 5h Mtplx – 2.24x faster TPS – The native MTP inference engine for Apple Silicon
- 6h How do you estimate total memory usage?
- 10h Best Llama Config for Turboquant_Plus? (Stats below)
13 itemsevent
Function CallingRecent evaluations show that smaller models like Gemma 4 E2B outperform larger siblings in multi-turn tasks. Meanwhile, function calling capabilities are being enhanced across various AI platforms, including Qwen and Claude, with new search engines and defense mechanisms also emerging to support these advancements.
- 1h Qwem Meetup Presentation: Function Calling Harness, from 6.75% to 100%
- 2d Qwen Meetup Draft Review Required (Function Calling Harness 2 - CoT Compliance from 9.91% to 100%)
- 5d Multi-agent in production: real win or just hype?
- 5d Learn, run and test Agentic AI on your browser for free! (Built with Claude Opus 4.7 in 2 days)
- 6d Show HN: I built a search engine for llms.txt sites
US GUARD Act: Age Verification for AI Chatbots (www.congress.gov via reddit)
There's been a growing number of AI regulation proposals I've been seeing in the US, and this bill in particular came to my attention today after seeing this article. The bill (which has just been "unanimously advanced to the Senate floor"…
Gemini thought it was ChatGPT. (www.reddit.com)
It's in spanish, sorry about that, but it felt so wrong... I asked for pricing and it told me OpenAI pricing!
- Gemini Vs Chatgpt (www.reddit.com)
OpenAI can't build working RSS feeds (openai.com via hn)
https://openai.com/index/openai-pwc-finance-collaboration Mon, 04 May 2026 21:00:00 GMT https://openai.com/index/delivering-low-latency-voice-ai-at-scale Mon, 04 May 2026 00:00:00 GMT https://openai.com/index/advanced-account-security Thu,…
Claude Design Bricked with Unconditional Drop Overload error (www.reddit.com)
https://preview.redd.it/nomyn2y6t8zg1.png?width=741&format=png&auto=webp&s=30b07d9ec175f7e647be3cbdd68e8b635ddaf0bd I lost my design work because of this error 5 minutes ago. It happened instantly but I don't know why or how.
- Unconditional drop overload - Claude Design (www.reddit.com)
OpenAI Raises $4B for 'The Deployment Company' to Help Businesses Leverage AI (officechai.com via hn)
The world’s top AI model startups aren’t just focused on building the best models — they’re also looking to take a part of the gains from deploying these models at companies around the world. OpenAI has closed more than $4 billion for a ne…
Peanut - Text to Image Model (Open Weights coming soon) (www.reddit.com)
A new anonymous model debuts at #8 in the Artificial Analysis Text to Image Arena! Peanut’s weights are expected to be released soon, which would make it the leading Text to Image Open Weights Model.
-
119 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
I'm the paper author. Disclosure up front per sub policy.
unconditional drop overload (www.reddit.com)
Does anyone know why this error is happening? I was working on my first designs in Claude Design—the page was ready, but when I asked for some modifications, it broke.
When Claude tells you to "stop spiraling and go to bed" (www.reddit.com)
From fabian on 𝕏: https://x.com/fabianstelzer/status/2051260931758272863
Ask HN: Why would we care about "extended time horizons" and LLMs? (news.ycombinator.com)
Is it more impressive to take longer to answer 2 + 2? It’s not.
An LLM agent that runs on any Linux box (getclaw.site via hn)
A single shell script that gives you a full LLM agent — streaming chat, shell tool calls, rolling memory, and mentor mode against OpenAI or Anthropic. No Node.
Got this absolute gem of a response from Claude (www.reddit.com)
Since when did Claude have a jd😭 PS. I did add instruct for it to act as software development advisor.
How do you use common skills in organization (www.reddit.com)
Hi all, we use Claude enterprise and want to have common skills repo. I am wondering how do you guys uses common skills within repo?