Vision agents vs. structured APIs on the same internal tool task (news.ycombinator.com)
Vision agents (browser-use, computer-use) are the default for letting AI agents operate web apps without APIs. Writing an MCP or REST API per app is the alternative, but every app needs its own.
Why Claude needs many tries to get something done right (correct)? (www.reddit.com)
Every time I ask Claude to check a document or plan or strategize or update one document based on the other document, it gives just an ok-ok result first time (not 100% correct). Then I ask it to recheck because of issues and missing info,…
Long-Running Agents (addyo.substack.com via hn)
Long-running Agents What changes when your agent runs for days instead of turns? A long-running AI agent can keep making progress over hours, days, or weeks.
Anaconda acquires Outerbounds to rein in the buggy code AI agents keep shipping (thenewstack.io via hn)
Anaconda acquires Outerbounds to rein in the buggy code AI agents keep shipping Anaconda, which provides an AI-native dev platform, is acquiring Outerbounds, the company behind Metaflow, an open source AI/ML orchestration framework that or…
Has anyone experimented with the new agent mode? (www.reddit.com)
I see there are chief of staff or sales agents I can use, is it safe to link my email to chat gpt or should I create a seperate one for them to use? Has anyone used them and had much luck or help using the agents?
OpenAI: WebAssembly and Rust Are Reshaping Data Visualization in BI (blog.gopenai.com via hn)
24 min read Dec 30, 2025 Press enter or click to view image in full size Image by Author with Recraft.ai A journey from vendor lock-in to privacy-first, client-side analytics — and why the future of BI tools doesn’t need a backend. The Dat…
-
90 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 11m Elon Musk Admits xAi is Distilling OpenAI Models
- 5h Families of Canadian mass shooting victims sue OpenAI, CEO Altman in US court
- 9h The Download: storing nuclear waste and orchestrating agents
- 13h OpenAI, Sam Altman Hit with Slate of Lawsuits over Mass Shooting Canadian School
- 14h OpenAI sued by families of school shooting victims in Canada's Tumbler Ridge
143 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 49m Cutting Through the Mythos: What AI Vulnerability Discovery Means for OT
- 1h Claude Mythos supports Image outputs - Anthropic's first image gen model
- 7h White House Opposes Anthropic's Plan to Expand Access to Mythos Model
- 19h what is claude mythos doing in my azure model catalog 😭
- 21h Trump officials draft plan to bring Anthropic back amid Pentagon fight
After a while you stop seeing “projects” and start seeing patterns Different founders different ideas different stacks Same failures every time And almost never because the model wasn’t good enough The first is integration The AI works in…
TRiP — TRansformer in Progress A few-files, all-in-one C engine for Transformer AI models: inference, training, tokenizer creation, chat, and vision. Built from scratch over 18 months (from March 2024 to August 2025) during my lunch breaks…
Claude just rickrolled me (www.reddit.com)
I just asked Claude to generate a basic about page with a placeholder YouTube embed… and it straight up used Rick Astley Is this a normal thing or did I just get played??
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
Guardians: Static verification for AI agent workflows (github.com via hn)
Guardians Static verification for AI agent workflows. An implementation of the ideas in Erik Meijer's "Guardians of the Agents" (CACM, January 2026).
Just wondering (www.reddit.com)
I recently started a new position in a new working place, and while Ai usage is not brand new to me, I need some clarifications. The organization I am working for is at the very beginning of transitioning towards a heavy Ai usage in all co…
Show HN: Larkin – Authorization middleware for x402 agent payments (larkin.sh via hn)
x402 authorization · v1 Your API takes x402 payments. You have no idea who’s paying.
-
9 items
model roundup
GLM 5.1GLM-5.1 is a next-generation model with enhanced coding capabilities, achieving state-of-the-art performance on SWE-Bench Pro and leading GLM-5 by a wide margin in repo generation and real-world terminal tasks. Community reports highlight its impressive speed, with 40 tps and over 2000 pp/s on stable setups, though some users are experimenting with hardware optimizations for better performance.
131 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 1h 26 years ago I took a website management company public on NASDAQ (200+ staff, 60 engineers). Over just a few weekends I rebuilt a better product using Claude Code.
- 4h Brightdata plugin
- 7h New to the pro plan. Recommend me the best tools and resources.
- 15h I got tired of switching between 10 different claude codes and claude cowork
- 1d Where do skills live? I'm so confused
Show HN: Phase Router – capacity-aware routing for MoE (github.com via hn)
A deterministic, capacity-aware routing kernel that reduces dropped work in load-balanced systems. Trades microseconds of routing for milliseconds of saved compute.
Alright so hear me out. Every single time you start a new AI agent project you end up writing the same configuration scaffolding from scratch.
Accurate infographics with ChatGPT Images 2 (surguy.net via hn)
- ChatGPT Images 2.0 (openai.com via hn)
- ChatGPT Images 2.0 (chatgpt.com via hn)
- ChatGPT Images 2.0 (twitter.com via hn)
+1 more
- ChatGPT Images 2.0 2K (www.reddit.com)
Is local AI the actual endgame? (M5 Mac Studio vs. Dual 3090s) (www.reddit.com)
Mozilla's opposition to Chrome's Prompt API (which only supports Google Gemini Nano) (news.ycombinator.com via reddit)
could not extract summary
Dear Claude (www.reddit.com)
-
38 items
event
MistralMistral, a French AI company, is set to release a medium-sized model with 128 billion parameters and is planning to launch Workflows in public preview. The company, founded by Arthur Mensch, continues to grow its AI empire despite not being based in the United States.
237 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h Are Qwen 3.6 27B and 35B making other ~30B models obsolete?
- 1h 12GB-Club: 4070S qwen3.6 27b + 35b a3b, and Gemma 4 26b a4b + 31b speeds
- 2h Qwen3.6 27B seems struggling at 90k on 128k ctx windows
- 4h Actual comparison between locally ran Qwen-3.6-27B and proprietary models
- 5h Qwen-27B as a Local Agent — It Actually Works Now
could not extract summary
Agentic User Research Tool (github.com via hn)
Research AI AI-powered user research, end to end. Frame a problem, pick your personas, attach your artefacts — then watch eight archetypes interview themselves and synthesise a report.
How to stop your agents from making the same mistakes (twitter.com via hn)
LangChain has raised $160 million. Three years of development.
how to do?? (www.reddit.com)
bruh I've around 150$ in my aws bedrock account and I've configured all the anthropic models like i just wanted to use those credits in the claude code and build some projects for my end sem.. The first time I tried it was just the bedrock…
A Dungeon Master as a long-horizon agent (h-tu.ch via hn)
Like others, I’ve tried to play solo RPGs and adventure games directly with ChatGPT / Gemini / Claude via their chat UI. While LLM chat applications can convincingly create a world setting, narrate a scenario and interact over a modest num…