Pylon You must construct additional pylons. Self-hosted daemon that turns events into sandboxed AI agent runs.
Had 327 production traces from a restaurant-reservation agent I wanted to retrain. The plan was to fine-tune a smaller self-hostable model so I could ditch the frontier-API bill.
Sparse-attention decoders rely on exact Top-K selection to choose the most important key-value entries for each query token. In long-context LLM serving, this Top-K stage runs once per decode query and becomes a meaningful latency bottlene…
Artificial Intelligence: Foundations of Computational Agents (artint.info via hn)
Artificial Intelligence: Foundations of Computational Agents, 3rd Edition This book is published by Cambridge University Press and the complete text is available here with permission of CUP. Please consider buying the book.
Show HN: 2 weeks of coding, 3 months of OpenAI review, my ChatGPT App is live (news.ycombinator.com)
I run Tredict, an endurance sports training platform I've been building since 2020. OpenAI opened the ChatGPT App Directory to third-party submissions in December, and the official Tredict app is now live.
Ask HN: Any examples of useful AI agents? (news.ycombinator.com)
Apart from coding agents, are there any AI agents that are actually useful? Is it all still just hype?
Qualcomm stock spikes on a report that it could make chips for an OpenAI smartphone (www.businessinsider.com via reddit)
Qualcomm stock surged after a report from a tech analyst in Asia shed light on a potential partnership with OpenAI for a 2028 smartphone release.
When LLMs Get Personal (joshbudman.substack.com via hn)
When LLMs Get Personal As AI answers become more personalized, do stable patterns still exist? In my last two posts, I approached the heavily polarizing topic of AI search (yes, remarkably, it’s still polarized) from two related angles.
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond (huggingface.co via hn)
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Abstract World models are categorized into three capability levels and four law regimes to better understand and develop predictive environment models for AI agents across…
-
78 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 3m When do you think GPT 5.6 comes out? How big of an improvement will it be?
- 1h GPT 5.5: The System Card
- 1h GPT 5.5 vs. Opus 4.7: Benchmarks Say One Thing, Reality Says Another
- 1h We Tested $200 GPT-5.5 Pro on PhD Level Math [video]
- 2h GPT-5.5 hallucinates at 6 times the rate of Opus 4.7 on degraded insurance docs
190 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 29m Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090
- 1h GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B
- 2h Question regarding 4 t/s Qwen 3.6 performance
- 3h Why are there so few small local creative writing models from the Chinese?
- 3h PI agent integrated with Cline-Kanban repo: All using PI and Qwen 3.6 35B MOE UD 4K_XL
I genuinely want to know if anyone is able to one-shot a perfect (for you) website with ClaudeCode. In most cases I've to go back n forth 5-6 times in order to get what I want, even though I already specified same things in the original pr…
Sverklo, on the record. A local-first MCP code-intelligence server, two reproducible benchmarks, and an 8-page preprint that publishes both.
Agentic Ai Revolution humming along… (www.reddit.com)
while people argue about ai ethics on the surface there’s a whole underground building agents that never sleep different timelines forsure which timeline are you on?
Customizing Karpathy's LLM Wiki for fighting disease (kamens.com via hn)
hstack is a stack of tools and agent specialists built to help those who use LLMs to fight disease, including an LLM-powered "personal disease wiki" inspired by Karpathy
Long-running Claude for scientific computing (www.anthropic.com via hn)
Agentic AI made DevOps and Agile obsolete (avkcode.github.io via hn)
The Self Healing Platform and the Agent Store I think DevOps as a separate identity, and a lot of agile ceremony around it, are already a bit obsolete. Engineers are doing development, operations, and lightweight management at the same tim…
Google's A2A Protocol: How AI Agents Will Talk to Each Other (www.ismatsamadov.com via hn)
Google quietly dropped the Agent-to-Agent protocol in April 2025 with a blog post and a GitHub repo. Eleven months later, it has 22,700 GitHub stars, backing from over 150 organizations including AWS, Microsoft, and Salesforce, and a perma…
Show HN: agenv - A pyenv-like environment manager for coding agents (github.com via hn)
agenv Environment manager for AI coding agents — like nvm or pyenv, but for agent accounts, config, and saved runtime args. agenv installs codex, claude, and gemini into isolated profiles and lets you pick which profile runs by default, gl…
Show HN: Vibe-coding video games with Claude (Day 14: Tetris) (gamevibe.us via hn)
I used to run a flash games website (SWF files) years ago. I've made a few games of my own.
- Vibe-coding video games with Claude (gamevibe.us via hn)
- Show HN: Vibe-coding video games with Claude (gamevibe.us via hn)
-
109 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 57m GitHub Copilot is moving to usage-based billing
- 5h Watch out UK taxpayers: 28,000 HMRC staffers just got an AI copilot
- 6h How Fast Does AI Really Make Developers? The Evidence so far
- 19h Frontend dev. A month of building a Rust cost tracker + cloud + Cursor extension solo with Claude Code. Honest writeup + workflow tips.
- 20h Claude Max users, what do you do good sirs?
Homescreen - Try all endpoints for free I wanted share a recent project I wanted to build a project around free-to-use data, that when brought together, enriched and made easy to use, would be valuable to people. I used Claude Code to buil…
Ask HN: Claude Code usage changing (max 20x) (news.ycombinator.com)
Show HN: Fenster – Run Chromes Local Gemini Nano as a CLI (github.com via hn)
fenster Run Chrome's local Gemini Nano through a Go bridge. Chrome ships a built-in LLM (Gemini Nano, about 3B parameters, GPU-accelerated).
AI Usage Analytics – Real-time budget enforcement and PII redaction for LLM (news.ycombinator.com)
- Claude Status Update : Elevated billing related errors on Claude.ai on 2026-04-27T14:11:29.000Z (www.reddit.com)
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T18:42:40.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:57:57.000Z (www.reddit.com)
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T19:02:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:25:22.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T11:58:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:43:51.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:37:30.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T07:48:31.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:53:02.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T16:29:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:03:39.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:20:03.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:57:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:55:35.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T07:43:32.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T06:50:56.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T17:42:57.000Z (www.reddit.com)
If Claude Feels Worse, Fix Your Harness (mdelcaro.substack.com via hn)
If Claude Feels Worse, Fix Your Harness Take What Works and Leave the Rest A note before I start. In my last post I called a lot of people embarrassing for complaining that Claude is getting worse.
Learning to Orchestrate Agents in Natural Language with the Conductor (openreview.net via hn)
Keywords: RL, reasoning, LLM, tool use, prompting TL;DR: We introduce the Conductor, a new kind of language model trained with reinforcement learning to automatically discover powerful coordination strategies among LLMs Abstract: Powerful…
Six months ago I started a side project because Claude Code kept forgetting things I'd already explained. My architecture, the weird reason that one function exists, what broke last deploy.
CNBC, CNN, and other major media sources have just reported that Meta’s acquisition of the AI startup Manus was blocked! Interestingly, I shared a survey on AI Agent platforms for knowledge workers.