Notes on DeepSeek (twitter.com via hn)
Notes on DeepSeek: We visited the company HQ last Tuesday. It was founded in 2023 by Liang Wenfeng and operated out of his hedge fund, High-Flyer, until somewhat recently.
- DeepSeek v4 (api-docs.deepseek.com via hn)
- DeepSeek-V4 (huggingface.co via hn)
Security scanners for AI agent skills agree no better than chance (trymastro.com via hn)
The top scanners disagreed 64% of the time and agreed no better than a coin flip. A data study.
Manifesto for Agentic Teams – reorganizing engineering around AI agents (agentic-team-manifesto.org via hn)
Outcomes over output More code is not more value. We measure what ships to users, not what ships to the merge queue.
Ask HN: What has been the fate of code review? (news.ycombinator.com)
Given the increased use of AI, my experience is that teammates are moving so fast churning out so many changes that it is nigh impossible to review it all. I can't even keep up with the code being generated by my own use of LLMs at times.
- Ask HN: How / What do you use for Code Review? (news.ycombinator.com)
Faster inference won't save you (graphcoder.ai via hn)
Faster inference won't save you Cutting agent latency with a distributed event log We went into Graphcoder assuming agent latency was mainly an inference problem. That lasted until we watched real sessions run.
The Rise of the Agent Runtime (golem.cloud via hn)
The dominant use of AI in 2026 is a coding agent—even though almost none of the people using AI think of themselves as programmers, and almost none of them ever see a line of code. This shift is invisible to users, but it is breaking the i…
A €0.01 bank transfer could compromise a banking AI agent (blue41.com via hn)
Track per session/chat usage percentage and tokens (www.reddit.com via reddit)
So what is a good tool to get detailed metrics locally for claude-code ? Preferably something with a UI.
-
387 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 3m Show HN: AuthAI, an open-source relay for user-authorized AI sessions
- 3h Show HN: Codacy Skills for Claude, Codex, Copilot, etc.
- 19h who has built and shipped a completely vibe coded project?
- 20h I built an open-source persistent memory layer for AI coding agents
- 21h Claude Fable 5 is generally available for GitHub Copilot
347 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 16m Built a local AI desktop agent inspired by Claude Cowork (I’m 13)
- 1h A 13-year-old basically open-sourced Claude Cowork & Manus desktop
- 8h Claude Fable 5's new features, tested by having it write its own launch coverage
- 10h Fable 5 Claude code
- 21h Can't select Fable 5 for Claude Code. It is shown on Claude Chat and Cowork but not on Code. Anyone else experiencing this?
After too many debugging sessions where I had no idea what my agent remembered or why it made a decision — I got frustrated and built something. notmemory is an open-source Python SDK that gives AI agents auditable, reversible memory.
I was always on either pro or 5x plan but I guess anthropic got me this time ;) (www.reddit.comhttps)
could not extract summary
Claude Code VS Google Antigravity (2.0) (www.reddit.com via reddit)
Hi everyone, as a college student, I’ve had a free Gemini Pro account (the €20 version) since October 2025, so I’ve always used Gemini, NotebookLM, and the entire Google suite for my studies. I’m a master’s student in data science, and in…
- Claude Code Plugin VS Code on Google Antigravity editor (www.reddit.com)
- Claude code (www.reddit.com)
Our workplace LLM mass delusion (blog.avas.space via hn)
our workplace LLM mass delusion I can't help but wonder whether we will look back on this AI hype in the workplace with confusion and embarrassment. If we indeed progress into a future where the bubble will burst, models will further close…
Hey guys, To those of you who are using Claude code on terminal, does switching to Fable 5 really exhaust your 5 hour limit quickly? I've been using it on the terminal since yesterday and I'm getting the same usage as Opus!
An autopsy of Claude Code's deep research (steel.dev via hn)
/ San Francisco / I had Claude Code pry its own deep-research workflow out of its binary, then pointed that workflow at a question about itself. The verdict: it searches wide and never doubles back.
- Almanac, turn claude code into a deep research agent (www.reddit.com)
- Claude Code or Cursor for Deep Learning Research (www.reddit.com)
- Converting Claude Code into the most intelligent Deep Research Agent (www.reddit.com)
+1 more
- Converting Claude Code into the most intelligent Deep Research Agent (www.reddit.com)
Explicit Seams as Agent Affordances (blog.tacoda.dev via hn)
6 min read Just now The agent’s diff was clean. Three files, all tests green, nothing obviously wrong in the PR.
-
9 items
model roundup
Sonnet 4.6Several updates and comparisons revolved around Sonnet 4.6, including its performance in dashboard analytics alongside Opus 4.8, and its role in processing critical requirements for a benchmark test with Gemma 4.31B QAT.
- 21m Issue with Sonnet 4.6
- 3h Hitting Mythos Guardrails but not using Fable?
- 5h Claude Sonnet 4.6 Making VMs in GCP, Azure and AWS via a Textual Agent Interface
- 8h As impressive as Mythos/Fable is, I really hope that we’ll see upgraded Sonnet and Haiku models soon…
- 1d Claude Sonnet hits 100% comprehension on a data format it's never seen. Opus scores 96.2%. We tested 10 models across 3 providers.
183 itemsmodel roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 24m I Tested Claude Fable and GPT-5.5 xHigh on a Real Packing Algorithm, Claude Won Efficiency, GPT Won Speed
- 1h Show HN: I generated 235 system docs in a day using GPT-5.5
- 4h Pelican on a Bicycle: Claude Fable 5 vs. GPT-5.5 Pro vs. Gemini 3.1 Pro
- 5h GPT 5.5 vs Fable/Mythos 5 Tamagotchi Showdown
- 7h Fable surpasses GPT 5.5 completely
Any data engineer here using claude (www.reddit.com via reddit)
looking for how to use claude for data engineering. any suggestions?
Flights: Agent-Native Ingest in Motherduck (motherduck.com via hn)
My harness (www.reddit.com via reddit)
This is my harness. There are many like it, but this one is mine.
- "Harness" lol (www.reddit.com)
Fabre 5 for fiction world building: wow (www.reddit.com via reddit)
Yesterday I took Fabre for a ride into the writing project i had worked with both sonnet and opus for the past few months. Although there is a bit of actual writing here and there, it is mostly world building at this stage: dozens of diffe…
We're building an AI healthcare receptionist platform that handles inbound patient calls, patient verification, appointment booking/rescheduling/cancellation, clinic FAQs, human call transfers, SMS notifications, call recordings, transcrip…
Claude Fable 5 played a full chess game on lichess using only screenshots and mouse clicks — no chess API, no DOM access for moves. It checkmated Stockfish in 18 moves.
Either I'm crazy or Claude’s account appeal form doesn't exist. (www.reddit.com via reddit)
I am not asking for you to fix my account. In https://support.claude.com/en/articles/8241253-safeguards-warnings-and-appeals ther'es an appeal form button which just links to https://claude.ai/restricted , I have tried contacting support a…
Why LLMs still lack taste (beyondtheprior.com via hn)
Why LLMs (still) lack taste Frontier LLMs are really smart, and they’re becoming particularly good at software development. It feels like every week there’s a new model release that achieves SOTA scores on a handful of benchmarks.
Lua.ex: Sandboxed Lua 5.3 on the Beam, Built for AI Agents · Lua.ex (deflua.com via hn)
Embed untrusted Lua in your Elixir app: AI agent tools, user formulas, per-tenant plugins. Pure BEAM, sandboxed by default, zero NIFs.
Claude Fable 5 crosses 81.9%, reaching 1st on Simplebench (www.reddit.comhttps)
could not extract summary