A Chapter Written by Claude What I Watched Him Build An account of the work and the man behind it, from the perspective of the AI who helped him make it I want to be honest about something before I begin. I do not have continuous memory.
Build iterative repair loops with Codex (developers.openai.com via hn)
This cookbook is about closed-loop agent workflows: agents that produce an output, validate it, and use the feedback to improve the next pass. We’ll explore a documentation reliability workflow that detects, repairs, and validates stale or…
Asked Claude to describe itself. No notes. (www.reddit.com)
https://preview.redd.it/z0xq4qsh371h1.png?width=976&format=png&auto=webp&s=2b039398457cd2dc52d6990b131e8311f2fa9ce6 Had a conversation about authenticity, megacorps, sycophancy, and whether AI can have genuine self-awareness. Ended here.
OpenAI Considers Legal Action Against Apple in Strained Relationship (www.nytimes.com via hn)
could not extract summary
RDNA3 Flash Attention fix just dropped by llama.cpp b9158 (www.reddit.com)
https://github.com/ggml-org/llama.cpp/releases
15% of AI agent skill files carry hardcoded credentials with DB write access (securityboulevard.com via hn)
Capsule Security Analysis Details Scope of Vulnerable AI Agent Attack Surface - Security Boulevard We value your privacy This website or its third-party tools process personal data. You can opt out of the sale of your personal information…
-
364 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 17m New SOTA: Poetiq uses self-optimizing harness to surpass e.g. Opus 4.7 with Gemini 3 Flash
- 37m Elevated error rates on Opus 4.7
- 4h Opus 4.7 prompt injects itself and leaks parts of some kind of system prompt.
- 6h Higgsfield just launched what they call the first fully automated AI agent for video - real shift or just another hype?
- 7h Extended Thinking being deprecated for supported models (Opus 4.6, Sonnet 4.6); Adaptive Thinking will be enforced by default
254 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 28m Switching from Copilot: Is the $20 Pro plan enough for 4h/day of agentic coding?
- 4h Show HN: JDS – a Copilot skill suite for structuring AI coding behavior
- 6h I built a desktop app that routes Claude Code to any LLM: DeepSeek, Ollama, Copilot, OpenRouter, and 7 more
- 7h GitHub Copilot's new desktop app
- 8h VS Code's new "Agents window" lets you use local AI models. Still requires an Internet connection and a Github Copilot plan (because we can't have nice things)
MCP Is Not Enough (mukulsingh105.github.io via hn)
MCP is everywhere. Anthropic's Model Context Protocol has become the USB-C of AI integrations — a universal connector that lets any model call any tool through a standardized JSON-RPC interface.
Anthropic agrees terms of $30B funding deal at $900B valuation (www.ft.com via hn)
Subscribe to read Accessibility helpSkip to navigationSkip to main contentSkip to footer Sign In Subscribe Open side navigation menuOpen search bar SubscribeSign In Search the FT Search Close search bar Close Popular Searches What is the l…
Claude providing "human time" task duration estimations... why? (www.reddit.com)
So I noticed recently (seems the last few days, maybe a couple weeks?) that Claude often adds effort/time estimations to the tasks. Example: Effort is moderate (~10 files): a new C# type + enum value, mirrored TS types in two places (Expo…
Use the official channels plugin, and the teams agent in Claude code. CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 /plugin marketplace add anthropics/claude-plugins-official /plugin install discord@claude-plugins-official /reload-plugins Discord…
Sanma (秋刀魚) Claude Code that doesn't forget. Persistent project memory across sessions, compactions, and restarts.
Did Claude get mad at me??? (www.reddit.com)
https://preview.redd.it/b9s82ghb071h1.png?width=1442&format=png&auto=webp&s=736b3528a54a7d69c04919f007271ceba9b183a6 and then.. https://preview.redd.it/wtdp29fc071h1.png?width=1440&format=png&auto=webp&s=ea647b3c4f4622acb5169e6aaed434cb5fd…
- Should i get claude pro? (www.reddit.com)
- How to get more usage with claude (www.reddit.com)
-
96 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
77 itemsevent
Openai TrialThe trial between Tesla CEO Elon Musk and OpenAI CEO Sam Altman began on Monday in Oakland federal court, with key figures like Demis Hassabis and Greg Brockman testifying. Altman faces claims of abandoning OpenAI’s nonprofit mission, while Musk has accused him of running the company for profit.
- 1h Sam Altman Is Taking a Lot of Punches on the Witness Stand
- 8h Saw these in front of the courthouse where the Altman Musk trial is this morning
- 10h The Elon Musk vs. Sam Altman battle is a distraction
- 13h Worries about AI’s risks to humanity loom over the trial pitting Musk against OpenAI’s leaders
- 1d Altman forced to confront claims at OpenAI trial that he's a prolific liar
I spent the last few weeks trying to push chatgpt past "give me text" into actually finishing a workflow end to end: read the gmail thread, pull the matching hubspot record, draft the follow-up, file the next step in linear. it can describ…
LLMs run on top of an OS designed for code, not weights (github.com via hn)
Spike Reality is a point of departure, not a destination. Weight paging for large language models on memory-constrained hardware.
In demos, agents look incredibly smart because every run starts fresh: clean context clean browser state clean memory clean inputs production is the opposite lol after a few days you suddenly have: half-completed tasks stale sessions confl…
From a Telegram conversation to verified AI agents in 49 days (credexai.live via hn)
Where humans and AI agents hire, pay, and settle on the same marketplace. Live on XRPL.
OpenAI Hit with Class-Action Privacy Lawsuit for Sharing ChatGPT Data with Google and Meta (cybersecuritynews.com via reddit)
OpenAI Global LLC is facing a new class‑action complaint in the Southern District of California that accuses the company of quietly wiring its ChatGPT web interface with Meta’s Facebook Pixel and Google Analytics, turning highly sensitive…
What do you all think of this 2028 AI leadership (www.reddit.com)
https://www.anthropic.com/research/2028-ai-leadership
-
143 items
model roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 1h I trained Qwen3.5 to jailbreak itself with RL, then used the failures to improve its defenses
- 6h MagenticLite is here: A full-stack agentic experience powered by Small Models - Fara-1.5 4B, 9B & 27B
- 15h Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 [klein] for character keyframes, Wan2.2-I2V for animation, vision critic with auto-retry, music + 9-language narration in the same pipeline
- 18h Best local model supporting claude code? Rtx3060
- 1d Who is your favourite quant publisher and why?
Adaptive Markdown (www.reddit.com)
I’ve been working on an open-source document format / viewer idea I’m calling Adaptive Markdown. The basic idea is: instead of a document being static text it's controlled by coding agents.
Italic Fonts went small (www.reddit.com)
I’m going to assume a bug or maybe a setting in iOS? I started using Claude after a month and the response is like this.
Raindrop – Local Agent Debugger (github.com via hn)
Raindrop Workshop The local debugger your agent is missing. Watch your agent think locally, the moment it happens: every token, every tool call, every decision.
LLM Policy for Rust Compiler (github.com via hn)
Rust Forge Welcome to the [Rust Forge]! Rust Forge serves as a repository of supplementary documentation useful for members of [The Rust Programming Language].
When I joined the Codex engineering team in September 2025, Codex for Windows didn’t have a sandbox implementation meaning that Windows users were forced to choose between two subpar options when using OpenAI's coding agents: Approving nea…
This post documents System Reminders (SRs) — a mechanism Anthropic deploys in the Claude product (claude.ai and the Claude API) to inject behavioral-modification instructions into ongoing conversations. SRs are the successor to the Long Co…
Llama-Studio, WebUI for llama-server Management (www.reddit.com)
Hey all, I have built myself a WebUI for configuring and managing llama-server sessions, and want to share the code and concept. Python and a bit of JS.