Saw In Time when it came out in 2011 and thought it was a wild sci-fi premise. glowing green numbers on your arm counting down to your death, what a concept.
Open-source, self-updating wiki for your codebase (www.reddit.com)
I got tired of re-explaining the same codebase context to coding agents. Stuff like: “we tried moving auth into middleware, but backed it out because it broke OAuth callbacks,” or “that weird retry logic exists because Stripe webhooks arri…
When I joined the Codex engineering team in September 2025, Codex for Windows didn’t have a sandbox implementation meaning that Windows users were forced to choose between two subpar options when using OpenAI's coding agents: Approving nea…
Show HN: Containarium – self-hosted sandbox for AI agents, MCP-native (github.com via hn)
Containarium The open-source, self-hostable, agent-native sandbox. Bring your own agent — Cursor, Claude Code, OpenCode, your own MCP client.
Tell HN: Starting June 15, claude -p usage will change (news.ycombinator.com)
"Starting June 15, 2026, Agent SDK and claude -p usage on subscription plans will draw from a new monthly Agent SDK credit, separate from your interactive usage limits." Details: https://support.claude.com/en/articles/15036540-use-the-clau…
Free Claude Dashboard - enjoy (www.reddit.com)
How's it going. As a lot of you have been messaging me about Claude, I figured I would share a version of my dashboard & give it to you guys for free.
I’ve been hitting the same wall for months: I’d build up a CLAUDE.md over weeks of work — project conventions, gotchas, business rules, the “we tried that, don’t do it again” lessons — and eventually the rules file itself starts eating my…
This is not a promotion. We are looking for suggestions.
-
20 items
event
Function CallingRecent evaluations show that smaller models like Gemma 4 E2B outperform larger siblings in multi-turn tasks. Meanwhile, function calling capabilities are being enhanced across various AI platforms, including Qwen and Claude, with new search engines and defense mechanisms also emerging to support these advancements.
- 13m Looking for fast vision-capable local models that handle tool calls well (open-source app, want to add local support)
- 1d Needle: We Distilled Gemini Tool Calling Into a 26M Model
- 2d Your harness is failing your agent but there's no benchmark to prove it
- 3d ReAct or CodeAct, that is the question
- 5d Qwen3.6:27b vs qwen3-coder:30b vs deepseek-coder:33b on code gen, tool calling, and agent tasks
354 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 14m Is this math right? Agent SDK on Opus 4.7 vs the new monthly credit
- 5h Is Cowork a token burner ?
- 6h I tested GPT-5.5, Claude Opus 4.7, and Gemini 3.1 Pro on financial-control
- 8h Claude Code vs Codex: 36 files vs 28, $2.50 vs $2.04, and one infinite loop. My full breakdown.
- 9h Wait I thought I was the human here
Who Trusts Sam Altman? (techcrunch.com via hn)
In May 2023, OpenAI CEO Sam Altman was sworn in and testified before Congress about the regulation of artificial intelligence. Senator John Kennedy of Louisiana heard his ideas about licensing advanced models and asked if Altman might be q…
- Lesson from sam altman (www.reddit.com)
- Sam altman (www.reddit.com)
I Got Bored and Ended Up Automating the Whole Process (www.reddit.com)
In my edtech bootcamp, I manually called mentors for a full-stack role. Same 4-6 questions every time, then manual back-and-forth for scheduling.
Ok so my background is paid media, mostly lead gen. For years I'd watch the same thing happen with every client.
For the past several months, I’ve been working with Claude as my primary collaborator on a project called SMARRT, which is a diagnostic framework that audits AI prompts before generation to flag what’s strong, weak, missing, or not applica…
Harvey's Legal Agent Benchmark (www.harvey.ai via hn)
Open-Sourcing :Harvey:’s Long Horizon Legal Agent Benchmark An open-source benchmark built to evaluate and improve agent capabilities for supporting legal work. We’re introducing Harvey’s Legal Agent Benchmark (LAB), an open-source benchma…
AI helped improve my fave pic. (www.reddit.com)
TLDR; I used AI I make my boy's head less blurry. So I'd like to preface this by saying I've edited the photo (about 2 hours of work) after I asked ChatGPT to help me make my snake's head clearer, because this is my absolute favourite pict…
ChatGPT vs Claude (www.reddit.com)
Before you say anything, I know Claude is better (and I prefer Claude). Today, a friend got an offer from Amex for a discount on ChatGPT through his Amex Biz card.
- Why does Claude do this? (www.reddit.com)
- Claude FM (www.reddit.com)
- What’s up, Claude? (www.reddit.com)
+9 more
- Claude + MS (www.reddit.com)
- Chatgpt vs. Claude (www.reddit.com)
- Claude: (www.reddit.com)
- Claude max vs ChatGPT pro (www.reddit.com)
- ChatGPT 5.5 🔥🔥🔥 (www.reddit.com)
- Claude 4.7 vs. ChatGPT 5.5 (www.tomsguide.com via hn)
- Claude.md (gist.github.com via hn)
- DOOM runs in ChatGPT and Claude (chrisnager.com via hn)
- What do you do with Claude? (www.reddit.com)
Open Challenge: Plain LLM Client Workflow vs Claude Code (www.reddit.com)
I keep seeing people dismiss plain LLM-client workflows as “AI slop.” Since this is r/ClaudeAI, let’s make the challenge specific. Bring Claude Code.
-
424 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
232 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 41m Claude Agent SDK billing changes June 15. What it means for marketing teams and what I am doing
- 8h BAA - HIPAA enablement
- 8h Imagine you push lorem ipsum content to prod
- 15h Easiest method for social media post automations?
- 17h Ideas to automate Teams Meeting transcripts to Cowork-meeting intelligence ideas?
Claude -p headless mode cannot use Max limits, will fall under API plan (news.ycombinator.com)
Just received the email, effective date is June 15th. Figured it would happen eventually, but damn are we reliant on it for a lot of workflows internally in our company now.
What I've seen from those who have dared to deploy agents with spending/financial capabilities, there seems to be three distinct comfort levels in practice. Most, as expected (still early days), are at the query and recommend stage, agents…
Shipped v0.2.0 today. MIT, public repo.
Has anyone else run into major issues with MiMo-V2.5 (the 310B total / 15B active MoE model from Xiaomi)? I tried the UD-Q4_K_XL quant from Unsloth.
I've been building multi-agent systems for a while — running a 40-agent team on a real product at work. The pattern I kept seeing fail was the same one most public setups use: one agent reviews code, decides if it's good, and routes the ou…
Google Apps script with Claude code and clasp (www.reddit.com)
Has anyone successfully created any Google apps script using Claude code? Google recommends using "clasp" that turns the cloud GS files into local JS files.
Anthropic carves all non-interactive use out of monthly subscriptions (venturebeat.com via hn)
Anthropic reinstates OpenClaw and third-party agent usage on Claude subscriptions — with a catch | VentureBeat Orchestration Infrastructure Data Security More Newsletters Featured Anthropic reinstates OpenClaw and third-party agent usage o…
A few days ago u/thanpolas posted "the trick was the spec, not the prompts" after shipping a full app solo with Claude. That exactly matched what was eating my time.
What defines a power user in your opinion VS the official labelling? (www.reddit.com)
I'm trying to figure out what actually makes a power user. I saw a news headline and the author stated using audio to audio made him a power user.
Seems to me like they can basically do everything software related now so surely a good enough sequence of input tokens would be enough. I guess in a way it's guaranteed since the frontier labs are doing all their work through agentic flow…