What if new proofs are included in LLM trained so LLM rediscover it? (news.ycombinator.com)
If I were to sell the power of LLMs as powerful research agents, and if I had enough money, I could think about introducing little "gems" into the training set of LLM so that they are able to discover new theorems and proofs. There is a lo…
What Claude says vs What Claude thinks (www.reddit.com)
Anthropic research: https://www.anthropic.com/research/natural-language-autoencoders
- Claude Says No (wadetregaskis.com via hn)
Free Gpt.im (freegpt.im via hn)
Generate AI images with GPT Image 2 for free within a fair daily cap. No ChatGPT account, no login, no credit card, no watermark on outputs.
- What's going on with GPT-5.3 for free users? (www.reddit.com)
- GPT-4 (openai.com)
Which industries will be disrupted the most by autonomous AI agents? (www.reddit.com)
Curious to hear everyone’s thoughts on this. As autonomous AI agents get better at handling complex tasks with minimal human input, which industries do you think will see the biggest disruption first and why?
Has anyone set a local LLM up as a language learning tool? (www.reddit.com)
I've been learning German recently, and it occurred to me that I could point some of my AI horsepower at having a German speaking LLM to practice with. I'm not too concerned with the speech to text side of things or getting it to talk back…
Inputs on improving development workflow (www.reddit.com)
Looking for ideas on how I can optimize my workflow further. I currently have created a moderately complex vibe coded app.
-
205 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
91 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
I've replaced my Claude subscription with a sleep control app (twitter.com via hn)
Don’t miss what’s happening People on X are the first to know. Log in Sign up Post Conversation patoroco @patoroco Stop wasting money on Claude or ChatGPT subscriptions for coding.
Job at Cursor (www.reddit.com)
Has anyone here worked at Cursor or knows someone who does? A recruiter reached out recently, and I’m trying to get a genuine feel for what the company is like beyond the usual recruiter pitch.
Running a quantized 72B VLM on M4 Pro for GUI tasks — some numbers (www.reddit.com)
Running a quantized 72B VLM on M4 Pro for GUI tasks — some numbers Been messing around with running a vision-language model locally on my Mac to do GUI automation stuff — basically the model looks at a screenshot of my desktop and decides…
Is “prompt debt” becoming a real problem in AI apps? (www.reddit.com)
Lately I’ve been noticing how quickly prompts grow in real AI apps. Teams keep adding: more examples formatting instructions fallback behavior style constraints edge-case handling …but almost nothing gets removed over time.
I keep seeing the same trajectory in AI startup conversations: AI search → coding agents → OpenClaw → agent IM → ? Most people fill in that question mark with some version of "agent collaboration platform." AI-native Slack.
ABA Games (1D Pac-Man, etc) Agentic Gamedev Skills (github.com via hn)
Agentic Gamedev Skills English | 日本語 This repository collects agent skills extracted from game-development work and related agentic-workflow research. Each skill lives under .agents/skills/, uses SKILL.md as its entry point, and may includ…
-
323 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
50 itemsevent
HallucinationClaude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.
- 2h Can model Hallucination also be a demand signal?
- 1d GPT-5.5 Instant might be OpenAI’s most important update yet and almost nobody is talking about why
- 1d The weirdest thing about AI agents is how human failure patterns start showing up
- 1d Giga Launches Realtime Hallucination Correction
- 1d Φ³−φ⁻³=4 (exact): The transformer's ff/d ratio is algebraic, not empirical
This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Elevated errors on Claude Opus 4.1 Check on progress and whether or not the incident has been resolved yet here : https://status.cla…
- Claude Status Update : Elevated errors on Claude Opus 4.1 on 2026-05-09T07:57:05.000Z (www.reddit.com)
- Claude Status Update : Elevated Errors on File Operations on 2026-05-08T14:12:26.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-08T17:13:50.000Z (www.reddit.com)
+67 more
- Claude Status Update : Elevated Errors on File Operations on 2026-05-08T15:17:02.000Z (www.reddit.com)
- Claude Status Update : Elevated Errors on File Operations on 2026-05-08T15:03:50.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-08T17:01:08.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.6 on 2026-05-08T15:03:50.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across Claude Models on 2026-05-08T09:49:14.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across Claude Models on 2026-05-08T11:32:47.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-08T17:25:40.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across Claude Models on 2026-05-08T10:26:46.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across Claude Models on 2026-05-08T11:40:41.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.6 on 2026-05-08T15:11:04.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-07T12:10:17.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across multiple models on 2026-05-06T15:29:02.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-07T12:20:20.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-07T12:45:50.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across multiple models on 2026-05-06T15:54:50.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across multiple models on 2026-05-06T16:51:13.000Z (www.reddit.com)
- Claude Status Update : Elevated errors across multiple models on 2026-05-06T16:32:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 and Sonnet 4.5 on 2026-05-04T13:59:17.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 and Sonnet 4.5 on 2026-05-04T14:45:58.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-04T14:07:57.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 on 2026-05-04T09:49:29.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-05-04T14:33:46.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 and Sonnet 4.5 on 2026-05-04T14:27:54.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 and Sonnet 4.5 on 2026-05-04T14:08:19.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 on 2026-05-04T09:23:04.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 on 2026-05-04T09:14:14.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 on 2026-05-04T08:19:01.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.5 on 2026-05-04T08:12:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.5 on 2026-05-04T08:09:58.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T17:51:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-30T13:10:09.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T18:33:55.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T19:15:52.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-30T14:01:41.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-28T12:38:38.000Z (www.reddit.com)
- Claude Status Update : Elevated billing related errors on Claude.ai on 2026-04-27T15:18:47.000Z (www.reddit.com)
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T18:42:40.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-28T12:45:07.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-29T00:00:29.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-28T12:24:00.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T01:35:55.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-28T23:33:07.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T18:59:47.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-29T14:14:34.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-29T14:01:16.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-29T13:47:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.5 on 2026-04-28T13:50:06.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.5 on 2026-04-28T13:29:56.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:57:57.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:53:02.000Z (www.reddit.com)
- Claude Status Update : Elevated billing related errors on Claude.ai on 2026-04-27T14:11:29.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:25:22.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T02:34:30.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T16:29:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:03:39.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:20:03.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:57:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:55:35.000Z (www.reddit.com)
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T19:02:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T11:58:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:43:51.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:37:30.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T07:48:31.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T02:15:52.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T07:43:32.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T06:50:56.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T17:42:57.000Z (www.reddit.com)
Sharing my evals-driven vibe koding setup (www.reddit.com)
(Disclaimer: Originally posted on r/AIEval thought this is relevant) Been iterating on a setup where my coding agent (cursor in my case) runs evals in a loop, reads the failing metrics, and patches things automatically. Wanted to share the…
Claude's signup workflow is terrible (news.ycombinator.com)
I used claude via the web console for a while, then thought I'd want to go for higher limits, and API usage. I click and end up here https://claude.ai/upgrade.
How to work with Claude as a novice? Hitting limits (www.reddit.com)
Hello! I've recently started working with AI more and more due to my company exceeding requirements and timings for development.
We turned Cursor.ai into an OpenClaw-style multi-agent control panel (www.reddit.com)
I’ve been experimenting with Cursor agents for more than just one-off coding tasks, and I kept running into the same problem: once you have multiple agents running across different workflows, the terminal starts to feel messy fast. So we b…
Anthropic weighs fundraising for near $1T valuation, FT reports (www.reuters.com via hn)
paywalled
-
5 items
model roundup
Sonnet 4.5On May 4, 2026, multiple automated status updates reported elevated errors for Claude Opus 4.5 and Sonnet 4.5 around the same time, with Anthropic introducing a feature called E-STEER that applies emotion intervention to these models.
Qwen doesn't work for free (www.reddit.com)
could not extract summary
Vibe-coders, I’m done with Electron-based sidebars. I’m done with "Apply" buttons.
How I made my Claude setup more consistent (www.reddit.com)
I’ve been trying different Claude setups for a while, and honestly, most of them don’t hold up once you start using them in real work. At first, everything looks fine.
- Why Claude is not consistent? (www.reddit.com)
- How I made my Claude setup more consistent (www.reddit.com)
I want to build the AI agent that can replace me 100% (www.reddit.com)
I’m actually serious about this lol Not AGI or sci-fi stuff, I mean realistically with current models like Claude I use Claude Max pretty heavily already and honestly it feels way closer than most people think. A huge part of my work is ba…
This question will probably make more sense when I explain my current situation: lately I’ve been doing some small projects here and there to some small business in my town and they have been working fine, but that is about to change. I ma…
How to setup caveman on the web app of Claude ? (www.reddit.com)
Did anyone use the caveman prompt (or skill) in the web app version of claude, if yes how did you achieve that and also could you tell me did it really help with saving tokens or not ?
Hi Reddit, we are a team of database researchers (including a PhD from MIT DB Group) and we just open-sourced an embedded vector database for agent/LLM applications. An embedded vector database supporting both text and vectors.