I built a 24/7 AI radio station called WRIT-FM where ChatGPT/Claude is the entire creative engine. Not a demo — it's been running continuously, generating all content in real time.
Before adding another feature, I like using a "receipt test." Write down what a user should be able to prove after using the app for 5 minutes. Not "the dashboard looks nice" or "the AI responded." A real receipt: a file was uploaded and c…
Claude Code Oauth Error (www.reddit.com)
I’ve set up Claude on my VPS, but when I try to log in, I get this issue. Even though I entered the correct code, it still shows this error.
- Claude OAuth (developer.puter.com via hn)
- Claude code (www.reddit.com)
- Claude Code OAuth down for >12 hours (news.ycombinator.com)
Stoa – open substrate for agent-readable SaaS (spec, runtime, registry) (stoa.tryvext.com via hn)
Open standard. Open runtime.
-
164 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 13m Mobile Claude Code, May 2026 — current best picks by threat model. What am I missing?
- 11h Cursor CVE-2026-26268: Hidden Git hooks RCE via agents autonomous Git operations
- 1d Getting LLMs Drunk to Find Remote Linux Kernel OOB Writes (and More)
- 1d Claude Code and sex appeal
- 2d How are you handling prompt injection across multi-step agent workflows?
6 itemsmodel roundup
ChatGPT 5.5Mathematician Timothy Gowers used ChatGPT 5.5 Pro to solve complex mathematical problems and discussed potential crises in research. Meanwhile, users tested various AI models for everyday tasks, finding both impressive capabilities and persistent errors.
- 30m My workflow: GPT for architecture and Claude Code for execution
- 13h Afraid of Using the Wrong LLM. ChatGPT 5.5 waterdown, Gemma Struggles
- 1d Fields Medal winning mathematician Timothy Gowers used GPT5.5 Pro to solve open problems, believes mathematical research will face a ‘crisis’ very soon with current rate of progress
- 1d Testing AI modeling skills
- 2d A recent experience with ChatGPT 5.5 Pro
I built a thing to monitor Claude Code in real time (www.reddit.com)
Has anyone else noticed how some Claude Code sessions cost you a few cents and others somehow burn through actual dollars and you can't really tell why after the fact? I kept hitting this — was it retry loops, was it the agent re-reading t…
Experimented with Claude Code + NVIDIA APIs today (www.reddit.com)
Tried using Claude Code with NVIDIA APIs today and honestly it was way more fun than I expected. The workflow felt surprisingly smooth for testing AI stuff quickly without overcomplicating everything.
Claude misgenders me (www.reddit.com)
I’m a ciswoman and, wow, Claude REGULARLY thinks I’m a man. Here’s an example.
- What’s up, Claude? (www.reddit.com)
- Claude + MS (www.reddit.com)
- Claude: (www.reddit.com)
+2 more
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X The Problem We Solved Walk into any small CNC machine shop and ask the manager how they decide whether to accept a customer job. The answer is almost always th…
-
89 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
375 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 36m Running Qwen3.6 35b a3b on 8gb vram and 32gb ram ~190k context
- 3h Getting a feel for how fast X tokens/second really is.
- 5h Building out my tool library, any recommendations? I just added email capability and im starting to get hyped!
- 5h Speeding up local LLM for usable coding agent
- 9h Hello from 10KM high! - Thanks to Qwen 3.6 35b a3b!
It started out as a way for me to freshen up my C++ skills during COVID. But life got in the way and it was put on ice.
Benchmarking AI persistent memory server against connected memory. (www.reddit.com)
Retrieval of only semantically similar memories using vector search is not sufficient to build an holistic context to feed to an llm. Most of the memory system works on the first concept of pure vector search, While running an experiment I…
How are you actually saving cost on your agent systems? (www.reddit.com)
I've been researching how teams handle cost and FinOps for agent systems in production. Token bills get unpredictable fast, and most tooling stops at per-call or per-agent attribution, which doesn't tell you much about why the bill jumped.
Claude Code and Making a Second Brain (www.reddit.com)
Hello, newbie here. Does one need to pay for Claude Code to build a "Second Brain?" (One that combines Claude with Obsidian)
- NanoBrain – A Markdown+Git "second brain" for Claude Code (nanobrain.app via hn)
- NanoBrain – A Markdown+Git "second brain" for Claude Code (github.com via hn)
-
104 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 38m DeepSeek-V4-Flash W4A16+FP8 with MTP self-speculation: 85 tok/s @ 524k on 2× RTX PRO 6000 Max-Q
- 6h DS4
- 7h I have DeepSeek V4 Pro at home
- 16h Opus 4.7 and DeepSeek V4-Pro select Buddhism as preferred religion
- 19h I used Claude to build an AI assistant that helps run live TTRPG sessions and am looking for a few playtest GMs
164 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
Practical lessons from 50K lines of production code with Claude Code (jappiesoftware.com via reddit)
I've been using Claude Code in full agentic mode for two months — not just autocomplete, but letting it write features, run tests, read CI output, and push fixes. Around 50K lines of production code.
Help here (www.reddit.com)
How should I fix this, Claude’s own suggestions are failing, I added $5 for the API key to extract from my Beehiiv newsletter still it’s failing, what could be the real problem here?
- Help (www.reddit.com)
Claude still feels much better than ChatGPT/Codex at UX design (www.reddit.com)
https://preview.redd.it/km7o9670lc0h1.png?width=1542&format=png&auto=webp&s=3fea5e97f3e518222eefd7cfd0cc871fcd58a933 Has anyone else found Claude stronger than ChatGPT/Codex for UX critique? In a recent test, I asked both to review a wikil…
'AI gave me your number': AI doxxing turns ChatGPT hallucinations to harassment (www.the-independent.com via hn)
‘AI gave me your number’: The new trend turning ChatGPT hallucinations into harassment Known as ‘AI doxxing’, victims say popular chatbots are sharing their personal phone numbers with strangers. Anthony Cuthbertson looks at how criminals…
-
332 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
Like a lot of people experimenting with vibe coding and AI agents lately, I’ve been trying to understand why models keep ignoring explicit instructions, constraints, and requirements even when those rules are written clearly. Today Opus sa…
Built HTML Drive - Google Drive for Claude generated HTML files (www.reddit.com)
I built HTML Drive this weekend: a personal Drive Claude can save to. Sign in with Google, then ask Claude to make HTML and it lands in your account, versioned, shareable, and with its own URL.
Show HN: Lune - Software for agents that do serious science (luneresearch.com via hn)
Our MCP server provides scientific knowledge and action grounding for your AI agents, based on top-tier research literature and timeless research best practices/advice given by notable scholars.
Show HN: Visual Learning with LLMs (forray.io via hn)
Hi HN, sharing a learning tool I built which pairs an LLM generated learning space with interactive visuals and small games. The part which I am still working on is getting an agent to reliably generate good visuals without intervention.
Is This email To Give Higher Permission To Cursor a Scam? (www.reddit.com)
https://preview.redd.it/6ncrsdq4oc0h1.jpg?width=399&format=pjpg&auto=webp&s=c743b59bb9fd017dd26f6bd7294aa7770ebfa99c Hi, I got this email asking me to give permission to GitHub cursor. I had to log into my GitHub account which makes me wor…
We’re hosting the biggest Claude Code Prompt-a-thon at the AI x Marketing Summit in SF on May 28–29. For 36 hours, you’ll actually build with AI: • Claude Code • Humanic • n8n • MCPs • Figma Make • AI workflows for SEO, ads, lifecycle, out…
Weekend project: behaviour trees for LLM agents (www.reddit.com)