Claude is weirdly good at helping untangle messy thoughts (www.reddit.com)
One thing I’ve noticed after using Claude for some time now is that it is especially good when my notes or ideas are still not fully ready. A lot of AI tools are decent at generating polished output, but Claude feels good at taking messy p…
Why does this happen? (www.reddit.com)
I wonder if anyone can explain why this happens. I tell Claude not to use em-dashes, it replaces them with "--".
Anthropic NLAs translate LLM activations to human-readable text for safety (presciente.com via hn)
Anthropic's NLAs Decode LLM 'Thoughts,' Improving Claude Safety Anthropic's NLAs Decode LLM 'Thoughts,' Anthropic's Natural Language Autoencoders (NLAs) translate internal LLM activations into human-readable text, enhancing model interpret…
Publish an app (www.reddit.com)
If I made an app using claude (its still in html format), what do I need to publish it? Like backend payments and all sort of stuff to get it up and running?
Exploration Hacking: Can LLMs Learn to Resist RL Training? (www.alignmentforum.org via hn)
Vercel Security Checkpoint We're verifying your browser Website owner? Click here to fix Vercel Security Checkpoint | iad1::1778338836-ZK24T3gc9C9AzBlqHx3ECO6CCsYG4KEJ
Quick question for anyone here (and for anyone from Anthropic who reads this sub). I'm a Max 20x subscriber on claude.ai web, based in Canada, English-language account.
-
364 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 7m Should we use a non-thinking model for code after using a thinking one for plan? (Agentic coding)
- 31m More Qwen3.6-27B MTP success but on dual Mi50s
- 3h 80 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP
- 4h Pi and Qwen3.6 27B make setting up Archlinux really easy.
- 4h Show HN: Transformer Math Explorer
100 itemsmodel roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
Patchwork: AST-Native Editing for LLMs (github.com via hn)
patchwork AST-native sed — find, replace, delete, and insert code by structure, not regex. Installation cargo install patchwork-cli Usage Rename a method across files without false positives patchwork replace -i -p 'getOldData($a)' -r 'get…
- Show HN: Patchwork, AST-native sed without LLMs (github.com via hn)
Any way to have full session history across multiple PCs? (www.reddit.com)
I use Claude Code app on Windows, but I'm constantly switching between desktop PC and laptop, and would love to be able to see all conversations and pick up where we left off in the other machine. I've tried syncing the .claude folder with…
Pilot agents fail quietly because pilots rarely test authority (www.reddit.com)
A demo usually asks one question: can the model follow the happy path? Production asks a meaner question: does the system know what not to touch when context is messy?
Who's running local LLMs for agent workflows? What's your setup? (www.reddit.com)
Curious how many people here are running language models locally as part of their agent stack. What model are you using and what are your system specs?
Can you still imagine yourself working without AI? (www.reddit.com)
It’s kind of crazy how fast AI became part of our daily workflow. A lot of projects and ideas that used to stay stuck in people’s heads can now actually be built, especially with tools like Claude handling a lot of the heavy lifting.
The product: Scavio AI — an online search API for AI agents allowing them to access the web data in a structured format. In addition to Google, YouTube, Amazon, Walmart, Reddit, I just shipped TikTok support with 11 endpoints: - User profi…
-
205 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
204 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
Show HN: AaaS – Agent as a Service (github.com via hn)
AaaS: Agent as a Service Turn what you know into a running business. No code required.
Article: What are LLMs and Generative AI good at (jackpritz.com via hn)
What are LLMs and Generative AI Good At? Large language models and Generative AI have monopolized the conversation in tech for a few years.
After a monthof building and iterating, our firm's AI pipeline is live across three practice areas. Sharing everything here because I wish this post had existed when we started.
Claude is actively forcing my brother to take a quiz (www.reddit.com)
My brother is doing revision for a geography exam at his educational institution and he wanted something explained without finishing a quiz he previously asked for, and Claude literally told him he has to take the quiz… I just love these s…
Best approach for parsing client-side rendered docs (www.reddit.com)
I often need to read Salesforce Help documentation to get quick summaries or implementation tips, but the site renders content client-side, so Claude can’t properly access the page content. I’ve tried a few MCPs with web crawlers, but they…
Codex CLI Cheat Sheet (www.agenticcodingweekly.com via hn)
Printable single-page A4 JPEG reference for developers Concise reference for the Codex CLI : terminal UI, codex exec , local config, MCP, skills, subagents, hooks, rules, and automation. 🚀 Start Here Open the interactive terminal UI in the…
-
181 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 2h Not a good day for team "Claude Mythos is Just Marketing Hype"
- 14h METR evaluated an early version of Claude Mythos
- 20h Could Mozilla Security Hot Air Fill Mythos Sails?
- 23h Mythos set off a cybersecurity 'hysteria.' Experts say threat was already here
- 23h Mythos Fallout, U.S. Government Weighs AI Model Regulation
There's a finding circulating this week that deserves more attention than it's getting. The claim, backed by multiple builders comparing setups: the same model can produce a 30 to 50 percentage point performance difference depending on whi…
Autotrader: An autonomous paper trading agent, two weeks in (www.akashtandon.in via hn)
A paper-trading experiment on Indian equities, where Claude itself runs the loop on a free GCP VM and edits its own strategy between polls. Plus the parts that actually broke.
Most AI agent evals completely ignore execution efficiency (www.reddit.com)
We were evaluating some AI agents internally and noticed something weird: A lot of them scored perfectly on “task completion” while being wildly inefficient underneath. Example: same tool called multiple times with identical args unnecessa…
Flagging this because it seems more relevant to actual coding loops than to general AI-news posting: Ring-2.6-1T is now out, and there’s a free developer access window through May 15. The launch angle is pretty clearly “reasoning model for…
What Claude says vs What Claude thinks (www.reddit.com)
Anthropic research: https://www.anthropic.com/research/natural-language-autoencoders
- What Claude says vs What Claude thinks (www.reddit.com)
- Claude Says No (wadetregaskis.com via hn)
Skilljar courses. (www.reddit.com)
Anyone completed the Anthropic Academy Skilljar courses? If you've taken any of them, what's your field and which course would you recommend for it?
Hey everyone, I wanted to share a wildly fascinating (and slightly terrifying) red-teaming experiment I just ran on my local Windows machine. I've been playing around with autonomous agents and wanted to see what happens when you give an L…