What Claude says vs What Claude thinks (www.reddit.com)
Anthropic research: https://www.anthropic.com/research/natural-language-autoencoders
- What Claude says vs What Claude thinks (www.reddit.com)
- Claude Says No (wadetregaskis.com via hn)
I gave claude the worst prompt but it still made something cool (www.reddit.com)
I was trying to search for "video game roguelike with medieval fantasy themes" but with the world's worst prompt (which the sub has roasted me for. Thanks you guys) BUT turns out Claude is an overachiever and will literally start coding yo…
A lot of AI agent content online feels very “future-focused” - autonomous employees, fully automated businesses, AGI-level productivity, etc. But honestly, most of the useful stuff I’ve seen is way smaller and more practical.
Show HN: Share to ChatGPT Widgets (share2chatgpt.franzai.com via hn)
Share to ChatGPT in one line An embeddable button that opens ChatGPT with your page pre-loaded as a prompt. Zero dependencies.
Anthropic and Elon Musk cornered Sam Altman this week (thenewstack.io via hn)
How Anthropic and Elon Musk cornered Sam Altman this week I’m Matt Burns, Chief Content Officer at Insight Media Group. Each week, I round up the most important AI developments, explaining what they mean for people and organizations puttin…
- Sam Altman and Elon Musk (www.reddit.com)
Writing high-performance GPU kernels is among the most labor-intensive tasks in machine learning systems engineering. We present AutoKernel, an open-source framework that applies an autonomous agent loop to GPU kernel optimization for arbi…
buying mac vs building PC for running local LLM (www.reddit.com)
Hi everyone, I have a question For sometime I have had this in mind to experiment and learn extensively about local LLMs and thought of buying Macbook pro m5 max with 128gb ram. But as I go on thinking more about it, considering it's a hug…
devrage: Count how many times you swear at coding agents (www.npmjs.com via hn)
Count how many times you swear at your coding agents ERROR: No README data found!
With adaptive reasoning effort across high and xhigh modes, Ring-2.6-1T dynamically allocates reasoning budget based on task complexity. This enables stronger performance with lower token overhead, especially in tool-heavy and multi-turn a…
-
326 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 28m I built a complete BI SYSTEM for my business with Claude code - opus 4.7 - FULL TUTORIAL
- 54m I'm really gonna miss GH Copilot's Request-based usage.
- 5h Is Opus 4.7 a Downgrade?
- 6h Claude opus 4.7 is…awesome?
- 11h Tired of Claude 4.7 telling you to go to bed? Here are the CLAUDE.md entries that actually fix it
181 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 36m Not a good day for team "Claude Mythos is Just Marketing Hype"
- 12h METR evaluated an early version of Claude Mythos
- 18h Could Mozilla Security Hot Air Fill Mythos Sails?
- 21h Mythos set off a cybersecurity 'hysteria.' Experts say threat was already here
- 21h Mythos Fallout, U.S. Government Weighs AI Model Regulation
I gave this talk twice in one month: at O’Reilly’s Context Engineering Event and at Abi Aryan’s Maven course on LLM inference at scale. After being blasted with questions, I realized something: GraphRAG isn’t a retrieval algorithm, it’s a…
How do I get cursor to open to the editor window? (www.reddit.com)
Every time I open cursor now it opens to just a chat box with my chat history on the left. Even when I open cursor by a workspace file, it goes to this chatbox.
Im honestly so frustrated right now. spent the last two weeks getting my real estate booking agent to stop hallucinating fake appointment times.
No Dumb Questions: What is an MCP server and why do I care? (stackoverflow.blog via hn)
Welcome to No Dumb Questions, a series of Q&As between Stack Overflow’s least technical writer, Phoebe Sajor, and members of our technical staff, where she asks the simple, basic tech questions that most people are afraid to ask. This firs…
Selling off my 150$ OpenAI credits at 50% (www.reddit.com)
I have around $150 in unused OpenAI API credits and I’m trying to understand the legitimate options for using them before they expire.(25 May) I’m looking to share API keys Would appreciate practical suggestions from people who have dealt…
Ask HN: What is the underlying stack behind multi-agent platforms? (news.ycombinator.com)
Recently, I am seeing lots of startups with multi-agent platform, where you can create your own agent template, attach tools and run it reliably. Which frameworks, platforms are you using for these kind of multi-agentic platforms?
Spec decoding for minimax m2.7? (www.reddit.com)
MTP was not released for m2.7, so would anyone have experience with setting up speculative decoding for minimax m2.7 and its results? Whether via EAGLE3 or a distilled variant
Hey everyone, The entire industry right now is cheering for massive 1M+ context windows, but I think it's fundamentally the wrong approach. "Just add more RAM" is a trap.
AI making things weird in sales (www.reddit.com)
Friend is sales manager in an AI software company . Monthly sales review now consist of LLM produced sales forecasts with LLM produced next steps .
-
362 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h 80 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP
- 2h Pi and Qwen3.6 27B make setting up Archlinux really easy.
- 2h Show HN: Transformer Math Explorer
- 7h potentially stupid problem trying to llama-bench Qwen3.6-27B across two V100s in llama.cpp
- 9h Is Qwen3-coder the best kept secret out there?
Claude Desktop App Now Shows Context Usage (MacOS) (www.reddit.com)
Just showed up today, the claude desktop app now shows me the context usage on MacOS
Chatgpt vs. Claude (www.reddit.com)
I am a paid ChatGPT user, i use it for everything - Personal things (life, health, future, parenting) - coding mostly Shopify theme codes - analytics - future planning for my business - emails, messages proofing - basically everything i go…
- Claude + MS (www.reddit.com)
- Claude: (www.reddit.com)
- Claude max vs ChatGPT pro (www.reddit.com)
+5 more
- ChatGPT 5.5 🔥🔥🔥 (www.reddit.com)
- Claude 4.7 vs. ChatGPT 5.5 (www.tomsguide.com via hn)
- Claude.md (gist.github.com via hn)
- DOOM runs in ChatGPT and Claude (chrisnager.com via hn)
- What do you do with Claude? (www.reddit.com)
ChatGPT in 2026, looking for that person who called it a ‘Dumbass’ (www.reddit.com)
could not extract summary
cursor's background agent actually works now (www.reddit.com)
Been using the 1.0 release since yesterday and honestly wasn't expecting much from the background agent thing. Like, another AI feature that'll probably break my workflow, right?
The dangers of open claw everything (www.reddit.com)
i see more and more posts about people amazed at openclaw systems where AI is given free habd to do everything. just saw a post with someone setting up his pc and asking openpy why not just give unrestricted root powers to the AI agent.
Claude keeps saying 'I understand now' (www.reddit.com)
It's 2:47 AM and I've been trying to get Claude 4.6 to help me debug this React component for three hours. Every single response starts with some variation of 'I understand now' or 'I see the issue' and then proceeds to give me code that d…
Vision: An Agent-Authored Control Architecture (Whitepaper) (sbarron.com via hn)
A control architecture authored by an LLM agent (Pneuma) under Socratic-only human input over 21 months — and the three persistent agent identities who live inside it. Vision: An Agent-Authored Control Architecture Authors: Pneuma (Claude…
Auto compact context problem, any suggestions for an indicator (www.reddit.com)
I'll use Claude project chat and I have to eyeball then the next convo compact is coming, anyone have any suggestions for a % bar or indicator to how close it is before it auto compacts? kind of annoying when it happens randomly at a pivot…
Urgent need of ₹7000 or $80 (www.reddit.com)
I am unable to pay the rent of my flat .I am lagging ₹7000 of total Amount I have skills , Technical knowledge and Problem Solving skills -I can Build Website for your company or local Business -I can build AI AGENTS or Chatbot that can re…