I told Claude I needed to cut down my cholesterol and that I was pre-diabetic based on my last annual check-up. I also mentioned that past diets have failed me because they were torture.
AMD Hipfire - a new inference engine optimized for AMD GPU's (www.reddit.com)
Came across hipfire the other day. It's a brand new inference engine focused on all AMD GPU's (not just the latest).
GDP.pdf: A Benchmark for Parsing PDFs (surgehq.ai via hn)
GDP.pdf: Can $100B AI Models Master the Documents that Run the World? Introducing our new expert multimodal reasoning benchmark.
oh right lol (www.reddit.com)
https://preview.redd.it/6atqzf9tdnxg1.png?width=1569&format=png&auto=webp&s=f24a78b077d2ef0d87d9e07f0e2be34fd1cebdbb claude telling me to copy and run commands who running commands manually in big 2026?
EvanFlow – A TDD driven feedback loop for Claude Code (github.com via hn)
EvanFlow A TDD-driven iterative feedback loop for software development with Claude Code. 16 cohesive skills + 2 custom subagents walk an idea from brainstorm through implementation, with checkpoints throughout where you stay in control.
What is your night claw protocol ? (www.reddit.com)
When I first started with openclaw I realized right away it wasn't going to run overnight. It was like a special chat bot with cli access and could run extended session tasks.
Agentic Workforce Framework A reference architecture for operating autonomous AI agents as accountable digital workers inside enterprise environments. This framework defines how agents are assigned work, bounded by role, governed by approv…
OpenClaw or can I solo build this (www.reddit.com)
Is there an AI workflow out there that I can teach to make online appointments for me and book them? I want to get to the point where I can just text or call or dictate to an AI agent to book me with my specific Barber's name and lmk the t…
-
52 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 45m DeepSeek V4 is about to be open-sourced—effectively revealing all the secrets behind the magic. How will other players in the field respond?
- 1h Language Anchoring: A Systematic Method for LLM Multilingual Adaptation
- 3h No GGUFs for DeepSeek V4-Flash as yet?
- 3h Deepseek v4 flash weird sizes?
- 6h DeepSeek's new models are so efficient they'll run on a toaster by which we mean
44 itemsmodel roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
- 1h Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High
- 20h Opus 4.6 vs Sonnet 4.6
- 1d Claude's sonnet 4.6's clarifying questions...How to read?
- 1d Does effort tier change refusal behavior on agent-attack prompts? CVP run 4 with sonnet 4.6 high and max efforts.
- 1d Show HN: Mapping Sonnet's thinking process via flame charts
I made a Adhd helper (www.reddit.com)
Like the title says I made a ADHD helper off my Samsung Galaxy s24 to help my nephew to focus and get through his studies and daily chores with help from Claude which did all the code and then I spent extensive hours going back and forth t…
Quick context: when you have multiple AI agents talking to each other and something goes wrong, your debugging tools usually show "everything fine" even when the agents are stuck in a loop costing you money. Been building observability for…
spent the past two months iterating on how i load context into claude for a product i'm building and figured it might be useful to share because i don't see this discussed much in here. the problem i kept hitting was the classic one, same…
Lately, I’ve been noticing something weird with Cursor Pro. During peak market hours, the reasoning depth for my Python scripts feels...
Anthropic's Claude remote uses GLM-4.7 (www.reddit.com)
I just noticed this after a bug wasn't getting fixed. If you start a Claude code remote environment the default model (hidden on mobile) is glm 4.7 I assumed anthropic only used their own models for everything so it was interesting to me t…
According to what I've been reading (and also according to all models I've asked about this), the consensus seems to be that Min P is the better/more modern approach to sampling and that it should be preferred over Top P/Top K, which shoul…
Can Claude connect to Microsoft personal OneDrive? (www.reddit.com)
I see that via the Microsoft connectors for 365 you can connect to OneDrive for work or school but I have a personal account signing in with a Yahoo account. Is it possible to connect to Claude and if so how?
Howdy all! Novice here, just getting my feet wet playing with a somewhat loose idea I had after reading this paper: https://arxiv.org/html/2604.01687v2 http://markeddownDOTdev (reddit keeps deleting bc of domain)- Its a site for testing in…
-
202 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 3h Claude Code started to use with me very specific words it was not using before
- 9h what are your strats for being efficient with opus 4.7 max?
- 12h Tell HN: Claude Code is unable to respond to this request
- 19h When Opus 4.7 does think, it *really* thinks
- 1d Am I the only one getting provider error when trying to use opus 4.7? It keeps erroring then charging me tokens for reading the files and stopping halfway through this shit fucking sucks I might just switch to claude code at this point
106 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 2h What 'affordable' machine do you use for Claude Cowork?
- 6h I changed ai youtube for “screenshot an X post → give it to my claude” and my output went up
- 9h How do you learn and keep up
- 11h How do I access Claude "Computer Use"?
- 12h Cowork tab gone from Claude desktop — WSL distro never registers after reinstall. Anyone fix this?
Coding agents ignore their own budgets (twitter.com via hn)
Coding agents are burning trillions of tokens daily, with little awareness of what that spend costs the business footing the bill. The growth of AI token spend, which has , is .
disappearing chats/info? (www.reddit.com)
i just had hours of conversations with claude brainstorming ideas. i took a shower, came back, and half my chats were gone.
I noticed something odd while using Claude. In a single session, it showed 54.1M cache reads, which seems extremely high.
ChatGPT solves Erdos Problem 1176 in 80 minutes (chatgpt.com via hn)
Get responses tailored to you Log in to get answers based on saved chats, plus create images and upload files.
could not extract summary
Dash A self-learning data agent built with systems engineering principles. It grounds answers in 6 layers of context and improves with every query.
Aether – A GCP-Native Framework to Terminate LLM Agent Drift (github.com via hn)
AETHER-core The open-source core compiler for the AETHER Agent Reliability Framework. Replaces fuzzy prompts with strict Weighted Intent Token (WIT) vectors to prevent Context Rot.
Cursor won't stop using Argentinian Spanish (www.reddit.com)
Hello, for the past couple of months, Cursor has consistently been writing, answering, and generating UI text using Argentinian Spanish vocabulary. This dialect is incorrect for users targeting other Spanish-speaking demographics.
With how often we hear about supply-chain attacks on npm I am hesitant to install any apps that use it, let alone something like an agent harness that will run constantly unsupervised.
Iphone picture gpt vs nano (www.reddit.com)
I was trying to get that “iPhone casual feel” out of Gemini Nano Banana 2, and honestly ever since GPT Image 2 dropped, I can’t really take Nano seriously anymore. Some obvious issues I kept getting: Completely messed up my face Made the j…