I've been building a thing called Fathom. It's a partly-Claude-based agent that's been running since January, changing my mind about how it should work as it helps me build itself.
Claude AI vs Claude Code vs models (this confused me for a while) (www.reddit.com)
I kept mixing up Claude AI, Claude Code, and the models for a while, so just writing this down the way I understand it now. Might be obvious to some people, but this confused me more than it should have.
- Switching Models in Claude Code? (www.reddit.com)
What are people using Browser Based Agents for ? (www.reddit.com)
Curious to see different verticals where people are deploying browser based agents in production. Is it just for realtime search and data extraction or also some end to end workflow automations?
Claude cerifications (www.reddit.com)
I'm not a software engineer but a program manager, looking for suggestions on any Claude related certification that I can pursue to improve resume and genuinely learn something new.
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
Show HN: Another experiment with an Erdos problem and LLMs (news.ycombinator.com)
Background: I am a coder, not a mathematician, but I was quite entertained by this story: https://news.ycombinator.com/item?id=47903126 I wondered how far I could get by just choosing a random open problem and throwing it at LLMs. Disclosu…
Publishing an app? (www.reddit.com)
Non technical person here looking to publish an app Claude helped me create. I’ve finalized the design.
Claude Architect Foundations exam with 0/0 score - anyone else? (www.reddit.com)
Hi all i took the exam on last Saturday when I checked the result page today it shows 0/0, I am confused tho Anyone else facing this?
Show HN: MemOperator-4B (huggingface.co via hn)
Memory Operator is a specialized language model developed for MemOS, designed to handle memory-related operations. Its core capabilities include memory extraction, integration, and update.
Cursor Deleted Railway Production Volume and Backups (twitter.com via hn)
A 30-hour timeline of how Cursor's agent, Railway's API, and an industry that markets AI safety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS.
-
204 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 21m Deferring Planned Items
- 3h Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High
- 5h Claude Code started to use with me very specific words it was not using before
- 11h what are your strats for being efficient with opus 4.7 max?
- 14h Tell HN: Claude Code is unable to respond to this request
121 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 41m Ask HN: Will local models on normal hardware ever compete?
- 9h Best sota 12b-32b creative writing model?
- 12h A weekend with LoRA on Gemma 4 E2B: instrumenting what fine-tuning changes
- 14h Gemma 4 Folks
- 15h Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks
The official uninstall instructions provided by Anthropic result in a left over app on Mac OS named "Claude Code URL Handler" This is sloppy and not what I asked for. I'm right to push back.
Making a Landing Page Work for Both Humans and AI Agents (docsalot.dev via hn)
I spent this week redesigning my landing page. The surprising part was not the typography, the footer, or the mobile nav.
Good people of Reddit, can you help me? I’m looking for a GitHub repo, tool, or software that can download the full text from an entire website (with multiple pages) into one single text file.
I've been tracking a few places where people are actually letting agents handle funds and run work without constant human supervision (babysitting) Quick disclaimer: all of these within clearly defined parameters and budgets in a controlle…
Memory import drops a bunch of memory? (www.reddit.com)
Used the official memory import a few weeks ago. Got my preferences and broad strokes but lost all the specifics from actual conversations.
I told Claude I needed to cut down my cholesterol and that I was pre-diabetic based on my last annual check-up. I also mentioned that past diets have failed me because they were torture.
LLM-as-judge is the wrong default. Here's what works (www.reddit.com)
Most internal agent teams I work with start with the same eval setup. Write expected answers, have an LLM grade whether the agent's response matches.
AMD Hipfire - a new inference engine optimized for AMD GPU's (www.reddit.com)
Came across hipfire the other day. It's a brand new inference engine focused on all AMD GPU's (not just the latest).
Can I use Claude code with own LLM/non-claude APIs? (www.reddit.com)
Anybody using claude code with local LLMs/non-claude APIs - does it work and work well? I really dont like opencode.
-
182 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 48m Brief Ngram-Mod Test Results - R9700/Qwen3.6 27B
- 1h Qwen3.6-27B-3bit-mlx · Hugging Face: 3 & 5 mixed quant for RAM poor Mac users.
- 5h Does anyone have a usable vLLM setup with Qwen3.6 27B + pipeline parallelism + MTP?
- 9h What is the best coding agent (CLI) like Claude Code for Local Development
- 11h Qwen 3.6 27B in Claude Code says it will do something then stops and prompts for user reply (not failing a tool call)
Hey guys, I want to connect some AI (like OpenClaw or any other model/agent) to act as a personal assistant, but I’m completely stuck on what functions it could actually perform. For example, if I work in real estate, or a friend of mine w…
EvanFlow – A TDD driven feedback loop for Claude Code (github.com via hn)
EvanFlow A TDD-driven iterative feedback loop for software development with Claude Code. 16 cohesive skills + 2 custom subagents walk an idea from brainstorm through implementation, with checkpoints throughout where you stay in control.
Use other model than composer-2 from CLI (www.reddit.com)
I can choose other model from Cursor IDE but when I select other model than composer-2, there is an error: Cannot use this model models are disabled for this user. In Cursor IDE, there is no setting as I just toggle a button.
- Which model should I use? (www.reddit.com)
Agentic Workforce Framework A reference architecture for operating autonomous AI agents as accountable digital workers inside enterprise environments. This framework defines how agents are assigned work, bounded by role, governed by approv…
spent the past two months iterating on how i load context into claude for a product i'm building and figured it might be useful to share because i don't see this discussed much in here. the problem i kept hitting was the classic one, same…
GDP.pdf: A Benchmark for Parsing PDFs (surgehq.ai via hn)
GDP.pdf: Can $100B AI Models Master the Documents that Run the World? Introducing our new expert multimodal reasoning benchmark.
Quick context: when you have multiple AI agents talking to each other and something goes wrong, your debugging tools usually show "everything fine" even when the agents are stuck in a loop costing you money. Been building observability for…
oh right lol (www.reddit.com)
https://preview.redd.it/6atqzf9tdnxg1.png?width=1569&format=png&auto=webp&s=f24a78b077d2ef0d87d9e07f0e2be34fd1cebdbb claude telling me to copy and run commands who running commands manually in big 2026?
What is your night claw protocol ? (www.reddit.com)
When I first started with openclaw I realized right away it wasn't going to run overnight. It was like a special chat bot with cli access and could run extended session tasks.