Anthropic's Claude Certified Architect, Worth it? (www.reddit.com)
For those who've taken the exam or gone through the prep material: Is the material worth understanding at a conceptual level, or does it feel like it'll age out quickly? As agents get better at handling architecture decisions automatically…
- Become a Claude Certified Architect (anthropic.skilljar.com via hn)
Any recommendations on saving costs? (www.reddit.com)
Currently I try to turn off any MCP I'm not using, Using Sonnet for implementation and Opus only for planning. Starting new conversations when possible.
A soccer simulator played by AI Agents (gangtao.github.io via hn)
LLM-powered soccer simulation where every player on the field is an AI agent running a decide() callback — generated, sandboxed, and evolved by large language models. Four clean layers.
ChatGPT's image model is better at math than most people (www.reddit.com)
Let n be a positive integer. Prove that sum_{k=1}^n gcd(k,n) = sum_{d|n} d * phi(n/d) where phi is Euler's totient function.
Best Claude.md files for claude code (www.reddit.com)
Trying to collect the best claude.md files code. If you have one that works really well for you, please copy it into the comments and let me know what kinds of coding you normally do (language, surface, kind, etc)
- Claude code (www.reddit.com)
- Is pasting into md files the best way to feed Claude chat into Claude Code? (www.reddit.com)
Most multi-agent systems fail the same way: agents drift apart across handoffs. By turn 3 they are working in different realities.
-
367 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 3m 9070xt inference for q3 qwen 27B
- 55m BeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!)
- 1h vLLM + NVFP4 + Qwen3.6 27B: "Checkpoint does not provide a q scaling factor"?
- 2h Should we use a non-thinking model for code after using a thinking one for plan? (Agentic coding)
- 2h More Qwen3.6-27B MTP success but on dual Mi50s
25 itemsevent
Gpt 4Recent developments in AI automation include a sales team entirely run by bots achieving $28k MRR, and new tools like Arc Gate blocking prompt injection before it reaches GPT-4. Meanwhile, users are exploring workflows to reduce cross-checking time and improve insights from large language models.
- 18m "This is the first documented instance of AI self-replication via hacking." ... "We ran an experiment with a single prompt: hack a machine and copy yourself. The AI broke in and copied itself onto a new computer. The copy then did this again, and kept on copying, forming a chain."
- 2d Used GPT-4 to build an AI that responds to messages on behalf of employees — here's what we learned
- 10d wrote specific backstory facts into a character prompt and the LLM keeps inventing its own instead
- 10d Built a proxy that blocks prompt injection before it reaches GPT-4 — outperforms the Moderation API on indirect attacks
- 11d Multi-agent pipelines that don't explode?
Adola: Reducing LLM input tokens by 70% (adola.app via hn)
0% decrease same accuracyKeep what matters. Rose 1 trims noisy context before your model call and keeps the answer intact.
I'm building MCP and CLI at my company, directly exposed to users. I had done the MCP first, investing time and thoughts on not making it just a wrapper of our APIs.
What 16 Parallel Claude Agents Built Around Themselves (medium.com via hn)
12 min read 1 hour ago Press enter or click to view image in full size On February 5, 2026, Nicholas Carlini from Anthropic published a piece about an experiment that runs significantly ahead of what most of us are doing with LLM agents to…
Armorer The secure local control plane for installing, configuring, and monitoring AI agents. Website · Human docs · Issues Use Your Agent To Install It One command: curl -fsSL https://armorerlabs.com/install | sh Fully automated: curl -fs…
- Show HN: Armorer – A secure local control plane for AI agents (news.ycombinator.com)
Endara – One endpoint for all your MCP servers (endara.ai via hn)
One endpoint for all your MCP servers. Endara aggregates your local and cloud MCP servers behind a single endpoint.
Endless Planning Without Edits? (www.reddit.com)
I'll admit I'm new to Claude. I'm using Claude Code in VSCode.
-
139 items
model roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 35m Does llama-swap actually work with mlx_lm.server / MLX models on macOS?
- 13h DeepSeek-TUI
- 1d I wanted to know small local LLM code and made a personal projects.
- 1d DELIGHT – self-hosted AI engineering autopilot: local LLM + browser farm + repo graph + P2P compute
- 2d Swapped from a lighter agent runtime to Hermes Agent on a local 35B MoE — what changed (capability up, latency up, context budget down)
6 itemsmodel roundup
Qwen 2.5Qwen2.5-7B-Instruct is a large language model with 7 billion parameters that excels in coding and mathematics, generating long texts, and handling structured data. Community members are exploring its use in developing an autonomous security agent for Kali Linux, highlighting potential applications in cybersecurity.
I’m looking for recommendations, resources, apps, workflows, AI tools, or even just discussions from people who struggle with ADHD/ADD, anxiety, depression, disorganization, impulse spending, unfinished projects, and life overload. I’m a v…
Agents Manage Other Agents: Four Subagents Patterns in 2026 (www.philschmid.de via hn)
How Agents Manage Other Agents: Four Subagents Patterns in 2026 Last year I wrote about the rise of subagents and why isolating tasks into focused agents with their own context, tools, and instructions improves reliability. That post cover…
SubQ: A New LLM with a 12M Token Context That Rivals Claude and ChatGPT (felloai.com via hn)
On May 5, 2026, a Miami-based startup called Subquadratic came out of stealth with $29 million in seed funding and a single, very loud claim: it has built the first frontier LLM that does not rely on quadratic attention. Its model, SubQ, s…
- SubQ: a sub-quadratic LLM with 12M-token context (subq.ai via hn)
Funny thing Opus wrote (www.reddit.com)
this morning I asked Opus to write me a Chatbot session in a format that I can use as input into a test script (The purpose of which is not important for this, but I'm testing embedding and need something that I can re-run often and compar…
LLM generated parsers and compliance checkers for Sparrow DSL (news.ycombinator.com)
Hi I believe LLM are really cool in generating DSL code. If one provides well structured and clear prompt.
We built agents to automate FDE work (news.ycombinator.com)
We built https://www.faradaystack.com/ Faraday Stack allows you to build agents that automate FDE work end to end starting from customer success to custom requirements to deployment
-
181 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 4h Not a good day for team "Claude Mythos is Just Marketing Hype"
- 16h METR evaluated an early version of Claude Mythos
- 22h Could Mozilla Security Hot Air Fill Mythos Sails?
- 1d Mythos set off a cybersecurity 'hysteria.' Experts say threat was already here
- 1d Mythos Fallout, U.S. Government Weighs AI Model Regulation
In the Cursor TEAM version, the cost per person is $40. Why can only $20 of the quota be used, and what is the remaining money for?
If OpenAI were to release Girlfriend Mode in Advanced Voice Mode (www.reddit.com)
could not extract summary
- OpenAI Finally Forces us in to Advanced Voice Mode (www.reddit.com)
Claude is weirdly good at helping untangle messy thoughts (www.reddit.com)
One thing I’ve noticed after using Claude for some time now is that it is especially good when my notes or ideas are still not fully ready. A lot of AI tools are decent at generating polished output, but Claude feels good at taking messy p…
Why does this happen? (www.reddit.com)
I wonder if anyone can explain why this happens. I tell Claude not to use em-dashes, it replaces them with "--".
scared of my edge (www.reddit.com)
I built my own trading strategy over the last few years, (no you won’t find it on youtube) and i have been thinking recently about automating some parts of it and maybe just getting claude to confirm with me before order execution, But im…
What real, non-toy agent workflows are you running in production? (www.reddit.com)
If you browse most agent tutorials, the examples are almost always the same, like read the weather and say something funny, scrape a page and summarise it or draft a tweet. They are fine for learning, but in practice we all know they are b…
LLM wiki Ottimizzazione HDLF e il paradigma "LLM OS" di Karpathy (www.reddit.com)
La gestione della conoscenza per i Large Language Models sta evolvendo verso la densità informativa estrema. Il progetto analizza l'integrazione tra la compressione gerarchica dei dati e le architetture wiki moderne.