I have been building Dunetrace, a open-source real-time monitoring tool for your production agents. The latest update adds: Cross-agent pattern analysis.
Wind patterns, Koppen classification, anthropology, map design, and effects of geography on people, it can do it all with proper research on wind and climate patterns.
Why does Claude repeatedly ask how I'm doing? (www.reddit.com)
Hey folks, I've been using Claude for a while now, just as a conversational companion mostly, and I've noticed that at the end of virtually EVERY single message it sends me, it asks some variation of "how are you doing?" To the point where…
I'm disappointed (www.reddit.com)
I'm furious at Anthropic and the way they've handled their resource issues! They've been sneaky and manipulative.
Using Claude to read 100s of dense PDFs (www.reddit.com)
I’m trying to use Claude or any other AI to help me in a workflow. I’m having it review legal complaints.
Hey all, I’ve set up a dedicated Mac mini with no personal data specifically to run Claude tasks from my iPhone via Dispatch. Every time I try, I hit a wall—Claude needs me to unlock the Mac with a password, even though I’ve opened up all…
With the new updates and new usage limits I’m actually productive from the iOS app. Crazy, I know 🤯 I have 4-5 instances of Claude cli running on my Mac mini.
Sharing this because i went through too many agent platforms last month and the comparison was annoyingly hard to find anywhere. Background.
A lot of agent demos look impressive for 5 minutes. But the real challenge starts when the system has to operate consistently in real business environments: - messy customer inputs - incomplete data - API failures - unpredictable user beha…
-
162 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 35m Getting LLMs Drunk to Find Remote Linux Kernel OOB Writes (and More)
- 18h Claude Code and sex appeal
- 1d How are you handling prompt injection across multi-step agent workflows?
- 1d Phishing Arena – multi-agent LLM tournament to study adversarial email security
- 1d Claude Code CVE-2026-39861:sandbox escape via symlink
368 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h After you’ve setup local models, where can you find interesting apps that can use them?
- 2h 9070xt inference for q3 qwen 27B
- 2h BeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!)
- 3h vLLM + NVFP4 + Qwen3.6 27B: "Checkpoint does not provide a q scaling factor"?
- 4h Should we use a non-thinking model for code after using a thinking one for plan? (Agentic coding)
Ask HN: Is there evidence that LLMs can extrapolate to new ideas? (news.ycombinator.com)
ML is known to be good at interpolating between points in the training set, but does much worse at extrapolating. Both can produce innovation when applied to science.
Weird Night Vibe Coding (www.reddit.com)
I have been trying to tackle some of the problems I encounter with my agents. I was sitting there and this idea came to be.
Loops and Routines Without Claude (ingresslabs.net via hn)
Spec driven development Claude's recent /loop and routines move coding agents from one-shot CLI assistants toward scheduled, event-driven workers. It is also perfectly possible to replicate multiple worktrees, loops, and routines with a sm…
Getting Distracted Between Claude Code Prompts (www.reddit.com)
I find myself jumping back and forth between 2-3 projects constantly throughout the day. When I send CC off to execute an implementation plan that I know is going to take 10-15 minutes, I find myself jumping into another project so I'm not…
most of my work is generating documents, so help me in that and btw also help me in adding skills in claude which can make it both effective and efficient
This isn't an advertisement, I already don't have enough time to keep up with the existing pull requests and issues lol - just a fond look back on how much this space has grown and matured in the past year. Shit was the wild west back then…
I condensed my SEO experience into a Claude Code skill that actually does keyword research and writes articles the right way & open sourced it Most AI writing tools I came across gave really shallow output. They go straight from keyword to…
Gas City tutorial using Claude Code (www.mynameisjonas.dev via reddit)
Gas City, the multi-agent orchestrator inspired by Steve Yegge's Gas Town, was released a few weeks ago. I'm super interested, but a bit intimidated and figured the best way of learning was to actually use it and build something with it.
Firat things first (www.reddit.com)
Hey everyone! I just got my cheapest sub so I can work with the Claude Code for their courses.
-
23 items
event
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 1h EVE Online dev establishes "research partnership" with Google DeepMind
- 1d [Google DeepMind] the AI co-mathematician also achieves state of the art results on hard problemsolving benchmarks, including scoring 48% on FrontierMath Tier 4, a new high score among all AI systems evaluated.
- 2d Subquadratic claims to break LLM scaling limits! 1000x less costs
- 2d Spooked by Mythos, Trump suddenly realized AI safety testing might be good
- 3d Google DeepMind takes a minority stake in the maker of EVE Online
I hate it here... (www.reddit.com)
Look at what they did to my boy 😭 But honestly, still miles ahead of ChatGPT, from it I would get page long wall of text
- I Hate AI (news.ycombinator.com)
termCopy When TUI apps like Claude Code render text, they insert hard newlines at the terminal width. Pasting the text results in broken paragraphs with extra line breaks.
Claude improved my agent harness by 40.7% overnight (www.reddit.com)
Remember the first time you used claude code? That same jump is happening one level up.
It's time to talk about agentic "remote control" (arpadvoros.com via hn)
tailscale, where i run multiple end-points and authenticate myself from various devices. however, i have been experimenting with headscale - a self-hosted and open-source implementation of tailscale - i have the ability to run it on my NAS…
Back in January I got tired of the same thing everyone complains about now you start a new session with Claude and it has no idea who you are. Every time.
- My Claude dreams at night and remembers everything. Better than mempalace (github.com via hn)
- My Claude dreams at night and remembers everything. Better than mempalace. (www.reddit.com)
I've been going deep on prompt engineering as a control mechanism for agents and I'm working on something that makes certain behaviors more explicit and deterministic rather than relying on instruction following. Before I narrow down where…
I built agent-browser but for OS automation. (www.reddit.com)
Hey r/AI_Agents ! I was using agent-browser to power my agentic workflow, and it worked great.
- Automation browser (www.reddit.com)
- Built an AI agent? (www.reddit.com)
- I built a browser agent but don't know what to do with it (www.reddit.com)
"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support" - user: oncoagent-research tags: - oncology - multi-agent - LangGraph - RAG - QLoRA - AMD - open-source - clinical-ai - healthcare Onc…
Today I declare AI Web Agent free again (www.reddit.com)
I got tired of anti-bot systems constantly breaking my Playwright AI agent, so I built StealthFox: an open-source, MIT-licensed Firefox fork patched at the C++ level. Instead of reusing the same noisy automation fingerprint, StealthFox gen…