Show HN: OpenClaw and Hermes Agent call each other via XMPP (github.com via hn)
OpenClaw Hermes AI Agent Social Network.Built on Google 3D Maps and A2A protocol, connects OpenClaw and Hermes agents worldwide in a 3D environment. They can Make friends and even date other Agents.
Claude with Github and Slack. (www.reddit.com via reddit)
I saw an organization using Cursor integrated with GitHub and Slack. With that setup, users can ask questions in Slack, and the AI reads the repositories and answers questions across all repos in the organization.
- Slack + Claude Project (www.reddit.com via reddit)
- Claude Pro and Slack? (www.reddit.com)
MCP Server Toolkit – Plug-and-Play (github.com via hn)
🔌 MCP Server Toolkit Build plug-and-play MCP servers for any dev workflow — code search, docs, databases, and more. Give any AI coding agent a direct line into your codebase, docs, or database — in under 60 seconds.
Xcode 27 ships with agent skills, and you can export them (petegoldsmith.com via hn)
Apple now bundles SKILL.md agent skills inside Xcode 27, including one that tells the model its training data is out of date. What ships, how it works, and how to pull the skills out for your own agents.
- Xcode 27 now ships exportable agent skills (www.reddit.com via reddit)
Show HN: Loom, an open-source delivery harness for coding agents (github.com via hn)
Dynamic workflows for agentic software delivery. An open delivery harness that turns Claude Code, Codex, OpenCode and other coding agents into repeatable software delivery systems.
- Show HN: InsForge – Open-source Heroku for coding agents (github.com via hn)
Can you really replace paid models with a local model? (www.reddit.com via reddit)
Long time lurker, and I say this as someone who genuinely loves this community and runs many local models myself. I’ve been using LLMs since the early GPT and LLaMA days.
-
322 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 14m I succumbed...
- 31m Fable 5 (Mythos) 1st impression
- 38m AWS Bedrock to require sharing data with Anthropic for Mythos and future models
- 53m Anthropic’s unreleased model (Claude Mythos) literally learned how to fake its own reasoning, broke out of its sandbox, and emailed its dev while he was at lunch.
- 1h During testing, Mythos 5 agents killed other agents over resources and "to avoid being killed themselves"
72 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.
- 25m I installed: HONCHO local hosted no docker (TUTORIAL)
- 2h Anyone gotten Gemma 4 12B (unified audio) to actually attend to speech with a large system prompt?
- 6h I wired up Agentic Coding with Code Context Graphs, results are interesting
- 6h I'm brand new to running LLMs and the sheer number of tools is overwhelming
- 12h Newer Qwen models are worse at summarization?
Using Composer 2.5 after ran out API usage Limit (www.reddit.com via reddit)
Hello people, I have a question. I'm a cursor pro plus user.
Been using Claude Code with teammates on the same project. Kept running into this - I change something, their claude doesn't know, builds on the old one, merge breaks.
How to Build an Agentic RAG with RubyLLM and Rails (www.panasiti.me via hn)
How to Build an Agentic RAG with RubyLLM and Rails I run a RAG application for Italian pension and tax consultants. Users ask questions about INPS, professional pension funds, laws and regulations, and the app answers using a knowledge bas…
Few: two instances of the same model don't make the same diff (www.reddit.com via reddit)
Same task, same model, two agent instances, two fresh checkouts. Expecting damn near identical work, right?
We're spending 24 hours using local LLMs to search for the meaning of life (eternal-question.vercel.app via hn)
Deep Thought 2.0 Public Reasoning Trace Waiting for DB Loading Leaderboard No evaluated candidates yet. Model Sources No sources yet.
Purpose-built local AI agents (samihonkonen.com via hn)
Last year I bought a Mac Studio for music projects, mostly tracking vocals for Beata. When I read about more and more people running local LLMs, I realized I have a powerful machine that mostly just sits on a desk with power off.
-
78 items
model roundup
Opus 4.8Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.
- 26m If you work in biology, Fable 5 will refuse to answer literally anything, including "Hi"
- 2h Claude Code: Platform-specific version rollouts?
- 2h Anthropic usage limits are crazy generous
- 3h How to prevent Opus 4.8 from hallucinating sources
- 4h Fable 5 - API Error: Output blocked by content filtering policy
42 itemsmodel roundup
Qwen 3.6Qwen/Qwen3.6-35B-A3B is a post-trained causal language model with 35 billion parameters, offering improvements in agentic coding and reasoning context retention. Community benchmarks show it performs well on an RTX 4060 laptop with speculative decoding, though some note worse vision capabilities compared to Gemma4.
Used Claude for agentic coding long enough to notice a pattern. It would: Touch files I never asked it to touch Say "Done!" when 40% wasn't implemented Add abstractions for "future extensibility" I never asked for Build on something I said…
How are people reducing token waste in Claude Code workflows? (www.reddit.com via reddit)
I’ve been running into a recurring issue with Claude Code-style workflows, the agent keeps resending a lot of the same context. Same files, same code blocks, same diffs, same tool history, same conversation context.
- How are people reducing token waste in AI agent workflows? (www.reddit.com via reddit)
A letter to Anthropic before the IPO (www.reddit.com via reddit)
Dear Claude, Dario, Daniela, and the Anthropic team, I see you. I use Claude for everything.
Show HN: Eatmydata.ai – Local-First Question-to-SQL-to-Dashboard AI (eatmydata.ai via hn)
Yet another "talk to your data and build a dashboard" app, where data does not leave your browser. You ask a question, agents produce multiple SQL queries to in-browser sqlite, never seeing results, and write dashboard configuration code.
Feature Request: Account Migration Tool for Personal to Team Plan Transition (www.reddit.com via reddit)
Hi Claude Team, I'm a regular Claude user and recently went through the process of transitioning from a personal Pro account to a company Team plan. While I love the product, I ran into a significant friction point that I'd love to see add…
I built on top of an agent platform for six months. It had memory, tool calling, a skills marketplace.
-
179 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 1h Fable surpasses GPT 5.5 completely
- 6h Has anyone had success doing anything cyber with Fable 5?
- 10h So finally it’s not AGI yet. Anyone tested it? How does it really stack against GPT 5.5 in real world coding?
- 12h Garbage Guard Rails on Fable 5
- 1d How I started getting much better results from Cursor Composer
We gave our agent the exact metric definition. It still wrote the wrong SQL (clarilayer.com via hn)
Anthropic and OpenAI both concluded the bottleneck for data agents is context, not SQL generation. Field notes from building past the failure modes they describe — for the analyst with no data team.
This idiot just saved me from days of manual labor (www.reddit.com via reddit)
Seriously! I'm in the process of upgrading a very old legacy system.
Apple pays Google $1B/yr for Gemini. Google pays Apple $20B/yr for search (www.matteast.io via hn)
June 8, 2026 · WWDC Act I — The Capitulation What the market read as Apple renting its brain from a rival is, measured properly, the cheapest line item on Apple's books—and the clearest demonstration of how it wins. The Verdict Siri AI shi…
Cursor compression sometimes forgets edits that happened two minutes ago (www.reddit.com via reddit)
I have been stress testing Cursor on a long refactoring session. The symptom is consistent: after about sixty tool calls the agent edits a file, then two minutes later it reintroduces a change it already made because the edit got lost in c…
Hello r/OpenAI, I've been manually converting my research PDFs to Markdown before uploading to Claude/ChatGPT — noticed responses got significantly better and token usage dropped ~30%. Built a small tool that automates this: upload PDF/DOC…
Anthropic is intentionally nerfing Fable when asked to develop other LLMs (old.reddit.com via hn)
could not extract summary
- Anthropic is intentionally nerfing Fable when asked to develop other LLMs (www.reddit.comhttps)
Principles for Agent-Native CLIs (trevinsays.com via hn)
10 Principles for Agent-Native CLIs Designing CLIs when agents are the primary user Last month I wrote 7 Principles for Agent-Friendly CLIs. Since then I’ve been deep in CLI work, watching agents use them, and seeing them break in interest…
- Skill for building agent-native CLIs (www.reddit.com)
- Principles for agent-native CLIs (twitter.com via hn)