Mini PC for local LLMs in 2026 (terminalbytes.com via hn)
I bookmarked a GMKtec EVO-X2 listing in October last year. 128GB Ryzen AI MAX+ 395, listed at $2,099.
What do I need to know as I embark on my multi-agent empire? (www.reddit.com)
If you were going back to your first day with claude an autonomous business agents, what would you do differently? What add ons / plugins would you use?
Ask HN:Do people configure Claude Code to use other models (openrouter.ai via hn)
Claude Code is Anthropic's agentic coding tool that reads your entire codebase, plans and executes changes across files, runs tests, and iterates on failures, all from natural language prompts. Claude Code uses OpenRouter to access hundred…
Max subscriber but still locked out of Claude Design — anyone else? (www.reddit.com)
I'm on the Max plan and tried opening claude.ai/design, but I'm getting the lock screen saying "Claude Design is available to users on subscription plans" with Max actually listed as one of the eligible plans. So the page knows I should ha…
-
5 items
model roundup
Grok 4.3Grok 4.3, launched by xAI on [specific date if provided], improves the Artificial Analysis Intelligence Index to 53 with enhanced agentic performance, reducing input and output prices by approximately 40% and 60%, respectively, though it has a slightly higher hallucination rate compared to Grok 4.20.
- 13m Grok 4.3 is way cheaper and better than before
- 8h Grok 4.3: strong in finance and long-context, with some tradeoffs
- 16h Grok 4.3 underperforms Grok 4.20 0309 on the Extended NYT Connections Benchmark, dropping from 93.4 to 67.5, though it achieves this result at a lower cost than the earlier Grok 4.20 run
- 1d Grok 4.3
- 1d Grok 4.3 achieves higher overall intelligence over 4.20 with less of a cost, at the price of slightly higher hallucination rate.
162 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
OWASP Agent Security Regression Harness (github.com via hn)
OWASP Agent Security Regression Harness The OWASP Agent Security Regression Harness is an open source, vendor-neutral test harness for running executable security regression scenarios against agentic applications and MCP-integrated systems…
I built a free open source RPG-inspired character-sheet app with CC (www.reddit.com)
I started building a structured way to store context between chats before Claude had its auto updating built-in project memory. Even now that's a thing I personally prefer asking Claude to update a structured JSON file.
I’m testing an API-native mailbox service for agents and automations. The idea: create inboxes by API, receive messages/attachments as webhooks, and avoid giving agents access to real human mailboxes.
Bland.ai frustration (www.reddit.com)
Has anyone else had just about the worst experience possible trying to set up a phone agent for their business? I run a swimming pool shop out of which we run a service and construction business for swimming pools, and I have been working…
-
146 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
101 itemsevent
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 41m Breaking: someone has out-vague-posted Sam Altman. It wasn’t known to be possible until
- 1h Sam Altman has changed his stance on the claims that AI will replace humans.
- 9h I can't believe Altman said this
- 12h Sam Altman says OpenAI doesn't want to replace you with AI
- 17h Sam Altman falls out of love with universal basic income
Roll your own local AI coding agents to save money (www.theregister.com via hn)
Usage-based pricing killing your vibe - here's how to roll your own local AI coding agents Take those token limits and shove them by vibe coding with a local LLM With model devs pushing more aggressive rate limits, raising prices, or even…
Advice for Beginners (www.reddit.com)
This is aimed at the people like me who for the life of me couldn't figure out how to actually get a useful or even working agent built. Just caught in a loop of unfinished slop and ai bots unable to make me a millions dollars overnight.
Show HN: Vdiff – CLI to help you review AI-generated code (news.ycombinator.com)
Hey, you probably already saw that reviewing AI-generated code is a nightmare and quickly becomes a bottleneck. Everyone is using AI agents to write code fast, but the hard part is reviewing a bazillion lines.
the agent company I joined is imploding (www.reddit.com)
Three months ago I left my cushy enterprise software job to join what I thought was the future of AI agents. Today I'm updating my resume while my coworker Jake stress-eats Cheetos at 2:47 AM because our Series B just evaporated.
-
48 items
event
MistralMistral, a French AI company, is set to release a medium-sized model with 128 billion parameters and is planning to launch Workflows in public preview. The company, founded by Arthur Mensch, continues to grow its AI empire despite not being based in the United States.
130 itemsmodel roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 41m [Help] Running big dense models faster
- 51m found a new project memory MCP with hybrid recall (BM25 + vectors + RRF) on FFT Qwen3.5-4B
- 3h We are finally there: Qwen3.6-27B + agentic search; 95.7% SimpleQA on a single 3090, fully local
- 7h Show HN: Hollow is an open-sourced self-modifying agentic system
- 15h I Cut Claude API Costs by 50% Using This Self Modifying Agentic System
Internet of Agents (news.ycombinator.com)
What do you think will be necessary to build a protocol for internet of agents ?
- The Internet Needs a New Layer for AI Agents (www.reddit.com)
- AI Agents (www.reddit.com)
- I’ve built the "The Internet For AI Agents" (www.reddit.com)
Interoperability for LLM Applications (news.ycombinator.com)
Is there any suitable way to make a user switch applications such as Chatgpt and Claude but maintain their context , without copy paste shenanigans ?
In recent months, discussions have increased around one central topic: OpenAI is building an advertising and marketing infrastructure around ChatGPT. This does not mean that OpenAI is selling chat conversations.
Favorite Connectors, Skills etc. (www.reddit.com)
New to Claude and am looking for some guidance on what you guys are using. Since I’m just learning looking for “simple-r” options, but if you have references or tutorials you’d recommend to get further in the weeds that would be great!
-
31 items
model roundup
GPT 5.4OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.
- 58m GPT 5.5 tops private citation benchmark on Kaggle (AbstractToTitle task)
- 6h UPDATE: The method from the proof generated by GPT-5.4 Pro for Erdos Problem #1196 was successfully applied to other problems including another 60 year old Erdos conjecture.
- 20h gpt-5.5 API is randomly and inconsistently resizing image inputs
- 22h GPT-5.5 vs. GPT-5.4 vs. Opus 4.7 on 56 real coding tasks from 2 open source repo
- 1d AI Security Institute: GPT-5.5 "may be the strongest model we have tested" for cyber exploits, including Mythos
Does AMD's "infinity cache" even matter for dense model inference? (www.reddit.com)
AMD has nailed the SEO/AEO for this query in Google: 7900 xtx memory bandwidth I get back this response: The AMD Radeon RX 7900 XTX features 24GB of GDDR6 memory with a maximum bandwidth of 960 GB/s. It uses a 384-bit memory interface with…
Show HN: AgInTiFlow, a local web and CLI agent workspace using DeepSeek (www.npmjs.com via hn)
AgInTiFlow is a web-first coding agent and CLI with DeepSeek routing, sandboxed tools, model providers, canvas artifacts, and optional wrappers. English · العربية · Español · Français · 日本語 · 한국어 · Tiếng Việt · 中文 (简体) · 中文(繁體) · Deutsch ·…
Hi all, my Mac mini just got delivered. Want to play around with Claude code.
Streamline your customer support process. Prompt included. (www.reddit.com)
Hello! Are you overwhelmed with customer support tickets and unsure how to extract valuable insights from them?
Beyond Memorization: Do Larger Models Know More, or Just Better? (www.reddit.com)
Just read 2 papers: 1. Incompressible Knowledge Probes 2.
Codex Pets (developers.openai.com via hn)
- What is Codex? (openai.com)
Long-time Claude user, finally built something for the long-session problem and want this sub's read on whether it's actually useful or solving something I made up. The pattern that pushed me to build: 60+ messages into a Claude session, t…