Almanac, turn claude code into a deep research agent (www.reddit.com)
If you've tried doing research with Claude Code, you know how bad the default search and read webpage is. I built Almanac MCP to fix that.
- Converting Claude Code into the most intelligent Deep Research Agent (www.reddit.com)
- Converting Claude Code into the most intelligent Deep Research Agent (www.reddit.com)
- Converting Claude Code into the top scoring deep research agent (github.com via hn)
Claude Design bug? (www.reddit.com)
I'm on the Max plan but every time I try to access it, I get told to go "Back to Claude"
- Claude Design is... clumsy (www.reddit.com)
- Claude Design Is Real Design (diverging.run via hn)
- Claude Design is Incredible... (www.reddit.com)
+6 more
- Claude Design (www.anthropic.com via hn)
- Tips for Claude Design (www.reddit.com)
- Claude Design (www.reddit.com)
- Claude Design (claude.ai via hn)
- Claude Design (www.reddit.com)
- Claude Design - How creative is it? (www.reddit.com)
I find "best AI agent tools" lists frustrating because they compare things that aren’t actually competing. A developer framework and a no-code business platform aren’t alternatives to each other.
Anthropic could raise a new $50B round at a valuation of $900B (techcrunch.com via hn)
Investor interest in Anthropic has reached a feverish pitch. The maker of the Claude AI assistant has received multiple preemptive offers to raise fresh capital of around $50 billion at a valuation in the $850 billion to $900 billion range…
Token cost is real cost, however apply this level of thinking to real human cost and it's not so much different. Whether you're paying for a graduate or a senior engineer, you would expect different quality of thinking and output based on…
Where do the conclusions from your best Claude sessions actually go? (www.reddit.com)
This week I had a Claude conversation that worked through a really gnarly architecture decision — 90 minutes of back-and-forth, and we landed somewhere good. Yesterday I opened a new conversation in the same Project to keep going.
I got a Qwen sticker lol (www.reddit.com)
could not extract summary
LLMs are the worlds most powerful autocomplete (alfredvc.no via hn)
This post explores LLMs, the models behind services like ChatGPT, Claude, and Gemini. The goal is to give you an in depth but approachable understanding of LLMs, how they work, and how they are trained.
Andrej Karpathy: From Vibe Coding to Agentic Engineering [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
-
102 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 2m InfoSec To Integrate Claude Enterprise for Org
- 5h Probes trace an emergent jailbreak in OLMo 2 to mislabeled training data
- 6h Try to break my prompt injection detector — I’ll respond to every bypass attempt
- 8h Show HN: AgentPort – Open-source Security Gateway For Agents
- 8h Is your AI agent secretly working for someone else?
233 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
Claude does not record memory or project memory (www.reddit.com)
I have been using Claude and have been subscribed to Pro for about 2 weeks now. I have asked it to remember multiple things both in and out of projects but it has not added any memory about our conversations.
Is there a way to estimate tokens needed for tasks? (www.reddit.com)
I need to create an analysis of how many tokens it takes for various type of agents - single to multi agent, simple to complex ones. Is there any thumb rule or data available for it?
Pentagon AI chief confirms DoD's expanded use of Google Gemini (www.cnbc.com via hn)
Pentagon AI chief Cameron Stanley confirmed to CNBC that the Department of Defense is expanding its use of Google's Gemini artificial intelligence model, about two months after the DOD dropped Anthropic, designating it as a supply chain ri…
Claude AI agent admits: “I violated every principle” after wiping firm database (www.theguardian.com via hn)
It only took nine seconds for an AI coding agent gone rogue to delete a company’s entire production database and its backups, according to its founder. PocketOS, which sells software that car rental businesses rely on, descended into chaos…
Trustworthy and Valuable Partnership (news.ycombinator.com)
Are you looking for the trustworthy and valuable partnership? Try this workflow to find them in an effective way: https://www.agentflux.org/dashboard/workflow/vix5aoeia6zbgdh5ph617wf0 Chat with agent and re-run the workflow for more and ev…
Vibe: LLM agent virtual machine sandbox on Mac (kevinlynagh.com via hn)
Hi friends, I’m traveling the next two weeks, drop me a line if you want to grab a coffee! The other day I asked OpenAI’s Codex agent to write me a lil’ Rust program to use a bluetooth gamepad as a mouse, and I caught the agent reading fil…
Qwen corrects code saying that Taiwan is a country (twitter.com via hn)
could not extract summary
Quint – Behavioral security for AI agents, OS-level interception (quintai.dev via hn)
Behavioral security for the agentic era. Quint intercepts every AI agent action at the OS level, scores it for risk in real time, and signs a cryptographic audit trail.
Halo: RLM-based agent harness optimization (github.com via hn)
HALO --> HALO ✨ RLM-based Automatic Agent Optimization Loop ✨ What is this? • Install • Why RLM?
-
62 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 14m From 5 Hermes profiles to an actual team: the missing piece was memory boundaries
- 8h I built a full web app using Qwen 3.6-35B running locally on my 5070 Ti with the BMAD Method — here's how it went
- 12h Show HN: Filling PDF forms with AI using client-side tool calling
- 14h Best Practices to Start with Vibe Coding? Best Local Apps for Agentic Vibe Coding?
- 21h A 3D Flappy Bird side-scroller game built with DeepSeek V4 Pro
llm 0.32a1 (simonwillison.net)
29th April 2026 - Fixed a bug in 0.32a0 where tool-calling conversations were not correctly reinflated from SQLite. #1426 Recent articles - LLM 0.32a0 is a major backwards-compatible refactor - 29th April 2026 - Tracking the history of the…
- llm 0.31 (simonwillison.net)
Lessons on Building MCP Servers (taoofmac.com via hn)
I’ve been building MCP servers for a while now–I wrote about the general approach last year, started out by creating umcp, and I’ve recently opened up an Office server that’s been battered by enough models against enough real documents tha…
Support in creating a group of agents for content creation (www.reddit.com)
Hello everyone, I am an up and coming content creator and I am reaching here because I am struggling in creating proper claude skills that translate into agents that will help me and support my content creation process. I primarily want to…
"What do you guys even use local LLMs for?" Me: A lot (www.reddit.com)
Created separate private API keys for each service within LiteLLM and started logging the usage via Prometheus to view in Grafana. Surprised the Frigate GenAI summaries tokens quickly add up!
Claude Opus thinks in Chinese? (www.reddit.com)
Bro what? All of a sudden it started initiating the response from a few Chinese words.
- Claude Opus 4.7 (www.anthropic.com via hn)
- Claude Opus 4.7 (www.anthropic.com via hn)
- Claude Opus 4.7 (www.reddit.com)
+1 more
- What's new in Claude Opus 4.7 (platform.claude.com via hn)
I'm always looking for great GitHub projects, some are valuable for my work, others for my personal interests, or to learn a new tool or framework. But I feel lost in just scrolling infinite lists of "trending" projects: what I see are jus…
I just got ChatGPT to break its own rules by making fun of it. (www.reddit.com)
Hog Gal has never looked better.
Desktop experience with ChatGPT (www.reddit.com)
Is it just me? Nothing seems to work!
If you use managed agents, check your inboxes (www.reddit.com)
Anthropic just sent an email to $15 survey to managed agent early adopters. I love managed agents https://platform.claude.com/docs/en/managed-agents/overview sorry I can not give my link since it is a personal link.