Show HN: Memweave CLI – search your AI agent's memory from the shell (github.com via hn)
memweave Agent memory you can read, search, and git diff. memweave is a zero-infrastructure, async-first Python library that gives AI agents persistent, searchable memory — stored as plain Markdown files and indexed by SQLite.
The output doesn't matter: Thoughts on Aristotle's Craftsmen in the age of LLMs (lambdacreate.com via hn)
LLM assisted, or even just whole hog generated content, is all the rage these days. Whether it's an image, a video, a thought piece, a program; it frankly doesn't matter.
From $200 to $30: Five Layers of LLM Cost Optimization (blog.dwornikowski.com via hn)
The Problem One of the services I’ve been building for an ecommerce app is a product categorizer: given a product name, assign it a 3-level category path from a large taxonomy. The app is in Polish, so both the product names and the catego…
NeoAgent Your agent. Your server.
-
53 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 4m Is GPT-5.5 actually a big step forward, or just a better efficiency story?
- 1h GPT 5.5 flags accounts for "potential high-risk cybersecurity"
- 2h GPT 5.5 Xhigh VoxelBench test. Minecraft builders got automated.
- 3h First impressions using GPT 5.5 for video game scripting
- 5h Testing GPT-5.5 in early access: what we are seeing so far
160 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 58m What do you consider to be the minimum performance (t/s) for local Agent workflows?
- 2h Replace RTX 2060 12G with second RTX 5060 Ti 16G for Qwen 3.6 27B?
- 2h Qwen3.6-27B at ~80 tps with 218k context window on 1x RTX 5090 served by vllm 0.19
- 3h Throughput and TTFT comparisons of Qwen 3.6 27B, Qwen 3.6 35B A3B and Gemma 4 models on H100
- 3h Qwen 3.6 27b - can I run on 1x 3090?
suspiciously precise floats, or, how I got Claude's real limits If you're reading this, you're probably aware that Claude's subscription plans are a much better deal than the API. But how much better exactly, and what are the actual limits?
Anyone actually using Dispatch for something useful? (www.reddit.com)
I (sort of) get what Dispatch is supposed to do. Make Claude a task from your phone, come back to find it done on your desktop.
I’ve been messing around with AI agents for a while now but most of them feel kind of the same. They can browse things, summarize stuff, maybe generate reports but after that you’re still stuck thinking “ok cool… now what”.
I'm so fed up by the crazy long answers ChatGPT gives every time. (www.reddit.com)
I am tired of prompting "answer in concise manner" or "keep economy of words". For day-to-day queries, it's just giving stupid long answers every time with all these fancy emoji stuff and cringy punchlines.
-
193 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 59m Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T11:58:15.000Z
- 2h Claude Opus 4.7 has turned into an overzealous query cop, devs complain
- 6h How Anthropic can save Opus 4.7 with one change.
- 6h Claude Opus 4.7 didn't believe me that the model UV was damaged until I came up with a delta filmstrip idea for it to screenshot
- 20h Anyone noticed Anthropic didn't added the model Opus 4.7 and Mythos Preview to there Transparency Hub?
92 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
I’m a nursing student at NYU, and on the side I built The Drug Database (thedrugdatabase.com). The idea came from a simple frustration: every time I needed to look up a medication while studying, I’d end up jumping between Drugs.com, RxLis…
Most agent deployments I have worked on fail in predictable ways. Like : Bad data quality,Missing business logic, Operator trust issues.
Hit your context limit mid-conversation? Annoying.
Prompt Fuzzing for Fuzz Driver Generation PromptFuzz is an automated tool that generates high-quality fuzz drivers for libraries via a fuzz loop constructed on mutating LLMs' prompts. The fuzz loop of PromptFuzz aims to guide the mutation…
-
5 items
model roundup
Gemini 3.1- 1h Kimi K2.6 - the mighty turtle that wins the race
- 16h Deepseek V4 Pro is 15x cost to run Artificial Analysis bench from V3.2, higher than Gemini 3.1 Pro
- 1d Top open weight models like ds v4 pro max are still like 6-7 months if not more behind closed lab models
- 1d Back to the real world.....anyone having problems using Gemini API after the update on the model descriptiont/selection? Gemini 3.1 Pro and Gemini 3 Flash are not working, only Gemini 2.5 flash. Is there an update on the pipeline to fix it?
93 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 2h Another Microsoft Copilot AD injected into 4M GitHub commits
- 9h The "AI will replace engineers" discourse has the abstraction level wrong
- 18h Ask HN: How are you evaluating AI apps and CLI?
- 19h Tried Cursor (after GH Copilot disaster) even took Pro+ for safety, and in 10 minutes I'm at 10%, I genuinely feel scammed or feel like it majorly glitched
- 1d Built a side-by-side AI tool comparator for coding, image, writing & search : also accepting tool submissions
Honestly, “easy long-term memory” isn’t about storage — it’s about reliable retrieval over time. From what actually works: Mem0 → easiest plug-and-play (good for MVPs) LangChain (LangMem) → solid if you’re already using it Letta (MemGPT) →…
Shipping the OpenClaw Stack in Public (agentbot.raveculture.xyz via hn)
Agentbot Journal Shipping the OpenClaw stack in public. Product updates, release notes, launch logs, and operator field reports from Agentbot.
Claude Code Routines: 5 Production Workflows That Ship Real Work (www.arcade.dev via hn)
TL;DR - Claude Code Routines enable unattended, cloud-run workflows via scheduled, API, and GitHub event triggers. Enterprise use breaks with demo-grade setups.
Mesi fa si è creato un meme- ormai leggendario per chi segue l'affondamento di OpenAI- secondo cui i componenti della famiglia 5.x non sarebbero in grado di risolvere il più elementare e stupido interrogativo di logica rappresentato dal pr…
-
36 items
model roundup
DeepSeek 4- 2h DeepSeek V4 Update
- 4h To run deepseek v4 flash how much max vram we need? 175 gb or 320gb?
- 6h Show HN: A CLI to use any model in your coding agent
- 12h Deepseek V4 flash (high) rivals Gemini 3 flash at 1/5th the cost
- 12h DeepSeek V4 is out. 1.6 trillion parameters. MIT license. $1.74 per million tokens. The gap between US and Chinese AI strategy has never been more visible.
A few weeks ago I caught myself doing the same chore for the third time: opening 8 tabs (G2, Capterra, Reddit, GitHub Issues…), copy-pasting “what do you dislike?” into Notion, then trying to figure out which gaps my product already covers…
Amália- Open Source Large Language Model (LLM) for European Portuguese (portugal.gov.pt via hn)
Modelo de Linguagem em Grande Escala para a língua portuguesa Na sequência do anúncio do Primeiro-Ministro sobre o lançamento do primeiro Modelo de Linguagem em Grande Escala de língua portuguesa de Portugal (LLM Português), no passado dia…
SAP API Policy v.4.2026© 2026 SAP SE or an SAP affiliate company. All rights reserved.
Hey r/AI_Agents, We built Caliber, an open source tool that handles the part of AI agent development that everyone dreads: environment configuration and setup. The problem we kept running into was that agent configs were always scattered.
n8n vs Latenode for multi-agent orchestration (www.reddit.com)
Recent benchmark results showing frontier models hitting new highs on OSWorld is the clearest signal yet that the "agent as coworker" framing, isn't just hype anymore, and now I'm rethinking which orchestration layer actually holds up when…
Hi guys, I am working in a MNC they recently purchased Claude AI and asking us to build a skill in it which helps to make the work easy. I am totally lost with ideas.
Hey r/ClaudeAI 👋 I built Nuzlocke Tracker (nuzlocketracker.xyz), a free tool for Pokémon players running Nuzlocke challenges. It's live, 100% free, no signup required, no paywall — just open the link and use it.