Ok cursor play despacito (www.reddit.com)
could not extract summary
How to build an agent that is both neuro-symbolic and probabilistic (www.reddit.com)
Most agent architectures treat memory like a rigid database, but that leads to the "stochastic drift" everyone complains about. My partner is a neuroscientist and we've spent the last year modeling an agent’s memory on biological systems r…
How I personally deal with Claude's limits without giving up on Opus (www.reddit.com)
I only use Sonnet as my main model. I instruct it to delegate indexing and similar grunt work to Haiku, and whenever something genuinely needs deeper thinking, I tell it to "consult Opus." Sonnet then explains the situation to Opus, gets t…
could not extract summary
OpenClaw Alternatives? (www.reddit.com)
I just set up OpenClaw on my docker container, currently with almost no tool access. I've heard of security issues around Openclaw, but I don't know what else to use.
AI Visibility Monitor A small toolkit for tracking whether your website appears in AI search results (ChatGPT, Claude, Perplexity, Gemini) and Google search, and for diagnosing the technical layer underneath that determines whether AI engi…
-
167 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
97 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
Claude was more useful as an inbox filter than a reply writer (www.reddit.com)
Claude got more useful for me when I stopped asking it to write the reply. The actual problem was after a post went up.
Should we plan with Codex, then code with Claude? Or should we plan with Claude, then code with Codex?
I'm trying as hard as I can to get a local setup somewhere in the ballpark of proprietary LLMs for code generation. My computer is running a Intel(R) Core(TM) Ultra 7 265K (3.90 GHz) with 128 GB of DDR5 RAM and an Nvidia Geforce RTX 5090 t…
The Professors Are Using ChatGPT, and Some Students Aren't Happy About It (www.nytimes.com via hn)
could not extract summary
Show HN: Routiium – self-hosted LLM gateway with a tool-result guard (github.com via hn)
Routiium is a self-hosted, OpenAI-compatible LLM gateway I built. It does the table-stakes things you'd expect — managed keys, routing, rate limits, analytics — but the part I want to flag for HN is what it does on the agent side.
Agent-World: Scaling RW Environment Synthesis for General Agent Intelligence (agent-tars-world.github.io via hn)
1Gaoling School of Artificial Intelligence, Renmin University of China 2ByteDance Seed *Work was done during their internship at ByteDance Seed †Corresponding Author What is Agent-World? A self-evolving training arena that unifies scalable…
-
40 items
model roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
- 1h Show HN: Mapping Sonnet's thinking process via flame charts
- 1h An experiment with Claude Sonnet 4.6
- 3h "We've partnered with OpenAI to offer it for 50% off through May 2." Please confirm that it means 50% off both input and output tokens, which means we are paying Sonnet 4.6 prices to use GPT 5.5 until May 2nd.
- 22h Sonnet 4.6 repetition
- 1d Has Claude become less intelligent? I had a frustrating day with Claude.
81 itemsevent
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 1h Self-Hosted AI Red Team Tools
- 14h LLM CTF challenges. Can you crack all 13?
- 1d Most AI agent "skills" on GitHub are unvetted garbage. I built a marketplace to fix that.
- 1d env variables and claude best practices
- 1d Security Audit of Mem0 (AI Memory Layer): 23 High-Severity Vulnerabilities found (SQLi, Prompt Injection, and more)
LLM-Rosetta — A Python library for converting between different LLM provider API formats using a hub-and-spoke architecture with a central IR (Intermediate Representation). Full documentation is available at: - English: https://llm-rosetta…
i think humans are better than ai automations (www.reddit.com)
ive seen a lot of people talk about automating their work using ai agents, i tried a couple of them this week and all of them seem to have failed when it comes to real life applications either they're way too complex to set up or they just…
what is an agent? (www.reddit.com)
I know what an llm is, what makes one agentic? is it the fact that the results of what it produces goes back in as a prompt?
- AI Agent Has Amnesia (www.coderabbit.ai via hn)
- Ecommerce AI Agent (www.reddit.com)
- Agent orchestration (www.technologyreview.com)
- AI agent for email (www.reddit.com)
- AI Agent for LinkedIn (www.reddit.com)
- Your agent is lying to you… (www.reddit.com)
This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Investigated elevated errors and slower responses on claude.ai Check on progress and whether or not the incident has been resolved y…
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T19:02:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:57:57.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T11:58:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:25:22.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:43:51.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:37:30.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T07:48:31.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:53:02.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T16:29:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:03:39.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:20:03.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:57:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:55:35.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T07:43:32.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T06:50:56.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T17:42:57.000Z (www.reddit.com)
For all of the Claude-oriented tutorials and resources out on the web, I prefer the insights from experienced developers and software engineers here about how to think through building a project, instead of just prompt bashing until a brit…
Enterprise systems often avoid "monolithic" AI to prevent context rot and hallucinations. The standard fix is task-decoupling: splitting logic between specialized agents or deterministic code.
-
59 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
I've been keeping a tally for two weeks and claude has recreated existing utilities 11 times across three repos. clean code every time.
LLM for reliable tool calling/searching (www.reddit.com)
Hey, im making a project which includes using LLM to act as "search engine" I need LLM to use tool calling to request for which category of products to search from with pipeline: Category (LLM gets all main categories) LLM picks sub catego…
How I turned a 249 files PR into a piece of cake to review :) (www.reddit.com)
Created this quick code-review claude plugin for myself and wanted to share it with the community :) I guess github/graphite and others could use from features like these: Clustering topics in the PR TL/DR of file changes and descriptions…
Hey everyone! I built Claude Squad, a self-hosted tool that lets multiple people share one coding session through Claude Code.
VSCode Claude extension in Dev Container auth issue (www.reddit.com)
Anyone not able to get their Claude extension authorised to their account in a VSCode Dev Container recently? I have exactly the same setup on another machine that works perfectly.
Youtube descriptions writer. How to make him perfect? (www.reddit.com)
Hey everyone, I’m currently building a custom AI Agent designed specifically for B2B YouTube optimization (Titles and Descriptions). The goal isn't just "good enough" copy—I need it to sound like a high-level strategic partner, not a gener…
I've been running a multi-agent system in production for a few months — a co-CTO agent + specialist agents (PM, dev, ops) that handle real engineering work end-to-end: design specs, code review, PR implementation, deploys, monitoring. The…