LLMs are breaking 20 year old system design (zknill.io via hn)
The ‘cloud-native’ architecture of the last decade is built on a 20-year-old assumption: that state lives in the database, and compute is stateless. If you want to scale, you scale the database vertically (get a larger machine) [1][1] or d…
Show HN: Monghoul – Desktop MongoDB GUI with schema-aware autocomplete and MCP (monghoul.com via hn)
Last year I decided to start a fun side project - a love child of VS Code and NoSQLBooster. I wanted a GUI that looks modern and snappy, minimal, not like 2003 MS Excel with dozens of buttons and dropdowns everywhere.
what model are you using for your personal AI agent? (www.reddit.com)
Hey everyone, I’m building a small AI agent for personal use and I’m trying to figure out which model actually feels best in day to day usage. I’ve been testing ChatGPT, Claude, Gemini and a few open-source ones, but I keep changing my min…
Disclosure first: I'm the author. MIT, runs locally, zero telemetry.
-
349 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 4m How to get Opus to be less pro-active?
- 2h issue with opus 4.7
- 2h 10+ days of silence from Anthropic support — Max plan ($200/mo) and locked out of Claude Design
- 3h Claude can now follow ~500 instructions, up from ~150 a year ago
- 9h Is Opus 4.7's attention degradation a training direction problem? Some observations from heavy use
235 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 9m Show HN: AgentKanban for VS Code – A task board with agent harness integration
- 3h When a client wants to deploy an LLM internally but their data governance is a mess, do you take the engagement and fix the data first, or walk away?
- 3h How do you write a bug ticket differently now that you know an AI agent might pick it up before a human does?
- 4h Microsoft patched 137 bugs, but the Azure AI Foundry one is what caught my eye
- 4h AI coding agents genuinely changed how fast small products get built
Text: I built Elecz because LLMs regularly hallucinate electricity prices, cheapest charging hours, and energy contract data. Elecz is a read-only MCP server and REST API for real-time electricity data across 40 countries and 100+ bidding…
What is your AI agent constantly doing wrong? (news.ycombinator.com)
My colleague and I have found AI is consistently falling short on the following things. I'd be interested to know whether this is a prompting issue or a model issue (we use claude 4.7): - comments being too verbose, and making reference to…
- What am I doing wrong? (www.reddit.com)
- What am I doing wrong? (www.reddit.com)
- What am I doing wrong? (www.reddit.com)
OpenAI faces lawsuit claiming chatbot gave advice that led to fatal overdose (www.reuters.com via hn)
paywalled
i'm a bit confused which service is best for what, how to think about token usage for the different usecases. is there benefits to running multiple, is there a good setup like paperclip + hermes for coding vs gstack + openclaw for recurrin…
-
137 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 9m What is more efficient to do?
- 14h ChatGPT Thinking Loop: No response is received from GPT-5.5 Thinking (Standard)
- 17h Agentic harness for theoretical physics research
- 20h OpenAI gives European companies access to its latest model GPT-5.5-Cyber
- 1d GPT-5.5 was used to flag fatal errors in FrontierMath problems
177 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 20m Model for reverse engineering
- 2h Does THINKING MODE significantly improve translation?
- 3h Q: Does DFlash (and PFlash) work with Heretic models?
- 3h How many of you tried BeeLlama.cpp? How's it? Agentic coding possible with 8GB VRAM?
- 5h Qwen3.6:27b single-shot fixed a CSS UI bug that had Gemma4:26B doom looping uselessly for 15 minutes
Sam Altman Testifies That Elon Musk Wanted Control of OpenAI (www.nytimes.com via hn)
Pinned Here’s what happened in the trial on Tuesday. Image Sam Altman, OpenAI’s chief executive, spent the day on the witness stand in a federal court in Oakland, Calif., on Tuesday.Credit...Jason Henry for The New York Times Before Elon M…
- Sam Altman testimony: Musk wanted 'total control' of OpenAI to pass to his children (www.businessinsider.com via reddit)
- What Elon Musk's Clash with Sam Altman of OpenAI Is About (www.nytimes.com via hn)
- Sam Altman and Elon Musk (www.reddit.com)
Small community team - shared account? (www.reddit.com)
I run a small online community team of 1 ft and 2 pt employees. How possible is it to share a Claude Pro licence?
A fully autonomous browser runtime for any AI agents (github.com via reddit)
Built (with Claude) an open source, fully autonomous browser runtime for agents. One critical issue I faced (I guess most of us do) is the inability to have a robust web search feature and this will help you direct towards that goal I hope…
is live streaming the next big distribution layer for AI and media? or just hype curious what everyone here thinks
-
415 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
228 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 1h Ideas to automate Teams Meeting transcripts to Cowork-meeting intelligence ideas?
- 7h Claude Desktop + Cowork on Windows without admin rights, possible?
- 8h Claude Code and cowork usage statistics
- 8h How do you back up your Claude cowork files ?
- 13h Anthropic says newest lawyer tools are 'like giving an engineer a legal degree'
Local Transformer Language Model Running on GameBoy Color (github.com via hn)
gbc-transformer TinyStories-260K running locally on a stock Game Boy Color. This is a proof-of-concept GBDK-2020 ROM that runs a quantized transformer language model on the Game Boy Color CPU.
Anthropic rolled out Claude For Legal (May 12), adding practice-area plugins for commercial, employment, privacy, product, corporate, and AI governance law. The release also includes MCP connectors to tools lawyers already use: DocuSign, I…
Cross devices agent memory and context management? (www.reddit.com)
Hey, developers. Imagine you have 2 macs, one at your job, one at your home.
Paid for Cursor Pro but no access (www.reddit.com)
I've paid for Cursor Pro and it did not give me pro access I even paid a second time, it took my money but didn't give me pro access. Support is not responsive.
-
194 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
I’ve been experimenting with a local control-plane for coding agents, and I’d love serious critique from people building real agent workflows. The problem I kept running into: - agents forget the original project intent after long sessions…
working in a typescript monorepo with 200+ files and claude keeps hitting context limits when i need it to understand module relationships. tried chunking, separate chats for different parts, even wrote my own context manager.
One unexpected thing about AI agents: They’re forcing companies to realize how much of daily work was never actually structured in the first place. A lot of “processes” turn out to be: random Slack messages undocumented approvals tribal kn…
In a trial pitting him against Elon Musk, nobody has more to lose than Altman (www.latimes.com via hn)
In a trial pitting him against Elon Musk, nobody has more to lose than OpenAI CEO Sam Altman - Click here to listen to this article - Share via In a trial featuring a clash between Elon Musk and OpenAI CEO Sam Altman, neither of the tech t…
Show HN: HYPD – AI co-pilot for marketers running Google Ads (www.hypd.ai via hn)
We've been building HYPD for the last 1 year together with a small team in Berlin. It's an AI co-pilot (chatbot) for PPC freelancers and agencies.
Can the new Agents overview spawn sessions in worktrees? (www.reddit.com)
The new FleetView / Agents dashboard (the "describe a task for a new session" input) makes it easy to fan out parallel sessions, but every new session inherits the parent's cwd — same git checkout, no isolation. I work in a monorepo and us…
AI agents that need real social context (www.socialcrawl.dev via hn)
Scrape social media data from 27 platforms with one API The unified social media scraper API. One schema, one request — clean, structured data from TikTok, Instagram, YouTube, and every other platform in our 27-platform lineup.