9 min read Mar 31, 2024 -- After my latest post about how to build your own RAG and run it locally. Today, we’re taking it a step further by not only implementing the conversational abilities of large language models but also adding listen…
Claude Code plugin for designing modular systems (github.com via hn)
Modularity Skills TL;DR: A Claude Code plugin for designing and analyzing modular software systems using the Balanced Coupling model. There's no shortage of AI tools that provide code-level feedback: best practices, edge cases, potential b…
Creating specs for existing code to help with massive code change ? (www.reddit.com)
Hi all, Very much a noob when it comes to claude. We've a rather large and complex codebase (with a LOT of spaghetti) and we're working on a fairly complex change that affects nearly the entire codebase.
My api usage suddenly shoot to 100% from 66% or 67% (www.reddit.com)
https://preview.redd.it/81ho6nenkoxg1.png?width=1050&format=png&auto=webp&s=2d29bc4b066555b586a98e17ad618bb502f4b58e - on saturday my api used was at 66% or 67% - today (monday) i opened my cursor and updated it to, Version: 3.2.11 VSCode…
-
53 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 1h How will you scale these models
- 4h DeepSeek V4 is about to be open-sourced—effectively revealing all the secrets behind the magic. How will other players in the field respond?
- 5h Language Anchoring: A Systematic Method for LLM Multilingual Adaptation
- 7h No GGUFs for DeepSeek V4-Flash as yet?
- 7h Deepseek v4 flash weird sizes?
64 itemsmodel roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
Study the 50 leaked LLM Interview Questions with my lil learning App (boguslavskyy.com via hn)
Gamified learning for AI/ML interviews. 5-step method: understand, quiz, vocabulary, write, speak.
The cost math behind routing Claude Code through Ollama (~90% cut) (github.com via hn)
Use Ollama to Enhance Claude — Two-Engine Setup Pair Claude Desktop on Anthropic with Claude Code routed through Ollama in your terminal. Strategy stays on Pro.
Claude project running for hours - meaning?? (www.reddit.com)
I have often read people posting about their projects saying after hours of running or after 2 days of running, claude came with a solution. In my personal experience, i haven’t ever stumbled upon a situation where claude even took an hour…
Everything that went wrong with Claude (clawd.rip via hn)
Music Publishers Drag Claude Into Court Universal, Concord, and ABKCO sued Anthropic, alleging Claude was trained on copyrighted lyrics and could reproduce lyrics from hundreds of songs. The 'constitutional AI' company got its first big co…
-
121 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
44 itemsmodel roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
- 41m Does Claude have access to things pasted in the text box but not sent?
- 5h Does higher effort make Claude refuse more? CVP Run 5 with Opus 4.6 Medium and High
- 1d Opus 4.6 vs Sonnet 4.6
- 1d Claude's sonnet 4.6's clarifying questions...How to read?
- 1d Does effort tier change refusal behavior on agent-attack prompts? CVP run 4 with sonnet 4.6 high and max efforts.
I want AI to guide me in driving (www.reddit.com)
I often struggle to drive in narrow roads. It happened like 3 times this month.
So ive setup claude code on my ubuntu homelab machine & typically SSH into my homelab & run claude code thru a windows terminal from my main PC. I linked claude to the app on my iphone.
OpenAI boss 'deeply sorry' for not telling police of mass shooting suspect's account The leader of OpenAI has apologised for the company not going to police with information on a ChatGPT account that belonged to the person accused of a mas…
I've been building a thing called Fathom. It's a partly-Claude-based agent that's been running since January, changing my mind about how it should work as it helps me build itself.
-
109 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
59 itemsevent
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 1h OpenAI Leadership Overruled Staff Warnings to Report School Shooter to Police
- 7h OpenAI tries to explain its AGI philosophy as Sam Altman admits the company deserves scrutiny
- 15h Elon Musk's legal battle with OpenAI and Sam Altman will head to trial
- 20h Musk and Altman's bitter feud over OpenAI to be laid bare in court
- 21h OpenAI CEO Apologizes for Not Warning Authorities About Mass Shooting Suspect
Txtfold – summarize large files for LLMs (github.com via hn)
txtfold Identifies repeated patterns and surfaces outliers in large log files and structured data. Converts thousands of lines into a human- or LLM-readable summary.
Claude Code can now watch videos... [+4 AMAZING Use cases] (www.reddit.com)
Quick context: Claude can see images but can't stream video. That kept blocking me on a bunch of workflows, so I built a skill that fakes it.
I told Claude I needed to cut down my cholesterol and that I was pre-diabetic based on my last annual check-up. I also mentioned that past diets have failed me because they were torture.
What are people using Browser Based Agents for ? (www.reddit.com)
Curious to see different verticals where people are deploying browser based agents in production. Is it just for realtime search and data extraction or also some end to end workflow automations?
-
14 items
model roundup
Claude 4.7Users of Claude 4.7 are reporting issues where the AI frequently checks for malware in their files, even during normal tasks. This behavior has been observed across multiple projects, including one using Next.js.
- 1h Claude 4.7 vs. ChatGPT 5.5
- 16h Claude 4.7 named a journalist from 125 words of unpublished writing
- 2d Tell HN: Claude 4.7 is ignoring stop hooks
- 5d Claude 4.7 blocks cyber prompts: before the fact vs. after the fact
- 5d non-benchmaxxed fun AI question with Terminator reference - I think Claude won
204 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 2h Deferring Planned Items
- 7h Claude Code started to use with me very specific words it was not using before
- 13h what are your strats for being efficient with opus 4.7 max?
- 16h Tell HN: Claude Code is unable to respond to this request
- 16h Claude Opus 4.6 vs. Opus 4.7 Effort Levels and Prompt Steering Benchmarks
Claude AI vs Claude Code vs models (this confused me for a while) (www.reddit.com)
I kept mixing up Claude AI, Claude Code, and the models for a while, so just writing this down the way I understand it now. Might be obvious to some people, but this confused me more than it should have.
- Switching Models in Claude Code? (www.reddit.com)
Cursor Deleted Railway Production Volume and Backups (twitter.com via hn)
A 30-hour timeline of how Cursor's agent, Railway's API, and an industry that markets AI safety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS.
Show HN: MemOperator-4B (huggingface.co via hn)
Memory Operator is a specialized language model developed for MemOS, designed to handle memory-related operations. Its core capabilities include memory extraction, integration, and update.
The official uninstall instructions provided by Anthropic result in a left over app on Mac OS named "Claude Code URL Handler" This is sloppy and not what I asked for. I'm right to push back.
Making a Landing Page Work for Both Humans and AI Agents (docsalot.dev via hn)
I spent this week redesigning my landing page. The surprising part was not the typography, the footer, or the mobile nav.
Show HN: Another experiment with an Erdos problem and LLMs (news.ycombinator.com)
Background: I am a coder, not a mathematician, but I was quite entertained by this story: https://news.ycombinator.com/item?id=47903126 I wondered how far I could get by just choosing a random open problem and throwing it at LLMs. Disclosu…