Chat GPT wrote your code, what else is missing? (blog.viewfromtheweb.com via hn)
The world has a lot more code now, since large language models (LLMs) can easily generate it. As the CEO of Anthropic Dario Amodei has predicted, AI is probably writing 90% of all the code in the world.
I built Impact Graph MCP using Claude Code. It’s an MCP server that does AST-based impact analysis for TypeScript codebases, so Claude can tell you things like “if I rewrite loginUser, what else breaks?” What it does: You give it a functio…
Finding WhatsApp Group JIDs for Agent Routing (Post-Update Fix) (www.reddit.com)
If you are building agents that interface with WhatsApp groups, you probably noticed that recent UI updates have hidden the '@g.us' JIDs from the DOM, making them hard to find for your config files. I ran into this while setting up per-gro…
So, with this project I want to see if a length constrained (like 64 tokens only) quality summarization can be done by tiny LLMs using GRPO! https://preview.redd.it/6f3tou9xhixg1.png?width=2816&format=png&auto=webp&s=c0b11ea7c387c1e84e1ad2…
Been vibe coding full-time for a few months. One workflow question I haven't nailed down yet: how do you decide which model to use for which task in Claude Code?
Show HN: Axle – a11y/WCAG CI that proposes real source-code fixes via Claude (axle-iota.vercel.app via hn)
Ship accessible code, automatically. Every PR scanned for a11y / WCAG violations, real source-code fixes proposed via Claude, lawyer-ready artifacts included.
Designing for Agents (twitter.com via hn)
If you spend time in the same corner of X as I do, scrolling past the "How I built a second brain with Obsidian" and "Anthropic just KILLED [insert industry] FOREVER" posts, you've probably also seen the take that UI is dead. And unless a…
- Designing agents to purchase products? (www.reddit.com)
You guys are rich (www.reddit.com)
A survey by Epoch AI and Ipsos found that 80 percent of US adults who use Claude live in households earning more than $100,000 a year.
-
56 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 2m Musk and Altman's bitter feud over OpenAI to be laid bare in court
- 1h OpenAI CEO Apologizes for Not Warning Authorities About Mass Shooting Suspect
- 4h Sam Altman May Control Our Future—Can He Be Trusted?
- 17h How to Attend the Altman vs. Musk Trial
- 21h OpenAI CEO Sam Altman apologizes for not flagging mass shooter to police
45 itemsmodel roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 35m is Deepseek v4 unvailable in Cursor? I cannot see it.
- 40m llama.cpp DeepSeek v4 Flash experimental inference
- 4h The exact KV cache usage of DeepSeek V4
- 6h DeepSeek's new model is 75% off right now, here's how to take advantage
- 11h DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles
Free llm APIs from Nvidia (www.reddit.com)
So build[.]nvidia[.]com[/]models give access to free APIs for llms ranging from SLMs to frontier models. I tried building with it and let's say the APIs are so slow to respond.
Hello everyone, I usually vibe-code in a fairly simple setup: I work inside an agent interface, review the changes, and then manually test everything from both a design and functionality perspective. For context, I’m building mobile apps.
Best Voice AI stack for India (not calling bots, just voice agents) (www.reddit.com)
Hey folks, I’m building a product in India where users interact with an AI agent using voice (like talking to an assistant to get tasks done). I’m specifically looking for the best voice AI stack for Indian use cases especially for things…
I'm set on the 128GB M5 Max, and deciding between storage options (2TB or 4TB)? Question: What have been your actual LLM workflow centric storage requirements?
Is Claude mocking us while taking a dig at Gemini? (www.reddit.com)
could not extract summary
Could creativy in LLM emerge by reframing language? (news.ycombinator.com)
Deleted.
Using PaddleOCR-VL-1.5 with llama-server for book OCR (www.reddit.com)
I've been running PaddleOCR-VL-1.5 via llama.cpp's server for OCR on book pages. It handles complex layouts, tables, and mixed text/figure pages surprisingly well.
If you've built anything on top of the OpenAI API and tried to enforce business rules via system prompts, you know the frustration: the model sometimes just ignores them. We built Caliber to tackle this.
-
175 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h [Qwen3.6 35b a3b] Used the top config for my setup 8gb vram and 32gb ram, and found that somehow the Q4_K_XL model from Unsloth runs just slightly faster and used less tokens for output compared to Q4_K_M despite more memory usage
- 2h Benchmark: Windows 11 vs Lubuntu 26.04 on Llama.cpp (RTX 5080 + i9-14900KF). I didn't expect the gap to be this big.
- 4h qwen3.6 27b poor experience
- 4h Vs code extension
- 8h Qwen3.6-27B-FP8 - JS file is too long and causing JSON truncation
4 itemsmodel roundup
Sonnet 4.5Anthropic has kept Claude Sonnet 4.5 available after its retirement due to user demand, while open-source models like DeepSeek V4 are catching up in capabilities, which remain several months behind closed lab versions.
Get access to powerful AI tools and cloud storage with Google AI Pro (Gemini) at a highly affordable price. This plan includes advanced features powered by Google Gemini, giving you smarter assistance for writing, coding, research, and con…
Hey everyone, I’ve been building a multi-agent system in my spare time, and I just open-sourced the repository. I was getting tired of the standard text-in/text-out chat paradigm and wanted to build a genuinely situated AI—one that actuall…
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents (www.lasso.security via hn)
The rapid adoption of always-on autonomous agents projects like OpenClaw has triggered a parallel arms race in the security industry. As these agents gain the ability to write code, access personal files, and operate indefinitely, the imme…
Can LLMs Scale to AGI? (news.ycombinator.com)
is there any good argument in favor or against, I have read a lot but there is always the same argument like "they do not think", "they are next work predictors", "they are not biological". Which makes sense to some extent but does not exp…
Show HN: OpenClaw but Efficient and with an SDK (www.npmjs.com via hn)
OpenClaw but efficient and with an SDK fastyclaw A small, fast local AI agent server with HTTP/SSE transport, a full tool-call loop, and a proper TypeScript client SDK. Run it once, talk to it from anywhere — your terminal, a script, a Wh…
What to do? Cursor stuck during "Grepping" (www.reddit.com)
I just encountered this recently. For some reason, Cursor consistently is stuck on "Grepping" Even if I stop it and resume by prompting "continue" - it'll continue to Grep and it's just stuck.
Maybe some of you have already figured out the perfect way to do this or found the best apps for linking them.
Is it good to use big files for project memory? (www.reddit.com)
Hi guys, I’m a gpt user slowly approaching to Claude and wondering few things. Using projects for long creative tasks (stories, book writing, and so on), I use some big pdf as memory for the project.
I just released Pocket LLM v1.5.0🚀 New in this release: - 🎙️ Voice input - 🖼️ Image input with OCR, Gemma vision, and FastVLM support - 📷 Camera capture with retake, crop, and photo review - 🗂️ Previous chats side panel - 💾 Downloaded mode…
OpenAI Privacy Parser (github.com via hn)
OpenAI Privacy Parser OpenAI shipped Privacy Filter — a model that hides PII in text. Defenders use it so data doesn't leak.
- OpenAI Privacy Filter (openai.com via hn)
- OpenAI Privacy Filter (huggingface.co via hn)