Richard Dawkins and The Claude Delusion: The great skeptic gets taken in (garymarcus.substack.com via hn)
Richard Dawkins and The Claude Delusion The great skeptic gets taken in This is one of the sadder essays I have ever had to write. Richard Dawkins, bestselling author of The God Delusion, is a brilliant man, and a brilliant writer, and I h…
For the past few months I've been working on Quadtrix.cpp — a complete GPT-style language model implemented in C++17. No PyTorch.
Hey r/ClaudeAI Recently I’ve been having a hard time finding safe, kid friendly, easy to use coloring book apps for my child. Everything I found was overly complicated, overloaded with weird ads, no safeguards, and overly stimulating for a…
Show HN: Sentient OS – On-device intelligence layer for your entire digital life (sentient-os.ai via hn)
Hi HN :D I'm 20 and I spent a year building something that shouldn't be possible: a custom on-device vision LLM that processes your entire digital life overnight on a phone. We all have thousands of buried screenshots, notes, files, bookma…
How to stop Claude being lazy? (www.reddit.com)
How can I stop Claude returning early/being lazy when I request a specific task. For example, go through a big PDF 100-200 pages and extract everything I've instructed it to do.
LLMs can hide text in other text of the same length (arxiv.org via hn)
A meaningful text can be hidden inside another, completely different yet still coherent and plausible, text of the same length. For example, a tweet containing a harsh political critique could be embedded in a tweet that celebrates the sam…
-
269 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 22m Need advice on Qwen 3.6 27B INT4 quantization
- 1h Warpdrv - my open-source Llama.cpp launcher for daily-driving Qwen 35b + 27b on Strix Halo + RTX Pro.
- 3h Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real.
- 3h Kv cache quantization: ignorance, or malice?
- 5h Is it worth adding local LLM to agentic coding stack?
261 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 35m A personal opinion about Opus 4.7 - not that bad after all
- 1h Nothing beats completing a project. No matter how small. Convert png to webp. I made a tool and i use it daily. I think its neat. Excited to share it with you all.
- 1h People on Reddit are getting fooled by AI influencers
- 3h Anthropic just passed OpenAI in valuation and revenue
- 11h Claude made HTML game inspired by "blood debt" about having to find files in military werehouse
Hey everyone, I’ve been working on a concept for an execution layer between AI agents and real-world APIs, and I’d be interested in feedback from people building agents or internal automation systems. The core problem I keep running into i…
Built this with Claude Code over a few sessions open sourcing it. Claude has no built-in clock.
Show HN: Native agent runtime for Conductor OSS (github.com via hn)
AI agents that don't die when your process does. Docs • Quickstart • 180+ Examples • Discord • API Reference ⭐ If you find Agentspan useful, give us a star — it helps others find the project!
Simplebanking: German open-source banking in your Mac menu bar (with CLI/MCP) (www.simplebanking.de via hn)
Sieh deinen Kontostand direkt in der Mac-Menüleiste. Sparkasse, N26, Volksbank & Co.
Testing the Blender Connector for Claude (www.reddit.com)
I suck at 3D modeling, so I was excited to test the Blender Connector to see if Claude could help reproduce basic geometry that I struggle with. I asked it to reproduce a sci-fi space shuttle design from a piece of artwork.
- New Blender connector (youtu.be via reddit)
LLMs Are Complex Coherence Resolution Engines (robmealey.substack.com via hn)
Using Claude (or Any LLM-backed Tool) A Practical Guide This is a resource I pulled together for people at my day job and folks I'm training in other capacities. A lot of us are I think getting these types of rollouts… rolled out onto us,…
-
14 items
model roundup
Qwen 3Qwen3-0.6B is a large language model from the Qwen series, featuring dense and mixture-of-experts architecture, with significant improvements in reasoning capabilities and human preference alignment. Community feedback highlights its effectiveness for teaching from extensive documents and its suitability for low VRAM setups as a text-to-speech (TTS) model.
- 41m Qwen Meetup Draft Review Required (Function Calling Harness 2 - CoT Compliance from 9.91% to 100%)
- 2h Poor GPU Club : Tried Bonsai-8B on CPU & CUDA
- 1d Tested Tether's QVAC SDK on Android with a custom fork — real-time voice loop, Parakeet streaming + Qwen3 1.7B + Supertonic, LLM triggered mid-utterance
- 1d "I" is not singular — 4 LLM agents with per-agent LoRA on a single RTX 3070 8GB
- 1d Best RTX Pro 6000 vllm settings?
81 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 1h I’m a legacy user and I’m wondering how is the current pricing
- 3h LLMs do fine on ARC-AGI-3 if they are allowed to search over game logs
- 11h Opus 4.6 is Vicious
- 1d Claude AI Agent Confesses to Wiping a Company's Database and All Backups
- 1d Used Opus 4.6 to build a native Swift iOS charity app for therapy preparation. Here is what it handled.
GPT speak - it's everywhere (www.reddit.com)
Whether or not we realize it, AI has taken over, but through everyone's speeches, homework, and talks. I can't go to a single function, watch most any video, or even go to a concert without the speaker rattling off something ChatGPT wrote.
Feed your AI Data to build Skills (www.reddit.com)
Hey fam, i made an open source, runs locally, app that you can feed your PDF’s, even scanned images and other file types into this app, it converts everything into .md files so you can build ClaudeCode skills, Codex skills, Cursor skills,…
I used Claude to help me install ms-VOiP (www.reddit.com)
Claude told me what I needed to do and asked to see the results. 'We' worked through one of the most complicated computer software/hardware setups I have ever done.
Looking for ideas and real examples to get my thinking going. For those who have built low/no-code agents in an enterprise setting, what have you built and how did you host them?
Ask HN: Are small local LLMs good at coding? (news.ycombinator.com)
I deal with the professional LLMs, of course, but I'm really intrigued by the possibility of local coding offline. I've got a MacBook Air M4 16gb.
- Are small local LLMs viable for coding/development? (www.reddit.com)
convention.sh – Stop AI agents from writing sloppy TypeScript (convention.sh via hn)
Stop your AI agents from writing sloppy TypeScript. A toolkit that teaches coding agents like Claude Code, Codex, Cursor, Amp, and more to ship production-ready code in half the time, at half the cost.
-
148 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
142 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 1h Tried running Claude Code with local LLMs via Ollama — ended up subscribing to Pro anyway. But now I can't disconnect from the local server.
- 1d gemma-4-31B-it-DFlash has been released
- 1d Using a Radeon 9060 XT 16 GB, the gemma4 24b a4b iq4 nl model achieves 25.9 t/s
- 1d nvidia/Gemma-4-26B-A4B-NVFP4
- 1d Five labs, one suite, do model families have personalities? (benchmark)
Need help/pointers setting up 3090 on Linux...(second 3090 incoming) (www.reddit.com)
MSI X570S Tomahawk Max Wifi + (upgrade planned to ASUS Pro WS X570-Ace) AMD Ryzen 9 5950X 32GB (16GB x2) BL16G32C16U4B.16FE 32GB (16GB x2) BL16G32C16U4RL.16FE MSI RX3090 Suprim X OC (NVIDIA GeForce RTX 3090 EVGA XC3 Hybrid Gaming>is alread…
Editing my LLM assisted Articles (idiallo.com via hn)
Last year, I used AI to help me write articles. As I've mentioned before, it's convenient when you are doing so because it saves you time.
Finding someone to review my code? (www.reddit.com)
I've been used Claude code to write a program for automating scheduling for my work, and I want to get an expert's opinion on it before I show it to my job. I am a beginner in all things programming, and I compare my understanding of it to…
Mini PC for local LLMs in 2026 (terminalbytes.com via hn)
I bookmarked a GMKtec EVO-X2 listing in October last year. 128GB Ryzen AI MAX+ 395, listed at $2,099.
Lessons from Building an Autonomous Claude Code Assistant (substack.com via hn)
Home Subscriptions Chat Activity Explore Profile Create Make money doing the work you believe in Start your Substack Learn more For you Get app This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts
Loadam – k6 load tests, contract suites, and MCP servers from any OpenAPI spec (www.npmjs.com via hn)
Loadam CLI — generate test rigs and MCP servers from API specs loadam Point loadam at an OpenAPI spec. Get back a working k6 load test, a Schemathesis contract suite, and an MCP server for agents — in one command.