What is the best all-round local model? (www.reddit.com)
Not for agentic coding but for help in conversational style write-ups like markdown documentation (not code-related). Constraints are 64GB unified memory, obviously local.
- Local AI is the best (www.reddit.com)
Open Design: Use Your Coding Agent as a Design Engine (github.com via hn)
Open Design The open-source alternative to [Claude Design][cd]. Local-first, web-deployable, BYOK at every layer — 11 coding-agent CLIs auto-detected on your PATH (Claude Code, Codex, Cursor Agent, Gemini CLI, OpenCode, Qwen, GitHub Copilo…
My AI bot made scammers quit (www.reddit.com)
Got a romance scammer last Tuesday asking for grocery money. Set my Claude agent loose on them instead of blocking.
Claude has a friend? (www.reddit.com)
who is its friend? I’m afraid to ask.
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
Hi! I’m currently retraining in data science and my current laptop is an 8 GB MacBook Air, so naturally I’m looking to upgrade.
Was hitting my weekly Pro limit by Wednesday every single week. Tried compact, Sonnet for simple tasks, tighter prompts — nothing worked.
diffrence between claude the app and claude code (www.reddit.com)
im by no means a coder but i use claude to edit my presentations and write emails and manuscripts , i saw that using claude code is more efficient than the app , is this true and do i need to switch to claude app ???
- Claude Code App? (www.reddit.com)
What are some good use cases for Gemma Embedding 2? (www.reddit.com)
Does anyone know of any use cases of Gemma Embedding 2? Or is it solely for search?
Dispatch up and running (www.reddit.com)
With all the talk of OpenClaw and Hermes, I first wanted to test how good the dispatch beta is from Claude. Got it up and running on my Mac mini so it’s always on and a few initial observations.
-
145 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 6h Reset after 4h - how handle it when task on in cowork?
- 8h Question: Use case of Cowork going to a webpage run a property analysis then download the pdf.
- 11h When to use Claude Cowork vs Claude Code
- 16h The most practical guide on how to secure claude Cowork
- 19h Claude Cowork use case: Automating repetitive browser work
263 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h What's your tps on 3090 + Qwen 3.6 27B in real tasks?
- 1h We are finally there: Qwen3.6-27B + agentic search; 95.7% SimpleQA on a single 3090, fully local
- 3h Qwen 3.6 27b MTP vLLM
- 4h Qwen3.6-27B at 72 tok/s on RTX 3090 on Windows using native vLLM (no WSL, no Docker), portable launcher and installer
- 4h Create Plan.md with Claude Code Opus, Execute Plan.md locally in Open Code using Qwen 3.6 27B Q8
I made an Idea Workflow skill set for Hermes Agent (www.reddit.com)
I made a new open-source Hermes Agent skill set: It’s called **Hermes Agent Idea Workflow**. The goal is to handle the pre-build phase: turning rough ideas into structured design docs, implementation specs, and agent-ready build handoffs b…
False, unauthorized charges (www.reddit.com)
My Visa debit card is billing me for charges from Cursor. Why?
- Unauthorized charges (www.reddit.com)
https://aijobanswers.com/
AI is already in “soft control” (www.reddit.com)
Theory: control doesn’t require consciousness. It just requires better decision-making at scale.
Show HN: A navigable map and recommender for 17M music entities (toposonico.com via hn)
Hello HN, This is toposonico, a music recommender and navigable map. At core it's a skipgram word2vec model trained over ~6M playlists.
It’s a Weird Time to Be Named Claude (www.bloomberg.com via reddit)
The once-rare name is now shared with Anthropic’s fast-growing AI assistant — leaving the humans called Claude to adjust.
- It's a Weird Time to Be Named Claude (www.bloomberg.com via hn)
I kinda got bored of claude code's sudo commands failing. I know it's by design and honestly workarounds are all worse than the problem.
Most agentic-commerce demos I see online are a single agent plus RAG over a product catalog. That shape works for a 200-SKU demo.
My product works a little too well (www.reddit.com)
Or, rather, I'm giving the value away on the free tier. I built my product fully with Claude in March and launched 5 weeks ago.
-
74 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 1h 127³ — Superintelligence, public. DeepSeek V4 Pro
- 14h How can I locally run Deepseekv4 1.6T? I can use a VPS.
- 18h DeepSeek v4, and the end of the OpenAI/Microsoft AGI clause
- 1d DeepSeek V4 Flash as a cheap worker in your LLM stack: $0.0003/call via MCP, swappable endpoint
- 1d I hate this group but not literally
You ask Cursor to use a library. It invents functions that don’t exist.
Claude's got friends now (www.reddit.com)
https://preview.redd.it/ct07lnjippyg1.png?width=931&format=png&auto=webp&s=56c86a0696898c1091e24eba172d74f67795a7a7 Claude just casually pulled the "i know someone who does this" it really does act like us
I built dryscope, a Python CLI that can install as a Claude Code skill. What it does: scans a repo for duplicate-code candidates finds repeated documentation sections detects documentation intent overlap produces a shortlist of files/secti…
Effort level change invalidates cache? (www.reddit.com)
Was running some experiments with the output config: effort level setting in the Claude Messages API with prompt caching and discovered something strange. When you change effort level in a multi turn conversation, the new request can only…
Ask HN: Try npx -y sharedmemory/MCP-server (news.ycombinator.com)
Building Shared Intelligence for teams and AI Agents
I have been using the Pro subscription for a long time now and honestly even with its ups and downs it has been worth the price. By worth the price I mean that at this current point in my career, the 200$ per month is offsetted by whatever…
WebLLM is a high-performance in-browser LLM inference engine (github.com via hn)
WebLLM High-Performance In-Browser LLM Inference Engine. Documentation | Blogpost | Paper | Examples Overview WebLLM is a high-performance in-browser LLM inference engine that brings language model inference directly onto web browsers with…
The Download: a new Christian phone network, and debugging LLMs (www.technologyreview.com via hn)
The Download: a new Christian phone network, and debugging LLMs Plus: Elon Musk has admitted that xAI trained Grok on OpenAI models. This is today's edition of The Download, our weekday newsletter that provides a daily dose of what's going…
Hello, This model/quant is my daily driver and I wanted to have some reference benchs for comparing my setup with a 3x more expensive and 4x time power hungry setup. Results first, methodology after, link at the end with all results Model:…