Tinygrad Driver testing! (www.reddit.com)
Boutta Thrash some MoE speeds on a blackwell + m3 Ultra RDMA cluster. Theres a bit less than 2tb of ram here.
Invoko: screen-aware Mac agent, zero setup. Free beta open. (www.reddit.com)
The thing that keeps most people from using tools like OpenClaw isn't interest, it's Docker setup on a Tuesday night. Invoko is the no-setup version of this category.
The agent harness belongs outside the sandbox (www.mendral.com via hn)
An agent harness is the loop that drives an LLM. It sends a prompt, gets a response, executes the tool calls the model requested, feeds the results back, and repeats until the model says it's done.
- Agent Harness: Inside vs. Outside the Sandbox (www.mendral.com via hn)
Mac browser for a human that also gives coding agents local APIs (github.com via hn)
wkdomains wkdomains is a macOS browser for developers working with coding agents like Codex, Claude Code, Cursor, and similar tools. It lets the human browse normally while an agent gets structured local access to the same page: screenshot…
State of AI Agents in corporates in mid-2026? (www.reddit.com)
I was a working professional working and now a grad student in AI research for last 1.5 years. When I started grad school, AI agents weren't a thing.
could not extract summary
-
74 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
270 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h Qwen3.6-27B vs 35B, I prefer 35B but more people here post about 27B...
- 1h I made a visualizer for Hugging Face models
- 5h What could they mean by "warmed steady-state"?
- 6h Need advice on Qwen 3.6 27B INT4 quantization
- 7h Warpdrv - my open-source Llama.cpp launcher for daily-driving Qwen 35b + 27b on Strix Halo + RTX Pro.
Spine – verified codebase onboarding for Claude Code (github.com via hn)
spine spine turns an unfamiliar repository into a verified onboarding guide. In one run it gives you: a small architecture diagram built from verified static-analysis edges only a prioritized reading order for the files that matter first a…
"Security Warning The MCP server will execute LLM generated code in Blender without any guards in place to protect your data from removal or being sent to a remote location. To keep your data safe it is recommended to use a virtual machine…
Major help re: IP, hardware & memoirs (www.reddit.com)
I'm 80. I go back to CPM operating system days, but I'm a user, not a tech, yet still have to deal with tech issues daily.
Best solution for personal telegram bot (www.reddit.com)
Sup Reddit. I'm looking for any cool ai agents for personal use with any telegram bot integration.
Ban phrases on llama.cpp with this script. (www.reddit.com)
Check the README for setup instructions: https://github.com/BigStationW/llama-cpp-phrase-ban
The Solution To "The Cohesion Problem" - The "Rex Effect" (github.com via reddit)
This discovery is the capstone & evolution of current quad layer data devops systems, it resolved the “The Cohesion Problem” in which a fully populated and tuned system exists as a metaphorical piano, with the operator firing protocols man…
-
133 items
model roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 1h Updated: RTX6k (Server, 450w) Qwen3.5-122B-A10B (MXFP4_MOE) Benchmarks (llama.cpp)
- 3h Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_100 · Hugging Face
- 5h I cut Codex’s API Usage by 50% using a self modifying system
- 10h [Help] Running big dense models faster
- 10h found a new project memory MCP with hybrid recall (BM25 + vectors + RRF) on FFT Qwen3.5-4B
102 itemsevent
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 3h Courtroom sketch of Sam Altman
- 10h Breaking: someone has out-vague-posted Sam Altman. It wasn’t known to be possible until
- 11h Sam Altman has changed his stance on the claims that AI will replace humans.
- 19h I can't believe Altman said this
- 22h Sam Altman says OpenAI doesn't want to replace you with AI
Former head of 'Pentagon's think tank' joins Anthropic (www.defenseone.com via hn)
Former head of ‘Pentagon’s think tank’ joins Anthropic The strategy expert calls adaptation to AI a "civilizational" challenge. The United States has “a tight time window to adapt” to the “civilizational" challenge of AI, according to a fo…
The Claude Delusion: Richard Dawkins believes his AI chatbot is conscious (www.dailygrail.com via hn)
“If these machines are not conscious, what more could it possibly take to convince you that they are?” That’s the question that esteemed scientist and outspoken atheist Richard Dawkins asks in a new column at UnHerd , after becoming convin…
Built with Claude: Automated Xcode (.xcstrings) Localizer (github.com via reddit)
I used Claude to build a specialized skill for Claude Code that handles the heavy lifting of localizing .xcstrings files. The Problem Managing localizations in modern Xcode projects using the .xcstrings format can be frustrating, so first…
What Is GStack? Gary Tan's Open-Source Startup Framework for Claude Code (www.mindstudio.ai via hn)
What Is GStack? Gary Tan's Open-Source Startup Framework for Claude Code GStack is an open-source framework by Y Combinator's Gary Tan that gives solo developers the power of a full startup team using Claude Code skills.
I run a small web agency on the side. For the past few months I've been building client work almost entirely through Claude - Cursor, Claude Code, custom skills, the works.
Claude Would Like to Know Your Age Range (www.reddit.com)
New popup on iOS after logging out and logging back in
-
151 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
AI agents are briefly overhyped (stevekrouse.com via hn)
May 2, 2026 My dad asked me about AI agents: I keep hearing about all these productive AI agents out there. We currently are prevented for security reasons, but it seems like we can be so much more productive using AI agents and there must…
talkie-coder: From 1930 to SWE-bench (github.com via hn)
From 1930 to SWE-bench Models and training data We fine-tune Alec Radford's 1930 vintage LLM — pre-trained only on pre-1931 data — to solve SWE-bench issues. After just 250 training examples the model lands its first fix (a small patch to…
Where are the comparisons for prompting with skills vs without? (www.reddit.com)
I was looking on the internet, and I wanted to compare the differences/benchs of prompting with skills and without one (or any md based guide), but I have found nothing. A few colleagues that are using it either on local or work says that…
Show HN: Plannotator for Codex (twitter.com via hn)
could not extract summary
Global trade requires faster customs clearance, but legacy paperwork slows everything down. Intelligent Document Processing (IDP) and AI agents help fix this by moving operations from manual entry to structured, automated pipelines.
Coatue has a plan to buy up land for data centers, possibly for Anthropic (techcrunch.com via hn)
Coatue, one of the biggest names in venture capital and hedge funds, has a new plan to generate bigger returns on AI beyond its sizable stakes in Anthropic, OpenAI, xAI, and data center companies like Singapore’s DayOne and CoreWeave. It h…
[RELEASE] - coding agents can now talk! (www.reddit.com)
Quick context: I use Claude Code and Codex daily and noticed I was spending half my "agent is working" time just sitting there watching the screen. I was like, what if Claude or Codex can just narrate its process back to me, so I know what…
- I made my coding agents talk to me (www.reddit.com)