I currently use the Official Claude Code plugin in VS Code and have Claude Code installed natively on Windows 11 + Powershell. I went with the below Pwsh command as shown here: irm https://claude.ai/install.ps1 | iex I am leaning towards s…
What the Benchmark Cannot See (exoskeleton.ghost.io via hn)
What the Benchmark Cannot See AI progress became publicly visible through benchmarks: scores, leaderboards, deltas, and claims that could travel. But every machine for seeing intelligence also teaches us what not to see.
ArmyClaw = Make your Claude Code subscription 100x more productive. (www.reddit.com)
ArmyClaw: 24/7 Agents on Your Existing Claude Code Subscription Want 24/7 OpenClaw-style agents but on your existing Claude Code subscription? Meet ArmyClaw.
I’ve been obsessed with autonomous agents lately, but I got tired of them hitting walls because they didn't have the right "tools" or because their context window turned to mush after an hour. The main idea is to move away from "AI as a ch…
-
154 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 9m After Mythos, Nobody Is Safe from Cybersecurity Threats
- 2h Contrary to popular belief, Mythos is very real, just that you don’t have it where you are
- 1d Anthropic Won't Let You Use Their Best Model. Prediction Markets Are Trying Anyway.
- 1d Who else thinks AI is reaching a plateau
- 1d GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests
152 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
Connect to ADO (www.reddit.com)
I’m not a developer. I am a product owner.
Two keyboard behaviors in Claude.ai web that Anthropic needs to make optional: **1. “/” always forces the command menu open** The moment you type after a space “/” anywhere in the input box, the slash command popup takes over.
https://preview.redd.it/2mytyh2gstyg1.png?width=1556&format=png&auto=webp&s=10d8cb3d48996e7b4e24377adf80fe2674fd3c04 It should be using my slides in the claude project as a source. Maybe it was trained on some source K-Means that was in ch…
Harness engineering: Preparing TypeScript codebases for coding agents (www.analogue.computer via reddit)
Our product development team fully embraces Claude Code for vibe coding. Here's how we set up our codebases for the best results.
-
111 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 42m what is the command to call the countdown or waiting function?
- 54m GPT-5.5 & GPT-5.5 Pro are now available in Manifest Router.
- 1h GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub.
- 11h Anthropic just passed OpenAI in valuation and revenue
- 12h GPT 5.5 tops private citation benchmark on Kaggle (AbstractToTitle task)
165 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 1h Cursor Review 2026: The AI Code Editor That Replaced VS Code
- 10h Community-built registry for AI agent config files (system prompts, CLAUDE.md, GPT instructions) just hit 888 stars
- 12h Claude Code, Copilot and Codex got hacked. Attackers went for the credentials
- 12h Migrating from VS Code (GHCP) to Cursor
- 13h Best suited model for solo Dev
Tested 10 image generation models on M1 Max 64GB for photorealism, text rendering, and cultural accuracy (Japanese/Asian content). Key findings: Qwen-Image Lightning (8-step distillation) beats the full model in quality while being 9x fast…
Treat Agent Output Like Compiler Output (skiplabs.io via hn)
Treat Agent Output Like Compiler Output Philip Su's recent post argues that code reviews are not just impractical in the age of coding agents, they're headed toward being irresponsible. He's right on trend.
Tinygrad Driver testing! (www.reddit.com)
Boutta Thrash some MoE speeds on a blackwell + m3 Ultra RDMA cluster. Theres a bit less than 2tb of ram here.
One Question About AI Most People Avoid Answering… (www.reddit.com)
Everyone’s talking about Agentic AI… but very few are actually using it right. So here’s a real question: If you had to give ONE outcome (not a task) to an AI agent — something it fully owns end-to-end — what would you trust it with today?
-
74 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
270 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 3h Qwen3.6-27B vs 35B, I prefer 35B but more people here post about 27B...
- 3h I made a visualizer for Hugging Face models
- 7h What could they mean by "warmed steady-state"?
- 8h Need advice on Qwen 3.6 27B INT4 quantization
- 9h Warpdrv - my open-source Llama.cpp launcher for daily-driving Qwen 35b + 27b on Strix Halo + RTX Pro.
The agent harness belongs outside the sandbox (www.mendral.com via hn)
An agent harness is the loop that drives an LLM. It sends a prompt, gets a response, executes the tool calls the model requested, feeds the results back, and repeats until the model says it's done.
- Agent Harness: Inside vs. Outside the Sandbox (www.mendral.com via hn)
Mac browser for a human that also gives coding agents local APIs (github.com via hn)
wkdomains wkdomains is a macOS browser for developers working with coding agents like Codex, Claude Code, Cursor, and similar tools. It lets the human browse normally while an agent gets structured local access to the same page: screenshot…
could not extract summary
Ban phrases on llama.cpp with this script. (www.reddit.com)
Check the README for setup instructions: https://github.com/BigStationW/llama-cpp-phrase-ban
-
103 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
Invoko: screen-aware Mac agent, zero setup. Free beta open. (www.reddit.com)
The thing that keeps most people from using tools like OpenClaw isn't interest, it's Docker setup on a Tuesday night. Invoko is the no-setup version of this category.
Spine – verified codebase onboarding for Claude Code (github.com via hn)
spine spine turns an unfamiliar repository into a verified onboarding guide. In one run it gives you: a small architecture diagram built from verified static-analysis edges only a prioritized reading order for the files that matter first a…
Major help re: IP, hardware & memoirs (www.reddit.com)
I'm 80. I go back to CPM operating system days, but I'm a user, not a tech, yet still have to deal with tech issues daily.
State of AI Agents in corporates in mid-2026? (www.reddit.com)
I was a working professional working and now a grad student in AI research for last 1.5 years. When I started grad school, AI agents weren't a thing.
"Security Warning The MCP server will execute LLM generated code in Blender without any guards in place to protect your data from removal or being sent to a remote location. To keep your data safe it is recommended to use a virtual machine…
Best solution for personal telegram bot (www.reddit.com)
Sup Reddit. I'm looking for any cool ai agents for personal use with any telegram bot integration.
What Is GStack? Gary Tan's Open-Source Startup Framework for Claude Code (www.mindstudio.ai via hn)
What Is GStack? Gary Tan's Open-Source Startup Framework for Claude Code GStack is an open-source framework by Y Combinator's Gary Tan that gives solo developers the power of a full startup team using Claude Code skills.