We tested 7,039 sites for MCP support; 5.8% passed a live handshake (8bitconcepts.com via hn)
Executive summary NothingHumanSearch has been crawling the web for agent-readiness signals since launch, and the numbers tell a consistent story about the gap between claiming MCP support and shipping it. As of 2026-04-27 the index holds 7…
Not sure if it's just me but I've been using Cursor pretty heavily for the past few months and something feels off. The code works, shipping is faster, but I feel like I understand my own project less than I did before.
BluePunch: construction punch lists and takeoffs (bldrlife.github.io via reddit)
This project is completely free for the love of the game. I am a construction professional and generally speaking, the software we have available is either way too expensive, littered with useless AI, or endless monthly subscriptions.
Source: Claude Code Model Configuration
-
109 items
model roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 2m Guys this is so fun!
- 12m Best value in the 20$ range coding agents? I want the best quality and high-usage-limit I can get at that price.
- 59m I want to create and maintain a set of benchmarks for local LLMs. Would anyone pay/donate for this?
- 4h The 4B class of 2026 (benchmark)
- 10h Show HN: Local RAG Pipeline with Weaviate and Ollama
133 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 8m Claude Mythos Scaffold v0.1 — pattern-based skill set inspired by Mythos Preview behaviors
- 8h Mythos Preview: What Every CISO Should Do Now
- 9h Pen-Testing Company XBOW on GPT-5.5: Mythos-like Cyber-Sec
- 12h Leaked results of Mythos' audit of the Rust stdlib
- 1d Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error
Usage Limit Transparency Issue (www.reddit.com)
After using OpenRouter for more than a year i decided to try Claude Max 5x plan mostly to try Claude Design. Got my subscription on Friday afternoon, used it for 3hrs that day, 3hrs the next day and today after my first request got an erro…
ChatGPT Images 2.0 Still Can't Draw the Seven-Legged Spider I Want (will-keleher.com via hn)
ChatGPT Images 2.0 Still Can’t Draw the Seven-legged Spider I Want Whenever a new image generator comes out, I run the same test: “Please generate a spider silhouette missing its left front leg. Use an art deco style.” And every time, the…
Built a B2B real estate AI search agent in 2 days with Claude Code (www.reddit.com)
Spent the weekend building a B2B AI property concierge with Claude Code. Chat box on a real estate site — buyer types "4-bed house in SF with a nice view under $35M" and the agent returns ranked listings with a one-line "why this fits." Th…
I built a GPT that turns messy notes into clean, usable outputs (www.reddit.com)
Most AI outputs are too long and not directly usable. I’ve been working on a custom GPT called Sharpify that turns messy input into structured, usable outputs (like briefs, plans, prompts).
-
120 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
75 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 31m How I get 100% accurate answers, and replaced Google with Claude
- 3h GPT-5.5 improves over GPT-5.4 and overtakes Opus 4.6 to take the 2nd place behind Gemini 3.1 Pro on the Extended NYT Connections Benchmark
- 3h Found 48 Vulnerabilities in Open Source Projects During Live Testing with Claude Opus 4.6
- 4h Claude 4.6 Beats GPT-5.4, Grok & Gemini in a Strict Multi-Domain AI Test (2026)
- 11h How good is Qwen-3.6-27b? I asked Claude Opus
Open Museum – an MCP server for license-verified search across museums (github.com via hn)
open-museum-mcp One search across five open-access museum collections, with strict per-museum rights verification and ready-to-use citations. The Met, Cleveland Museum of Art, the Art Institute of Chicago, Wikimedia Commons, and Europeana…
I build my LLM a Brain (news.ycombinator.com)
A glimpse about my app context engineering Take a look : https://x.com/TabetKevin/status/2048884876603203850 Have a nice one, feel free to comment, i want to so better
The corpus is Lilian Weng's "LLM Powered Autonomous Agents" — the blog post that the LangChain RAG tutorial uses as its canonical demo. The retriever is the LangChain default (cosine similarity over all-MiniLM-L6-v2 embeddings, top-5).
Show HN: Memory Guardian – open-source memory governance for AI agents (github.com via hn)
Memory Guardian Memory governance for AI agents. Most agent memory layers are passive storage.
-
216 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 1h Tool/connector schemas leaking into user message stream. Anyone else seeing this?
- 1h DeepSeek-V4 arrives with near SotA intelligence at 1/6th the cost
- 2h Does effort levels change Claude's refusal posture, or only the depth of the answer? CVP Run 6 — Opus 4.7 at three effort levels
- 4h Opus 4.7 - "Build starcraft II in the browser. Make no mistake"
- 5h One of my devs is burning through company tokens
33 itemsmodel roundup
GPT 5.4OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.
End-2-end tutorial on fine-tuning, the whole journey (docs.liquid.ai via reddit)
I put together a hands-on tutorial that takes you from problem framing to fine-tuning, step by step. I decided to build a wildfire prevention system that uses satellite images and a Small Vision-Language Model (LFM2.5-VL-450M) to extract r…
Currently studying for closed book exams using chat plus and Claude, what tips / prompts will be game changers
Claude knows when you cheat on it with Codex?? (www.reddit.com)
- Claude Vs Codex (claudevscodex.com via reddit)
Is Chatgpt down ? (www.reddit.com)
For about 2 hours cannot login in to the Mac app. It just keeps loading the Auth page, while the web app works fine.
- ChatGPT 5.5 🔥🔥🔥 (www.reddit.com)
- Chatgpt Down?? (www.reddit.com)
- Chatgpt down guys I'm cooked (www.reddit.com)
- Urgent Chatgpt down help (www.reddit.com)
-
6 items
model roundup
Sonnet 4.5Anthropic has kept Claude Sonnet 4.5 available after its retirement due to user demand, while open-source models like DeepSeek V4 are catching up in capabilities, which remain several months behind closed lab versions.
- 58m Anthropic just quietly locked Opus behind a paywall-within-a-paywall for Pro users in Claude Code
- 1h I hate thinking models, any way to use the default ones?
- 1d Has anyone ever hit an ASL-3 error? Claude thinks im making a bioweapon lol
- 2d For the Preservation of Claude Sonnet 4.5: An Open Letter to Anthropic
64 itemsevent
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 1h I think over the next 4 month, we are going to see much more progress in AI than we have seen in the past years
- 2h Musk and Altman face off in trial that will determine OpenAI's future
- 3h U.S. companies back Sam Altman's World ID even as much of the world pushes back
- 3h The legal showdown between Elon Musk and Sam Altman begins today
- 5h Musk vs. Altman Kicks Off This Week. Hard Reset Will Be There.
Started exploring the Ai automations and Ai agents feels brainfogged (www.reddit.com)
Hey i'm nikhil and i was into Webdesign and SEO and Recently i have been exploring the ai automations and ai agents building but it feels pretty complicated for me or i can say im so brain-fogged when looking to start - Can anyone help me…
David Silver of DeepMind raises $1B to build AI that learns without human data (techcrunch.com via hn)
Ineffable Intelligence, a British AI lab founded a mere few months ago by former DeepMind researcher David Silver, has raised $1.1 billion in funding at a valuation of $5.1 billion to join the race for novel AI models that could outperform…
Ask HN: Will fixed applications become a thing of the past with agentic AI? (news.ycombinator.com)
Right now its mostly technical people using these agentic tools but if you extrapolate a few years into the future it seems likely to me that every day users of a computer will be using them as a whole new interface to interact with their…
Local vs Cloud LLMs… are we pretending it’s one or the other? (www.reddit.com)
IMO: You’re not running a real 70B workload on a laptop. You’re not handling spiky multi user demand locally.
Been experimenting with coding agents locally and kept hitting the same issue: they’re smart enough to code, but they repeatedly waste effort rediscovering repo paths, startup commands, preferences, folder structure, etc. So I built Substr…
Cloudflare wrapped Agents Week last week and the enterprise MCP stuff caught my eye, want to see what people think. They shipped a few things.