Stt.ai MCP Server (pypi.org via hn)
Model Context Protocol server for STT.ai — transcribe URLs, list/search transcripts, summarize, analyze, generate, and chat with transcripts from Claude Desktop / Cursor / any MCP client. STT.ai MCP Server Model Context Protocol server for…
- FilamentPHP MCP Server (github.com via hn)
Hey guys I need your Help! To Create Websites using Claude (www.reddit.com)
How you guys prompt to create unique design for websites. I tried multiple prompts with different variations read so many articles about how to prompt to create websites.
First time Post - consistent Issue (www.reddit.com)
Hi everyone, I have been consistently trying to code something (I have zero knowledge) that is important for my work as a musician, a SATB voice leading tool/checker similar to this: https://partwriter.com/ I am unable to get any working r…
-
101 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
120 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 9m Best settings for gemma-4 on a 3090?
- 12h Three lessons from fine-tuning a 5B code assistant — bad outputs from 5% → 0%
- 13h Were Qwen3.6 models scrubbed from openrouter?
- 19h Throughput and TTFT comparisons of Qwen 3.6 27B, Qwen 3.6 35B A3B and Gemma 4 models on H100
- 23h Qwen3.6-35B-A3B-UD-IQ4_XS C++ to Rust Code Port Test: It Worked (Mostly)!
I asked my local LLM to add 23 numbers and got seven wrong answers (viggy28.dev via hn)
ARTICLES I Asked My Local LLM to Add 23 Numbers. I Got Seven Different Wrong Answers.
Show HN: Track official AI company news and blogs in your Chrome side panel (chromewebstore.google.com via hn)
I've already picked up so much great news and tips from here — thanks to the HNers for sharing. That said, I still find myself manually checking various official newsrooms and engineering blogs.
Ask HN: Is "agentic" coding working for everyone except me? (news.ycombinator.com)
I'm a solo developer, working on my own for my startup. I use AI/LLMs extensively in my work to explore new ideas, but the vast majority of my code is manually written.
-
24 items
model roundup
Qwen 2.5Qwen2.5-7B-Instruct is a 7 billion parameter instruction-tuned language model that significantly enhances coding and mathematical capabilities, supports up to 128K tokens in context, and understands structured data. Community discussions highlight its suitability for code autocomplete tasks and debate the hardware requirements needed for deployment compared to other models like Gemma 26B MoE.
- 15m Using logit steering / KV Cache Dynamic Assembly to guide outputs from Small Language Models using ONNX Runtime
- 1d Show HN: Doxa – Open-source emergent simulator for geopolitical scenarios
- 2d Best coding/reasoning model for low vram
- 3d Best model that can run on raspberry pi 5 with 8GB of RAM
- 3d I ran a Hormuz Crisis emergent SIM: AIs started lying to hide a stalemate
42 itemsmodel roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 26m DeepSeek's new model is 75% off right now, here's how to take advantage
- 5h DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles
- 11h DeepSeek V4 with Strix: a quick test
- 15h DeepSeek V4 API price reduced, limited-time discount of 75%.
- 15h Decreased Intelligence Density in DeepSeek V4 Pro
Self-hosting LLM Provider on Open Router (www.reddit.com)
Is anyone here a provider on openrouter? curious about using my setup to make some $$ to offset costs of a new build Thoughts?
Claude for Personal USE (www.reddit.com)
Anybody out here using Claude for daily personal usage- like weekly grocery, personal training or finances ? Would love to hear !!
- I can not use Claude Cowork (www.reddit.com)
- HOW TO USE CLAUDE CODE (www.reddit.com)
Traces are trees. Multi-agent failures are graphs. (www.reddit.com)
Quick context: when you have multiple AI agents talking to each other and something goes wrong, your debugging tools usually show "everything fine" even when the agents are stuck in a loop costing you money. Here's why: Been building obser…
-
62 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
115 itemsmodel roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
Got a server with 8x A6000's how do I setup? (www.reddit.com)
Hey guys got some resources that just became available at org. What's the quickest way to get setup on a multigpu setup?
Tell HN: Medvi (telehealth) hardcodes 999 patient emails in public JavaScript (news.ycombinator.com)
Medvi is a telehealth pharmacy that has received significant media attention recently. While browsing their site with DevTools open, I noticed that their public JavaScript bundle contains a hardcoded list of 999 patient email addresses — a…
Need help in testing voice agents during development and production (www.reddit.com)
Hi folks, I am currently building an AI interviewer voice agent for one of my clients. I have been testing it manually, and each call takes 10–15 minutes, which is very tedious and manual.
-
129 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 1h Discord Sleuths Gained Unauthorized Access to Anthropic's Mythos
- 5h OpenMythos with Qwen2.5-1.5b weights (No recurrence atm) - looking to turn it into full OpenMythos
- 10h What would you use Claude Mythos for if you had access today?
- 10h Discord group says it accessed Claude Mythos by guessing location
- 11h Claude Mythos: The first AI-native cyberweapon?
84 itemsevent
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 2h Claude in excel is the best thing AI has brought to my life
- 5h Does effort tier change refusal behavior on agent-attack prompts? CVP run 4 with sonnet 4.6 high and max efforts.
- 9h Self-Hosted AI Red Team Tools
- 22h LLM CTF challenges. Can you crack all 13?
- 1d Most AI agent "skills" on GitHub are unvetted garbage. I built a marketplace to fix that.
Just shipped peeroxide, a complete, production-ready Rust port of the Hyperswarm P2P networking stack. It’s fully wire-compatible with the existing Node.js implementation, so Rust peers can join the live public HyperDHT and seamlessly disc…
**Claude.ai MCP connectors seem to be silently degrading — Google Drive broken, Gmail now only reads metadata. Anyone else?** I use Claude as a personal finance assistant.
Multi-Agent AI Systems Are Eating Single Agents (aistackinsights.ai via hn)
Single-agent architectures hit a wall the moment your task needs planning, research, and execution in parallel. Multi-agent systems solve this — but most tutorials skip the hard parts.
-
95 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 2h Hardening claude-code-action after the April 2026 Comment and Control CVE - actual YAML changes
- 13h Fortune 100 AI Use
- 13h CC-OpenAI-Codex Plugin, but for all CLI agents
- 18h Another Microsoft Copilot AD injected into 4M GitHub commits
- 1d The "AI will replace engineers" discourse has the abstraction level wrong
Impact of mixing architecture (www.reddit.com)
For context As planned after my previous post, I now have a decent amount of VRAM to work with: 2x RTX 3090 maybe 2 more coming soon, if needed 1x RTX 4060 8x RX 6600 XT 1x RX 6700 XT 1x RX 9060 XT (12 to 20 3060 more coming soon + 2 3090…
LLMs Corrupt Your Documents When You Delegate (arxiv.org via hn)
Large Language Models (LLMs) are poised to disrupt knowledge work, with the emergence of delegated work as a new interaction paradigm (e.g., vibe coding). Delegation requires trust - the expectation that the LLM will faithfully execute the…
Integrations (www.reddit.com)
I'm working on integrating my claude into my current architecture. I have a chatbot however I want it to be able to reason with claude.
Plugins Confuse me (www.reddit.com)
Hey everyone, plugins confuse me a lot. If anyone uses them outside of just their manually put together MCP servers, I would love to learn the technical difference between them.
Agents Aren't Coworkers, Embed Them in Your Software (www.feldera.com via hn)
Agentic management software is all the hype today: What started with Moltbot and OpenClaw now has a lot of competition: ZeroClaw, Hermes, AutoGPT etc. These systems work well and allow you to train and build generic agent loops that are ge…
Open-Source Inference is growing 10% week over week this year (news.ycombinator.com)
So we're a small inference provider, launched publicly two weeks ago and have seen a crazy demand of growth. I reached out to a lot of other inference providers such as fireworks, togetherAI, simpliAI etc and started asking them their grow…