Qwen Introduced FlashQLA (www.reddit.com)
Introducing FlashQLA: high-performance linear attention kernels built on TileLang. 2–3× forward speedup.
Letting AI play my game – building an agentic test harness to help play-testing (blog.jeffschomay.com via hn)
Vercel Security Checkpoint | sfo1::1777467624-qE4eB4e2LvmbibEDgl5Ljah0zEqW8iFE
Are "Vintage LLMs" the start of a new humanistic field? (resobscura.substack.com via hn)
Are "Vintage LLMs" the start of a new humanistic field? Thoughts on Historical Language Models and Talkie-1930 Imagine talking to the collective consciousness of an era.
Megent – Firewall for AI Agents (megent.dev via hn)
Control every agent tool call from one policy layer. Define what your agents can do.
Built a three-panel workspace for doing research with Claude Code (www.reddit.com)
Hey everyone. I've been using Claude Code a lot for my physics research, and it always felt slightly wrong — like I was forcing a coding tool to do work it wasn't really shaped for.
Claude for Word (claude.com via hn)
Get down to the details with Claude for Word Claude works inside your Word document instead of a separate window. Select text, describe the update, and review it as a tracked change.
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
- Opus 4.7 showing in Claude for Word (www.reddit.com)
Quality of life upgrade for my Brothers and Sisters in Claude (www.reddit.com)
Hey everyone! Just wanted to share a little utility I built for claude.ai in the browser.
Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs (news.ycombinator.com)
Hi HN, I'm Danilo. I've been struggling with the limitations of AdamW when fine-tuning LLMs locally.
-
3 items
model roundup
GLM 5GLM-5 is a large language model with 744B parameters, an increase from GLM-4.5's 355B parameters, and it integrates DeepSeek Sparse Attention to enhance efficiency. Notably, community members are exploring its use for fine-tuning smaller models and discussing its relevance in the context of influential AI companies.
221 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 33m llama.cpp benchmark native vs. non native NVFP4 on Blackwell - summary
- 2h How do you objectively tell if your custom agent tools are actually better?
- 4h What it feels like to have to have Qwen 3.6 or Gemma 4 running locally
- 4h Qwen3.6 27B on dual RTX 5060 Ti 16GB with vLLM: ~60 tok/s, 204k context working
- 6h Ran my own benchmark Qwen 3.6 35B vs Gemma 4 26B.... theres a clear winner here
Saw a lot of hype around Blender MCP this week so I decided to actually test it with two real workflows instead of just reading about it. Test 1: Build a scene from scratch Typed one sentence describing a cyberpunk room.
https://github.com/kaancat/tracking-auditor-skill The basic idea is simple Use Codex or Claude Code as a tracking auditor that looks at the whole conversion path, not just whether a GTM tag exists. For PPC accounts, that matters a lot beca…
What’s an AI agent you’ve actually relied on? (www.reddit.com)
Not the flashy demos or hype, just something that genuinely helps in real work. Like something that: Saves you time Takes care of repetitive tasks Makes your day a bit easier If you’ve used one, curious to hear: What do you use it for Wher…
The Missing Piece: A Self-Custody Wallet for AI Agents (pckt.blog via hn)
The Missing Piece: A Self-Custody Wallet for AI Agents Everyone talks about the web3 + AI symbiosis. Nobody is actually shipping a self-custody, agent-native wallet.
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning (machinelearning.apple.com via hn)
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning AuthorsHaoqiang Kang†, Yizhe Zhang, Nikki Lijing Kuang†, Nicklas Majamaki†, Navdeep Jaitly, Yi-An Ma†, Lianhui Qin† LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning AuthorsHa…
Show HN: I scanned 16 AI agent repos – 76% of tool calls had no guards (github.com via hn)
diplomat-agent You deployed a Python AI agent. Do you know every function it can call that writes to a database, sends an email, charges a card, or deletes data — and which ones have zero checks?
My three Claude subagents actually work (www.reddit.com)
Took me like 6 weeks to figure this out. Everyone's making these massive subagent libraries with 47 different specialists and wondering why their code still sucks.
Building AI Agents in Python with Pydantic AI (machinelearningmastery.com via hn)
paywalled
-
139 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 38m Show HN: Filling PDF forms with AI using client-side tool calling
- 1h leaked my anthropic key into a public repo, lost $15,423.
- 3h The final nail in the coffin for entry level creative freelancers just dropped
- 7h Is an agentic Spark copilot worth it? opinions?
- 7h Rust Bucket: Agent-first Rust project bootstrapper
224 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 43m Improve claude code on Opus 4.7
- 2h The Beautiful Lie - Teaser
- 9h Crystal Sapphire Pokemon: Claude Code (Opus 4-7) vs. Codex (GPT 5.5)
- 12h Opus 4.7 is just 4.6 with a stick up its butt. Give me my tokens back!
- 12h Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-29T00:00:29.000Z
Just shipped: live-canvas skill in liteagents (www.reddit.com)
I built with Claude a skill to solve UI iteration with AI is mostly context-switching. Reload, screenshot, describe, paste, repeat.
Versioned, portable LLM prompts as a spec – not a framework (promptpack.org via hn)
RFC 0010: Workflow Composition Extension - Status: Draft - Author(s): Charlie Holland (chaholl) - Created: 2026-04-28 - Updated: 2026-04-28 - Related Issues: N/A Summary Extend PromptPack with composition as a new workflow-state orchestrat…
Show HN: Analytics-skills – turn your agent into a senior analyst (github.com via hn)
analytics-skills Analytics skills for Claude, Cursor, and other AI agents. Read web analytics like a senior analyst: diagnose traffic changes, judge channel quality, read funnels, declare typed events, and read A/B tests without the usual…
Ask HN: Is any one experiencing partial outage with Gemini API? (news.ycombinator.com)
india region!
Monet – Open-source shared memory for AI agent teams (github.com via hn)
Monet Turn your team's AI operational intelligence into a reusable asset. Getting Started · Architecture · Documentation · Contributing Senior developers get better AI results — not because of better prompts, but because of accumulated ope…
The read/write framing made sense for about two weeks. Then I started hitting cases it couldn't handle and realized the problem: read/write is a data model.
W2A – Open Protocol for Agent Perception (github.com via hn)
Agents can't act on what they can't perceive. Website · Quick Start · Sensors · SensorHub · Docs · Community Watch the W2A Concept Video What is World2Agent?
I Used Claude Code + Remotion to generate my app's launch animation (www.reddit.com)
Agentic NixOS: Building a Safe Control Layer (nedkarlovich.com via hn)
A six-part series on building Agentix, a cautious agent-control layer for NixOS. From philosophy to MVP to roadmap.
How Much LLMs is too much LLMs? (www.sammystraus.com via hn)
Sammy's Blog Table of Contents - 1. How much LLMs is too much LLMs?