Ask HN: The death of software development as a job? (news.ycombinator.com)
A lot of programmers I read here and elsewhere say LLM isn't going to change much, some say LLM is just going to make them more productive, and some even say not using LLM makes you some sort of relic. What is not debated is that LLM has c…
Show HN: Design Taste for AI Agents (aidesigntaste.com via hn)
Free design.md systems from the world
Don't Become an Agent Wrapper (www.anantjain.xyz via hn)
Don't become a wrapper Remember when we made fun of startups that just wrapped a prompt and an API call to OpenAI in a good-looking UI? We agreed those "ChatGPT wrappers" were cooked.
Cryptographic hashing as a transformer attention head (github.com via hn)
unbounded-context-attention An attention architecture for transformer language models with three properties: it handles arbitrary context size with no architectural cap, every input token is guaranteed to have nonzero influence on every ou…
-
180 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 17m Anthropic’s new finance AI agents feel like a bigger move than just “better chat”
- 2h Claude cowork combined with code
- 3h Anyone using mac minis? If so, what’s the craziest use case
- 8h New agents for financial services | Claude Cowork + Claude Managed Agents
- 9h Project knowledge file indexing reliability seems to be getting worse? (should I just use cowork instead?)
185 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 35m Sick of Copilot? You Can Uninstall Microsoft's AI, but It's Tricky
- 2h I got tired of AI agents destroying my codebase and eating tokens, so I built a self-bootstrapping Markdown protocol to fix their memory.
- 2h Xbox CEO ends Copilot AI development and overhauls leadership
- 4h Xbox winding down Copilot on mobile and will stop dev of Copilot on console
- 5h Quirre – An AI marketing copilot for non-marketers
Seeking an AI place for Star Wars RPGs, non-gooner but also no filter. And similar ability to create documents like Claude can do.
https://preview.redd.it/q4wrgwouyezg1.png?width=1080&format=png&auto=webp&s=b307965ac6f7f0ada39b81044ecdce3b81984e6a Coordination is where multi-agent runs burn tokens. Every handoff, every "what was I working on", every "did someone alrea…
CollectorVision Part 8: The Sol Ring Benchmark –Testing Hardest Card Recognition (blog.hanclin.to via hn)
CollectorVision Part 8: The Sol Ring Benchmark — Testing Card Recognition on the Hardest Case Sol Ring is ranked #1 on EDHREC — the most-played card in Commander, by a wide margin. It also happens to be one of the hardest cards for an imag…
Codex's precision and attention to detail is *crazy* when set up correctly (news.ycombinator.com)
Lately I've been working on a Tower Defense game with Codex, in part to learn how game development works and in part to see how far I can get using just Codex, no manual coding at all. I've got my AGENTS md & my CODESTYLE md & six other AL…
-
83 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 56m 12M Context Window and some some sprinkle of lies?
- 18h DeepSeek V4 Pro matches GPT-5.2 on FoodTruck Bench, our agentic benchmark — 10 weeks later, ~17× cheaper
- 1d Using Claude-4.6-Sonnet and Opus 4.6 in a multi-agent "Code Review Swarm" (Visual Sandbox) - try in minutes!
- 1d Free Trial: Gemini 3.1 Pro & Opus 4.6 API Access via My Wrapper
- 1d Open source models are going to be the future on Cursor, OpenCode etc.
283 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 1h Used Claude Opus 4.7 to do a 5-hour solo incident response on real healthcare malware (where it worked, where I had to override)
- 2h Claude Code @ Opus 4.7 vs OpenCode @ qwen3.6:27b. Both shipped a playable cozy roguelite.
- 5h F-Bombs Per Thousand Prompts (fpk): I measured my frustration across 44,212 Claude Code logs
- 5h Opus 4.7 has a new favorite word
- 6h Agent review burnt most of my API credits. Rookie mistake
could not extract summary
Better Design 29 open-source design systems for shadcn/ui. Drop any of them into your app with one command.
I’m wondering if there’s any way to merge several Claude conversations that are all about the same topic into one master conversation. For example, I might have discussed a topic with Claude last February, then started another chat about t…
Common and Obscure Models and Ways to Find Them [ Human Written ] (www.reddit.com)
I've been on a binge finding uses for local AI on my machine outside of general LLM usage as I'm not sure what other sub discovery of these things should go on. Here's a collection of my findings.
-
7 items
model roundup
Haiku 4.5On April 30, 2026, users of Claude Haiku 4.5 experienced elevated errors, prompting system updates. Additionally, the coding agents "Claude Code" and "Codex" were enhanced to provide voice feedback to users, reducing idle time during tasks.
87 itemsmodel roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 1h Does Deepseek V4/Flash work with Llama CPP and Vulkan on and branches yet?
- 1h DeepSeek V4 Pro: The First Chinese Model at the Frontier
- 4h DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.
- 8h tested four newest open source Kimi K2.6 is the fastest, GLM 5.1 the fanciest, DeepSeek V4 is the most comprehensive, and Xiaomi MiMo is the slowest
- 13h DeepSeek V4's indexer OOMs at 65K context. We got it to 1M in 6G
how i got my github inbox handled by claude code while i sleep (www.reddit.com)
i delegated my github inbox to agents. it's an open source daemon that lives in the menu bar : agents triage notifications in the background and only surface the ones that need a human call.
Spyware? (www.reddit.com)
Has anyone else had this happen? I verified the files, it’s signed by anthropic.
is it possible to build harnesses as good as codex/claude code (www.reddit.com)
The codex harness, in my experience, is extremely intelligent. It picks the right tools to call, corrects itself when it makes a mistake, and can run for extremely long periods of time.
DALLE-3 genuinely made my childhood dream app possible. (www.reddit.com)
Iv’ve been experimenting with different text to image API’s over the past few months and honestly DALLE-3 is so damn clean and quite reliable for my use cases. I ended up building a generative coloring book app where users can create custo…
-
309 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 2h Gemma4:31b-coding-mtp-bf16 - slow on Macbook M5 128gb
- 2h MTP on strix halo with llama.cpp (PR #22673)
- 3h Smaller gguf getting way less tokens per second?? So confused!
- 5h My setup for running Qwen3.6-35B-A3B-UD-Q4_K_M on single RX7900XT (20GB VRAM)
- 6h Dense Model Shoot-Off: Gemma 4 31B vs Qwen3.6/5 27B... Result is Slower is Faster.
17 itemsevent
DeepmindGoogle DeepMind has released "Deep Research Max," advancing autonomous research agents, while also facing challenges and competition from other AI companies like Anthropic and Ineffable Intelligence. Meanwhile, DeepMind workers in the UK have voted to unionize, and former DeepMind architect Demis Hassabis is at the center of legal drama involving Elon Musk.
- 2h Google DeepMind Workers Vote to Unionize over Military AI Deals
- 8h CAISI [Center for AI Standards and Innovation] Signs Agreements Regarding Frontier AI National Security Testing With Google DeepMind, Microsoft and xAI
- 10h Google’s AI architect, Demis Hassabis, lived rent-free in Elon Musk’s head
- 18h Retrospective on Black and White and it's connection to Google DeepMind
- 1d Ex-DeepMind David Silver Raises $1.1B for AI Startup Ineffable
Claude Code on iOS uses? (www.reddit.com)
I should preface this by saying that I'm one of those people who, when I find something that works, I just stick with it. It may not be the right way, but it works for me.
- Claude Code Uses GLM 4.7 (old.reddit.com via hn)
- Claude code (www.reddit.com)
My main project application is about 50K lines (plus UI/front end) and since I started using CC, the amount of work I do in a day is equivalent to 3–5 days of work from before, including much more rich features with functionalities that we…
Hey everyone — I’m trying to streamline part of my Etsy workflow and could use some direction. I run a digital wall art shop and already create everything manually (art, mockups, descriptions, titles, etc.).
Claude, I order a whiskey sour, I get spaghetti (claude.ai via hn)
This is a copy of a chat between Claude and Nils. Content may include unverified or unsafe content that do not represent the views of Anthropic.
i'm looking for good resources, please don't let me die ;( (www.reddit.com)
Hello! A few days ago I made a post about a conflictive project i got (and I still don't finish but lets not focus on that for now).
Been noticing my Claude Code feeling sluggish over the past couple of months. Decided to just ask Claude to audit itself.