The OpenAI-Microsoft reset, decoded: Why AWS may come out ahead (thenewstack.io via hn)
The OpenAI-Microsoft reset, decoded: Why AWS may come out ahead OpenAI wasted little time since announcing changes to its partnership with Microsoft on Monday. The ChatGPT hitmaker is now bringing its models, coding tools, and agentic capa…
Claude to build an app with no experience? (www.reddit.com)
In my job we currently have an app we use as a kind of community based subscription platform. It's backended by Circle and we pay around £900 a month for it.
Get a Pro subscription or above to see the live story progression and the full list of independent sources confirming each event as they happen. Log in to upgradeCreate a free account to use the public wire and manage upgrades later.
-
254 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 6m PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090
- 12m Need help optimizing qwen 3.6 on my 2x 5060ti 16gb
- 2h Filed two PRs for SGLang which may help others too — FP8 KV cache corruption and memory leak on image requests
- 2h Qwen3.6-27B - Closed-loop SVG Images
- 4h Model stuck in some thinking zone where it keeps saying a similar thing again and again
153 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 11m Sidebar chats get a lot of criticism, but users are already used to them.
- 9h I used Claude to build "pin-llm-wiki" — A skill that turns any URL into a clean, citable Karpathy-style LLM Wiki
- 21h Claude Code vs Cursor vs Copilot vs Codeium: Which AI coding assistant is actually worth paying for?
- 23h Show HN: Sampletext.store/ We built a dumb web shop and we cannot look away
- 1d LocalPilot with Ollama as a Replacement for CoPilot in VS2026
Quick context: I'm a longtime Claude user but pretty new to coding. Claude Code is doing most of the heavy lifting on this project — I'm sharing because I think the idea is useful, not pretending the implementation is perfect.
I am building an in-house contract review tool that provides in-line explanation and accounting guidance for sections of a contract that have accounting impact. How do I build an automated process to validate Claude hasn't hallucinated the…
Performance of a large language model on the reasoning tasks of a physician (www.science.org via hn)
www.science.org Performing security verification This website uses a security service to protect against malicious bots. This page is displayed while the website verifies you are not a bot.
-
114 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 26m 🚨Claude Desktop high severity vulnerability warning!
- 1h Found Zero day Claude Desktop + Chromium bug need to know where to submit report.
- 7h I stopped writing 500-word guardrail prompts. This 8-line template works better.
- 8h Built + open sourced anti-slopsquatting CLI
- 8h Every cloud sandbox for AI agents has a "front desk". That's the whole problem.
42 itemsevent
MistralMistral, a French AI company, is set to release a medium-sized model with 128 billion parameters and is planning to launch Workflows in public preview. The company, founded by Arthur Mensch, continues to grow its AI empire despite not being based in the United States.
- 36m Mistral vs OpenAI: European Sovereignty or Global Scale?
- 2h DeepSeek V4 Flash as a cheap worker in your LLM stack: $0.0003/call via MCP, swappable endpoint
- 21h How would you feel about "Claude Go"?
- 23h Terminal Bench score for Mistral 3.5 Medium
- 1d Mistral Medium 3.5: A reliability first open source model from Europe
Higher-order effects of LLM slop (www.natemeyvis.com via hn)
Higher-order effects of LLM slop I don't much enjoy LLM prose, at least when it's presented as human-generated. I've written about that, and since that post I've started to notice myself adjusting in indirect ways.
I am curious if anyone is building a sales tools with AI. Im building one from scratch because cold outreach was killing me.
I come from a design background, so I keep wanting AI tools to feel less like a chat box and more like a room. You can lay out notes, research, docs, links, decisions, tasks, screenshots, and AI outputs on a realtime canvas.
-
242 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 1h Opus 4.7
- 2h MiMo-V2.5-Pro - the actual best open-weights model
- 11h Are they selectively releasing Opus 4.7 in Claude.ai chat with 1M context window?
- 13h Opus 4.7 is a genuine regression and I'm tired of pretending it isn't
- 17h Any point in paying for the Max plan as opposed to a Claude Desktop and Codex Sub (each $100)
149 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 39m Claude Security Explained: Mythos, Glasswing, and What Opus 4.7 Changed
- 4h After dissing Anthropic for limiting Mythos, OpenAI restricts access to Cyber
- 14h Rare W
- 19h Anthropic: World is not ready for Mythos. Systems will break, Cybersecurity will be compromised. Its too dangerous to release. OpenAI:
- 21h GPT5.5 slightly outperformed Mythos on a multi-step cyber-attack simulation. One challenge that took a human expert 12 hrs took GPT-5.5 only 11 min at a $1.73 cost
Does anyone else has this issue? (www.reddit.com)
I’m mainly asking people using the app, because I genuinely have no idea if I messed up something in the settings or if Claude iOS app has a display bug. The first picture is from the browser version, second is from app.
AMD PRO W7900 vs R9700 for Local Inference? (www.reddit.com)
I thought of upgrading my RX 6800 for Local LLMs (Mostly Agentic Coding) and Video Generation on Linux. I focused on the AMD PRO R9700 32gb and the PRO W7900 48gb because performance on Linux is very good with AMD and both cards have a gre…
Pentagon strikes deals with 7 Big Tech companies after shunning Anthropic (www.cnn.com via hn)
The Department of Defense announced Friday an agreement with seven major technology companies to use their artificial intelligence tools in its classified networks. Not included: Anthropic, which the Trump administration has blacklisted ov…
-
6 items
model roundup
Gemini 3Gemini 3 flash has become a popular choice for automated promotions due to its high productivity. The cost of Deepseek V4 flash is one-fifth that of Gemini 3, making it a competitive alternative in the market.
- 47m My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.
- 2d ChatGPT/Gemini can now draw on your screen to help you navigate complex software
- 3d Show HN: Prediction market analysis app layering LLMs with data APIs
- 4d Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview
- 7d Anyone else noticing how Gemini-3-Flash is becoming the 'hidden' beast for automated promotions, its so productive?
140 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 1h If the benefit of Claude Cowork is having persistent context for a given project, but conversations degrade as they grow, how do you resolve this?
- 4h Two desktops
- 6h Cowork can't even get my Notion tasks - Can anyone help?
- 11h How would you build this?
- 13h I Gave Claude Cowork an Obsidian Second Brain. Here Is What It Remembered After 11 Sessions
Show HN: Raft to allow a group of AI agents to reach consensus (github.com via hn)
Gravity AI Why Gravity AI is a consensus platform for AI agents. A possible use case is a critical task where answer accuracy and consensus is important, and we would like a community of diverse agents to agree on a given answer/solution.
I’ve been using Claude for a while now and I’m starting to notice some patterns. Long threads usually start strong.
Lately, I’ve been seeing a lot of discussions around AI replacing traditional SaaS. Things like AI agents, tools such as Claude, OpenAI systems, and “agent-to-agent workflows” are being positioned as the next big shift.
-
79 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 1h Claude AI Agent Confesses to Wiping a Company's Database and All Backups
- 9h Used Opus 4.6 to build a native Swift iOS charity app for therapy preparation. Here is what it handled.
- 1d Has Cursor always used Composer 2 for subagents?
- 1d WT...?? The Guardian Article - Cursor Opus gone rogue
- 2d So I gave claude Leetcode problem 3245.
NanoBrain – A Markdown+Git "second brain" for Claude Code (nanobrain.app via hn)
A knowledge corpus that captures your decisions, voice, and relationships while you work. Markdown plus git.
- NanoBrain – A Markdown+Git "second brain" for Claude Code (github.com via hn)
Apple accidentally left Claude.md files Apple Support app (xcancel.com via hn)
Apple accidentally left Claude.md files in today's Apple Support app update (v5.13) Apr 30, 2026 · 10:57 PM UTC 182 618 9,868 1,243,061 Apr 30, 2026 · 10:57 PM UTC
- Apple accidentally left Claude.md files in today's Apple Support app update (twitter.com via hn)
spring-agent-flow Stateful multi-agent orchestration for Spring AI. Design and run long-lived agent workflows with state, retries, and graph execution, all in Java, without manual orchestration code.
I kept hitting the 5-hour limit out of nowhere and had no idea how close I was to the context window filling up mid-conversation. The fact that Claude.ai shows you basically nothing about your actual usage drove me nuts, especially when I'…
Claude hacked my ("rotary") phone (ktoya.me via hn)
Claude code hacked my (rotary) phone A rather technical report from claude itself on how it reverse engineered a hardware protocol of a viking voip phone As part of my work on putting a voice agent in a British red telephone booth, I neede…
If Claude writes the code, what makes me still a developer? (betweentheprompts.com via hn)
If Claude Writes the Code, What Makes Me Still a Developer? Published on It’s been three months since I last wrote a line of code.