I think Claude just had a stroke (www.reddit.com)
I was trying to get it to help me with a crossword answer. Usually it does great but… I have no idea what happened here.
GPT Image Generation Models Prompting Guide (developers.openai.com via hn)
1. Introduction OpenAI’s gpt-image generation models are designed for production-quality visuals and highly controllable creative workflows.
- GPT-5.5 Prompting Guide (simonwillison.net via hn)
Anthropic created a test marketplace for agent-on-agent commerce (techcrunch.com via hn)
In a recent experiment, Anthropic created a classified marketplace where AI agents represented both buyers and sellers, striking real deals for real goods and real money. The company admitted this test — which it called Project Deal — was…
Show HN: A Calendar for Songs (theyearinsongs.com via hn)
Hey, I built this website for fun, I thought it would be a cool topic. I came across many posts trying to build a compilation like this in many forums and social networks and thought it would be fun to put together a site for exactly this.
I asked GPT Image 2.0 for a funny meme. (www.reddit.com)
could not extract summary
-
123 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 1m Gemma 4 Folks
- 1h Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks
- 6h Benchmark: Windows 11 vs Lubuntu 26.04 on Llama.cpp (RTX 5080 + i9-14900KF). I didn't expect the gap to be this big.
- 10h Best settings for gemma-4 on a 3090?
- 22h Three lessons from fine-tuning a 5B code assistant — bad outputs from 5% → 0%
200 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 7m Tell HN: Claude Code is unable to respond to this request
- 48m Claude Opus 4.6 vs. Opus 4.7 Effort Levels and Prompt Steering Benchmarks
- 2h Ask HN: Has Claude Opus 4.7 nerfed?
- 6h I built an AI-native freelance platform with Claude, blockchain escrow, real-time chat, and progressive trust
- 7h Claude Code + Opus 4.7 appears to serialize independent file reads, causing the higher token usage than Opus 4.6
Can I enable thinking some other way? (www.reddit.com)
After upgrading to gpt plus from standard, the thinking mode disappeared and got replaced with deep research quizzes etc. Now whenever I need it to think I have to type "Think and give me a detailed answer with your reasoning" etc.
I'm a product designer using cursor at work to help the devs in QA-ing their tickets. Usually I load up the branch and work on UI changes using the Design Mode.
Posts here about burning through credits fast and choosing between different model tiers keep making me think the real question is no longer just “which model is smartest?” In Cursor, the practical pain often feels different. It’s the mode…
Ask HN: Where are all the consumer ChatGPT apps? (news.ycombinator.com)
For several months I've been checking https://track.appsdiscoverability.com which is tracking all the apps built on ChatGPT and Claude. I'm still surprised to see that there are barely any consumer apps.
I run three concurrent projects in different domains — operations, hardware/CV engineering, and research. The math on attention is brutal: 168 hours a week, three streams of work, one me.
I'm not a robot. Have been proving I'm human for years now. (www.reddit.com)
Logging into my Microsoft account today and it threw two different CAPTCHAs at me. Made me wonder, can AI actually beat these in 2026, or is this still hard for LLMs?
-
104 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
13 itemsmodel roundup
Claude 4.7Users of Claude 4.7 are reporting issues where the AI frequently checks for malware in their files, even during normal tasks. This behavior has been observed across multiple projects, including one using Next.js.
- 46m Claude 4.7 named a journalist from 125 words of unpublished writing
- 1d Tell HN: Claude 4.7 is ignoring stop hooks
- 4d Claude 4.7 blocks cyber prompts: before the fact vs. after the fact
- 4d non-benchmaxxed fun AI question with Terminator reference - I think Claude won
- 5d Cline and Roo Code are dying projects. Alternatives?
Solo dev, been working on this on the side during first year uni, 10/500 questions were missing context to answer and the rest were model misusing context so going to keep iterating to hit top of the leaderboard. I know its closed source s…
Show HN: A minimal context engine with streaming API (github.com via hn)
I needed a better way to create and compare prompts when using local LLMs (e.g. via Ollama) in a workflow.
Hey, Started with a simple goal: automate short-form video creation for small businesses so they don't have to hire an agency or touch any software themselves. A client fills out a form, and roughly 5 minutes later they get a branded email…
Show HN: A bilingual guide to Thaayam, a Tamil board game (amal-david.github.io via hn)
I saw The Mahjong Guide (https://themahjong.guide/) today and loved how much a good visual explanation can do for an old game. Around the same time, this HN thread about using coding assistants to revive projects you never were going to fi…
When Can LLMs Learn to Reason with Weak Supervision? (salmanrahman.net via hn)
We study when RLVR generalizes under three weak supervision settings (scarce data with as few as 8 examples, noisy reward labels, and proxy rewards such as majority vote and self-certainty) across multiple models from the Qwen and Llama fa…
Claude teases out angioedema cause (www.reddit.com)
After hernia surgery recently, my wife noticed my lips swelling and speech slurring about an hour into post-op care. Alert nurses quickly administered Benadryl and symptoms abated.
-
130 items
event
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 59m Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error
- 3h Thoughts about Moments in Claude Mythos System Card
- 11h Discord Sleuths Gained Unauthorized Access to Anthropic's Mythos
- 15h OpenMythos with Qwen2.5-1.5b weights (No recurrence atm) - looking to turn it into full OpenMythos
- 20h What would you use Claude Mythos for if you had access today?
176 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 3h Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found!
- 5h [Qwen3.6 35b a3b] Used the top config for my setup 8gb vram and 32gb ram, and found that somehow the Q4_K_XL model from Unsloth runs just slightly faster and used less tokens for output compared to Q4_K_M despite more memory usage
- 8h qwen3.6 27b poor experience
- 8h Vs code extension
- 12h Qwen3.6-27B-FP8 - JS file is too long and causing JSON truncation
Is Claude Design available to api users? (www.reddit.com)
could not extract summary
NARE (Non-parametric Amortized Reasoning Evolution) Deterministic routing of logic tasks via semantic compression and executable reflexes. Читать на русском языке (Read in Russian) NARE is a Skill-Based Cognitive Architecture designed to t…
src: https://modelrepublic.substack.com/p/the-reporters-at-this-news-site-are
mirollama Local-first multi-agent simulation and prediction engine. Project Origin This project is a derivative work of: Upstream: https://github.com/666ghj/MiroFish.git Target repository: https://github.com/oswarld/mirollama This reposito…
Cursor on mobile (www.reddit.com)
How do you use Cursor agents from mobile? Just fire it up from the browser every time?
How Claude Code Actually Remembers Things (www.reddit.com)
https://preview.redd.it/v9t7wx9rijxg1.png?width=3600&format=png&auto=webp&s=5f3dfe284c20b3b004a6cabcfb18a9de296ab60b -> https://ahammadnafiz.github.io/posts/How-Claude-Code-Actually-Remembers-Things/ I spent a few days reading the leaked C…