OpenAI mulls slashing prices as it competes with Anthropic for users (www.cnbc.com via hn)
OpenAI is mulling sharp price cuts to its artificial intelligence offerings, as it looks to woo consumers away from rival Anthropic, the Wall Street Journal reported Wednesday evening stateside, citing sources familiar with the matter. "Th…
OpenAI says Chinese accounts tried to turn Americans against data centres (www.engadget.com via hn)
OpenAI says fake accounts from China tried to turn Americans against data centers The company has published a report about China-linked influence campaigns that used ChatGPT. OpenAI has published a report about ChatGPT users, who it says w…
👻 Phantomix The open-source AI browser agent. Free alternative to OpenAI Operator.
Show HN: A text based browser, written in Rust, for humans and agents (github.com via hn)
WebCLI Releases Public release artifacts for WebCLI are published here by the private source tree release workflow. see more: https://webcli.sh Install: curl -fsSL https://webcli.sh/install.sh | bash Windows PowerShell: irm https://webcli.…
CommBench: Can LLMs Write Correct and Efficient GPU Communication Code? (uccl-project.github.io via hn)
CommBench: Can LLMs Write Correct and Efficient GPU Communication Code? By: Shuang Ma, Yuyi Li, Yihan Zhang, Danyang Chen, Shuyang Ji, Ziming Mao, Cheng Ji, Ansha Prashanth, Wenting Yang, Yiran Wang, Chihan Cui, Peiyu Lin, Amanda Raybuck,…
Frontier: A Discrete-Event Simulator for Modern LLM Serving (github.com via hn)
Frontier A Discrete-Event Simulator for Modern LLM Serving Latest News 🎯 📍[2026/06] Initial version released, with support for co-located serving and modern optimizations. Support for disaggregated serving will be available soon.
-
13 items
model roundup
Qwen 3.5Qwen/Qwen3.5-4B is a 4 billion parameter model that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Notably, community projects like Hitoku Draft showcase local AI assistants, while General Instinct focuses on frontier models for edge devices.
- 5m Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
- 11h Hot Take "Rigid code is better than Flexible code if you're on a budget"
- 1d I have 4x 128 GB VRAM now , what should i do.
- 1d nice_meme
- 1d [Opinion/Benchmark] Gemma4-12B's architecture change is too big of a tradeoff; A quick reasoning comparison between Gemma4-12B and Qwen 3.5-9B
340 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
AI researcher claims he's bypassed Anthropic's Fable 5 guardrails (cointelegraph.com via hn)
Pliny demonstrates a path to meth synthesis by asking about the Birch reduction method. Source: Pliny Anthropic’s Fable 5 has prompted backlash from critics since its launch due to its heavy restrictions.
AMD R9700 vs GB10 (www.reddit.com via reddit)
I have a budget of 5K, and want to buy some gpus my requirement is 48gb+ vram, because I finetune small language model, perform DPO, in general tinkering/ development is my usecase. if you where in my shoe which among these would you get,…
"Trust Us" Is Not a Control Surface: Anthropic and the Case for Open Weights (trust-us.vercel.app via hn)
What one week in June told us about who plans to own AI, and why open models are the only way out I run a small mortgage company in Georgia. I am not an AI researcher.
Anthropic Walks Back Policy That Could Have 'Sabotaged' Researchers Using Claude (www.wired.com via hn)
vaportrail Your agents leave trails. vaportrail reads them.
What's the best way to learn RAG for real-world applications? (www.reddit.com via reddit)
I've noticed many AI courses explain vector databases but not complete RAG systems. The Knowledge Base RAG module on SimplAI University appears to focus on building retrieval-powered AI experiences.
-
360 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 43m Claude Code filled almost my entire SSD with random nonsense overnight
- 3h JailbreakOPT: Tool-Assisted Iterative Jailbreak Prompt Optimization
- 3h Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code
- 3h Learning to Inject: Automated Prompt Injection via Reinforcement Learning
- 3h Are Frontier LLMs Ready for Cybersecurity? Evidence for Vertical Foundation Models from Dual-Mode Vulnerability Benchmarks
105 itemsmodel roundup
Opus 4.8Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.
- 1h UPDATE: You asked how the orange negotiation would go against a smaller model. Fable 5 vs Haiku 4.5. It was a massacre.
- 3h Tell HN: Anthropic's Fable model is too expensive
- 7h Thanks Fable-5, I'm Flattered
- 10h It blocked us at 'hello!' Anthropic Fable 5 refusing innocuous prompts
- 10h Critique my prompt
This trend is starting to genuinely worry me. Heard it from a couple friends in totally different industries, seen a few posts about it lately.
Claude is almost unusable for math (www.reddit.comhttps)
When I ask fable a hard math question it thinks for a while and then stops saying the message is too long.. I can press continue and it will then seemingly start again and this repeats.
Terms of Service Ban AI Agents from Using Stack Overflow for Agents (meta.stackoverflow.com via hn)
This question shows research effort; it is useful and clear -17 This question does not show any research effort; it is unclear or not useful Save this question. Show activity on this post.
Antrophic and I scammed myself (www.reddit.com via reddit)
https://preview.redd.it/952yfte7el6h1.png?width=1478&format=png&auto=webp&s=929b690a25390dad2ec244eff2c66d421d290fb7 I am Biology-Teacher. You know where this is going.
Agentic Frameworks (astledsa.substack.com via hn)
Agentic Frameworks Or different ways to make LLM API calls The agentic framework research has produced some very interesting results; from different topologies to different ways of using tool-calls, it has been one of the most fascinating…
- Agentic AI frameworks (www.reddit.com)
- Agentic RAG Frameworks (www.reddit.com)
- An iOS Adaptater for Agentic Frameworks (onepilotapp.com via hn)
+2 more
- What is agentic AI (www.reddit.com)
- Best production agentic frameworks (www.reddit.com)
Claude-based demo builder built with /goal (www.reddit.com via reddit)
Repo: https://github.com/iamneilroberts/cueframe Example: https://demo.voygent.ai/ I was working on my web app personal project and trying to create some kind of demo animation or video...or something. I tried several apps, and it just was…
-
30 items
model roundup
Opus 4.7Anthropic has released Claude Opus 4.8, an upgrade over 4.7 with enhanced judgment and independence. Meanwhile, a new benchmark called The Singularity Gate tests AI models like Opus 4.7 and GPT-5.5 for their ability to predict scientific discoveries beyond their training data.
- 56m Composer 2.5 is phenomenal. So is cusror 3.0
- 17h It's such a nice change of pace to see the sub full of praise like after Opus 4.5
- 17h What happens after June 22?
- 19h Show HN: Apodex-1.0-H – Beats Claude-Opus-4.7 on deep research (90.3 BrowseComp)
- 1d Spent a whole weekend convinced Opus 4.7 had gotten worse. It was my MCP setup the entire time.
77 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.
- 1h Is Qwen 3.6 27B IQ4XS better than Gemma 4 31B QAT as a Hermes agent?
- 3h nvidia/diffusiongemma-26B-A4B-it-NVFP4 · Hugging Face
- 4h Monitor your screen using local LLMs with only one sentence! Free, Open Source and Local.
- 7h LLMs and tabletop games
- 11h Are these quants of QAT better than non-QAT? What do I use?
TokenPulse – Live token and rate limit tracker for Claude and ChatGPT (chromewebstore.google.com via hn)
Overview Live token & rate limit tracker for ChatGPT and Claude. Context window bar, usage rings, notifications.
Is Claude accurate and good for calorie counting or a meal plan ai? (www.reddit.com via reddit)
In my opinion, I've tried Claude yesterday, and overall the recipes it gives are amazing, it even gives a timer for you to set up. I know that some apps already exist with ai calorie counters but I'm trying to find free and actual accuracy…
fableExpectations (www.reddit.com via reddit)
https://preview.redd.it/2o426zap9l6h1.png?width=1080&format=png&auto=webp&s=169e2d511bbf4c4b08a155775d94b0e9f3f931a5 Claude Fable is incredible It one-shotted my usage limits in 1 prompt
Unstable Cursor Hooks (www.reddit.com via reddit)
Hey! I wonder if anyone else is building automations based on the Cursor hooks and finding that from version to version some of them just stop working.
See what your AI coding agent is doing with Datadog Lapdog (chrisebert.net via hn)
See what your AI coding agent is doing with Datadog Lapdog Datadog Lapdog is a free tool that gives you real-time visibility into what your AI coding agents are doing. Here's how to install it, pair it with Claude Code, and drill into a re…
China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (decrypt.co via hn)