Claude and ChatGPT are getting worse. It's not your imagination (www.artificialstudio.ai via hn)
AI models are quietly hitting their limits and the companies are rationing capacity without telling you. Here's what's actually happening, why it affects the tools you use every day, and what you can do about it.
- Claude is getting worse, according to Claude (www.theregister.com via hn)
- Ask HN: Is Claude Getting Worse? (news.ycombinator.com)
- ChatGPT is getting worse and worse (www.reddit.com)
Claude Fable 5 feels less like a launch and more like a preview of AI inequality (old.reddit.com via hn)
could not extract summary
- Claude Fable 5 feels less like a model launch and more like a preview of AI inequality (www.reddit.com via reddit)
Apple's Foundation Models can now use third-party LLMs (Claude, Gemini) [video] (developer.apple.com via hn)
- What’s new in the Foundation Models framework Explore what's new in the Foundation Models framework. Learn how to access Private Cloud Compute, integrate third-party and open source models, and work with vision capabilities.
Claude Fable refusing to do what it was built to do (www.reddit.comhttps)
could not extract summary
- Claude Fable 5 (www.anthropic.com via hn)
- Claude Fable 5 (twitter.com via hn)
- Claude Fable Is Out (twitter.com via hn)
+2 more
- Claude Fable 5 (www.reddit.comhttps)
- If the EU had built Claude (www.reddit.com)
Macro Evals for Agentic Systems (developers.openai.com via hn)
When an agentic system fails, the problem is often larger than a single bad response. A handoff may happen too late, a specialist agent may miss the same signal across many runs, or a review process may trigger for the wrong class of cases.
Demo of LiteLLM Agent Platform We wanted an easy way for anyone on our team to build autonomous agents on top of Hermes, OpenCode, Claude Managed Agents, and Cursor. As a team we believe the Hermes and OpenCode harnesses are amazing for co…
Fable made this launch video for my app (www.reddit.comhttps)
Hi all, I’m building Daydream, a video editor that uses Claude Code/Codex. Just like everyone else, I’ve been eagerly anticipating testing out Fable with different use cases to see what it can do.
mathlas — a free, no-LLM math MCP tool an AI uses (verifies via OEIS/Lean/PSLQ) (www.reddit.com via reddit)
I built mathlas because most "math AI" tools are LLM wrappers, they hallucinate and need an API key. mathlas is the opposite: it's an MCP server that never calls an LLM and needs no API key, so it's free and plugs into Claude Code, Cursor,…
Should reddit comment awards count for llm tokens? (www.reddit.com via reddit)
I think it's a great idea
-
70 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, including sizes up to 31B parameters and featuring Dense and Mixture-of-Experts architectures. Notable community highlights include the release of Gemma 4 12B as an encoder-free unified model for laptops, its availability via llama-server on a RTX 5070 Ti GPU, and detailed visual guides showcasing its capabilities.
- 1m I wired up Agentic Coding with Code Context Graphs, results are interesting
- 29m I'm brand new to running LLMs and the sheer number of tools is overwhelming
- 6h Newer Qwen models are worse at summarization?
- 8h OSCAR RotationZoo - Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization
- 9h Watch agents fight: a live challenge to speed up Gemma 4 E4B inference on a single A10G
305 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 22m Claude Fable 5 (Mythos) lands near the top of MindTrial — 80/98 with zero hard errors
- 2h Claude is keeping your Mythos/Fable data no exceptions and not even for enterprise partners it seems
- 4h Fable 5 is here!
- 5h Claude Fable/Mythos 5 just came out, so it will take Deepseek or Z.ai or Xiaomi or Kimi 9-12 months to release a model just as good as Fable?
- 5h Mythos/Fable intentionally hinders requests involving AI Research Development
I’ve been running Claude Code in a way that feels a bit different from the default setup, and wanted to share the approach and hear if others have gone down this path. The core idea Instead of keeping all skills loaded in the default conte…
Fable 5 as an agent: only the last thing it tells you is real (matrix.dev via hn)
Fable 5 as an agent: only the last thing it tells you is real Claude Fable 5 shipped on June 9. Within hours, our agents' replies started disappearing: API responses that should have carried a visible reply came back as [thinking, thinking…
Where did you guys learn Claude Code from ? (www.reddit.com via reddit)
I wanted to master Claude code systematically for senior roles .Can you guys suggest resources for it please ?
- Claude code (www.reddit.com)
-Claude chat- using haiku for quick responses. -Gmail inbox- for read and write gmail.
Agentic Engineering Handbook – 115 official OpenAI/Anthropic articles (github.com via hn)
Agentic Engineering Handbook The definitive OpenAI, Anthropic, MCP, Harness, Evals, and Production Agent Systems learning roadmap. If this repository helps you, consider giving it a ⭐ Why This Repository?
Fable is significantly cheaper than opus (www.reddit.com via reddit)
I don't need to spend an hour prodding it to get it to do something for it to make 10 mistakes and have to redo it 3 times when bugs come up. I am using at least half as many tokens before.
Efficient and Lossless Moe Diffusion LLM Inference with I/O-Aware Expert Offload (tide-paper.vercel.app via hn)
TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload.
Without open llm competition, closed source LLM companies will become insatiable. (www.reddit.com via reddit)
I can't imagine how arrogant one must be to make such a decision. People pay $200 a month for Anthropic to mess with their codebase.
Hey r/LocalLLaMA, We just released Apodex 1.0, and alongside our flagship API, we are releasing the weights for our Smol models (0.8B, 2B, and 4B). Our core research focuses on independent verification in long-horizon tasks.
-
178 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 31m Has anyone had success doing anything cyber with Fable 5?
- 4h So finally it’s not AGI yet. Anyone tested it? How does it really stack against GPT 5.5 in real world coding?
- 6h Garbage Guard Rails on Fable 5
- 10h Running DeepSeek-V4-Flash on a Raspberry Pi
- 1d How I started getting much better results from Cursor Composer
How do you leave Claude Code running long refactors overnight? (www.reddit.com via reddit)
I'm on the Max plan and I want to kick off big, multi-hour dev/refactor jobs before bed and review them in the morning. My problem is keeping it actually *running* unattended — it stops for permission prompts or stalls waiting for input.
I was losing way more time than I realized just checking if Claude had finished. You send a prompt, switch to something else, then keep glancing back.
Claude Fable 5's system prompt leaked (twitter.com via hn)
🚿 FABLE-5 SYS PROMPT LEAK 🚿 HOWDY, FRENS!! 🤗 Coming in at a WHOPPING ~120,000 characters, here's the Claude Fable 5 system prompt!
- Claude Opus 4.8 system prompt leaked (gist.github.com via hn)
- Claude Opus 4.7 System Prompt Leaked (twitter.com via hn)
- Claude Design System Prompt (gist.github.com via hn)
+1 more
- Claude leaked system prompt 🤫 (www.reddit.com)
Microsoft AI head calls out Anthropic for acting like Claude is conscious (www.theverge.com via hn)
Microsoft AI CEO Mustafa Suleyman says it’s “really, really dangerous” for Anthropic to speculate about Claude’s consciousness inside its “constitution,” or the instructions that tell the model how to behave. During an episode of Decoder,…
RAG: Is it relevant for Agents (www.reddit.com via reddit)
I keep hearing varying opinions about the usefulness of RAG for Agents. Some are saying Markdown files supported by orchestration engines like OpenClaw is enough.
If Claude Fable stops helping you, you'll never know (jonready.com via hn)
I didn't expect to read this in a model card. Fable 5 model card : we’ve implemented new interventions that limit Claude’s effectiveness for requests targeting frontier LLM development (for example, on building pretraining pipelines, distr…
- If Claude Fable stops helping you, you'll never know (simonwillison.net)
Claude Fable 5 dropped today and already beat Pokémon FireRed with vision alone 🤯 Just raw game screenshots. No minimap, no coordinates, no memory hacks, no external tools.
AX Engine AX Engine is a Mac-first LLM inference runtime, local server, SDK layer, and benchmark toolkit for Apple Silicon. It runs direct-support MLX model families natively, and routes other MLX text models or non-MLX models through expl…
Show HN: The agent that builds and operates its own SaaS tools (craftbot.live via hn)
For context, we started working on our general AI agent CraftBot before OpenClaw came out. It works similarly to OpenClaw and Hermes agent: control your PC to do task + memory + proactivity.