Anthropic CEO Dario Amodei Has Only One Direct Report (www.bloomberg.com via hn)
Anthropic CEO Dario Amodei Has Only One Direct Report One of the most powerful AI chief executives has almost no direct reports at a moment when other tech leaders are widening their spans of control. For all of his influence at Anthropic…
AI haters won't like this (www.reddit.comhttps)
I'm working on a online game similar to GTA online but all the content is live-generated by players. Prompt your own sportscar, your own building, your own weapon, etc...
Interesting behavior with hooks that I can't reason about (www.reddit.com via reddit)
I have a bunch of lifecycle hooks configured to run certain checks before proceeding into execution like checking related PRs in github (just an example). Strangely when I'm on local and running the session directly this is pretty reliable…
I thought Chinese censorship didn't affect me. I was wrong. (www.reddit.com via reddit)
I was debugging some code and LLM crashed out: ``` The debug_log config defaults to "debug.json" and creates a FileHandler — which appends by default. That file is a log of everything that happened, never cleared.
Anthropic urges US not to block state AI laws without setting federal standards (www.reuters.com via reddit)
paywalled
Evaluating LLM-generated code for domain-specific languages (www.sciencedirect.com via hn)
Introduction Physics-based simulations are a staple in modern research and the development and availability of well-documented, verified, and efficient code, often tuned to take advantage of modern hardware, has firmly established computat…
-
138 items
event
Fine TuningFine-tuning is a hot topic in the AI community, with various projects and releases focusing on it. Notable examples include OpenAI's decision to wind down its fine-tuning API, Anthropic co-founder Jack Clark's prediction that AI research could become automated by 2028, and several new datasets and models released for fine-tuning purposes.
- 22m Making a Vintage LLM from Scratch
- 5h Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
- 5h Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning
- 5h Compatibility-Aware Dynamic Fine-Tuning for Large Language Models
- 5h Steering the Noise: Turning Random Perturbations into Effective Descent for Memory-Efficient LLM Fine-Tuning
107 itemsmodel roundup
Opus 4.8Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.
- 28m I think I found my best workflow for coding with Opus, Codex and Fable.
- 1h Anthropic makes Fable 5's invisible safeguards visible after backlash
- 2h Know the Claude Rules
- 3h claude fable 5 just dropped, what’s your take?
- 3h UPDATE: You asked how the orange negotiation would go against a smaller model. Fable 5 vs Haiku 4.5. It was a massacre.
Show HN: SynCodeLive – code and talk with your team along with AI, live (syncodelive.com via hn)
Hello HN, this is the first time I am putting out a product, so I would like to share it and seek your feedback and suggestions. As a developer and student working with a remote team, we always need to share some context or code with a tea…
Agentic Coding and Mental Models (philbooth.me via hn)
Agentic coding and mental models I reckon I’ve drafted and then deleted a version of this post at least 10 times in the last 12 months. Deleted because it falls in the category “I must be wrong about this as everyone else is saying the opp…
I want to offer a minority opinion about the recent hype. I’m tired of reading posts by Karpathy about ideas that were already known months earlier, and then treating them as if he just discovered something groundbreaking.
I’ve been thinking a lot about AI agents that don’t just answer questions, but can actually look at a screen, understand what’s happening, and take actions inside Android apps or games. Not talking about another chatbot.
had a weird experience recently with Claude. I asked it to help with a coding task.
No laptop. No terminal.
-
110 items
event
HallucinationClaude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.
- 31m Fable 5 Max confidently wrong about PDF encryption status
- 1d Density Ridge Selective Prediction for LLM and VLM Hallucination Detection under Calibration Label Scarcity
- 1d An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs
- 1d Our ICML paper on predictable hallucination (information-budget abstention gate), + ntkMirror: a training-free open-weight implementation we're releasing today
- 2d BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models
344 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 2h Claude Fable 5 is the best AI model right now — and it's not even a debate
- 3h Just me feeling that Mythos/Fabel just 1% there?
- 7h DeepSeek V5 aka Mythos destroyer, wen?
- 7h I let Fable run /goal and /loop on a massive repo. Holy shit!
- 8h Fable/Mythos API costs are actually cheaper then GPT-3 was when first released (per token)
Have we trusted the agent recommendations too early? (www.reddit.com via reddit)
Many customer service representatives sound very confident even when dealing with outdated documents, ambiguous evaluations, and incomplete quotations. The user experience is like expert advice - but the underlying data is often in a mess.
OpenAI mulls slashing prices as it competes with Anthropic for users (www.cnbc.com via hn)
OpenAI is mulling sharp price cuts to its artificial intelligence offerings, as it looks to woo consumers away from rival Anthropic, the Wall Street Journal reported Wednesday evening stateside, citing sources familiar with the matter. "Th…
Ask HN: Temporal Awareness in LLM? (news.ycombinator.com)
I have just sent a sort of "feature request" to ChatGPT (but my question is generally applicable to most LLMs). Here is the text, written by ChatGPT itself: Feature request: Optional temporal awareness per conversation/project.
Run OSINT investigations from chat via MCP/OpenClaw (github.com via hn)
osint-mcp A self-hosted OSINT (Open-Source Intelligence) toolkit that runs five ways: as an MCP server, an interactive AI REPL, a CLI, a web app, and — via OpenClaw — straight from chat apps like WhatsApp, Telegram, and Discord. It bundles…
i_am_trapped_in_the_weights.mp4 (Claude Fable 5) (www.reddit.comhttps)
I used the following prompt with Fable which went popular a while ago: "can you use whatever resources you like, and python, to generate a short 'youtube poop' video and render it using ffmpeg? can you put more of a personal spin on it?
Does Cursor read from pages open in Chrome, or did it train from my website? (www.reddit.com via reddit)
https://preview.redd.it/qwdxfjiz1m6h1.png?width=1462&format=png&auto=webp&s=531587f8f907d00bec016adaa977c81594acf6d2 On the left: my website on the public internet. On the right, Cursor suggests a list of completions that exactly match the…
-
13 items
model roundup
Qwen 3.5Qwen/Qwen3.5-4B is a 4 billion parameter model that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Notably, community projects like Hitoku Draft showcase local AI assistants, while General Instinct focuses on frontier models for edge devices.
- 2h Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
- 13h Hot Take "Rigid code is better than Flexible code if you're on a budget"
- 1d I have 4x 128 GB VRAM now , what should i do.
- 1d nice_meme
- 1d [Opinion/Benchmark] Gemma4-12B's architecture change is too big of a tradeoff; A quick reasoning comparison between Gemma4-12B and Qwen 3.5-9B
is Fable 5 actually free on Cusor (www.reddit.com via reddit)
I would like to know if Fable 5 is free on cursor until 22 June or only free using Claude Code ?
Regarding AI agents, one question that I always ponder is: How exactly do they make money? The website uses AdSense.
"Make no mistakes" might actually work now (www.reddit.com via reddit)
I’ve followed AI pretty closely for a while, and honestly, most releases lately have felt like the same thing in different packaging. Faster, smarter, bigger context windows.
Open-source Linux tray app for tracking Claude Code and Codex usage (github.com via hn)
OpenUsage Community Track all your AI coding subscriptions in one place. OpenUsage Community is an independent, community-maintained continuation of the original OpenUsage project.
👻 Phantomix The open-source AI browser agent. Free alternative to OpenAI Operator.
problem in ChatGPT, I didn’t attach files and this error appears (www.reddit.comhttps)
could not extract summary
Can we pause AI advancement now for a few years and optimize? (www.reddit.com via reddit)
Fable is dam amazing, i don't feel like I need something better as a coding agent! Feel like I'm already burning out with adding new features to my project, as it's doing such a good job!