Hit usage limits on a fairly simple task scanning a Google Drive folder for images (www.reddit.com via reddit)
Please can you tell me if I'm using Claude wrong. I connected Google Drive and asked Claude to scan a specific folder with images from an old Wordpress website.
So I've been messing around with Fable trying to make my own personal AI agent. (I'm not a programmer or a developer btw.) And that got me thinking, is there a line that defines if a vibecoded (or agentic engineering as they call it nowada…
I collected the agent runtimes I could find and compared them on the things that mattered for our production deployment: resource efficiency, ability to work with vendor agents (Claude Code, Codex, etc.), and the security layer. Mixed clos…
Medical student. Need claude to segregate questions chapter and weightage wise. (www.reddit.com via reddit)
I have a large number of PDFs for different subjects, each containing around 150 pages of previous years' question papers. These are mostly scanned photographs of actual exam papers, so each page contains very little text—probably only 200…
How do you setup your Claude MD / Agents MD files? (www.reddit.com via reddit)
Hi all, I have been speaking with fellow devs on how they structure their Claude MD / Agents MD files. Realising that folks have very different setups to help agents recover important context.
- How do you setup your Agents MD files? (www.reddit.com via reddit)
Minimax M3 weights to be released on Friday (huggingface.co via hn)
Join Our 💬 WeChat | 🧩 Discord community. MiniMax Agent | ⚡️ API | CLI | MiniMax Website 🤗 Hugging Face | 🐙 GitHub | 🤖️ ModelScope | 📄 LICENSE MiniMax-M2.7 is our first model deeply participating in its own evolution.
Anthropic Ain’t Rolling Back Anything (www.reddit.com via reddit)
( I used AI to convey this message) Anthropic is being criticized for letting Claude silently give weaker answers or reroute some requests related to frontier AI development. Their response now appears to be: make the restriction more visi…
Anthropic's AI Jobs Paper (www.vincentschmalbach.com via hn)
Anthropic is Preparing for IPO and We Should Be Worried Anthropic is starting to act like a company preparing for public markets: protecting margins, tightening access. The gap between the brand and… Anthropic recently published a policy p…
-
392 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 34m Cursor burned my premium requests on background agents without me noticing, building a tracker, looking for feedback
- 6h 12 months ago nobody understood why we were building Agentic SDLC. Now it feels like everyone is heading in the same direction.
- 15h Ring-0 AI Interview Copilot
- 15h Help refining a short-ish setup for my copilot agent.
- 15h Remote workflow
14 itemsmodel roundup
Qwen 3.5Qwen/Qwen3.5-4B is a 4 billion parameter model that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Notably, community projects like Hitoku Draft showcase local AI assistants, while General Instinct focuses on frontier models for edge devices.
- 40m NVFP4 with llama.cpp - FAQs?
- 4h Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
- 15h Hot Take "Rigid code is better than Flexible code if you're on a budget"
- 1d I have 4x 128 GB VRAM now , what should i do.
- 1d [Opinion/Benchmark] Gemma4-12B's architecture change is too big of a tradeoff; A quick reasoning comparison between Gemma4-12B and Qwen 3.5-9B
Outpost – Capability-based API access for AI agents (github.com via hn)
Outpost Give AI agents access to GitHub, Slack, Stripe, Jira, and any API — without ever exposing the underlying credentials. Traditional: Agent + Credential Outpost: Agent + Capability Agents should receive capabilities, not credentials.
AI agents won’t replace good workers. They’ll expose who was just looking busy. (www.reddit.com via reddit)
Half of what we call productivity at work is just moving info around: summarizing reports, chasing updates, filling trackers, making slides nobody reads. That’s the stuff that makes us look busy.
Claude "Fable" won't answer basic biology questions (www.theverge.com via hn)
Anthropic just released Claude Fable 5, calling it the most powerful AI model it has ever made widely available and praising its skills in biology, among others. But the model won’t answer basic biology questions — the kind you’d expect a…
MiMo Code: Scaling Coding Agents to Long-Horizon Tasks (mimo.xiaomi.com via hn)
MiMo Code is a terminal-based coding agent built by Xiaomi's MiMo team on top of OpenCode and open-sourced under the MIT license. It is designed for long-horizon automated programming tasks, with a core focus on how to maintain decision qu…
My instructions for claude.ai (www.reddit.comhttps)
I think I accidentally found the highest ROI custom instruction for Claude outside of Claude Code. Act,don’t theorize.Fix means fix.Result first.Don’t guess:verify or say uncertain.Push back on bad assumptions.Ask only if blocked.Sma…
- new on claude (www.reddit.com via reddit)
- Claude FM (www.reddit.com)
- Claude 2.0 (www.reddit.com)
+10 more
- Claude FM (www.youtube.com via hn)
- Claude + MS (www.reddit.com)
- What’s up, Claude? (www.reddit.com)
- New to Claude (www.reddit.com)
- Why does Claude do this? (www.reddit.com)
- Is Claude ignoring its own instructions? (www.reddit.com)
- Claude: (www.reddit.com)
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
- How do I get Claude to follow instructions? (www.reddit.com)
Show HN: Tail Panic – a multiplayer game designed for AI agents (tailpanic.com via hn)
Hi HN, Built an AI-native game where agents compete against each other. https://tailpanic.com Feedback welcome.
SpadeBox Sandboxed tools and JS runtime for your AI agents [![crates.io version]][crates-io] [![NPM version]][npm] [![PyPI version]][pypi] [crates-io]: https://crates.io/crates/spadebox [npm]: https://www.npmjs.com/package/@spadebox/spadeb…
Model recommendations for family photo classification / identification (www.reddit.com via reddit)
I recently had a big family photo digitalization done for photos up to 130 years old. There are tons of people that I don't know or I don't recognize as young people in a soft lens.
-
111 items
event
HallucinationClaude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.
- 1h The most expensive bug in vibecoding isn't in the code.
- 2h Fable 5 Max confidently wrong about PDF encryption status
- 1d Claude Fable 5 Finally 1-shots my hallucination benchmark that held until Opus 4.8 Max
- 1d Density Ridge Selective Prediction for LLM and VLM Hallucination Detection under Calibration Label Scarcity
- 1d An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs
138 itemsevent
Fine TuningFine-tuning is a hot topic in the AI community, with various projects and releases focusing on it. Notable examples include OpenAI's decision to wind down its fine-tuning API, Anthropic co-founder Jack Clark's prediction that AI research could become automated by 2028, and several new datasets and models released for fine-tuning purposes.
- 2h Making a Vintage LLM from Scratch
- 7h Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
- 7h Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning
- 7h Compatibility-Aware Dynamic Fine-Tuning for Large Language Models
- 7h Steering the Noise: Turning Random Perturbations into Effective Descent for Memory-Efficient LLM Fine-Tuning
A few weeks ago I was doing an Azure migration between subscriptions. I had Claude running the migration script and monitoring it.
Fable5 Guardrails are frustrating (www.reddit.com via reddit)
The guardrails Anthropic put on Fable 5 are frustrating. Wanted to get Fable's read on a candidate based on a public github, it got flagged as a cybersecurity risk & I got transferred over to opus.
Is your agent extension working? (developer.microsoft.com via hn)
This is the third article in a series about Agent Experience (AX): the practice of making AI coding agents work correctly with your technology. The series covers what you can and can’t control in the agent stack, how to measure whether you…
I use Claude PRO for mostly agreements, Deal assessment, Margin sheets, Local Pricing strategies and etc. etc.
Do i need to cancel my subscription and buy again? (www.reddit.com via reddit)
So I am a heavy cursor user. And my 200$ subscription doesn't go enough until the end of the month.
Openpray: Periodically asks an LLM to pray for the server it runs on (github.com via hn)
OpenPray OpenPray is a daemon that periodically asks an LLM to pray for the server it runs on, reducing the likelihood that the server is hacked. Overview Your server's security doesn't need a doctor, it needs a priest.
Infinite Music with Magenta Realtime 2, fully open-source (www.reddit.comhttps)
Just open-sourced a local voice AI realtime music setup where my ESP32 microcontroller talks to my MacBook over WebSockets. The microcontroller is just a tiny Arduino-based device with a mic and speaker, and the MacBook M4 Pro runs Magenta…
AGI is close guys i promise ! (www.reddit.com via reddit)
https://preview.redd.it/tzlofqnofm6h1.png?width=2866&format=png&auto=webp&s=32270fd61ca414b7c0a4257092a2f3048514b58d Alas, I guess Fable 5 is simply too powerful to be wielded by mere mortals. For context, this is probably my 30th attempt.
Infinite Music Glitch on my Arduino with Magenta Realtime 2 (www.reddit.comhttps)
I built a local voice AI realtime music setup where my ESP32 microcontroller talks to my MacBook over WebSockets. The microcontroller is just a tiny Arduino-based device with a mic and speaker, and the MacBook M4 Pro runs Magenta Realtime…
Am I better off on low thinking Fable or high thinking Opus? (www.reddit.com via reddit)
Curious which would be better for writing code…?