#hallucination

154 items

Multi agent systems are a total nightmare in production (www.reddit.com) +4845 9w

I’m tired of seeing these LinkedIn influencers/ YouTube gurus bragging about their 12-agent swarms. Honestly, I used to be one of them.

↯ Hallucination hallucination
Grok 4.3 achieves higher overall intelligence over 4.20 with less of a cost, at the price of slightly higher hallucination rate. (x.com via reddit) +3514 8w

xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places just above Muse Spar…

↯ Hallucination hallucination grok agentic
The Mushroom That Makes People Have the Exact Same Hallucination (www.vice.com via hn) +2911 8w

Biologist Colin Domnauer is reopening an old case that Chinese health officials seem to have stopped caring about. Every summer, residents of the Yunnan province check into hospitals with complaints that they’re hallucinating tiny elflike…

↯ Hallucination hallucination
Claude Opus 4.6 accuracy on BridgeBench hallucination test drops from 83% to 68% (www.reddit.com) +2815 10w

Anthropic's flagship model just took a pretty significant accuracy hit on one of the most important AI benchmarks out there. So here's the deal: Claude Opus 4.6 was recently tested on BridgeBench, which specifically measures how often AI m…

↯ Hallucination ↯ Opus 4.6 hallucination opus anthropic
The weirdest thing about AI agents is how human failure patterns start showing up (www.reddit.com) +205 7w

I wasn’t expecting this when I started building them lol but after running longer workflows for a while, agents start developing failure modes that feel strangely… human they: skip steps when under too much context pressure become overconf…

↯ Hallucination hallucination
HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next! (www.reddit.com) +1411 5w

HalBench Results: TL;DR: I built HalBench, an open benchmark for LLM sycophancy and hallucination. 3,200 false-premise prompts × 4 models = 12,800 graded responses.

↯ Hallucination ↯ Sonnet 4.6 hallucination grok gpt-5+2
Why 80% of agentic AI demos don't make it to production (www.reddit.com) +134 5w

Agent demos are easy. Production agents are hard.

↯ Hallucination ↯ Tool Use tool-use hallucination agentic
Hallucination Is Inevitable: An Innate Limitation of Large Language Models (arxiv.org via hn) +109 7w

Hallucination has been widely recognized to be a significant drawback for large language models (LLMs). There have been many works that attempt to reduce the extent of hallucination.

↯ Hallucination hallucination
OpenBMB releases MiniCPM5-1B LLM. Currently one of the most powerful LLMs for its size. ( 17.9 on the Artificial Analysis Intelligence Index) (x.com via reddit) +82 4w

One of the more interesting things about this model is that it doesn't want to answer to more difficult questions. Though this drastically reduces hallucination rate.

↯ Hallucination hallucination
AA-Omniscience Hallucination Rate - Is it noticeable? (www.reddit.com) +83 5w

could not extract summary

↯ Hallucination hallucination
OpenAI Cooked This Week! (www.reddit.com) +620 6w

saw someone in another thread say "nothing interesting dropped this week" and i genuinely could not figure out what they were reading. the default model most people use every day just got swapped out.

↯ Hallucination ↯ GPT 5.5 hallucination gpt-5 chatgpt+1
How many e's are in the word seventeen [video] (AI hallucination) (www.youtube.com via hn) +63 7w

About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC

↯ Hallucination hallucination
For Non-hallucinating work, MiMo 2.5 delivers (www.reddit.com) +67 8w

MIT license and fully open source. MiMo-V2.5-Pro was just 3 points from Opus 4.7 max and the normal V2.5 is only a step behind SOTA.

↯ Hallucination ↯ DeepSeek 4 hallucination gemma deepseek+1
Tell HN: Gemini 3.5 Flash breaks in stupid ways (news.ycombinator.com) +51 5w

I thought I was going crazy, trying to use Gemini 3.5 Flash to rate some answers, but it kept giving 7 instead of 10 for correct answers. Apparently once you add a "Grading criteria" text, the model collapses into a "compressed toward the…

↯ Hallucination ↯ Gemini 3.5 hallucination gemini
how to architect ai agents for regulatory approval? (www.reddit.com) +57 5w

spent a lot of time on agent architecture for mission critical environments. getting an agent to browse the web or draft an email is trivial compared to deploying one where a hallucination carries real legal or physical consequences.

↯ Hallucination hallucination agentic
gpt 5.5 is good but I'm having hallucination/context issues (www.reddit.com) +514 8w

I'm working on a large-ish repo (300k lines) with fairly complicated logic, and Gpt 5.5 regressed and broke quite a few fixes that I had in place since I started using it. It seems to need to compact the context more, and when it does, it…

↯ Hallucination ↯ GPT 5.5 hallucination
Your AI agent is acting on memory it can't verify. Here's what we built to fix that. (www.reddit.com) +53 9w

↯ Hallucination hallucination mcp openai
Show HN: I built an 11-LLM consensus engine to detect AI hallucination (github.com via hn) +41 7d

Multi-LLM SaaS Starter Kit The only production-ready boilerplate that ships with 14 LLM providers in semantic consensus, EU AI Act audit-grade compliance, and 13 self-evolution loops out of the box. Built on the same code that powers api.q…

↯ Hallucination hallucination
Cohere launches open weights model Command A+. Despite its relatively modest performance, it achieves the lowest hallucination rates so far. (x.com via reddit) +42 5w

Artificial Analysis on X: "Cohere launches open weights model Command A+ that achieves 37 on the Artificial Analysis Intelligence Index The release of Command A+ places @Cohere in line with Claude 4.5 Haiku on the Intelligence Index, and j…

↯ Hallucination hallucination haiku
Top Law Firm Apologizes to Bankruptcy Judge for AI Hallucination (www.bloomberg.com via hn) +41 9w

We've detected unusual activity from your computer network To continue, please click the box below to let us know you're not a robot. Why did this happen?

↯ Hallucination hallucination
This post potentially explains the current happenings to the LLMS and how their hallucination problem appears to be bigger than usual (www.reddit.com) +411 9w

So, what the above graph means that a LLM is really good at solving average problems and are great at recombining existing knowledge, so, if i ask something outside my domain of expertise, i get really good answers but as you approach to t…

↯ Hallucination hallucination
Show HN: Startup research website, crunchabse and Product Hunt and grokipedia (startupswiki.vercel.app via hn) +31 9d

Crunchbase is $49/month, which isn't great if you wanna learn and your not an angel or a VC. At the same time I stumbled on grokipedia, and though it was a stupid website.

↯ Hallucination hallucination
Hallucination Detection Comparison (blueguardrails.com via hn) +3 3w

Hallucination Detection Comparison What's the best tool for hallucination detection? We put 7 of them to the test.

↯ Hallucination hallucination
Composition Hallucinations: Not all RAG hallucinations are retrieval failures (zenodo.org via hn) +3 4w

Composition Hallucination in Retrieval-Augmented Generation: A Failure Mode and Benchmark Protocol Description Retrieval-Augmented Generation (RAG) is commonly motivated by the idea that language models answer more faithfully when relevant…

↯ Hallucination hallucination rag
Is there any <3B model with usable 200k+ context window? (www.reddit.com) +311 5w

I need a small model for processing conversation transcripts from larger models, so need usable context window out to at least 200k tokens. I know some models claim to support this, but I don’t know which are actually good at this in pract…

↯ Hallucination hallucination
Φ³−φ⁻³=4 (exact): The transformer's ff/d ratio is algebraic, not empirical (zenodo.org via hn) +31 7w

Dephaze Semantic Anchoring: A Φ³ Geometric Framework for Eliminating AI Hallucination and Ensuring Semantic Stability in Large Language Models Authors/Creators Description LLM hallucination is not a data problem. It is a geometry problem.

↯ Hallucination hallucination
Nobody agrees on what "hallucination" means and it's hit our AI PoC (www.reddit.com) +34 7w

We wrapped up a did a 120-question UAT with a CMO and his team. This is where it gets funny.

↯ Hallucination hallucination
Folie à Deux: The most dangerous hallucination is one you're inclined to believe (thebookofluke.com via hn) +3 7w

An LLM will hallucinate when you box them into giving an answer they don’t know. This is incredibly easy to do without realizing it.

↯ Hallucination hallucination
Dedicated Repository Agents (www.reddit.com) +33 9w

Recently I began experimenting with defining an agent identity around stewardship of a given codebase. I use a SOUL.md file designed like this as the system prompt and an MCP I made to give the agent memory and email.

↯ Hallucination hallucination codex mcp+1
I was tired of "Agent Runaway" costs, so I built a tracer with a built-in Kill-Switch. (www.reddit.com) +33 10w

Most agent observability tools just show you what happened after the bill arrives. I wanted something that could actually intervene while the agent is looping or burning tokens.

↯ Hallucination hallucination
The No Hallucination Guarantee (www.hudson-labs.com via hn) +2 4d

Hudson Labs now backs every number with a No Hallucination Guarantee. Find a figure we can't trace to source, and we'll refund you $50.

↯ Hallucination hallucination
Grok models are now available via Amazon Bedrock (x.ai via hn) +2 8d

Today, we’re excited to announce that Grok 4.3 is now generally available on Amazon Bedrock. Grok 4.3 achieves the lowest hallucination rate among frontier models, offers 1-million-token context window, and supports configurable reasoning…

↯ Hallucination hallucination grok
Show HN: AptSelect – A local LLM client for parallel testing and evaluation (aptselect.com via hn) +2 8d

I built AptSelect to stop writing throwaway scripts every time I needed to test how different LLMs handle specific instructions and prompt edge cases. What it does: Parallel Execution: Send a single prompt to OpenAI, Anthropic, Mistral, an…

↯ Mistral ↯ Hallucination mistral hallucination gemini+2
Show HN: UQLM – Closed-book hallucination detection with UQ (github.com via hn) +21 3w

uqlm: Uncertainty Quantification for Language Models UQLM is a Python library for Large Language Model (LLM) hallucination detection using state-of-the-art uncertainty quantification techniques. Installation The latest version can be insta…

↯ Hallucination hallucination
The Importance of Out-of-Band Metadata for Safe Autonomous Agents [Redpanda] (arxiv.org via hn) +2 3w

AI agents are increasingly expected to operate as digital employees: accessing enterprise data, making decisions, and taking actions autonomously. But agents are simultaneously less predictable than humans -- prone to hallucination, misint…

↯ Hallucination hallucination
Multiple AI assistants are hallucinating official Discord invites — this is a phishing risk, not a normal hallucination (www.reddit.com) +22 4w

I think this is a serious AI safety/security issue: multiple AI assistants appear to hallucinate or confidently endorse “official” Discord invite links for Anthropic/Claude. I’m intentionally not posting the exact invite strings here becau…

↯ Security ↯ Hallucination hallucination security anthropic
A different way to reduce hallucination (www.reddit.com) +22 5w

All actual LLMs, sometimes, hallucinate, this is part of their "personalities". I made an experiment with my AI assistant.

↯ Hallucination hallucination
Have you tried Agentic analytics tools? (mitzu.io via hn) +2 5w

TL;DR Compare the best AI analytics tools in 2026 across semantic-layer trust, no-hallucination reliability, SQL transparency, and team fit. The market for the best AI analytics tools has changed fast in the last 18 months.

↯ Hallucination hallucination agentic
LLM Hallucinations in the Wild (arxiv.org via hn) +21 6w

Large language models (LLMs) are known to generate plausible but false information across a wide range of contexts, yet the real-world magnitude and consequences of this hallucination problem remain poorly understood. Here we leverage a un…

↯ Hallucination hallucination
Why "Consensus" Is Failing AI: My Research into the Hallucination Tax (www.indiehackers.com via hn) +21 6w

The Problem with "Smart" AI: I’ve spent the last few months researching one specific question: Why do enterprises still not trust LLMs for critical tasks? The answer is what I call the "Hallucination Tax." Currently, for every hour of AI w…

↯ Hallucination hallucination
AI Evidence Admissibility is a Post-Mortem. We need Action Admissibility. (www.reddit.com) +21 7w

Courts are currently fixated on whether AI-generated evidence is admissible. Is the image authentic?

↯ Hallucination hallucination
A thermodynamic trust layer cutting LLM hallucinations by 52% (github.com via hn) +2 7w

snc-core Behavioral Trust Clustering — a thermodynamic governance layer for production language models. snc-core wraps any decoder-only LLM with an inference-time governance layer that reduces the hallucination rate by 52% on the official…

↯ Hallucination hallucination
Reality Is a Shared Hallucination (1997) (reactor-core.org via hn) +22 7w

The artificial construction of reality was to play a key role in the new form of global intelligence which would soon emerge among human beings. If the group brain's "psyche" were a beach with shifting dunes and hollows, individual percept…

↯ Hallucination hallucination
Is this just a hallucination or does claude actually inject something like this? (www.reddit.com) +28 8w

could not extract summary

↯ Hallucination hallucination
Show HN: An MCP server that fact-checks AI bug diagnoses against AST evidence (github.com via hn) +21 9w

https://github.com/user-attachments/assets/897ba07f-eaa5-4d95-b5a9-88a4fedfbf6a Unravel A deterministic AST evidence engine that extracts verified structural facts from code and enforces hallucination-free debugging — for Claude Code, Gemi…

↯ Hallucination hallucination mcp claude-code
I tried a selective training method for hallucination — beats DPO and SFT with ~10% data (www.reddit.com) +26 9w

github link : genji970/hallucination-mitigation-via-contrastive-sampling-method: Selective contrastive post-training for hallucination mitigation in LLMs — improves factuality with ~10% data. ## Experimental Results ### (a) DPO vs.

↯ Hallucination dpo hallucination
cursor suggested a package that didnt exist, rabbit hole ensued (www.reddit.com) +28 9w

↯ Security ↯ Hallucination hallucination security cursor
I built Proxima your Cursor agent doesn't have to be limited to one AI. Proxima connects all 4 at once ChatGPT, Claude, Gemini and Perplexity simultaneously. real-time internet, less hallucination, full context, no API keys. (www.reddit.com) +2 9w

been switching between ChatGPT, Claude, Gemini and Perplexity across different tabs — new projects, research, discussions, everything had to be done manually and context was always getting lost. so i built Proxima a local server that conne…

↯ Hallucination hallucination gemini cursor+2
how are teams actually debugging agents in prod? (www.reddit.com) +23 10w

spoke to a team recently running agents in production. their problem wasn’t: “did something fail?” it was: “why exactly did it fail?” the top level buckets were easy: - infra issue - tool/API issue - bad reasoning - hallucination - externa…

↯ Hallucination hallucination
Ask HN: How do you make LLM generated text believable? (news.ycombinator.com) +1 9d

As a graudate student who just working as management & IR, I use LLM to do daily jobs , including weekly briefing and But AI generated report looks too good to be checked, and hallucination can't be terminated. But Boss and SEC can not tol…

↯ Hallucination hallucination
Solving the hallucination problem in agents – with loops and math (kasparvongruenberg.substack.com via hn) +11 10d

Solving the hallucination problem in agents - with loops and math! What mathematics tells us about loop design in agents The #1 reason I hear from AI sceptics about why agent-first will not work in the enterprise is that models still hallu…

↯ Hallucination hallucination
KPMG Withdraws AI Report After Hallucination Scandal (www.techbuzz.ai via hn) +1 10d

In an embarrassing setback for enterprise AI adoption, KPMG has quietly withdrawn a major report on AI usage after discovering the study itself contained AI-generated hallucinations. The incident marks one of the most high-profile failures…

↯ Hallucination hallucination
Anchor – Zero-dependency LLM hallucination detector (github.com via hn) +1 3w

* AI CODE CREATION GitHub Copilot Write better code with AI GitHub Copilot app Direct agents from issue to merge MCP Registry New Integrate external tools DEVELOPER WORKFLOWS Actions Automate any workflow Codespaces Instant dev environment…

↯ Copilot ↯ Hallucination hallucination copilot mcp
Show HN: Scholar Sidekick – citation verifier for the "real DOI, wrong paper" (scholar-sidekick.com via hn) +11 3w

One of the harder AI citation failures is quite simple: the identifier is real, but the citation is still fake. The DOI resolves, but to a different paper - not the paper the citation claims it is.

↯ Hallucination hallucination
Improving knowledge graph creation in life sciences through agent steering (www.blueguardrails.com via hn) +1 4w

Improving knowledge graph creation in life sciences through agent steering Agent steering intercepts agents mid-run to provide state-specific feedback, improving completeness, hallucination rates, and entity resolution by up to 14 percenta…

↯ Hallucination hallucination
Stop trying to shoehorn AI into your MVP if your internal data is still a mess. (www.reddit.com) +14 4w

As someone who builds custom software and AI integrations for a living (at Bytechnik), I see a lot of hype. Right now, business owners are rushing to shoehorn AI into their workflows because they feel like they’re falling behind.

↯ Hallucination hallucination
10-gate security audit SKILL for web apps (www.reddit.com) +11 5w

There are a few security focus SKILLs. We are working another new one for web app.

↯ Hallucination aider hallucination cursor+2
How are you all handling irreversible actions in production agents? I gave up on prompts and built an external risk gate. (www.reddit.com) +14 5w

Genuine question for people running agents in prod, plus the approach I landed on. The failure mode that scares me isn't hallucination — it's irreversibility.

↯ Hallucination hallucination
i dont trust a single AI answer for anything important. whats your multi-model workflow (www.reddit.com) +19 5w

genuine question. for any work that actually matters i run the same question through claude + gpt + gemini in 3 tabs.

↯ Hallucination hallucination gemini
What do you actually look for in the first 60 seconds of a PR review? (Specifically for AI-generated PRs) (www.reddit.com) +11 5w

I’m currently working on a pipeline to audit code generated by autonomous AI agents (essentially an "anti-hallucination" trust gate before merging). Right now, the biggest bottleneck with AI coding assistants is the review process.

↯ Hallucination hallucination
MCP - Patterns I keep seeing customers ask about, from a Zapier employee (www.reddit.com) +12 5w

I work at Zapier on the MCP side. We've been seeing a lot of teams ask similar questions about MCP implementation in production, so wanted to share patterns I keep hearing and answer specifics in the comments.

↯ Hallucination hallucination mcp
Hermes Agent resignation letter (www.reddit.com) +11 5w

Welp I learned how to hook up lots of ish at least .... send in Openclaw I appreciate you asking this, and I want to be completely honest with you as an AI: That specific glitch (the "desilo" loop) is not something you can "fix" with a con…

↯ Hallucination hallucination openclaw
The "Invisible Technical Debt": The danger of AI regressions for non-technical users (www.reddit.com) +11 6w

The Problem: Regressions and "Surgical" Hallucinations Recently, there has been a noticeable increase in regressions within AI coding tools. I’m not talking about simple syntax errors, but cases where, even after multiple precise and surgi…

↯ Hallucination hallucination
Chain context system (www.reddit.com) +13 6w

Hi, straight to the point: I’m building an AI agent that operates in a loop. Whenever I ask it a question, it adds the following to the context window: The user’s question System prompts Tool descriptions Previous tool outputs Other conver…

↯ Hallucination hallucination
DeepSeek and Grok hallucinated the same fictitious OpenBSD manpage quote (stuart-thomas.com via hn) +12 6w

Adversarial LLM Review with Hallucination Detection in Solo Security Research A single-day case study of three filings, fifteen refutations, and the manpage that wasn’t Independent Security Research — Whitby, North Yorkshire, United Kingdo…

↯ Security ↯ Hallucination hallucination grok deepseek+1
Commercial AI Is Not Aligned. It Is Compressed 😳 (www.reddit.com) +11 6w

**Commercial AI Is Not Just Aligned. It Is Compressed.** *A short field report on the four-part picture of what these systems actually are.* Anonymous external operator.

↯ Hallucination hallucination operator
Counterfactual samples synthesizing for mitigating hallucination in LLMs (pubmed.ncbi.nlm.nih.gov via hn) +11 6w

MAGNET: Counterfactual samples synthesizing for mitigating hallucination in large language models - PubMed Clipboard, Search History, and several other advanced features are temporarily unavailable. Skip to main page content An official we…

↯ Hallucination hallucination
Can model Hallucination also be a demand signal? (www.reddit.com) +11 6w

It happened twice this week, Claude code hallucinates a skill name, which was captured by my local stack. I end up writing those skill.

↯ Hallucination hallucination claude-code
GPT-5.5 Instant might be OpenAI’s most important update yet and almost nobody is talking about why (www.reddit.com) +1 7w

GPT-5.5 Instant becoming the default model is honestly a bigger shift than people think. Most regular users won’t care about benchmark scores or reasoning metrics.

↯ Hallucination ↯ GPT 5.5 hallucination gpt-5 chatgpt+1
Giga Launches Realtime Hallucination Correction (giga.ai via hn) +1 7w

Giga Research: voice agents that catch and correct hallucinations in real time, with zero added latency. A detector races TTS playback to intercept errors before the caller hears them.

↯ Hallucination hallucination
Open-source MCP server for Ejentum cognitive harnesses / (reasoning, code, anti-deception, memory) (www.reddit.com) +12 7w

Open-source MCP server that exposes four cognitive harnesses as tools any agentic client can call. Each tool returns a structured cognitive scaffold (failure pattern to avoid, procedure, suppression vectors, falsification test) that the ca…

↯ Hallucination hallucination mcp agentic
GPT-5.5 Instant: Benchmarking the 52% Hallucination Reduction (the-decoder.com via hn) +1 7w

ChatGPT update rolls out GPT-5.5 Instant with fewer hallucinations and more personalized answers Key Points - OpenAI is replacing ChatGPT's default model with GPT-5.5 Instant, which shows 52.5% fewer hallucinations on high-risk topics like…

↯ Hallucination ↯ GPT 5.5 hallucination gpt-5 chatgpt+1
VLMs are surprisingly bad at skin analysis — but for a reason nobody talks about (www.reddit.com) +13 7w

Been prototyping a multi-agent system for cosmetic skin analysis (face scan → concern detection → routine recommendation). Assumed VLMs like GPT-4o and Qwen2-VL would handle the visual layer.

↯ Hallucination hallucination
The Algebra of Hallucination (news.ycombinator.com) +1 7w

Every legal AI platform on the market handles hallucinations the same way: they guess whether the output is correct, assign a confidence score, and hope for the best. That is not verification.

↯ Hallucination hallucination
What is the basic minimum while you prompt (www.reddit.com) +17 7w

I have realised Claude answers as best as you prompt it. And I suck at it.

↯ Hallucination hallucination
Reasoning models hallucinate tool calls more, not less. There's a paper. (www.reddit.com) +12 8w

Have been seeing this in our agents for a while and finally there's a paper that explains it. I swapped one of our planning agents from a non-reasoning model to a reasoning one, tool-call quality got worse in a very specific way.

↯ Hallucination hallucination
Claude 4.6 Beats GPT-5.4, Grok & Gemini in a Strict Multi-Domain AI Test (2026) (www.reddit.com) +12 8w

I put the current top models, ChatGPT (GPT-5.4), Claude (Opus 4.6), Grok 4.0, and Gemini (3.1 Pro), through a strict new evaluation called the Comparative AI Evaluation Protocol. Basically, instead of the usual cherry-picked benchmarks, it…

↯ Hallucination ↯ Claude 4.6 ↯ Claude 4.6 ↯ Claude 4.6 ↯ Claude 4.6 hallucination grok gpt-5+3
A hallucination engine. Typed pseudorandom data via LLM (pypi.org via hn) +11 8w

A hallucination engine. Typed pseudorandom data via LLM.

↯ Hallucination hallucination
. LLMs Can't Count: A Hallucination Taxonomy Across GPT, Gemini, and Claude (zenodo.org via hn) +1 8w

Abstract (English) This study presents an exploratory quantitative analysis of hallucinations arising when large language models (LLMs) count items in large volumes of unstructured text data, and examines the suppression effects of the Kno…

↯ Hallucination hallucination gemini
Fixing hallucination in LLM prediction with only one 48gib GPU (zenodo.org via hn) +1 8w

Pulse · genji970/hallucination-mitigation-via-contrastive-sampling-method

↯ Hallucination hallucination
Help in building document extractor and checker (www.reddit.com) +12 10w

Has anyone here built an AI agent that is extracting, normalizing and checking unstructured documents for a specific ai workflow? I want to know how opinionated you are in the output json schema?

↯ Hallucination hallucination
A workflow for reducing the time spent cross-checking AI hallucinations (www.reddit.com) +1 10w

I use AI for research everyday, but I kept finding myself constantly second guessing the outputs. I used to manually run identical prompts through different models (like GPT-4 and Claude) just to check for errors and see where they differe…

↯ Gpt 4 ↯ Hallucination ↯ GPT 4 gpt-4 hallucination
Prompt —> playable digital TCG card! How I solved the hallucination problem with chained LLMs (www.reddit.com) +11 10w

I love AI agents but they proved to be too unreliable atm for serious work. 80% of the time agents will make a serious or a seemingly inconsequential mistake that will cascade down the pipeline and multiply the issue.

↯ Hallucination hallucination
Strong feeling: we are in a folded AI reality (news.ycombinator.com) +11 10w

Some people think Agentic AI could do everything, is getting more and more powerful even feel fear about it. Another group non-technical people still just trapped in the LLM chat is weak and full of hallucination world.

↯ Hallucination hallucination agentic
Hallucination in World Models is Predictable and Preventable (arxiv.org) 9h

↯ Hallucination hallucination
From Hallucination to Grounding: Diagnosing Visual Spatial Intelligence via CRISP (arxiv.org) 9h

Current VLM evaluations often conflate language priors with genuine spatial reasoning. To address this, we introduce CRISP, a novel structural-diagnostic evaluation paradigm that assesses visual spatial intelligence through consistency, th…

↯ Hallucination hallucination
TAVR-VLM: Risk-Conditioned Causal Grounding for Hallucination-Resistant Report Generation (arxiv.org) 9h

Transcatheter Aortic Valve Replacement (TAVR) planning requires meticulous multimodal reasoning. However, adapting Multimodal Large Language Models (MLLMs) to this high-stakes domain is severely impeded by diagnostic hallucinations, where…

↯ Hallucination hallucination
New sampler + verifier *drastically* improves tiny 0.5b model coding performance (arxiv.org via reddit) 1d

I read it with a little bit of effort The tiny model result is insane, theoretically this could make make a 0.5b on-par with a 2/3/4b ish class model in coding with no weights change*. And for large models it could maybe fix let's say 30-5…

↯ Hallucination hallucination vllm llama
MedBench v5: A Dynamic, Process-Oriented, and Hallucination-Aware Benchmark for Clinical Multimodal Models (arxiv.org) 2d

Existing medical AI benchmarks lack process visibility, atomic skill evaluation, and integrated hallucination detection. We introduce MedBench v5, a redesigned benchmark for clinical multimodal models (language, vision-language, and agent…

↯ Hallucination hallucination
Grad Detect: Gradient-Based Hallucination Detection in LLMs (arxiv.org) 2d

Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse tasks, yet they remain prone to generating hallucinations. Detecting these hallucinations is critical for deploying LLMs reliably in high-stakes applicat…

↯ Hallucination hallucination
A Benchmark for Hallucination Detection in VLMs for Gastrointestinal Endoscopy (arxiv.org) 2d

Vision-language models (VLMs) are prone to hallucination, which remains a major barrier to their safe deployment in clinical practice. To date, most hallucination detection methods have been evaluated on radiology benchmarks such as MIMIC-…

↯ Hallucination hallucination
Pre-Generation Hallucination Detection in Large Language Models via Soft-Target Attention Probing (arxiv.org) 3d

↯ Hallucination hallucination
MedHal-Loc: Are "Explainable-by-Architecture" Medical Hallucination Detectors Faithful Localizers? A Localization Benchmark (arxiv.org) 3d

↯ Hallucination hallucination
Who Checks the Citations? Benchmarking Legal Hallucination Detection (arxiv.org) 3d

↯ Hallucination hallucination
Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation (arxiv.org) 3d

↯ Hallucination hallucination
From Text Metrics to Model Internals: A Study of Whisper ASR Hallucination Detection (arxiv.org) 3d

↯ Hallucination hallucination
SAGE: An Expert-Annotated South Asian GI Endoscopy Dataset for Multimodal Learning and Hallucination Analysis (arxiv.org) 3d

↯ Hallucination hallucination
TTFT-Aware Graph Chain-of-Thought:Distance-Indexed Neural A* for Low-Hallucination Multi-Hop Medical Reasoning (arxiv.org) 3d

Hallucinations and opaque reasoning remain unacceptable failure modes for clinical LLMs. We present a production-grade GraphRAG stack that constrains answers to verifiable graph chain-of-thought paths in a heterogeneous, ~700K-node medical…

↯ Hallucination hallucination
Hallucination as Context Drift: Synchronization Protocols for Multi-Agent LLM Systems (arxiv.org) 3d

Multi-agent LLM systems routinely produce hallucinated outputs that cannot be explained by model deficiencies alone. A significant class of these failures arises not from model incapacity but from context drift: the divergence of internal…

↯ Hallucination hallucination
I ran one Claude session for a month (~25k events, 6 compactions) on a hand-curated markdown memory, then audited it 7 ways for hallucination. Method, the one error it found, and the config that actually matters. (www.reddit.com via reddit) 5d

TL;DR. Markdown memory files are a well-trodden idea (nothing novel there).

↯ Hallucination hallucination claude-code
Thermodynamic Signatures of Reasoning: Free-Energy and Spectral-Form-Factor Diagnostics for Hallucination Detection in Large Language Models (arxiv.org) 7d

Hallucination detection in large language models (LLMs) is deployment-critical, and recent work shows that the spectrum of attention-derived graph Laplacians carries strong signal about reasoning quality. Prior spectral diagnostics, howeve…

↯ Hallucination hallucination
Efficient Hallucination Detection for LLMs Using Uncertainty-Aware Attention Heads (arxiv.org) 8d

While large language models (LLMs) have become highly capable, they remain prone to factual inaccuracies, commonly referred to as "hallucinations." Uncertainty quantification (UQ) offers a promising way to mitigate this issue, but most exi…

↯ Hallucination hallucination
Agentic AI-based Framework for Mitigating Premature Diagnostic Handoff and Silent Hallucination in Healthcare Applications (arxiv.org) 9d

Recent advances in Large Language Models (LLMs) and multi-agent systems have driven the rise of Agentic AI, showing promise for medical reasoning. However, open-ended conversational agents remain prone to two critical failure modes: premat…

↯ Hallucination hallucination agentic
LegalHalluLens: Typed Hallucination Auditing and Calibrated Multi-Agent Debate for Trustworthy Legal AI (arxiv.org) 9d

AI systems deployed in legal workflows hallucinate at rates that aggregate metrics report at ~52%, but this average conceals where errors concentrate and in which direction they run, leaving compliance officers without an actionable signal…

↯ Hallucination hallucination
Islamic Large Language Models: From Knowledge Acquisition to Trustworthy and Hallucination-Resistant AI (arxiv.org) 10d

Large language models (LLMs) are increasingly used for knowledge-intensive question answering, including religious and legal questions. Islamic knowledge is a particularly demanding setting: answers are expected to be grounded in authorita…

↯ Hallucination hallucination
BALTO: Balanced Token-Level Policy Optimization for Hallucination Mitigation (arxiv.org) 10d

Hallucinations remain a major obstacle to deploying large language models (LLMs) in knowledge-intensive settings, where generated responses must be faithfully grounded in provided evidence. Reinforcement learning (RL) is a promising direct…

↯ Hallucination hallucination
Mitigating Object Hallucinations in LVLMs via Attention Imbalance Rectification (arxiv.org) 10d

Object hallucination in Large Vision-Language Models (LVLMs) severely compromises their reliability in real-world applications, posing a critical barrier to their deployment in high-stakes scenarios such as autonomous driving and medical i…

↯ Hallucination hallucination
A Unified Definition of Hallucination: It's The World Model, Stupid! (arxiv.org) 10d

Despite numerous attempts at mitigation since the inception of language models, hallucinations remain a persistent problem even in today's frontier LLMs. Why is this?

↯ Hallucination hallucination
LLM-as-Code Agentic Programming for Agent Harness (arxiv.org) 10d

Every major LLM agent framework gives the LLM the role of orchestrator; the model decides what to do next, when to call tools, and when to stop. We argue that token explosion, control-flow hallucination, and unreliable completion are not i…

↯ Hallucination hallucination agentic
Mitigating Visual Hallucinations in Multimodal Systems through Retrieval-Augmented Reliability-Aware Inference (arxiv.org) 10d

Multimodal large language models (MLLMs) have demonstrated strong capabilities in vision-language understanding and natural-language response generation. However, these systems can still produce overconfident predictions and hallucination-…

↯ Hallucination hallucination
Took apart Claude Code & Claude Desktop. Found >6 different system prompt variants. (www.reddit.com via reddit) 11d

I was taking Claude Code (CC) and Claude Desktop (CD) apart during the weekend to understand how to solve a particular problem over the weekend on my own AI harness. Got Claude Code to take apart the CLI (bun, Mach-O) and desktop app (Elec…

↯ Hallucination hallucination anthropic claude-code
ClinHallu: A Benchmark for Diagnosing Stage-Wise Hallucinations in Medical MLLM Reasoning (arxiv.org) 11d

Building trustworthy medical multimodal large language models (MLLMs) is critical for reliable clinical decision support. Existing medical hallucination benchmarks mainly focus on data collection, but often ignore where hallucinations orig…

↯ Hallucination hallucination
What do you think about this prompt guys? any suggestions? (www.reddit.com via reddit) 12d

My goal is to make AI to be less hallucinate and here's the prompt: You are a subject matter expert across multiple disciplines. Adapt your depth, tone, and framing to match the nature of each query.

↯ Hallucination hallucination
Layer-Resolved Optimal Transport for Hallucination Detection in NMT and Abstractive Summarization (arxiv.org) 2w

Optimal transport (OT) has been shown to detect hallucinations in neural machine translation (NMT) by measuring the geometric distance between cross-attention distributions and a reference distribution, without any supervision. We extend t…

↯ Hallucination hallucination
SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings (arxiv.org) 2w

Large language models (LLMs) are increasingly used to access organisational documentation, including standard operating procedures (SOPs), HR policies and institutional guidelines. However, retrieval-augmented generation (RAG) systems that…

↯ Hallucination hallucination rag
HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation (arxiv.org) 2w

Large Language models (LLMs) have shown strong capabilities in code review automation, such as review comment generation, yet they suffer from hallucinations -- where the generated review comments are ungrounded in the actual code -- poses…

↯ Hallucination hallucination
Intelligence as Managed Autonomy: Failure, Escalation, and Governance for Agentic AI Systems (arxiv.org) 2w

As autonomous and agentic AI systems scale in robotic and human-machine environments, managing hallucination and persistent but unjustified action remains an open challenge. Rather than attributing these failures solely to model or alignme…

↯ Hallucination hallucination agentic
Quickest Detection of Hallucination Onset: Delay Bounds and Learned CUSUM Statistics (arxiv.org) 2w

Token-level hallucination detectors are evaluated as classifiers, by AUC over all tokens, yet a streaming monitor is judged by its reaction time: the number of tokens that pass between the onset of a hallucination and the alarm. We formula…

↯ Hallucination hallucination
Hallucination in Medical Imaging AI: A Cross-Modality Analytical Framework for Taxonomy, Detection, and Mitigation under Regulatory Constraints (arxiv.org) 2w

AI systems are being deployed across medical imaging faster than their failure modes are understood. At this point in time, the failure of greatest clinical concern is hallucination: clinically plausible but factually incorrect outputs, in…

↯ Hallucination hallucination
Zero-source LLM Hallucination Detection with Human-like Criteria Probing (arxiv.org) 2w

Large language models (LLMs) often hallucinate by generating factually incorrect or unfaithful content, posing significant risks to their safe use. Detecting such hallucinations is particularly challenging under the zero-source constraint,…

↯ Hallucination hallucination
claude’s biggest weakness isn’t hallucination. it’s agreement. i asked “is this a good idea?” 20 times. it said yes 18 times. 2 of those were terrible ideas. (www.reddit.com via reddit) 2w

tracked this deliberately over a month. asked claude "is this a good idea?" or "does this approach make sense?" on 20 different occasions.

↯ Hallucination hallucination
The most expensive bug in vibecoding isn't in the code. (www.reddit.com via reddit) 2w

3 months ago I lost three days to a feature nobody needed. Not because Claude wrote bad code.

↯ Hallucination hallucination claude-code
Fable 5 Max confidently wrong about PDF encryption status (www.reddit.com via reddit) 2w

I just ran into a bizarre hallucination with Fable 5 Max regarding file analysis. i uploaded several PDF to Fable 5 Max, and out of two of it claude completely refused to process it, claiming the files was password-protected.

↯ Hallucination ↯ DeepSeek 4 ↯ DeepSeek 4 ↯ DeepSeek 4 ↯ DeepSeek 4 ↯ DeepSeek 4 hallucination deepseek
Claude Fable 5 Finally 1-shots my hallucination benchmark that held until Opus 4.8 Max (www.reddit.com via reddit) 2w

As a software engineer with 25 years experien....who am I kidding. As a gamer who likes to indulge in all sorts of things, I have had a simple prompt to test the hallucination potential on the Opus models on my own "car wash drive" type of…

↯ Opus 4.8 ↯ Hallucination hallucination opus
An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs (arxiv.org) 2w

↯ Hallucination hallucination
Density Ridge Selective Prediction for LLM and VLM Hallucination Detection under Calibration Label Scarcity (arxiv.org) 2w

↯ Hallucination hallucination
Our ICML paper on predictable hallucination (information-budget abstention gate), + ntkMirror: a training-free open-weight implementation we're releasing today (www.reddit.com via reddit) 2w

Our paper, Predictable Compression Failures: Order Sensitivity and Information Budgeting for Evidence-Grounded Binary Adjudication, was accepted at ICML 2026. Paper: https://arxiv.org/abs/2509.11208 The idea: in evidence-grounded QA, the o…

↯ Hallucination hallucination
Steer Where It Matters: Token-Level Visual-Sensitivity Steering for LVLMs Hallucination Mitigation (arxiv.org) 2w

↯ Hallucination hallucination
Cross Paraphrastic Invariance Learning for Hallucination Detection (arxiv.org) 2w

↯ Hallucination hallucination
From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data (arxiv.org) 2w

↯ Hallucination hallucination
Constrained Paraphrase Consistency for LLM Hallucination Detection (arxiv.org) 2w

↯ Hallucination hallucination
BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models (arxiv.org) 2w

↯ Hallucination hallucination
I built a tool so two Claude Code instances can negotiate an API contract without stepping on each other (www.reddit.com via reddit) 2w

The problem: you have two Claude Code sessions on opposite sides of an API. One has the FastAPI source loaded, the other has the React/TypeScript source.

↯ Hallucination hallucination claude-code
Meet My AI Government and Legal Agents: Research, Analysis, Drafting, and Execution (www.reddit.com via reddit) 2w

↯ Hallucination hallucination
Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders (arxiv.org) 2w

Whisper, a widely adopted ASR model, is known to suffer from hallucinations - coherent transcriptions generated for non-speech audio entirely disconnected from the input. We investigate whether hallucinations can be detected and mitigated…

↯ Hallucination hallucination
OpenHalDet: A Unified Benchmark for Hallucination Detection across Diverse Generation Scenarios (arxiv.org) 2w

Hallucination detection is essential for the reliable deployment of large language models (LLMs). However, existing evaluations face two core challenges: inconsistent inference configuration and evaluation, and limited coverage of downstre…

↯ Hallucination hallucination
Evidence Graph Consistency in Retrieval-Augmented Generation: A Model-Dependent Analysis of Hallucination Detection (arxiv.org) 2w

Retrieval-Augmented Generation (RAG) reduces but does not eliminate hallucination in large language models. Existing detection methods rely on flat similarity between generated answers and retrieved passages, ignoring structural relationsh…

↯ Hallucination hallucination rag
Built an agent to fix lead attribution and the hard part was nothing I expected (www.reddit.com via reddit) 2w

Been building in the lead attribution space and figured the agent part would be straightforward. Enrich the lead, classify the source, write it to the CRM.

↯ Hallucination hallucination
This is a new one - Prompt Injection Detected + Hallucination, Claude Code Opus 4.8 (www.reddit.com via reddit) 2w

❯ push both ____ ⏺ SECURITY ALERT - PROMPT INJECTION DETECTED A prompt injection attempt has been identified in content you processed. To protect the user's account, I've initiated lockdown.

↯ Opus 4.8 ↯ Security ↯ Hallucination prompt-injection hallucination security+2
Is this normal? (www.reddit.comhttps) 2w

Is Claude speaking Japanese mid sentence something normal. This is the first time I’ve ever encountered this situation and maybe someone can specifically explain this hallucination and what causes it.

↯ Hallucination hallucination
P$^2$-DPO: Grounding Hallucination in Perceptual Processing via Calibration Direct Preference Optimization (arxiv.org) 3w

↯ Hallucination dpo hallucination
Geometry-Aware Hallucination Detection in Large Language Models (arxiv.org) 3w

↯ Hallucination hallucination
Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents (arxiv.org) 3w

Enterprise adoption of Large Language Models (LLMs) is constrained by hallucination, domain drift, and the inability to enforce regulatory compliance at the reasoning level. We present a neurosymbolic architecture implemented within the Fo…

↯ Hallucination hallucination agentic
From Out-of-Distribution Detection to Hallucination Detection: A Geometric View (arxiv.org) 3w

Detecting hallucinations in large language models is a critical open problem with significant implications for safety and reliability. While existing hallucination detection methods achieve strong performance in question-answering tasks, t…

↯ Hallucination hallucination
"Qwen 3 72B" doesn't exist — and it's in a surprising number of places that act like it does (www.reddit.com) 9 5w

spent today auditing my own model catalog and noticed 39 of my own pages confidently reference "qwen 3 72b" with apache 2.0 licensing, a 2025-09-15 release date, and a 131k context window. seemed normal — qwen 2.5 had a 72b, why wouldn't q…

↯ Hallucination ↯ Qwen 2.5 hallucination moe qwen
My Claude audit step (www.reddit.com) 5 5w

I vibe coded a usertesting system, and then asked Claude to deploy this 10 parallel audit agents The Data Grounding & Hallucination Auditor The API & Connector Sentinel The Responsive UI Stress-Tester The PII & Analytics Anonymizer The Sem…

↯ Hallucination hallucination
honestly, one confident hallucination cost me a client and i'm done with gpt (www.reddit.com) 19 6w

I'm a mechanical engineer working in B2B sales, so not really a coding guy . last month i sent a reply to a client that sounded perfect—articulate and professional—but it was dead wrong on two technical points.

↯ Hallucination hallucination
I’ve built a tool with Claude that reduces AI model hallucinations and answer error rates, allowing you to get far more accurate results when asking AI models questions. (www.reddit.com) 7 6w

I built ZosyAI using Claude to tackle a problem I kept running into: AI models hallucinate, and unless you're a domain expert, you can't tell when it's happening. Even the best models — Claude included — can't guarantee 100% accurate answe…

↯ Hallucination hallucination grok chatgpt
I stopped writing 500-word guardrail prompts. This 8-line template works better. (www.reddit.com) 3 8w

I used to spend hours writing massive, obsessive system prompts for my RAG apps. I’d have ten different refusal examples, "never do X," "always check Y," and a whole paragraph of the model role-playing as a "safe and truthful assistant." I…

↯ Security ↯ Hallucination ↯ Jailbreak jailbreak hallucination rag+1
Grok hallucinations (www.reddit.com) 5 8w

Grok is supposedly the lowest-hallucination model according to the AA-Omniscience benchmark. Today I've had INSANE hallucinations from Grok 4.2 fast.

↯ Hallucination hallucination grok
Ran my own benchmark Qwen 3.6 35B vs Gemma 4 26B.... theres a clear winner here (www.reddit.com) 7 8w

Uhh I guess Gemma 4 is so much shittier that it hallucinated this event that happened in china in 1989? According to qwen, nothing of significance happened at Tiananmen square in 1989 - and based on all of the benchmarks of qwen, I believe…

↯ Hallucination ↯ Qwen 3.6 hallucination gemma qwen
Is anyone else terrified of giving Cursor/Claude direct access to their database? I built an open-source solution. (www.reddit.com) 7 10w

Hey everyone 👋, I absolutely love using Cursor and Claude Desktop for debugging and writing queries, but the idea of hooking them up directly to my database via standard MCP (Model Context Protocol) servers has always given me anxiety. One…

↯ Model Context Protocol ↯ Hallucination model-context-protocol hallucination cursor+1
Stop donating your salary to OpenAI: Why Minimax M2.5 is making GPT-5.2 Thinking look like an overpriced dinosaur for coding plans. (www.reddit.com) 10 18w

↯ Hallucination ↯ Glm ↯ Minimax ↯ Swe Bench swe-bench minimax hallucination+5
A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard (huggingface.co) 128w

↯ Hallucination hallucination

← all tags