Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling (arxiv.org) 5h mixture-of-experts
SpenseGPT: Practical One-shot Pruning Enabling Sparse and Dense GEMMs for LLM Inference (arxiv.org) 5h
How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs (arxiv.org) 5h
Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization (arxiv.org) 5h
Mitigating hallucinations in healthcare LLMs with granular fact-checking and domain-specific adaptation (arxiv.org) 5h
GhazalBench: Evaluating LLM Understanding and Canonical Surface-Form Access in Persian Ghazals (arxiv.org) 5h
An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs (arxiv.org) 5h hallucination
Beyond Memorization: Distinguishing Between Pattern-Based and Epistemic Reasoning in LLMs Using Epistemic Puzzles (arxiv.org) 5h
Skill-RAG: Failure-State-Aware Retrieval Augmentation via Hidden-State Probing and Skill Routing (arxiv.org) 5h rag
HarDBench: A Benchmark for Draft-Based Co-Authoring Jailbreak Attacks for Safe Human-LLM Collaborative Writing (arxiv.org) 5h jailbreaksecurity
Disjoint or Overlapping? Inference Windowing for Reconstruction-Based Time Series Anomaly Detection (arxiv.org) 5h
Calibrating Overconfidence Without Sacrificing Confidence: Probe-Conditioned Head Intervention for LLMs (arxiv.org) 5h
Hasse Diagrams for Attention: A Partial Order Framework for Designing Transformer Masks (arxiv.org) 5h
Spatiotemporal Graph Transformer for 3D Neighborhood Interaction and Quality Prediction in Metal Additive Manufacturing (arxiv.org) 5h
When Design Rules Break: Benchmark Composition Determines Whether Label Informativeness Predicts GNN Aggregator Choice (arxiv.org) 5h
A Comprehensive Inference-Time Augmentation Framework in Physiological Signals: Application to PPG-Based AF Detection (arxiv.org) 5h
GRAFT: Gain-Recalibrated Adapters for Transformer-Based Neural Population Activity Modeling (arxiv.org) 5h