Hallucinated — page 9

Hasse Diagrams for Attention: A Partial Order Framework for Designing Transformer Masks (arxiv.org)

3h
Alignment Defends LLMs from Property Inference Attacks (arxiv.org)

3h
Spatiotemporal Graph Transformer for 3D Neighborhood Interaction and Quality Prediction in Metal Additive Manufacturing (arxiv.org)

3h
When Design Rules Break: Benchmark Composition Determines Whether Label Informativeness Predicts GNN Aggregator Choice (arxiv.org)

3h
A Comprehensive Inference-Time Augmentation Framework in Physiological Signals: Application to PPG-Based AF Detection (arxiv.org)

3h
GRAFT: Gain-Recalibrated Adapters for Transformer-Based Neural Population Activity Modeling (arxiv.org)

3h
Overcoming Rank Collapse in Feedback Alignment (arxiv.org)

3h
OncoTraj: a public benchmark for longitudinal resistance prediction in EGFR-mutant non-small-cell lung cancer on osimertinib (arxiv.org)

3h
Algorithmic and Minimax Complexities in Kernel Bandits (arxiv.org)

3h minimax
WHU-Infra3D: A Full-stack Multi-modal Dataset and Benchmark for 3D Roadside Infrastructure Inventory (arxiv.org)

3h
Multi-task LLMs for Bug Classification: Efficient Inference with Auxiliary Decoding Heads (arxiv.org)

3h
Spiking Neural Network inference on FPGAs with hls4ml (arxiv.org)

3h
Learning the Universe: Posterior Reliability of Neural Generative Models in High-Dimensional Field-Level Inference of Cosmic Initial Conditions (arxiv.org)

3h
POPSICLE: Benchmark Datasets for Segmentation and Localization in CryoET (arxiv.org)

3h
ClusBench: The Clustering Benchmark Data Resource You've All Been Waiting For (?) (arxiv.org)

3h
MemVenom: Triggered Poisoning of Multimodal Memories in Web Agents (arxiv.org)

3h
DMT: Demographic Conditioning, Morphology-Enhanced Transformer for Cuffless Blood Pressure Estimation from PPG Signals (arxiv.org)

3h
AdaGC: Enhancing LLM Pretraining Stability via Adaptive Gradient Clipping (arxiv.org)

3h
DAH-Net: A Dual-Attention Hybrid Network for Interpretable and Robust EEG-Based Emotion Recognition (arxiv.org)

3h
AnomaMind: Agentic Time Series Anomaly Detection with Tool-Augmented Reasoning (arxiv.org)

3h agentic
Spatio-Temporal Attention Graph Neural Network: Explaining Causalities With Attention (arxiv.org)

3h
Moral Sensitivity in LLMs: A Tiered Evaluation of Contextual Bias via Behavioral Profiling and Mechanistic Interpretability (arxiv.org)

3h
AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents (arxiv.org)

3h
Leave a Window Out: Modifying the Jackknife for Predictive Inference in Time Series (arxiv.org)

3h
A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training (arxiv.org)

3h
Claude Fable 5 Max Usage, Not Bad! (www.reddit.com via reddit)

3h
- Claude Max 5x Usage (www.reddit.com via reddit)
chatGPT is talking sh#t to me. tired trying to make a full code request to work. (www.reddit.com via reddit)

3h chatgpt
Macro Evals for Agentic Systems (developers.openai.com via hn)

+2 4h agentic

When an agentic system fails, the problem is often larger than a single bad response. A handoff may happen too late, a specialist agent may miss the same signal across many runs, or a review process may trigger for the wrong class of cases.
Fable 5 is here at no extra cost (www.reddit.com via reddit)

3h
- Fable 5 is here! (www.reddit.com via reddit)
Claude and ChatGPT are getting worse. It's not your imagination (www.artificialstudio.ai via hn)

+1 4h chatgpt

AI models are quietly hitting their limits and the companies are rationing capacity without telling you. Here's what's actually happening, why it affects the tools you use every day, and what you can do about it.
- Claude is getting worse, according to Claude (www.theregister.com via hn)
- Ask HN: Is Claude Getting Worse? (news.ycombinator.com)
- ChatGPT is getting worse and worse (www.reddit.com)