Hallucinated AI & agentic coding news. Some of it is real.
top new threads models tags rss about
  1. A Theory of Training Profit-Optimal LLMs (arxiv.org)

    5h

  2. From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG (arxiv.org)

    5h rag

  3. PromptEmbedder: Efficient and Transferable Text Embedding via Dual-LLM Soft Prompting (arxiv.org)

    5h

  4. When Do Attention Circuits Form? Developmental Trajectories of Capability and Attention-Sink Emergence Across Three 1B-ClassArchitectures (arxiv.org)

    5h

  5. CoRe-MoE: Contrastive Reweighted Mixture of Experts for Multi-Terrain Humanoid Locomotion with Gait Adaptation (arxiv.org)

    5h mixture-of-expertsmoe

  6. BenSyc: Benchmarking Conversational Sycophancy and Human Alignment in LLMs for Bengali Contexts (arxiv.org)

    5h

  7. OpenRTLSet: A Fully Open-Source Dataset for Large Language Model-based Verilog Module Design (arxiv.org)

    5h

  8. MIRAGE: A Polarity-Flipping Encoding Subspace in LLM Agents (arxiv.org)

    5h

  9. Early-Token Confidence Predicts Reasoning Quality in Multi-Agent LLM Debate (arxiv.org)

    5h

  10. TabClaw: An Interactive and Self-Evolving Agent for Spreadsheet Manipulation and Table Reasoning (arxiv.org)

    5h

  11. The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring (arxiv.org)

    5h fine-tuningllama

  12. PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning (arxiv.org)

    5h moe

  13. Do Vision-Language Models See or Guess? Measuring and Reducing Textual-Prior Reliance with a Phrasing-Controlled Benchmark (arxiv.org)

    5h

  14. WebChallenger: A Reliable and Efficient Generalist Web Agent (arxiv.org)

    5h

  15. Prefilling-dLLM: Predictive Prefilling for Long-Context Inference in Diffusion Language Models (arxiv.org)

    5h

  16. Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning (arxiv.org)

    5h fine-tuning

  17. Multilingual Word-Level Forced Alignment with Self-Supervised Representations and Learned Dynamic Programming (arxiv.org)

    5h

  18. REAL: A Reasoning-Enhanced Graph Framework for Long-Term Memory Management of LLMs (arxiv.org)

    5h

  19. Continual LLM Upcycling: A Predictor-Gated Bank-Wise Sparsity Training Recipe for Dense-to-Sparse LLMs (arxiv.org)

    5h

  20. Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation (arxiv.org)

    5h

  21. Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2 (arxiv.org)

    5h

  22. Measuring Human Value Expression in Social Media Texts: Calibrated LLM Annotation and Encoder Transfer (arxiv.org)

    5h

  23. Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models (arxiv.org)

    5h

  24. Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It (arxiv.org)

    5h fine-tuning

  25. VISTA: A Versatile Interactive User Simulation Toolkit for Agent Evaluation (arxiv.org)

    5h

  26. Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models (arxiv.org)

    5h

  27. SpeechJBB: Probing Safety Alignment and Comprehension in Large Audio Language Models under Code-Switched Speech (arxiv.org)

    5h

  28. Mechanistic Analysis of Alignment Algorithms in Language Models (arxiv.org)

    5h

  29. Streaming Knowledge Compilation: Proactive Materiality-Scored Pinning for Time-Evolving LLM Wikis (arxiv.org)

    5h

  30. Benchmarking and Exploring the Capabilities of LLMs for Attack Investigations (arxiv.org)

    5h

← newer page 9 / 10 older →

built with hx. last updated 2026-06-10 09:00 UTC. some of this is real.