1. How we used StarRocks, Gemini, and tool-based retrieval to power grounded Q&A in a developer community Slack. 9 min read 7 hours ago Author:Billy Chang, Software Engineer at Phoenix AI Press enter or click to view image in full size StarRo…

  2. So Claude just dropped Fable 5 and I got curious, went to check the API pricing… and wow 😭 $50/M feels crazy expensive depending on what you’re building. Maybe I’m just broke founder mode right now, but seeing that number actually made me…

  3. I mean I know that most AI employee discussions seem focused on making a single agent smarter, but I'm curious about the opposite approach. But has anyone deployed a multi-agent AI employee where different agents handle planning, execution…

  4. Let me say this upfront: I belong to the camp that feels the outrage/disappointment/frustration around fable not being part of the subscription and/or the safeguards as unfounded in reality. Fable is not meant for you and I - developers si…

  5. Will it be the game changer or is it just hype, what do you think about it?

  6. Context window usage analyzer for Claude Code. Tracks how context is consumed across tools, compaction, skills, and user interactions — then visualizes it so you can optimize your sessions.

  7. event

    Cowork
    343 items

    Issues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.

    307 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

  8. Reason 458 why local LLMs are going to be a necessity

  9. could not extract summary

  10. Hey guys 👋 We all struggle with the 5-hour reset, especially as token usage gets higher with the newer models. So I started using a simple workflow to maximize my productivity.

  11. GGUF for the new Cohere 30B A3B model I haven't had a chance to test this yet, but I think it's related to https://github.com/ggml-org/llama.cpp/pull/24260

  12. model roundup

    DeepSeek 4
    27 items

    DeepSeek-V4-Pro is a 1.6T parameter model with 49B activated, supporting one million-token context and achieving significant efficiency gains over previous versions. Notably, DeepSeek-V4-Flash (284B parameters) has been successfully run on a Raspberry Pi 5, demonstrating impressive performance despite the low hardware specifications.

    model roundup

    Opus 4.8
    71 items

    Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.

  13. could not extract summary

  14. Foundation-model agents are increasingly long-lived systems that remember users across interactions, making memorization an explicit deployment-time function rather than solely a property of model weights. Existing work addresses parametri…

  15. Multimodal Large Language Models (MLLMs) can listen and see, but how do audio and visual signals actually travel through the network to shape an answer? Despite their growing role in research and real-world applications, the internal pathw…

  16. event

    Fine Tuning
    130 items

    Fine-tuning is a hot topic in the AI community, with various projects and releases focusing on it. Notable examples include OpenAI's decision to wind down its fine-tuning API, Anthropic co-founder Jack Clark's prediction that AI research could become automated by 2028, and several new datasets and models released for fine-tuning purposes.

  17. Large language models deployed as autonomous agents for enterprise workflows face a key challenge: verbose tool responses from enterprise systems can cause context overflow, stale-state errors, and high inference cost. We study this proble…

  18. Open-pit mine scheduling is a critical process for maximizing economic return under complex geotechnical and operational constraints. While Mixed-Integer Linear Programming (MILP) provides mathematically optimal baselines, its exponential…

  19. When large language models generate from retrieved or augmented contexts, conflicts between external context and parametric priors remain a central reliability bottleneck. Existing contrastive decoding methods follow a \emph{context-aware}…

  20. Language-agent "memory palace" systems anchor each memory to a world coordinate, on the intuition that geometry adds something text cannot. We make that intuition testable and report three results.

  21. Although the study of human trajectory anomalies is critical for advancing spatial data mining, empirical research remains severely hindered by a pervasive lack of ground-truth datasets. Despite the availability of several real-world and s…

  22. Reinforcement learning has become a key paradigm for eliciting reasoning abilities in large language models, where exploration is crucial for discovering effective solution trajectories. Existing exploration methods typically encourage div…

  23. AI agents in supply chains face a fundamental epistemic gap: large language models (LLMs) interpret policies but lack physical grounding, while reinforcement learning (RL) optimizes flows but is semantically blind to unstructured constrain…