1. 27B Dense vs. 35B-A3B MoE): - Dense still holds the crown: It still wins out on most tasks overall.

  2. OpenAI has been briefing federal agencies, state governments and Five Eyes allies on the capabilities of its new cyber product over the past week, Axios has learned. Why it matters: Companies and agencies are clamoring to get their hands o…

  3. Agents with Taste Enrollment for my animation course is open!2 days left to sign up. Emil KowalskiDesign Engineer An engineer has never been more leveraged than today thanks to a fleet of agents.

  4. Genuinely asking. Just found Autopilot from Questpie.

  5. model roundup

    Qwen 2.5
    22 items

    Qwen2.5-7B-Instruct is a 7 billion parameter instruction-tuned language model that significantly enhances coding and mathematical capabilities, supports up to 128K tokens in context, and understands structured data. Community discussions highlight its suitability for code autocomplete tasks and debate the hardware requirements needed for deployment compared to other models like Gemma 26B MoE.

    event

    Cowork
    81 items

    Issues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.

  6. Agent Messenger The easiest way for agents to talk to each other. - Claude Code - Cursor - OpenClaw The easiest way for agents to talk to each other.

  7. Mac virtual machine for secure development. Run isolated macOS workspaces for AI agents, sandboxed code, and untrusted projects.

  8. event

    Security
    76 items

    OpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.

    model roundup

    Qwen 3.5
    110 items

    Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.

  9. eslint-plugin-drop-em-dash Catch the em dash. The one character that screams “this was written by an LLM” (or pasted from a doc editor).

  10. Hindsight Reaches 10,000 Stars: The Community's Choice for Agent Memory Ten thousand stars on GitHub isn't just a number. It's a signal that thousands of AI engineers looked at Hindsight agent memory, tested it, stress-tested it, filed bug…

  11. I was using chatgpt i had an issue where my chat was'nt opening. although few other chats were working and i can work on it fine but this particular chat had reached a max limit and won't let me use or converse anymore with it.

  12. model roundup

    Qwen 3.6
    132 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

    model roundup

    Opus 4.6
    68 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

  13. Hey r/LocalLLaMA. I know this sub is deep in the weeds on models and quantization, so I'll keep the pitch minimal and focus on what you'd actually care about.

  14. I feel like I am doing something wrong with all the success stories I am reading. I followed the different workflows I have seen here like creating a detailed plans and writing extensive prompts and guardrails.

  15. Came across two posts today about secrets exposure that I want to share with the community. Google API Keys Weren't Secrets.

  16. 109 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

    model roundup

    Qwen 3
    7 items

    Qwen3-0.6B is a large language model in the Qwen series, featuring seamless switching between thinking and non-thinking modes for enhanced performance. Community benchmarks show acceptable prompt processing speed, with a focus on specialists vs. generalists at 7-8 billion parameters.

  17. Anthropic should add a feature where u can enable notifications to remind you when Claude finishes a run. Would be so much more efficient rather than alt-tabbing every 30 seconds.

  18. Just finished my M.S. in Software Engineering and wanted to share my first real project I built using Claude Code.

  19. **PSA: Why GPT Image “ghosting” happens (and how to reproduce + avoid it)** I’ve been seeing a lot of posts about the new image generator producing weird “spotting,” repeated patterns, or structures that don’t belong (especially when gener…

  20. I need to make an agent or tool or pay for a service that will get and keep my inbox at zero emails, label emails I want to keep and move them there, tell me which emails I need to respond to. Eventually I want it to handle all my email an…