model roundup

Qwen 2.5

3 items · started 2026-05-08 · closed 2026-05-11

Currently setting up a Mac mini to be an agent server and would love some feedback (www.reddit.com)

+16 6w

After doing a little bit of digging (well, perusing reddit and asking other models), I'm leaning toward the following: - Default chat: qwen3:30b / qwen3:30b-instruct - Default coding: qwen3-coder:30b - Local reasoning: gpt-oss:20b - Fast c…
Local Autonomous Security Agent-QWEN2.5-7B+MCP Agent Loop (huggingface.co via hn)

+2 6w mcp

Hugging Face Models Datasets Spaces Buckets new Docs Enterprise Pricing Log In Sign Up 1 1 Christopher Sheridan automajicly Follow 0 followers · 1 following AI & ML interests LLM'S , Open AI, Experimentation Recent Activity updated a model…
How difficult is distilling? (www.reddit.com)

+24 6w deepseek qwen llama

I remember a year or so ago when DeepSeek R1 came out and it was pretty quickly distilled into Llama 3 8b and Qwen 2.5 (?) 7b. Why don’t we see more distilled models?