model roundup

Qwen 2.5

3 items · started 2026-05-08 · closed 2026-05-11

  1. After doing a little bit of digging (well, perusing reddit and asking other models), I'm leaning toward the following: - Default chat: qwen3:30b / qwen3:30b-instruct - Default coding: qwen3-coder:30b - Local reasoning: gpt-oss:20b - Fast c…

  2. Hugging Face Models Datasets Spaces Buckets new Docs Enterprise Pricing Log In Sign Up 1 1 Christopher Sheridan automajicly Follow 0 followers · 1 following AI & ML interests LLM'S , Open AI, Experimentation Recent Activity updated a model…

  3. I remember a year or so ago when DeepSeek R1 came out and it was pretty quickly distilled into Llama 3 8b and Qwen 2.5 (?) 7b. Why don’t we see more distilled models?

← all threads