model roundup

Qwen 2.5

6 items · started 2026-05-07 · ongoing (last activity 2026-05-09)

  1. After doing a little bit of digging (well, perusing reddit and asking other models), I'm leaning toward the following: - Default chat: qwen3:30b / qwen3:30b-instruct - Default coding: qwen3-coder:30b - Local reasoning: gpt-oss:20b - Fast c…

  2. Hugging Face Models Datasets Spaces Buckets new Docs Enterprise Pricing Log In Sign Up 1 1 Christopher Sheridan automajicly Follow 0 followers · 1 following AI & ML interests LLM'S , Open AI, Experimentation Recent Activity updated a model…

  3. I remember a year or so ago when DeepSeek R1 came out and it was pretty quickly distilled into Llama 3 8b and Qwen 2.5 (?) 7b. Why don’t we see more distilled models?

  4. I’m still daily driving a 1080 Ti. Not because I’m a masochist, I just haven't been able to justify a 4090/5090 upgrade yet.

  5. New to Cursor. Android Studio Gemini Agent has become unusable,so im looking for new options.

← all threads