model roundup

Qwen 3

4 items · started 2026-05-09 · ongoing (last activity 2026-05-12)

  1. Hello there people. So I have noticed that people are pretty much ignoring Llama 3 plus 3.1, 3.2, and 3.3 these days.

  2. 1. The Frontier Giants • Gemini: Access 1.5B tokens/day on Gemini 1.5 Flash/Pro.

  3. After doing a little bit of digging (well, perusing reddit and asking other models), I'm leaning toward the following: - Default chat: qwen3:30b / qwen3:30b-instruct - Default coding: qwen3-coder:30b - Local reasoning: gpt-oss:20b - Fast c…

  4. WebWorld is a large-scale open-web world model series for training and evaluating web agents. It is trained on 1M+ real-world web interaction trajectories via a scalable hierarchical data pipeline, supporting: Long-horizon simulation (30+…

← all threads