model roundup

DeepSeek 4

35 items · started 2026-04-22 · ongoing (last activity 2026-04-25)

To run deepseek v4 flash how much max vram we need? 175 gb or 320gb? (www.reddit.com)

+2 54m vllm deepseek

As far as i know the weight is of 160gb + 9.6gb needed for max 1 million token window + 5 gigs overhead = 175gb vram. But vllm and othere sources said "To use the full 1M context, you need 4x A100 80G" --> thats a 320gb vram ??
Show HN: A CLI to use any model in your coding agent (getaivo.dev via hn)

+2 2h deepseek gemini codex+1

Hi everyone, I've been working on a CLI tool that can help to easily run any model in claude, Codex, Gemini, Pi, and OpenCode. It's also an API keys manager, supports multiple providers or OpenAI/Claude/Gemini accounts.
Is Deepseek V4 really out? (www.reddit.com)

+312 2h deepseek qwen opus

Hello Guys, Each time a new local llm is released, there are a ton of new posts , this is it, it's near Opus level...., the abliteration matrix final something at Q2 KXLDND is the best but it's been a day that deepseek was released and i d…
Deepseek V4 flash (high) rivals Gemini 3 flash at 1/5th the cost (www.reddit.com)

+7920 8h deepseek gemini
DeepSeek V4 is out. 1.6 trillion parameters. MIT license. $1.74 per million tokens. The gap between US and Chinese AI strategy has never been more visible. (www.youtube.com via reddit)

10 8h deepseek
Xiaomi has released a MiMo V2.5 Pro model. It's apparently about as good as Deepseek V4 (but at different tasks) but is significantly cheaper. (x.com via reddit)

+217 9h deepseek
DeepSeek v4 - Subjective vibes (www.reddit.com)

+1315 10h deepseek qwen
Deepseek V4 Pro is 15x cost to run Artificial Analysis bench from V3.2, higher than Gemini 3.1 Pro (www.reddit.com)

+8535 12h deepseek gemini
🚨 The Chinese beast is BACK… DeepSeek just dropped V4 (www.reddit.com)

3 12h deepseek gemini
What is the next SOTA modei you are excited about? (www.reddit.com)

+538 12h deepseek chatgpt
Deepseek V4 AGI comfirmed (www.reddit.com)

+1157131 14h deepseek
Deepseek flash seems like a very good replacement for Haiku at the very least (www.reddit.com)

+5815 17h haiku deepseek sonnet+1
Tested Deepseek v4 flash with some large code change evals. It absolutely kills with too use accuracy! (www.reddit.com)

+13721 18h tool-use deepseek
Ask HN: Why is cache for DeepSeek-v4 cheapest on Vercel AI Gateway? (news.ycombinator.com)

+21 19h deepseek
Top open weight models like ds v4 pro max are still like 6-7 months if not more behind closed lab models (www.reddit.com)

+2136 21h deepseek sonnet gpt-5+3
DeepSeek V4 Pro underwhelms on Arena (crowdsourced user preference benchmark, not a capability benchmark) (www.reddit.com)

+8080 21h deepseek
Takeaways & discussion about the DeepSeek V4 architecture (www.reddit.com)

+13175 22h deepseek
Budget to run Deepseek V4 locally at FP4 precision (www.reddit.com)

+1323 1d deepseek
DeepSeek-v4 has a comical 384K max output capability (www.reddit.com)

+34866 1d deepseek
Deepseek v4 people (www.reddit.com)

+1952272 1d deepseek
DeepSeek V4 - almost on the frontier, a fraction of the price (simonwillison.net)

1d deepseek
DeepSeek-V4 Drops: Open-Source Push Toward Cheaper, Long-Context AI. (www.reddit.com)

+1236 1d deepseek
No Multimodality yet in DeepSeek-V4. But I'll wait. (www.reddit.com)

+12529 1d deepseek
Buried lede: Deepseek v4 Flash is incredibly inexpensive from the official API for its weight category (www.reddit.com)

+28667 1d deepseek
DeepSeek V4 Benchmarks! (www.reddit.com)

+36155 1d deepseek
DeepSeek V4 is out. the best open-source on coding. here's the breakdown (news.ycombinator.com)

+31 1d deepseek sonnet gemini+2
DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence (huggingface.co via hn)

+15818 1d deepseek
- DeepSeek-V4: Making 1M token context efficient (firethering.com)
- DeepSeek V4 in vLLM: Efficient Long-Context Attention (vllm-website-pdzeaspbm-inferact-inc.vercel.app)
- DeepSeek-V4: a million-token context that agents can actually use (huggingface.co)
DeepSeek-V4 Preview Version is launched (news.ycombinator.com)

+2 1d deepseek
DeepSeek V4 has released (www.reddit.com)

+918248 1d deepseek
DeepSeek-V4 Technical Report [pdf] (huggingface.co via hn)

+254 1d deepseek
Deepseek V4 Flash and Non-Flash Out on HuggingFace (www.reddit.com)

+776311 1d deepseek
Recent Open models from last 6 Months - Nov 2025 - Apr 2026 (www.reddit.com)

+11628 2d mistral glm deepseek+1

← all threads