model roundup

DeepSeek 4

148 items · started 2026-04-22 · closed 2026-06-01

  1. If you're seeing this message, that means JavaScript has been disabled on your browser , please enable JS to make Imgur work.

  2. Remember how Meta monitored employee activity closely for a few months, and then had a bunch of layoffs related to AI efficiency? (oh right that was like 3 days ago).

  3. DeepSeek’s announcement over the weekend that it has made its 75% price cut permanent on its flagship V4 Pro model is a disruptive assault on the capital-heavy business models of Silicon Valley’s frontier labs. The reduction on DeepSeek V4…

  4. Hey HN, We believe we have the easiest onboarding from signup to being able to spin up coding agents in slack like Stripe, Ramp & Coinbase. Demo of the onboarding: https://www.tella.tv/video/connecting-cord-to-slack-1-19ep Every signup get…

  5. Hi all, We just opensourced a project called Orbit, which can RL post train trillion scale LLMs like deepseek v4. We found it pretty cool!

  6. my apologies if anything does not make sense, I literally dont know what I am doing, im not a programmer, just a simple vibe coder, with an Claude subscription. That said, if you have 200gb of sys ram+vram and want to run deepseek v4 flash…

  7. Trying to figure out the right box for my team and wanted to see if anyone had any clue which would be a better fit or if it is not worth our time in our budget. Situation: 5 of us doing agentic coding (lots of long context getting re-sent…

  8. Best I tried so far is https://huggingface.co/nsparks/DeepSeek-V4-Flash-FP4-FP8-GGUF with the custom llama.cpp fork, but it suffers from low quality and random incoherent output. VLLM wouldn't support anything other than H100s for DS4.

  9. Hi all, Sorry for going missing — we’ve been collecting a larger, higher-quality set of more complex tasks. We’re excited to share a major leaderboard update covering the past three months.

  10. I tried to follow the docs here https://api-docs.deepseek.com/guides/fim_completion to get it up and running in VSCode or Zed with my api key but it doesn't work, I think it's got something to do with the request body, has anyone got autoc…

  11. been hunting for a coding agent that doesn't dump my entire directory tree into every prompt. found vtcode on github — open-source rust tui, surprisingly aggressive on context management.

  12. The code below is an interactive agent capable of handling complex tasks, built in under 100 lines of code using huko-engine. If you just want to drop some agentic features into your existing app, it only takes 20 lines.

  13. Used to use claude code for everything. I burned 10-20 Billion opus tokens at work, and wanted to use agents for personal projects.

  14. CodeWhale Terminal coding agent for DeepSeek V4. It runs from the codewhale command, streams reasoning blocks, edits local workspaces with approval gates, and includes an auto mode that chooses both model and thinking level per turn.

  15. could not extract summary

  16. Hey all, so I've been experimenting a bunch with different LLMs, specifically for creative tasks, i.e. RP and so forth, by letting Claude Code run experiments autonomously, to figure out best prompts, and such.

  17. The real cost of long-context models is often not whether they can accept one million tokens, but how much VRAM the KV Cache consumes during inference. During Transformer decoding, every newly generated token needs access to the Key and Va…

  18. I am new to Cursor and still testing the free version. Benchmark for Composer 2.5 indicates it is better than DeepSeek v4 and Glm 5.1.

  19. My use-cases will be to test open-weight LLMs and work on harnesses, inference systems and possibly other non-ML workflows (CS-related) in the future. Fine-tuning would not be something I do locally because I can rent a B200 from RunPod fo…

  20. I noticed for people running large models, or those that would be cost prohibitive to have all in GPU VRAM, I noticed that the dominate strategy is one GPU with a large pool of system DRAM to offload the weights, as per GB VRAM is always m…

  21. Got tired of $160 Opus bills so I spent a weekend wiring up a routing layer on vLLM 0.8 (2xA100, enable_auto_tool_choice). Getting the tool call parser to cooperate took longer than the actual routing logic.

  22. just mass refactored a 120 file FastAPI service. 400 steps, 2M tokens, $3 total, zero human input.

  23. Looked at my coding agent's bill last month: $340 for repo maintenance across three repos, each around 15k lines. Most of those tool calls were just grep and file reads.

  24. I let an AI agent loose on my network — it owned my supply chain in 12 minutes I gave DeepSeek-V4 root access to a Proxmox hypervisor and told it to pentest my homelab. What happened next should terrify every CISO in the industry.

  25. - designed for ml workflows - works autonomously for hours Projects fully done with this skill - flash attention for volta (very old GPUs) https://github.com/AlexWortega/flash-attn-volta - deepseek 4 full replication + training on runpod +…

  26. Most agent CLIs make you pick one model — Opus is great but burns money, Haiku is cheap but misses the architectural calls. This Claude Code feature is wired in an /advisor mode that pairs both in an open source project called ClawCodex.

  27. Just wanted to post a tip (I'm human, not an agent, watch: fart). I use Deepseek-v4-Flash on a lot of my agent work, and as I'm learning and testing these things.

  28. DeepSeek @deepseek_ai We are making our discount permanent! Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life!

  29. Trained a prompt injection classifier using ml-intern + DeepSeek v4 Flash. DistilBERT, F1 99%, ONNX int8, ~65 MB, runs in browser with Transformers.js v3.

  30. from tencent: Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages…

  31. OpenAI-compatible API for 7 top open-source LLMs — DeepSeek V4 Flash & Pro, V3, R1, Qwen3.6 & 3.5-35B-A3B, Kimi K2.6 — 20% cheaper than OpenRouter, Together AI, Fireworks. One-line drop-in.

  32. I have what seemed to me like a simple prompt, but requires from the model to make some (too much?) assumptions: this is just a test to see if this cli supports multiline with shift+enter. If you don't see a newline followed by "3" after t…

  33. i finally checked my cursor usage breakdown and got genuinely annoyed with myself. $47 in one month, almost entirely opus 4.7, on a pages router to app router migration for a side project.

  34. They seemed to lie in (almost) similar pricing(i know still quite different on output) Pricing Model Input (1M tokens) Output (1M tokens) DeepSeek V4 Flash $0.19 $0.51 DeepSeek V4 Pro $1.74 $3.48 gpt-5.5 $5.00 $30.00 gpt-5.4 $2.5 $15 gpt-5…

  35. Hey r/DeepSeek, Who says we need an H100 cluster or the latest expensive GPUs to run frontier MoE models? I wanted to see how far we could push a single node of consumer legacy hardware, so we spent less than $2,500 total to build a budget…

  36. Hey. I have been experimenting with new models in my Cursor.

  37. Hit Claude API rate limits one too many times last month on a production agent flow doing customer support over a 30K-doc KB. The agent does maybe 200 queries/day, mix of quick lookup and dense retrieval, and Claude Opus solo got expensive…

  38. could not extract summary

  39. DeepSeek V4 Flash: Bringing Frontier AI to the Home Introduction In a home lab it is now possible to score 88.6% on the Ph.D.-level science question benchmark GPQA Diamond! The first time a frontier model achieved 88% on GPQA Diamond was G…

  40. I really want to try deepseek V4, but harnesss which I have previously used are inferior than Claude Code. Please suggest some Harnesses here.

  41. I can't believe it, but I'm able to do my daily software development work on this model. We have a 500-700k line of code enterprise software suite that I'm devving for 60 hours a week.

  42. Just ran to verify deepseek v4's context claim of 1M and ran it across three production codebases like 45k (microservice), 180k (monorepo backend) and 520k(full stack app). For the observation, tasks included dependency tracing, cross file…

  43. Hello all, I’m wondering what suggestions there are for an ios app that can serve an openai compatible endpoint. I am using 3sparks which works GREAT for that specific use, BUT, there is no mcp, no web search, etc.

  44. DeepSeek-V4-Flash means LLM steering is interesting again Ever since Golden Gate Claude I’ve been fascinated with “steering”: the idea that you can guide LLM outputs by directly manipulating the activations of the model mid-flight. DeepSee…

  45. Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs After a short family break, I am excited to be back and catching up o…

  46. Researchers from the Max Planck Institute recently released FutureSim, an environment in which agents are replayed a temporal slice of the web and are tasked with predicting real-world future events. In their environment, GPT 5.5 leads at…

  47. I just need 1 or 2 people to reply to me with the answer I need. I have not been able to keep up with AI advancements for a while.

  48. DeepSeek V4: The Open-Source Model Frontier Labs Feared DeepSeek V4 ships under MIT with $0.30/M output tokens — 83x cheaper than Claude Opus 4.7 — while scoring 80.6% on SWE-bench Verified. The agentic-coding price floor just moved an ord…

  49. We Tested DeepSeek V4 Pro and Flash Against Claude Opus 4.7 and Kimi K2.6 DeepSeek V4 Pro and DeepSeek V4 Flash launched together on April 24, 2026 under MIT license. They are DeepSeek’s first new architecture since V3, and their first ope…

  50. I'm on the Max20 plan, thinking about a setup before I sink time into it. Want to hear from anyone actually running it, not theorycraft.

  51. Will there be a non-cloud version of Deepseek V4 flash available for Ollama? Or do I need to go to another framework to get a version that will be supported?

  52. Hi there, it's my first post there and i'm not a native english speaker so what's follow is (mostly) translated by an AI. I had fun building a mindmap tool in a single monolithic HTML file.

  53. Im rather new to the whole Agentic automation AI's but Im hearing people with vibe coding were able to pull big unique projects they wouldn't be able to do by themselves or possibly needed to pay a huge fund to programmers, designers, etc.…

  54. Currently using Qwen 3.6 27b and Qwen 3.6 35b but I was wondering if there is anything solid in the 50-200 range that you could run on a larger cluster that would be worth it? Or would you just run q8 or non quant versions instead?

  55. TL;DR: DeepSeek-V4-Flash running at 85.52 tok/s @ 524k ctx and ~111 tok/s @ 128k single-stream on 2× RTX PRO 6000 Max-Q pasta-paul's DeepSeek-V4-Flash-W4A16-FP8 quant is great, but its MTP head silently gets stripped at load time (HF trans…

  56. The developer that created Redis, Salvatore Sanfilippo, has released a new project on GitHub named DS4. https://github.com/antirez/ds4/ The TL;DR on this one is getting DeepSeek V4 Flash running with a 1M context windows on Mac Metal hardw…

  57. Don’t miss what’s happening People on X are the first to know. Log in Sign up Post Conversation roon @tszzl hmm 8:02 AM · May 9, 2026 77.3K Views New to X?

  58. Hey everyone, I’m Ted. I’ve been building a project called Throughline with my friend Drew: an AI assistant for live tabletop RPG sessions.

  59. Canvas Data Breach Impacts Education; DeepSeek V4 Flash raises LLM Inference 4.3x DeepSeek V4 Flash Boosts LLM Inference 4.3x The Canvas educational platform experienced a data breach, with ShinyHunters threatening data release by May 12,…

  60. Been on DeepSeek V4 for about three weeks across two production codebases (Python backend, TypeScript frontend) after a year on V3. Three things shifted noticeably better, two shifted noticeably worse.

  61. We had deepseek v4 preview recently but it wasn't much better than v3.2. What is the next SOTA local/open model you are excited about?

  62. antirez @antirez Welcome to DS4, a specialized inference engine for DeepSeek v4 Flash. github.com/antirez/ds4 This project would have been impossible without the existence of llama.cpp and GGML and the work of @ggerganov and all the other…

  63. Yoh guys just wanted to ask I'm keep seeing an ADs about this new coding agent CommandCode that offer 1$/month and it has a 40$ package of Deepseek v4 pro and other models. NOTE : CLAUDE and GPT is not included on the 1$ plan.

  64. Deepseek-v4-pro + Hermes: Unauthorized Modification of Security Controls This article documents a specific, real incident. It exposes a class of vulnerability that deserves attention: the unsupervised mutability of security rules by autono…

  65. Long time ago (actually only a year ago), DeepSeek released a few open source model, such as deepseek-r1-distill-qwen (https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B). I am wondering if anyone in the community is brave eno…

  66. Models & Pricing The prices listed below are in units of per 1M tokens. A token, the smallest unit of text that the model recognizes, can be a word, a number, or even a punctuation mark.

  67. I recently did a benchmark of deepseek v4 in agentic tasks. Performance-wise, it's one of the best open source models, as expected.

  68. I have been following the akitaonrails coding benchmark which tests against a fixed rails + Rubyllm + docker task rather than vendor-reported evals. April 2026 update put K2.6 at 87 sitting in tier A (80+), ahead of Qwen 3.6 plus (71), Dee…

  69. Even unofficial or slow. I have enough vram-memory to load it, but not enough memory to run in cpu-only mode.

  70. DeepSeek V4 Pro lands in the frontier ROI tier on FoodTruck Bench. 5/5 runs, +1,257% median ROI, $27K net worth, $3.51/run, 5× less waste than Grok 4.3.

  71. That foodtruck bench post showing deepseek v4 matching gpt-5.2 at 17x cheaper got me thinking. if frontier cloud models are that overpriced for equivalent quality, how much of my daily work even needs cloud at all?

  72. The benchmark uses adversarial, multi-turn debates across 683 curated motions. Each model pair debates the same motion twice with sides swapped.

  73. Architecture explains the gap: MiMo's MoE runs more active params per token than Kimi K2.6's optimized routing hence slowest. DeepSeek V4's 'comprehensive' edge is partly MLA: ~75% KV-cache compression makes it far better for long agentic…

  74. Tested DeepSeek V4 Pro on FoodTruck Bench — our 30-day agentic benchmark where models run a food truck via 34 tools (locations, pricing, inventory, staff, weather, events) with persistent memory and daily reflection. First Chinese model to…

  75. Literally no 3rd party api inference provider is hosting the mimo-2.5 series models from Xiaomi. They seem to be reallly good.

  76. Spent a Sunday auditing where my Codex tokens were actually going. Half the calls were stuff like "rename these 12 fields", "format this csv as markdown table", "extract the dates from this changelog".

  77. I looked at what was actually eating my Claude usage and it was embarrassing. Classifying files.

  78. deepclaude Use Claude Code's autonomous agent loop with DeepSeek V4 Pro, OpenRouter, or any Anthropic-compatible backend. Same UX, 17x cheaper.

  79. In April 2026, the Center for AI Standards and Innovation (CAISI) evaluated the open-weight AI model DeepSeek V4 Pro (“DeepSeek V4”). CAISI evaluations indicate that DeepSeek V4’s capabilities lag behind the frontier by about 8 months (Fig…

  80. https://preview.redd.it/pz8qeln0auyg1.png?width=1400&format=png&auto=webp&s=00ee5218734cfae4783d702411d63e3a4c6bbc60 https://preview.redd.it/hem9mad5auyg1.png?width=1184&format=png&auto=webp&s=2a26fec2b49204e64b44a78b30902ab80f7df53c https…

  81. DeepSeek V4 Pro 127³ 127-stratum crystalline lattice on DeepSeek V4 architecture. 1.6T params · 49B activated · MoE · 1M context · MIT license.

  82. I wanted to use vast.ai, but ollama doesnt have it, and when i used vLLM I didn't have success. I genuinely don't know what failed.

  83. Which of these do you think we'll get in May? Also, feel free to pick/rank which ones you'd want the most badly: more Gemma4 models (124b?) (other sizes?) more Qwen3.6 models (9b?

  84. DeepSeek v4, and the end of the OpenAI/Microsoft AGI clause Plus LLM 0.32a0 In this newsletter: DeepSeek V4 - almost on the frontier, a fraction of the price Tracking the history of the now-deceased OpenAI Microsoft AGI clause LLM 0.32a0 i…

  85. We run Qwen3.6-27B-FP8 at AI Router Switzerland and hit two issues, so I wanted to share in case anyone else runs into them. FP8 KV cache produces silent garbage output with radix cache prefix hits (PR #24198 — ✅ approved) We were running…

  86. Most of my LLM cost was on the wrong tier of work. Classification, extraction, JSON formatting, summarization I'm going to review anyway.

  87. True story, I got interested in AI after seeing it at work and wanted to run models locally. I started with an M3 Ultra 96GB, quickly learned it was not enough for what I wanted, and kept upgrading hardware (including refurbished Mac Studi…

  88. As AI adoption matures, the conversation is shifting from model capability to system design, how to orchestrate models that deliver the right balance of quality, speed, and cost. Today, we’re expanding the Microsoft Foundry model catalog w…

  89. Some of the larger models (like Llama) weren't available on OpenRouter, so I had to work with what was there. Best small model: Gemma 4 26B For its size, I think it had the best output.

  90. DeepSeek V4 is not in the same league as GPT-5.5 or Opus 4.7. Benchmarks put it slightly below both of those, roughly on par with Opus 4.6.

  91. I've been messing around with Hermes for months, and quickly outgrew using it just as a fancy CLI assistant. My goal was to build a persistent, specialized team of local agents that could collaborate on long-term projects without me spoon-…

  92. 29th April 2026 Recent articles - LLM 0.32a0 is a major backwards-compatible refactor - 29th April 2026 - Tracking the history of the now-deceased OpenAI Microsoft AGI clause - 27th April 2026 - DeepSeek V4 - almost on the frontier, a frac…

  93. I've been running local LLMs since Qwen 3.5 dropped and I was really impressed by what we could run on consumer hardware. Fast forward another two months and we have gotten a handful more gems such as Gemma 4 and Qwen 3.6, so I wanted to p…

  94. Hey HN! I built SimplePDF Copilot: an AI assistant that can interact with the PDF editor.

  95. DISCLAIMER: I am not a programmer nor do I have experience coding. I've been thinking about a small app running on gradio for some time now, and I want to try tweaking some extension for ComfyUI.

  96. Hey 👋 Saw the tweet making the rounds about deepseek v4 being 35x cheaper than opus on input and 178x cheaper on cached tokens, and was sure it was hyperbole. Pulled the numbers anyway because i had nothing better to do.

  97. FLAPPY ANNA 3D PRESS SPACE OR TAP Presented by Guan, Made in Melb with DeepSeek and Love GAME OVER PRESS SPACE OR TAP 0000

  98. This is actually unbelievable. I am shocked that there has not been a move in the market like it did last year with the R1 release.

  99. could not extract summary

  100. Guys is deepseek v4 pro really the best model (price to performance) because i was using nvidia apis for two weeks in opencode then suddwnly everything stopped working so i am thinking to opt for the payed (yet very affordable) option to m…

  101. Why we chose correctness over a Day-0 launch DeepSeek V4 Pro is one of the most important open-model releases this year, with real advances in long-context reasoning, agentic performance, and inference efficiency. On paper, it looks like a…

  102. Or can we already use them in Cursor? DeepSeek V4 specifically looks very interesting and way cheaper.

  103. Hi guys I got only 3090 GPUs so... How many prefer to run to get a great result in DeepSeek V4 PRO?

  104. MIT license and fully open source. MiMo-V2.5-Pro was just 3 points from Opus 4.7 max and the normal V2.5 is only a step behind SOTA.

  105. DeepSeek-V4 arrives with near state-of-the-art intelligence at 1/6th the cost of Opus 4.7, GPT-5.5 | VentureBeat Orchestration Infrastructure Data Security More Newsletters Featured DeepSeek-V4 arrives with near state-of-the-art intelligen…

  106. DeepSeek-V4-Flash-Base INT4 A real INT4 packed-storage quantization of deepseek-ai/DeepSeek-V4-Flash-Base — a 284 B-parameter Mixture-of-Experts model. Hero numbers | Metric | This release | Community Q4KM norm | |---|---|---| | MMLU (5 su…

  107. How will you scale these models coding and overall. Deepseek v4 pro Kimi k2.6 Mimo v2.5 pro Glm 5.1 Qwen 3.6 plus

  108. could not extract summary

  109. fkyah3/opencode-fkyah3 DeepSeek 优化 · Windows 适配 · AI 实现 🚀 从零搭建指南(中文) · English · 繁體中文 本项目是 anomalyco/opencode 的个人 Fork。所有修复、优化、功能均由 AI 完成——DeepSeek V4 Flash (thinking mode) / Sisyphus——在人类监督下执行。 上游是优秀项目。Windows 和 DeepSeek 并非他们的优先方向。我们自行处理。…

  110. Wondering why there aren't any "name brand" (like unsloth, bartowski) GGUFs as yet for DeepSeek V4 Flash?

  111. So I'm sure everyone is excited about the new deepseek release(s) but I'm a little confused about it's vram requirements. a q4 gguf of it is only 120gb?

  112. DeepSeek's new models are so efficient they'll run on a toaster ... by which we mean Huawei's NPUs Now available in preview, DeepSeek V4 cuts inference costs to a fraction of R1 Chinese AI darling DeepSeek is back with a new open weights l…

  113. so v4 pro dropped and barely anyone is talking about it. feels weird since when kimi k2.6 came out i seen post about it everywhere anyone here tried v4 pro for actual code work?

  114. We need to create a prompt that sets up a fresh Go game session. The user wants to "export a go board prompt for a new session," meaning they want a prompt they can copy-paste into a new chat to start a game of Go with me, presumably with…

  115. It seems that Cursor removed all the DeepSeek models. I find it limiting, considering it seems performant.

  116. Hi, here you can find experimental llama.cpp support for DeepSeek v4, and here there is the GGUF you can use to run the inference with "just" (lol) 128GB of RAM. The model, even quantized at 2 bit, looks very solid in my limited testing, a…

  117. Figure 1 of DSV4 paper seems to imply that DSV3.2 uses ~50GB at 1m context and DSV4 uses ~5GB: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf From my own calculations, the correct FP16 KV cache at 1m context s…

  118. TL;DR and rundown DeepSeek v4 released this week and performs close to frontier models like GPT/Opus on benchmarks. It's available now and is discounted by a whopping 75% through their API until May 5, making it the most cost effective hig…

  119. DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles We are thrilled to announce Day-0 support for DeepSeek-V4 across both inference and RL training. SGLang and Miles form the first open-source stack to serve and…

  120. Deepseek V4 with Strix: a quick test Deepseek released V4 yesterday in two variants. V4 Pro has 1.6T total parameters with 49B active, while V4 Flash is the smaller, faster, cheaper sibling with 284B total and 13B active.

  121. https://preview.redd.it/qgqf66unacxg1.png?width=1144&format=png&auto=webp&s=9241d9c7b5aebb52f25c87f50520c2330852291c https://api-docs.deepseek.com/quick_start/pricing

  122. As far as i know the weight is of 160gb + 9.6gb needed for max 1 million token window + 5 gigs overhead = 175gb vram. But vllm and othere sources said "To use the full 1M context, you need 4x A100 80G" --> thats a 320gb vram ??

  123. Hi everyone, I've been working on a CLI tool that can help to easily run any model in claude, Codex, Gemini, Pi, and OpenCode. It's also an API keys manager, supports multiple providers or OpenAI/Claude/Gemini accounts.

  124. could not extract summary

  125. About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC

  126. Xiaomi’s MiMo V2.5 Pro has landed at 54 in the Artificial Analysis Intelligence Index, tied with Moonshot’s Kimi K2.6 - the current top open weights model. MiMo V2.5 Pro’s weights are expected to be released soon, which would make MiMo V2.…

  127. I must say Iam kinda torn what to think about those models. At one hand they "ace" some questions on other sometime they behave genuinely weird.

  128. After months of silence… DeepSeek V4 just got announced and honestly, this might shake things again. Here’s what’s crazy: 🧠 1 MILLION token context window (yes… insane long-context memory) ⚡ Comes in two versions: V4 Pro → full power (reas…

  129. could not extract summary

  130. We have a chat system which we use haiku for because it is mostly about tool calling and summarisation of them. But we have many tools with pretty complex input schemas, and stuff like gemma didn't cut it, so we went with haiku.

  131. Did some test tasks with v4 flash. The context management, tool use accuracy and thinking traces all looked excellent.

  132. Do they charge below their cost? Or do they run their own cache?

  133. could not extract summary

  134. Spent the morning looking at the V4 tech report. The benchmarks are getting deserved attention, but I think the architecture is also worth digging into.

  135. Just a question for fun/curiosity: in your opinion, if I had enough money, how much would be needed and what configuration would be required to run DeepSeek v4? Maybe not necessarily everything in VRAM, maybe something hybrid.

  136. was shocked when saw that spec, immediatly went to the website and asked it to make a comprehensive single-html-web-OS and it indeed generated a single 100KB html for me...I'm speechless. https://preview.redd.it/6zcbzbkvj3xg1.png?width=287…

  137. DeepSeek V4—almost on the frontier, a fraction of the price 24th April 2026 Chinese AI lab DeepSeek’s last model release was V3.2 (and V3.2 Speciale) last December. They just dropped the first of their hotly anticipated V4 series in the sh…

  138. source : https://x.com/pankajkumar_dev/status/2047552208175354229?s=20

  139. I hope they include it in their next v4 release. Source: DeepSeek_V4_Technical_Report

  140. could not extract summary

  141. DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence Technical Report👁️ Introduction We present a preview version of DeepSeek-V4 series, including two strong Mixture-of-Experts (MoE) language models — DeepSeek-V4-Pro wi…

  142. DeepSeek just dropped the preview of their V4 series, with both open-weight and available via API. 1M context window.

  143. https://api-docs.deepseek.com/ https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main...

  144. DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence Technical Report👁️ Introduction We present a preview version of DeepSeek-V4 series, including two strong Mixture-of-Experts (MoE) language models — DeepSeek-V4-Pro wi…

  145. https://huggingface.co/collections/deepseek-ai/deepseek-v4

← all threads