model roundup

MiniMax 2.7

9 items · started 2026-04-13 · closed 2026-04-19

OpenCode + Self host Minimax-2.7 via SGLang? (www.reddit.com)

+23 10w minimax

anyone knows how to setup opencode to work with self hosted minimax-2.7 properly? It has <think> and </think> in the message and OpenCode failed to parse the answer correctly.
Those of you running minimax 2.7 locally, how are you feeling about it? (www.reddit.com)

+1221 10w minimax vllm

Im running the raw version straight from the minimax release on hugging face (https://huggingface.co/MiniMaxAI/MiniMax-M2.7) on 3 rtx pro 6000's on vllm. So no quantization.
But why Local LLM? How does this make economic sense vs API? (www.reddit.com)

26 10w minimax

Hey guys, come fight me: how do you justify local LLMs from a value perspective? It doesn't seem economical?
How does a self correcting loop for AI agents work? (www.reddit.com)

+12 10w minimax sonnet llama

Hey guys, just checked out minimax 2.7, where they used AI to train itself, and ran over a hundred loops, and it improved it's performance by 30%, how does that work, can I also run a script that makes AI store it's memory in a loop on a m…
Upgrade paths for my 256g ddr4 ram + 4x24g vram system (www.reddit.com)

+110 10w glm llama

So I was just about to give up playing with local models, until I realised I can actually run GLM 5.1 at not too horrible speeds, using this quant https://huggingface.co/ubergarm/GLM-5.1-GGUF/tree/main/IQ2_KL in ik llama. Getting around 6.…
Guys we have to change the pelican test (www.reddit.com)

+4864 10w minimax glm deepseek+3

So i have been seeing more of those pelican on a bike svg tests and while they work i feel like (and maybe you guys do too) they are getting kinda benchmaxxed so we should switch things up soon and this is my idea generate me a html svg of…
A 1-bit quant of MiniMax 2.7 that runs from a CD at 1500 tk/s would be nice. (www.reddit.com)

+10 10w minimax

Badda Boom.
Optimizing MiniMax 2.7 - Experts vs Layers for best VRAM/RAM utilization (www.reddit.com)

4 10w minimax

I'm curious if there is a rule of thumb regarding how to best load Minimax given varying amounts of VRAM/RAM configurations. Is there a way to estimate how many experts versus layers to offload for individuals running either 16GB/24GB/32GB…
Mac Studio Performance Suggestion For minimax (www.reddit.com)

12 10w minimax ollama qwen

I need help. I want to self-contain my MiniMax 2.7 and Qwen 3.5 (122 billion parameter) models.

← all threads