model roundup

Gemma 3

4 items · started 2026-04-15 · closed 2026-04-25

Best second GPU for RTX 4070 Super? (www.reddit.com)

+64 10w

So i currently have an rtx 4070 super, and it can easily run models like gemma3 12b and even gpt-oss 20b (although it takes up to a minute to generate a response). I want to get a second gpu so i can run larger models around 20b-30b params.
Knlowledge Graph and hybrid DB (www.reddit.com)

+2 10w ollama gemma

Hello, everybody! I'm building and hybrid database with Qdrant and Neo4j for a few personal projects.
Turn an old Android phone into a Local AI Voice Assistant (www.reddit.com)

+111 10w gemma llama

I had a nice old cracked pixel 5a laying around that I wanted to get some use out of, so I turned it into a local AI Voice assistant. A server on a laptop running llama.cpp gemma-3-4b-q4.gguf served by flask connects to a script running on…
Upset about Nemotron Super (alleged) high precision post-training (www.reddit.com)

3 10w

https://arxiv.org/abs/2604.12374 Another nemotron-super paper was released, but from reading it still seems that NVFP4 post training process was not part of the program. They say they used a PTQ method for the final result.