Best second GPU for RTX 4070 Super? (www.reddit.com)
model roundup
Gemma 3
-
So i currently have an rtx 4070 super, and it can easily run models like gemma3 12b and even gpt-oss 20b (although it takes up to a minute to generate a response). I want to get a second gpu so i can run larger models around 20b-30b params.
-
Knlowledge Graph and hybrid DB (www.reddit.com)
Hello, everybody! I'm building and hybrid database with Qdrant and Neo4j for a few personal projects.
-
Turn an old Android phone into a Local AI Voice Assistant (www.reddit.com)
I had a nice old cracked pixel 5a laying around that I wanted to get some use out of, so I turned it into a local AI Voice assistant. A server on a laptop running llama.cpp gemma-3-4b-q4.gguf served by flask connects to a script running on…
-
Upset about Nemotron Super (alleged) high precision post-training (www.reddit.com)
https://arxiv.org/abs/2604.12374 Another nemotron-super paper was released, but from reading it still seems that NVFP4 post training process was not part of the program. They say they used a PTQ method for the final result.