model

gemma-3-12b-it

huggingface.co/google/gemma-3-12b-it ↗

2515706 downloads·704 likes·image-text-to-text·transformers

discussions

Gemma 3 4 2026-04-28 – 2026-05-03
Gemma 3 4 2026-04-15 – 2026-04-25

recent items

Running llama.cpp on Snapdragon Hexagon NPU seems promising (www.reddit.com) +82 8w

https://github.com/ggml-org/llama.cpp/blob/master/docs/backend/snapdragon/README.md I have an Oneplus 12 with Snapdragon 8 Gen 3. I followed the above README to cross-compile llama.cpp on Ubuntu and then copy to the Termux directory on the…

↯ Gemma 3 gemma llama
Creation OS: local σ-gated LLM runtime — BitNet/Qwen/Gemma, abstention, conformal gate, MCP, no cloud (www.reddit.com) +1 8w

I’ve been building a local-first AI runtime that wraps local LLMs with a σ-gate — a measurement layer that decides ACCEPT, RETHINK, or ABSTAIN before an answer reaches you. The idea: local models should be able to say “I don’t know” instea…

↯ Gemma 3 gemma qwen mcp
I've created a LoRA for Gemma 3 270M making it probably the smallest thinking model? (www.reddit.com) +84 8w

https://huggingface.co/firstbober/gemma-3-270M-it-smol-thinker Here is an example of the output: ``` ==================== THINKING ==================== Here is the thinking process: This is a large community with a wide range of interests…

↯ Gemma 3 gemma
Good LLM to generate ascii art? (www.reddit.com) 3 8w

I tried with Qwen but it sucked, Gemma3/4 was better but not good enough. From Gemma: https://pastebin.com/raw/Qr5iMgYj Still looks like a bloody car accident though.

↯ Gemma 3 gemma qwen
Best second GPU for RTX 4070 Super? (www.reddit.com) +64 10w

So i currently have an rtx 4070 super, and it can easily run models like gemma3 12b and even gpt-oss 20b (although it takes up to a minute to generate a response). I want to get a second gpu so i can run larger models around 20b-30b params.

↯ Gemma 3
Knlowledge Graph and hybrid DB (www.reddit.com) +2 10w

Hello, everybody! I'm building and hybrid database with Qdrant and Neo4j for a few personal projects.

↯ Gemma 3 ollama gemma
Turn an old Android phone into a Local AI Voice Assistant (www.reddit.com) +111 10w

I had a nice old cracked pixel 5a laying around that I wanted to get some use out of, so I turned it into a local AI Voice assistant. A server on a laptop running llama.cpp gemma-3-4b-q4.gguf served by flask connects to a script running on…

↯ Gemma 3 gemma llama
Upset about Nemotron Super (alleged) high precision post-training (www.reddit.com) 3 10w

https://arxiv.org/abs/2604.12374 Another nemotron-super paper was released, but from reading it still seems that NVFP4 post training process was not part of the program. They say they used a PTQ method for the final result.

↯ Gemma 3

← all models