model roundup

Gemma 3

4 items · started 2026-04-28 · ongoing (last activity 2026-05-01)

Running llama.cpp on Snapdragon Hexagon NPU seems promising (www.reddit.com)

+82 3h gemma llama

https://github.com/ggml-org/llama.cpp/blob/master/docs/backend/snapdragon/README.md I have an Oneplus 12 with Snapdragon 8 Gen 3. I followed the above README to cross-compile llama.cpp on Ubuntu and then copy to the Termux directory on the…
Creation OS: local σ-gated LLM runtime — BitNet/Qwen/Gemma, abstention, conformal gate, MCP, no cloud (www.reddit.com)

+1 1d gemma qwen mcp

I’ve been building a local-first AI runtime that wraps local LLMs with a σ-gate — a measurement layer that decides ACCEPT, RETHINK, or ABSTAIN before an answer reaches you. The idea: local models should be able to say “I don’t know” instea…
I've created a LoRA for Gemma 3 270M making it probably the smallest thinking model? (www.reddit.com)

+84 2d gemma

https://huggingface.co/firstbober/gemma-3-270M-it-smol-thinker Here is an example of the output: ``` ==================== THINKING ==================== Here is the thinking process: This is a large community with a wide range of interests…
Good LLM to generate ascii art? (www.reddit.com)

3 4d gemma qwen

I tried with Qwen but it sucked, Gemma3/4 was better but not good enough. From Gemma: https://pastebin.com/raw/Qr5iMgYj Still looks like a bloody car accident though.