claude is a token maxxing f*ckboi, what's next? (www.reddit.com via reddit)
model roundup
Gemma 4
-
is there anything with a 1M context window I can spend 100-200usd a day on that actually works? I don't have 5-10m to wait for claude to think about how to respond to a three word prompt.
-
I brought Claude-style artifacts to local models (www.reddit.comhttps)
One thing I miss when using local models is the artifact experience from Claude. With Claude, if you ask for a dashboard, chart, diagram, or landing page, you actually get the thing rendered in the chat.
-
[R] Gemma-4-12B-IT-Uncensored-Opus4.7-CoT (No Intel Loss) (www.reddit.com via reddit)
Hi everyone, I just released Gemma-4-12B-Uncensored-Opus4.7-CoT. To remove the safety filters without destroying the model's reasoning, I combined a precise ablation method with a CoT (Chain-of-Thought) data fine-tune to fully recover the…
-
Requirements: iPhone with A17 Pro or newer (8 GB RAM floor for the model), iOS 26+. TestFlight beta is open to anyone with a compatible device.
-
Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp (github.com via hn)
I got tired of sending every text I translate to Google/DeepL. Even with all the opt-out options and privacy policies, it never felt right especially for some work documents, personal writing, or anything sensitive.
-
Looks like I found a minor glitch in claude cli (www.reddit.com via reddit)
https://preview.redd.it/0jai8prknl8h1.png?width=2040&format=png&auto=webp&s=61576e05a908614b672db1fc89cb46cd4e148cde Steps to reproduce Run claude cli with ollama provider (`ollama launch claude --model gemma4`) Run `/model` command in the…