When we started Roo Code in late 2024 by forking Cline and adding what's now widely known as dangerously-skip-permissions, agentic coding was rough and experimental. But Roo Code took off fast: 3 million installs, a passionate community, r…
TL;DR 100 B2B buyer questions, 10 runs each, 5 categories, 1,000 runs total. Zero errors.
I was a personal trainer for about 7 years. quit around 14 months ago to try building an app full time, which sounds way more dramatic than it was.
I’ve been building plugins for Claude Code, and the first version of the idea was very Claude-focused. That made sense at the start.
Switching model mid conversation www.reddit.com
I wanted to know if switching models in mid conversation has any drawbacks. For example if I start off and opus and then drop down to sonnet to save on my usage, what are the disadvantages?
-
129 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- just now I tested 9 local models on the same flight sim prompt, all Q8, different Q providers, MLX
- 19m Is there an alternative between vLLM and Ollama that handles token prefill? (Arc Pro B70)
- 22m Anyone know how to run the new gemma 4 edge gallery litertlm format in the browser? Trying to load Gemma 4 e4b.
- 26m Gemma 4 is much less popular on Hugging Face than Qwen 3.x.
- 38m Did Google hide the best version of Gemma 4 e4b in Android? The extracted model beats Unsloth and everything else I've tried.
92 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 5m Gave a coding agent access to 2M+ research papers. Its Python tests caught 63% of bugs; with the papers, 87%. 9-task benchmark.
- 1h Differences Between Kimi K2.5 and Kimi K2.6 on MineBench
- 5h Daily created issues in anthropics/claude-code around the last 3 Anthropic model releases
- 1d Has anyone actually tested Opus 4.7 medium vs Opus 4.6 high?
- 1d anyone else feel like opus 4.6 is better than 4.7?
Howdy folks, thought this was too perfect to not share. I'd built a docker container for a TCG I like to play, just testing out if I could make an EDHrec for it.
I built Hydra because I kept losing my flow when Claude Code hit usage limits mid-task. I would copy context, open another tool, and then re-explain everything.
235M param LLM from scratch on a single RTX 5080 www.reddit.com
Grok 4.3 Beta grok.com
Tsukuyomi A reverse-proxy that sits between an LLM agent and its model API. The agent thinks it's talking to Anthropic or OpenAI directly, but it's actually talking to this thing first.
Ordering with the Starbucks ChatGPT app was a true coffee nightmare www.theverge.com
Venti iced coffee, light skim milk. That’s what I get at Starbucks.
-
214 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
35 itemsmodel roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
In January I had a 4,200-line HTML file pretending to be a personal finance app. One file.
If you don't like Claude Desktop or ChatGPT app you're not alone, here are some of the reasons why I don't like them and decided to built an alternative. Lack of control You can’t control the web-search (depth, breadth and number of source…
offloading to free AI www.reddit.com
Hey, I am not a programmer, I am an unemployed sysadmin. I have been making projects with Claude, and like everyone else, I am on a quest to reduce token usage.
Hello, I need to port an Electron application (around 20k lines of code) built with React into a mobile app using React Native for both Android and iOS. I'm planning to use Claude Code for this task.
I am trying to pay my membership and stripe keep declining a valid card. So here OpenAI probably need to manually process my payment and tell the stripe: "hey idiot this guy has been our members for a while, he wont suddenly decide to use…
Proper vibe coding with local LLM for average Joe www.reddit.com
The short answer is you don't. But let me explain a bit more.
-
81 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 2h Show HN: Runner – A Better Claude Cowork
- 3h Feels like AI agents are splitting into 3 very different directions…
- 5h Hardware set up advice
- 5h Built an Autonomous Content Engine on Claude Code - Sharing the Playbook
- 7h MCP connectors in Cowork: the tools work but Claude can't find them. Anyone else?
Built a skill that lets Claude drive local image tools via CLI — no API calls, no cost, works offline. Install: npx skills add ramon-webdevpro-nl/claude-skills@gimp-inkscape What it covers: - Resize, crop, batch, watermarks, WebP → ImageMa…
GMKTEC EVO-X2 Ryzen AI Max+ www.reddit.com
Szukam nowego peceta do llm i flux2 pro ,generowanie wideo, generowanie grafiki. Czy ten komp poradzi sobie ?
Hi there, I wonder if it possible to use our Enterprise subscription (Claude) to use in VS Code for auto-completion or inline suggestions? Claude code does not provide this feature, and while other extensions like Kilocode or Continue can…
Hey, I'm building an Agentic RAG pipeline and struggling with two decisions: Chunking strategy — fixed-size, semantic, or hierarchical? In an agentic setting where the agent can re-query iteratively, does it make more sense to use smaller…
Chat gpt image 2 weird weakness www.reddit.com
Maybe it is just me but when i do reference images and send into it for real life people, (not just the one in this photo but for other people too) it seems like it will get disorted (You can see the disorted lines around the body) around…