#aider
34 items
Same 9B Qwen weights: 19.1% in Aider vs 45.6% with a scaffold adapted to small local models (www.reddit.com) Lessons from building a coding agent for 8k context windows: token budgeting, parallel executors, and per-file isolation (www.reddit.com) Most AI coding tools (Cursor, Aider, Claude Code) assume you have a 200k-token model. If you're running local LLMs through Ollama or LM Studio, or hitting free-tier cloud APIs like Groq or OpenRouter, you've got around 8k tokens to work wi…
Show HN: Which public repos are friendliest to an AI coding agent? (www.agentfriendlycode.com via hn) Public leaderboard ranking GitHub, GitLab, and Bitbucket repos by how agent-friendly they are for Claude Code, Cursor, Devin, GPT-5 Codex, Gemini CLI, Aider, OpenHands, and Pi — per model, with AGENTS.md / CLAUDE.md, CI, tests, and dev-env…
New to agents, mcp , etc how do I get to a point where i can lay back and let my agents do the work (www.reddit.com) Currently working on some projects. I have some agents and chrome scrap tasks id like it to do.
Show HN: Zerminal – a terminal-first Zed fork for AI coding agents (zerminal.dev via hn) A terminal-first development environment for agentic coding. Use Claude Code, Codex, Aider, and other CLI agents in a focused workspace.
The Benchmark Gap: 1,472 runs show coding-agent context changes outcomes (github.com via hn) The Benchmark Gap 1,472 OpenCode runs reveal what vendor scores don't tell you about AI coding tools. Plus a 60-run supplemental Aider/Cline probe, a 48-run installed-CLI smoke, and a 10-row current-OpenCode context probe to test whether t…
Honey, I Shrunk the Coding Agent (itayinbarr.substack.com via hn) Honey, I Shrunk the Coding Agent Coding Agent Adaptation Lets a 9B LLM Outperform 10x Larger Models on Aider Polyglot Benchmark Small local language models often appear weak when they are placed inside coding agents designed around frontie…
Show HN: Built an open-source local firewall for AI coding agents (news.ycombinator.com) GitHub: https://github.com/ashp15205/guardian-runtime Docs: https://ashp15205.github.io/guardian-runtime/ Hi Guys, I built Guardian Runtime: a local FinOps and security proxy for AI coding agents (like Claude Code, Cursor, and Aider). You…
Show HN: Agentic Intent Benchmark (github.com via hn) intent-bench An open-source benchmark measuring whether providing structured intent to coding agents improves implementation effectiveness. What This Measures Existing agent benchmarks (SWE-bench, HumanEval, Aider Polyglot) test single-req…
Anyone evaluated the difference between Qwen Code for the local qwen models vs another harness? CC, OC, LC, Aider etc.. (www.reddit.com) For me, opencode doing fantastically but was wondering if qwen code would be more native and have better functionality, since idk which agentic harness they used to get their benchmark results
Sequencer: Visual multi-agent workflow pipelines. (www.reddit.com) I built Sequencer, an open-source visual prompt-to-agent chaining engine. When I build apps with AI tools, I break the project into bite-sized prompts, then copy-paste each one into Cline or Aider and wait.
Making AI coding sessions persistent across agents (github.com via hn) 🌐 English · 日本語 · 简体中文 · 繁體中文 drift_ai Vendor-neutral handoff for AI coding tasks — between Claude, GPT, Gemini, DeepSeek, local LLMs. Reads from Claude Code, Codex, Cursor, Aider.
Show HN: Smith – AI Agent Orchestrator (getsmith.dev via hn) multi-agent orchestration Run Claude Code, Codex, Gemini CLI, Aider, and OpenCode in parallel. Each in its own terminal pane with custom naming and live status subtitles.
How do I use gemma4 on 5090 gpu for coding? (www.reddit.com) I'm trying to replace openai codex which i used for development all the time, with gemma4 on 4090, small tasks it solves quite impressively, but i need to have some agent. So I tried to connect 31b to cline and to aider and it didn't reall…
We built Branchless, a desktop app for running parallel dev sessions with agents, terminals and editors, without switching branches (www.reddit.com) Hey everyone, We have been building Branchless, a desktop app for Mac, Windows and Linux. The basic idea is simple: we wanted a way to work on multiple tasks at the same time without constantly switching branches, stashing changes, opening…
10-gate security audit SKILL for web apps (www.reddit.com) There are a few security focus SKILLs. We are working another new one for web app.
Show HN: Ait – Claude, Codex, and Aider as a team, on your laptop (github.com via hn) ait Local control plane for multi-agent AI coding Run Claude Code · Codex CLI · Aider · Gemini CLI · Cursor as a team on the same task — context handoff, review gate, attempt ledger — all on your machine. English · 繁體中文 60-second walkthrou…
Two flat-fee agent endpoints, no token meter: OpenClaw chat ($7/mo, 128K ctx) + All You Can Code ($19/mo, 256K ctx). OpenAI v1. (www.reddit.com) For anyone running agents (coding or otherwise) who'd rather pay a flat fee than meter tokens. Two tiers, both flat fee, both unlimited: OpenClaw ($7/mo) - Nemotron-3-Nano-Omni-30B-A3B - 128K context - For general-purpose agents: research,…
Why does GitHub Copilot feel less accurate compared to Agentic/Autonomous AI tools ? (www.reddit.com) I'm looking for a solid solution to bridge this gap. How can we actually use these tools properly for complex development?
We built a free tool that generates a DESIGN.md from any live URL, keeps AI coding agents on-brand (www.reddit.com) The Google Labs DESIGN.md spec launched last month, it's a machine-readable markdown file your AI coding agent reads to understand your design system. This tool automates creating it.
corrigé code apk (www.reddit.com) bonjour a tous je travaille sur un projet apk et j'ai rencontré quelques problème au niveau des notifications ect...si quelqu'un peut m'aider a corriger les code et faire fonctionner apk . je utilise en ce moment Android studio.
SafeSandbox – infinite undo for AI coding agents (Cursor, Claude Code, Codex) (github.com via hn) SafeSandbox Infinite undo for AI coding agents. SafeSandbox is a local-first developer tool that automatically creates snapshots and checkpoints while AI coding agents (Cursor, Claude Code, Codex, Aider, etc.) modify your repository.
Coding Agent Harness Comparison 2026: Claude Code, Codex, Amp (techstackups.com via hn) Coding Agent Harness Comparison 2026: Claude Code, Codex, Amp, OpenCode, Gemini CLI, Pi, Command Code, Factory, and Aider In 2023, there was one serious terminal coding agent: Aider. By May 2026, there are at least nine, representing every…
Benchmarking Local LLM/Harness Combinations (neuralnoise.com via hn) I’ve been running a small benchmark, harness-bench , that pairs local LLMs (served via llama.cpp ’s llama-server ) with agent harnesses (Aider, Claude Code, OpenCode, Pi, Qwen CLI) on 16 software-engineering tasks across Python, PyTorch, J…
Show HN: An agent that remembers across sessions (no chat history) (github.com via hn) Hi HN — I built this in my off-hours over the last 3 months. Sharing now because I just filed the provisional patent yesterday (US 64/050,345) and the repo is freshly public.
Little-coder: A coding agent optimized to smaller LLMs (github.com via hn) little-coder A coding agent tuned for small local models, built on top of pi. The research story behind all this — why scaffold–model fit matters, how a 9.7 B Qwen beat frontier entries on Aider Polyglot, and what the load-bearing mechanis…
Has anyone managed to use gemma 4 e4b in Open Code/other agentic TUIs? (www.reddit.com) Hi everyone, as a power user I hit Claude Code's usage cap too often I wanted to set up my own local model, however I only have RTX 5070 with 12 GB of VRAM so the only realistic option was Gemma 4 with effective 4B params. When I tried to…
Does Cursor ever create more cleanup than expected because the task was too loosely scoped? (www.reddit.com via reddit) The AI literally deleted everything on my computer and I was left staring at a frozen screen (www.reddit.com via reddit) This isn’t a horror story. It actually happened to me.I was using an AI agent to automate some tasks.
Grok Build CLI, agents-cli, and the CLI coding tool gold rush (www.reddit.com) xAI dropped Grok Build CLI. Google has agents-cli.
Deepseek tui alternatives, when do you jump from single model terminal agents (www.reddit.com) Been using Deepseek-Tui for days. solid for v4 workflows.
I wanted to know small local LLM code and made a personal projects. (www.reddit.com) I Started toying around LLM about sometimes ago. I think Qwen3.5 came out after like a month.
Here is what most people get wrong about saving tokens with AST tools (www.reddit.com) I spent the last day benchmarking codebase context tools against a real AI agent. Not synthetic token counts.
Agentic Guardrails: 4 markdown workflows to improve the output quality of AI coding agents (github.com via reddit) Agentic Guardrails Reusable workflow templates that keep AI coding agents from shipping sloppy code. These are markdown-based instructions that any AI coding agent can follow — Cursor, Claude Code, opencode, Aider, Gemini CLI, or anything…