Meet Qwen3.6-35B-A3B:Now Open-Source!🚀🚀 A sparse MoE model, 35B total params, 3B active. Apache 2.0 license.
#agentic
88 items
Qwen3.6-35B-A3B released! www.reddit.com So, this week claude wiped agentic AI startups with a new update. Also, as they have mythos now, they will ship things very fast without any trouble www.reddit.com Qwen3.6-35B-A3B: Agentic Coding Power, Now Open to All qwen.ai Qwen Studio offers comprehensive functionality spanning chatbot, image and video understanding, image generation, document processing, web search integration, tool utilization, and artifacts.
‘Addictive’ agentic coding has developers losing sleep www.reddit.com Read through Anthropic's 2026 agentic coding report, a few numbers that stuck with me www.reddit.com Anthropic put out an 18-page report on agentic coding trends. Skimmed it expecting the usual hype but a few things actually caught me off guard The biggest one: devs use AI in ~60% of work but only fully delegate 0-20% of tasks.
Multi-Agentic Software Development Is a Distributed Systems Problem kirancodes.me Tried claude code. Hate it. www.reddit.com Comparing Qwen3.5 27B vs Gemma 4 31B for agentic stuff www.reddit.com A disciplined Cursor 3.0 Agentic workflow for complex backend/system design tasks www.reddit.com GPT vs Claude in a bomberman-style 1v1 game www.reddit.com Claude is getting worse, according to Claude www.theregister.com Claude is getting worse, according to Claude Brief outage follows growing number of quality complaints Once the AI darling of programmers everywhere, Anthropic's Claude has been stumbling mightily, both in terms of cost and perceived quali…
Show HN: A CLI that writes its own integration code docs.superglue.cloud anyone else stuck at their desk during long agentic runs? www.reddit.com so I've been running some complex agentic refactors and these sessions go 6+ hours because the agent is grinding through a massive legacy codebase, and I can't really walk away. close the laptop and the process dies. re-initializing takes…
2x Asus Ascent GX10 - MiniMax M2.7 AWQ - cloud providers are dead to me www.reddit.com Hello, I've been on a quest to get something "close enough" of Opus 4.5 running locally, for agentic coding, as SWE with 15 years of experience. I tried with one spark (yeah I'm calling my Asus Ascent GX10 sparks - they're the same), with…
Ive automated my email/sms/phone www.reddit.com we got it good boys! how many of you are doing this??
AI governance isn't failing because we lack regulation i mean like it's failing at execution www.reddit.com There's a lot of movement around AI regulation right now (EU AI Act, US frameworks, etc.), but in practice many of these governance models don't survive contact with real, agentic systems. I've been digging into why compliance frameworks t…
What do you use for autocomplete in 2026? (VS Code) www.reddit.com Show HN: Mercury – No-code orchestration for human and agent teams www.mercury.build Hey HN, I'm Naveen, one of three co-founders building Mercury (mercury.build). We spent the last year in deploying AI agents for teams in large enterprises.
$1,400/month with Cursor + Claude API — how are you managing costs while keeping a real agentic workflow? www.reddit.com Systems Engineering: The Key to Building Agentic Software That Works www.ashpreetbedi.com Tested 6 browser use agents for real-world tasks — here's an honest breakdown + looking for recommendations www.reddit.com 2026 Agentic Coding Trends Report [pdf] resources.anthropic.com Title: 2026%20Agentic%20Coding%20Trends%20Report.pdf URL Source: https://resources.anthropic.com/hubfs/2026%20Agentic%20Coding%20Trends%20Report.pdf Published Time: Wed, 21 Jan 2026 22:37:47 GMT Number of Pages: 18 Markdown Content: 2026 A…
Beyond Prompts: A Tiered Trust Model for Autonomous Agents (Experiment Report) www.reddit.com We often talk about agent autonomy, but rarely about the "Harness Engineering" required to make that autonomy safe. I’ve been running a design experiment comparing agentic workflows on open platforms (OpenCode) vs.
Why model drift is the real failure mode for agentic systems www.reddit.com Across Twitter and Reddit, I keep seeing the same complaint: Claude feels worse. Not on a benchmark.
Anybody has practical experiences using Chinese models? www.reddit.com So like with coding or any craft, I think there's a proper Tool for the job. Sure you can use a stone to hammer drive in a fence post, but a a sledge is usually more economical.
Huge throughput gains when switching agent evals to shared environments with per-run isolation www.reddit.com Thanks all for the comments on my previous post about local-first agentic evaluation collapsing in long stateful agents runs, just sharing an update on where I’m at now in case it helps as I had another issue to overcome. Took on board the…
Zuver – Build your enterprise Agents with just 10MB RAM news.ycombinator.com Agentic coding at enterprise scale demands spec-driven development venturebeat.com Agentic AI pentesting with Strix: results from 18 LLM models theartificialq.github.io Show HN: The opensource, reliable, scalable Agentic AI framework under 10MB zuver.cc Show HN: OQP – A verification protocol for AI agents news.ycombinator.com As AI agents autonomously write and deploy code, there's no standard for verifying that what they shipped actually satisfies business requirements. OQP is an attempt to define that standard.
Mi – agentic harness in 30 lines of JavaScript github.com I built a self-evolving agentic loop that ran 104 iterations autonomously to find questions that break every LLM — here's the architecture www.reddit.com Why I built this: I wanted to find the next "strawberry problem" — simple questions any kid can answer but every LLM gets wrong. Instead of manually testing questions, I built a system that does it autonomously.
A Black-Box Contract Engine for Agentic Software Development github.com Project Dojo A Black-Box Contract Engine for Agentic Software Development Dojo is a declarative testing engine built in Go. It acts as a transparent Man-in-the-Middle proxy between your Software Under Test (SUT) and its dependencies.
Ask HN: We dont need a programming language now? news.ycombinator.com I've seen agentic IDEs now Cursor or Antigravity and main trends seems to be development with just ideas, where although the changed lines are shown, its becoming less and less visible. If we are becoming language agnostic, shouldn't we op…
Solving the "Agentic Kill-Switch": Moving from Prompt Guardrails to a Python-native Safety SDK www.reddit.com The biggest hurdle for taking agents from "cool demo" to "production tool" is the lack of a reliable circuit breaker. We're currently relying on the LLM to "behave" via system prompts, but as we know, jailbreaks and hallucinations make tha…
Ask HN: Which LLM model and agentic CLI are you using for local development? news.ycombinator.com I’ve been testing a handful of models the past few weeks, but I still haven’t settled on one yet… I’m curious to see what models, their sizes, on what hardware, and which agentic tool people are using
Scaling from single-repo Claude projects to a multi agentic workflow www.reddit.com Hi everyone! Just a quick exchange on what I am using — and I'd love your take on it 🤖 So far I have mainly been doing one-off projects, setting up Claude in a single repo at a time.
Ask HN: What standards or protocols exist for AI Agent permissions news.ycombinator.com The (Mostly) Agentic SDLC amoshaviv.com Systems Engineering: Building Agentic Software That Works twitter.com Tradclaw: an open source AI mom for agentic parenting twitter.com Is qwen3 coder next still relevant with qwen3.5 release for agentic coding? www.reddit.com What are the key features that make an AI system truly "agentic"? www.reddit.com Show HN: A Bomberman-style 1v1 game where LLMs compete in real time github.com Show HN: On-Device vs. Cloud LLMs for Agentic Tool Calling in a Real iOS App subralabs.com We built an AI concierge into a resort directory app for iOS. The feature needed to search a dataset of ~85 properties, apply filters, find nearby airports, and respond conversationally in Italian.
Agentic Search Leaderboard www.algolia.com We tested every major LLM on real shopping queries through Agent Studio, Algolia's platform for building search and discovery agents. Three dimensions of quality.
OpenClaw Self-Improvement Loop: adversarial agentic self-modification workflow github.com An adversarial framework for AI agent self-modification, built and battle-tested in production. Inspired by karpathy/autoresearch.
Agentic dashboard analysis www.reddit.com Show HN: A better alternative to CLI and MCP for local tools github.com Observing the shift toward open-weight models for agentic coding workflows www.reddit.com Is my 'Retry Tax' math correct for DeepSeek V3/V4 agents? (Project Feedback) www.reddit.com Agentic Reasoning in Practice: Making Sense of Structured and Unstructured Data www.databricks.com Enterprise data is rarely useful in a silo. Answering questions like, "Which of our products have had declining sales over the past three months, and what potentially related issues are brought up in customer reviews on various seller site…
Two small agentic patterns to wire apps directly to Claude Code www.reddit.com These two patterns turn Claude Code into a personal assistant. You interact normally with it and it listens in the background for events, handles them, and gets back to interacting with you.
Complex, parallel, long-running claude/agentic sessions - what is the point? where is the value? www.reddit.com Here is how I view AI Agents field (with focus on SWE/research) right now: - "chats online" gpt/gemini/claude --> general use - "vscode like extensions" cursor/antigravity/cline vs code extension/cc vs code extension etc. --> for coding, b…
Complex, parallel, long-running claude/agentic sessions - what is the point? where is the value? www.reddit.com Here is how I view AI Agents field (with focus on SWE/research) right now: - "chats online" gpt/gemini/claude --> general use - "vscode like extensions" cursor/antigravity/cline vs code extension/cc vs code extension etc. --> for coding, b…
Show HN: ZettelForge – Agentic memory for cyber threat intelligence github.com ZettelForge The only agentic memory system built for cyber threat intelligence. Give your AI agents persistent memory with entity extraction, knowledge graphs, and STIX ontology -- no cloud, no API keys, works offline.
Show HN: Agentfab – A Distributed Agentic Platform github.com Hi HN, I’m the creator of agentfab, a distributed agentic platform that features task decomposition, multi-agent orchestration, model heterogeneity with custom agentic fabrics, bounded review loops, and a bespoke self-curating memory syste…
Agentic Engineering Methodology – Structured AI-Assisted Dev (Karpathy, Osmani) github.com Agentic Engineering Methodology A structured, human-led methodology for planning and executing software projects with AI coding agents. Built from practitioner experience and refined with research from Andrej Karpathy, Addy Osmani, and the…
Agent Continuity: Disaster Recovery for the Agentic Era gavinpineapple.substack.com Agent Continuity: Disaster Recovery For The Agentic Era What happens when the proverbial 💩 hits the (GPU) fan and you lose all your agents? My Favorite Alien I would like you to meet Rocky 🪨🦞 Rocky (named after the adorable alien from Proj…
Local models capabilities www.reddit.com Claude CLI, Codex CLI and Gemini CLI, all have agentic capabilities that it is capable of editing files or folders in my local machine directly or the apps that I have integrated using MCPs when working on my request like coding task or re…
Stop letting your agents decide everything — extract deterministic steps wherever you can www.reddit.com Context: I have been building Litmus (a brutal market validation tool) and I've learnt that if your agentic pipeline needs to produce factual, reliable output, stop letting the AI decide everything. The insight: extract deterministic steps…
Show HN: Idea File for LLM Cycling Coach gist.github.com This is heavily inspired by Andrej Karpathy's LLM Wiki, and could be used to create many other types of "Agentic Apps" or however you want to call them. My specific implementation uses Claude Code, TrainingPeaks, Todoist and Apple health.
Model agnostic, agentic annotation tools for text highlighting old.reddit.com Agentic AI | Confusion between reading the context of SKILL and reading the file www.reddit.com Agentic AI Tools – A directory to find and compare AI agent tools agenticaitools.net Show HN: I analyzed 591 agentic engineering jobs: LangChain dominates at 22% agentic-engineering-jobs.com Macbook Vs Strix Halo www.reddit.com Built an open-source knowledge graph that gives AI agents domain expertise in bioinformatics, hosted as an MCP server www.reddit.com Scaling Managed Agents: Decoupling the brain from the hands www.anthropic.com Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Why Engineering Teams Need an Agentic Layer, Not Just AI Chat medium.com Show HN: The Harness for Creative Agents www.flickspeed.ai Coding agents need shell. Creative agents need canvas.
Enterprises power agentic workflows in Cloudflare Agent Cloud with OpenAI openai.com Created a linter for agentic code smells Strong feeling: we are in a folded AI reality news.ycombinator.com Why Your LLM Leaderboard Scores Don't Matter www.reddit.com Leaderboard scores often don’t translate to production performance — even with newer agentic / Arena-style evals. The main issue seems to be that benchmarks are standardized, while real systems depend heavily on prompts, data distribution,…
m5 pro 64gb worth it for local agents or wait? www.reddit.com I am currently on an m3 mbp with 24gb ram. For regular python and django work the machine is perfect and i have no need to upgrade for speed.
Cloud AI is getting expensive and I'm considering a Claude/Codex + local LLM hybrid for shipping web apps www.reddit.com I'm a designer who's been working on web apps and plugins for the past 5 months. Right now I'm building an After Effects plugin (close to shipping) and a music learning game experience.
computation is the missing bedrock of agentic memory www.reddit.com link to full article in comments TLDR: - LLMs are the wrong substrate for memory. Prediction can't do routine work, repeatable work consistently.
Running a full agentic coding loop locally on a 3090. Here's what actually works in 2026. www.reddit.com After months of testing, I finally have a local setup that doesn't make me want to go back to the API. Hardware: RTX 3090 (24GB VRAM) Models tested: Qwen2.5-Coder 32B Q4_K_M, DeepSeek-Coder-V3 Q4, Llama 3.3 70B Q3_K_M Inference: llama.cpp…
I have a Macbook AIR M5 Base and I want to run an Agentic Coding program, similar to Claude Code or Codex. Besides the model, how do I do it? I've already tried with Ollama, VS Code, Opencode, and haven't been able to. (I'm not a developer, sorry) www.reddit.com Self employed, Small biz folks: Have you unlocked huge revenue gains with Claude specifically? www.reddit.com We've heard about the increase in productivity in engineering departments in large companies with Claude Code, but I'm curious about implementations in small businesses. I'm especially curious about folks who work for themselves (i.e. non-…
Excess of Agentic AI... does that make sense? www.reddit.com Does it make sense for AI companies to be limiting access to the AI models themselves, precisely because of Agentic AI? Let’s think about it, if there is already not enough computing power to sustain the gigantic, and increasingly excessiv…
How can I use agentic AI to automate my WFH dayjob? www.reddit.com Here is what most people get wrong about saving tokens with AST tools www.reddit.com I spent the last day benchmarking codebase context tools against a real AI agent. Not synthetic token counts.
Agentic Guardrails: 4 markdown workflows to improve the output quality of AI coding agents github.com reliable way just to have cursor agentic ability and IDE with external provider api without cursor pro ? The next frontier is AI QA www.reddit.com