We really need a website to stop these models from churning out identical code. From my experience they have started writing very functional code, the generic looking aesthetic is a poor prompting problem.
UAE Plans to Run 50% of Government on Agentic AI Within Two Years (www.mitsloanme.com via hn)
UAE Plans to Run 50% of Government on Agentic AI Within Two Years Agentic systems will analyze, decide, and execute across ministries under centralized oversight. News - Oman to Scale AI Ecosystem With New Special Economic Zone - UAE Bets…
One bash permission slipped... (www.reddit.com)
How? It kept getting chained bash commands wrong, with wrong escapes.
duralang – Durable Stochastic AI Agents with One Decorator duralang makes every LangChain LLM call, tool call, MCP call, and agent-to-agent call a Temporal Activity automatically — via a single @Dura decorator. No workflow DSL.
Stop Building MCP Servers for Personal Tools (www.reddit.com)
Everyone building AI agent tools reaches for MCP first. I did too.
Claude memory and projects question from newbie (www.reddit.com)
I'm a newbie to Claude so I'm learning this weekend. I like for it to remember and learn about myself but it seems Claude can't do that between projects right now so is a person better off just not using projects if they like it for it rem…
Reminder: Have you checked your context lately? (www.reddit.com)
Just a reminder to run /context. I like to think I was on top of this!
Show HN: Interpretable AutoResearch – Legible Agent Workflows (github.com via hn)
Interpretable Autoresearch Built for Claude @ MIT Hackathon. "Agents whose behavior you can read, verify, and trust." Track: Governance & Collaboration — Help people work together better Theme: Human-AI teaming through transparent, auditab…
-
280 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 23m Does running a model (like qwen3.6-27b) on vllm or transformers use less VRAM than llama.cpp?
- 3h First time GPU buyer. Got a RTX 5000 Pro. Was it a bad decision compared to two 3090s?
- 4h General vs Reasoning [Qwen 3.6]
- 4h Anyone tried +- 100B models locally with foreign languages?
- 7h Using ollama for Openclaw
145 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 1h These are the benchmark results for Gemma4 E4B tested on my iPhone 16 Pro.
- 2h Gemma 4 E2B runs surprisingly well on my 8GB Android phone, so I built a private voice notes app around it.
- 6h Potential of Gemma4 Per-layer embeddings?
- 1d Tried running Claude Code with local LLMs via Ollama — ended up subscribing to Pro anyway. But now I can't disconnect from the local server.
- 2d gemma-4-31B-it-DFlash has been released
Environmental impact (www.reddit.com)
Unpopular opinion: OpenAI should spend a LARGE ad campaign budget on some sort of environmental impact campaign. Whether it’s planting a tree every X amount of queries OR at least clarifying that AI WATER USE IS EXTREMELY OVERBLOWN.
If you use Claude , you need this yesterday. (www.reddit.com)
GitHub monster: forrestchang/andrej-karpathy-skills (108k+ stars and climbing fast)One CLAUDE md file that turns Claude from "vibe coder" into a disciplined engineer: Stops random assumptions Kills overcomplicated bloat Makes changes surgi…
Claude Code Leak: 8100 Takedown Requests and the Birth of Claw-Code (www.heise.de via hn)
Claude Code Leak: 8100 Takedown Requests and the Birth of Claw-Code A human error at Anthropic reveals the architecture of autonomous AI agents, sparking a heated debate about copyright for AI-generated code. A moment of carelessness was e…
Llama.ttf: a font file which is also a large language model and inference engine (fuglede.github.io via hn)
llama.ttf llama.ttf is a font file which is also a large language model and an inference engine for that model. llama.ttf is a font file which is also a large language model and an inference engine for that model.
My Agent Memory Library Helps Write Indie Articles (benemson.com via hn)
Illustration by ChatGPT My Agent Memory Library Helps Write Indie Articles Authors: Ben Emson & Alv (Ben’s knowledge vault agent, powered by elfmem) GitHub: https://github.com/emson/elfmem An experiment: can a custom memory library help an…
I’m working on an assessment where I need to create a coding task (basically SWE-bench style). The idea is: take an existing repo (I’m using pydantic) write tests that fail on the current code provide a patch that fixes it and the task sho…
could not extract summary
How Kepler built verifiable AI for financial services with Claude (claude.com via hn)
How Kepler built verifiable AI for financial services with Claude Inside a platform that indexes 26M+ SEC filings, earnings call transcripts, IR presentations, consensus estimates, and private data across 14,000+ companies and 27 global ma…
-
262 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 2h I was using Opus 4.7 to do research on the capabilities of Claude Mythos, and got this error.
- 4h ChatGPT Plus (20$) + Claude Pro (20$) or Claude Max (100$)
- 8h LLM proxy that lets Claude Code talk to any model
- 10h Set up multi-agent orchestration with Claude Code as the boss... am I overcomplicating this?
- 12h Does disabling /advisor significantly reduce token usage when using Opus?
17 itemsmodel roundup
Qwen 3Qwen3-0.6B is a large language model from the Qwen series, featuring dense and mixture-of-experts architecture, with significant improvements in reasoning capabilities and human preference alignment. Community feedback highlights its effectiveness for teaching from extensive documents and its suitability for low VRAM setups as a text-to-speech (TTS) model.
- 3h A Qwen finetune, that feels VERY human
- 8h [Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost
- 11h Looking for Small VLM/MLLMs Alternatives to Qwen Series Models
- 1d Qwen Meetup Draft Review Required (Function Calling Harness 2 - CoT Compliance from 9.91% to 100%)
- 1d Poor GPU Club : Tried Bonsai-8B on CPU & CUDA
ChatGPT crashed my browser when I continued 1k+ conversations (news.ycombinator.com)
Have you faced same or is it just me?
What if Claude launched in 1998? (www.reddit.com)
Would you like to use this UI? I'll take it.
- What if ChatGPT launched in 1998 (www.reddit.com)
Claude Chrome with Reddit (www.reddit.com)
Is there a way/workaround to use Claude Chrome Extension for Reddit?
- Issues with Claude in chrome (www.reddit.com)
AOP: Agent-Oriented Programming (en.wikipedia.org via hn)
Agent-oriented programming Agent-oriented programming (AOP) is a programming paradigm where the construction of the software is centered on the concept of software agents. In contrast to object-oriented programming which has objects (provi…
Show HN: Security Scanner for Agent Skills and MCP (github.com via hn)
Snyk Agent Scan Discover and scan agent components on your machine for prompt injections and vulnerabilities (including agents, MCP servers, skills). NEW Read our technical report on the emerging threats of the agent skill eco-system publi…
Show HN: Decentralized compute network. CLI-first (github.com via hn)
Decentralized compute network. CLI-first.
Show HN: Ableton Live MCP (github.com via hn)
Ever wanted to control Ableton with just your voice? Me too!
I know Claude is able to do a lot with Blender models from scratch, but what about models that were made elsewhere? Let's say I download a model of a cafe from a free 3d site.
Free access to Claude code- suggestions for things to try (www.reddit.com)
Hey community, I have free access to claude code to play around with. But don't have any ideas to try out- I have contained environment so can't really use it on my personal computer.
Is 2x5070Ti a good setup? (www.reddit.com)
I'm confused about what to get. I don't want to get something super expensive, but would like to have something that's "good enough" for coding etc.