AOP: Agent-Oriented Programming (en.wikipedia.org via hn)
Agent-oriented programming Agent-oriented programming (AOP) is a programming paradigm where the construction of the software is centered on the concept of software agents. In contrast to object-oriented programming which has objects (provi…
How Kepler built verifiable AI for financial services with Claude (claude.com via hn)
How Kepler built verifiable AI for financial services with Claude Inside a platform that indexes 26M+ SEC filings, earnings call transcripts, IR presentations, consensus estimates, and private data across 14,000+ companies and 27 global ma…
I know Claude is able to do a lot with Blender models from scratch, but what about models that were made elsewhere? Let's say I download a model of a cafe from a free 3d site.
Show HN: Security Scanner for Agent Skills and MCP (github.com via hn)
Snyk Agent Scan Discover and scan agent components on your machine for prompt injections and vulnerabilities (including agents, MCP servers, skills). NEW Read our technical report on the emerging threats of the agent skill eco-system publi…
-
51 items
event
MistralMistral, a French AI company, is set to release a medium-sized model with 128 billion parameters and is planning to launch Workflows in public preview. The company, founded by Arthur Mensch, continues to grow its AI empire despite not being based in the United States.
262 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 24m I was using Opus 4.7 to do research on the capabilities of Claude Mythos, and got this error.
- 2h ChatGPT Plus (20$) + Claude Pro (20$) or Claude Max (100$)
- 6h LLM proxy that lets Claude Code talk to any model
- 8h Set up multi-agent orchestration with Claude Code as the boss... am I overcomplicating this?
- 10h Does disabling /advisor significantly reduce token usage when using Opus?
Babysitting the Agent (christophermeiklejohn.com via hn)
Babysitting the Agent Two weeks in, even with all the hooks I've built, working with the agent has become a chore. Every shipped feature ends with me clicking through it to find out what didn't actually work.
- what is an agent? (www.reddit.com)
Show HN: TrainForgeTester – deterministic scenario tests for AI agents (github.com via hn)
Hi guys, I have built TrainForgeTester, an open-source scenario test runner for AI agents that take actions (call tools). The idea: test how agents perform in company specific scenarios and not just on general benchmarks.
Show HN: Ableton Live MCP (github.com via hn)
Ever wanted to control Ableton with just your voice? Me too!
Free access to Claude code- suggestions for things to try (www.reddit.com)
Hey community, I have free access to claude code to play around with. But don't have any ideas to try out- I have contained environment so can't really use it on my personal computer.
-
144 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
279 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 59m First time GPU buyer. Got a RTX 5000 Pro. Was it a bad decision compared to two 3090s?
- 2h General vs Reasoning [Qwen 3.6]
- 5h Using ollama for Openclaw
- 5h 3xR9700 for semi-autonomous research and development - looking for setup/config ideas.
- 8h If you've been waiting to try local AI development, please try it
Claude.MD file on multiple machines (www.reddit.com)
I very often work on my home Windows 11 Pro machine as well as my MacBook when I'm on the road. I'm just wondering in general how all of you keep the global Claude MD file up-to-date with the same version on both files.
AI agents are learning to leave the page (www.reddit.com)
They no longer only write, summarize, or suggest. They are beginning to touch systems, call tools, change states, move money, open access, close access, deploy code, and trigger workflows.
Safe(R) Repo Access for Agents (obiwahn.org via hn)
Safe Repository Access for AI Agents with bwrap and sshfs I run AI agents in a dedicated VM. The agents need access to my local git repositories so they can read and modify code, but I don’t want them to be able to push to remotes, read my…
Cheap worktree replacement for agent swarm (github.com via hn)
wafers Cheap, branch-backed repo views for parallel coding agents. wafers lets many agents work against one large Git checkout without paying for many clones or worktrees.
-
106 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
17 itemsmodel roundup
Qwen 3Qwen3-0.6B is a large language model from the Qwen series, featuring dense and mixture-of-experts architecture, with significant improvements in reasoning capabilities and human preference alignment. Community feedback highlights its effectiveness for teaching from extensive documents and its suitability for low VRAM setups as a text-to-speech (TTS) model.
- 1h A Qwen finetune, that feels VERY human
- 6h [Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost
- 9h Looking for Small VLM/MLLMs Alternatives to Qwen Series Models
- 1d Qwen Meetup Draft Review Required (Function Calling Harness 2 - CoT Compliance from 9.91% to 100%)
- 1d Poor GPU Club : Tried Bonsai-8B on CPU & CUDA
Elon Musk spars with OpenAI attorney in trial over company’s evolution from a nonprofit Elon Musk spars with OpenAI attorney in trial over company’s evolution from a nonprofit OAKLAND, Calif. (AP) — Elon Musk on Thursday sparred with an at…
How good is Gemini Embedding 001 for scientific retrieval? (www.reddit.com)
How good is Gemini Embedding 001 for scientific retrieval (RAG application)? How does it compare against Text Embedding 3 Large?
What if Claude launched in 1998? (www.reddit.com)
Would you like to use this UI? I'll take it.
- What if ChatGPT launched in 1998 (www.reddit.com)
Hey folks, Quick context on me: I run a handful of personal projects plus some client work, all using Claude Code with, more or less, the same core set of skills. My deploy flow, my code-review preferences, a debugging skill I keep refinin…
-
35 items
event
HallucinationClaude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.
- 1h How many e's are in the word seventeen [video] (AI hallucination)
- 1d What is the basic minimum while you prompt
- 2d I stopped writing 500-word guardrail prompts. This 8-line template works better.
- 2d Grok 4.3 achieves higher overall intelligence over 4.20 with less of a cost, at the price of slightly higher hallucination rate.
- 3d Reasoning models hallucinate tool calls more, not less. There's a paper.
I Use Codex CLI to Write and Maintain a Book on Codex CLI (blog.danielvaughan.com via hn)
9 min read Just now Press enter or click to view image in full size I have a 32-chapter book about Codex CLI that updates itself daily. An always-on agent named Andy (the default name in NanoClaw, because I save my creative energy for else…
Help needed. (www.reddit.com)
Hi, im currently working on a project and have built a rag pipeline in it, the pipeline works but its just gives the feeling of ‘not enough’ i cant seem to explain the whole situation,i need advice from someone experienced in this domain,…
- Hi everyone needed help!! (www.reddit.com)
Claude Doesnt Like External APIs (www.reddit.com)
Didn't even ask for all that just to implement a 3rd party api into something claude already worked on building, its only having issues with that API. 😭
so Ming-Chi Kuo (the Apple supply chain analyst) just dropped a note saying OpenAI might be building a smartphone. not just earbuds — an actual phone.
Andrej Karpathy: From Vibe Coding to Agentic Engineering (www.youtube.com via reddit)
Andrej Karpathy (co-founder of OpenAI, former head of AI at Tesla, and now founder of Eureka Labs) talks with Sequoia partner Stephanie Zhan at AI Ascent 2026 about what's changed in the year since he coined "vibe coding." He explains why…
- Andrej Karpathy: From Vibe Coding to Agentic Engineering [video] (www.youtube.com via hn)
New Claude-Code Plugin for Jupyterlab (github.com via hn)
jupyterlabclaudecode_extension Manage Claude Code CLI sessions from inside JupyterLab. A left-sidebar panel lists every project under ~/.claude/projects/ deduplicated to one row per folder, marks live remote-control sessions with a green d…
- Using Opus 4.6 in Claude Code (plugin) for VS Code (www.reddit.com)
- Claude code (www.reddit.com)
- Codex Plugin for Claude Code (community.openai.com via hn)
+2 more
- Show HN: Gemini Plugin for Claude Code (github.com via hn)
- Gemini Plugin for Claude Code (github.com via hn)
Public Runtime for Convera for LLM's (github.com via hn)
CONVERA Inference should not start from zero every time. CONVERA is an experimental local inference runtime that treats repeated work as reusable state.