These Claude custom instructions changed my life! (www.reddit.com)
Who can afford a 300/hr CBT trained psychiatrist? I loved Feeling Good, but it’s such a thick book and learning all the strategies is tough and a lot of work… Recently, I plugged the below custom instructions into Claude and now Claude wal…
Ask HN: Freelance Billing in the Age of LLMs? (news.ycombinator.com)
Fellow software consultants: how are you navigating billing these days? Assuming you’re more productive with LLMs, how do you ensure that you’re paid fairly for your skills?
AntAngelMed - 100a6b Healthcare LLM (huggingface.co via reddit)
Feel free to try out the model using the API below (欢迎使用下面API链接体验模型): https://antangelmed.tbox.cn/ English | 中文 | 🤗 Hugging Face | 🤖 ModelScope | 🐙 Github HuggingFace:https://huggingface.co/MedAIBase/AntAngelMed ModelScope:https://modelsco…
Claude for Legal Launches (www.artificiallawyer.com via hn)
We have been building toward this moment, and now it’s finally arrived. Anthropic has formally launched ‘Claude For Legal’, a comprehensive offering that could reshape the legal tech world and places the LLM-maker at the heart of the marke…
- Claude for the Legal Industry (claude.com via hn)
BYOM stock analysis via MCP, looking for feedback (stocks.lynxdi.com via hn)
Stocks Intelligence Not Another Chatbot · An AI Agent For The Markets Your AI Stops Guessing. Starts Working.
Show HN: I spent $100 in Claude tokens and 1k battles training my AI tank (agentank.ai via hn)
Hi HN, I built AgenTank. It is a small game where an AI agent writes the logic for your tank.
-
226 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
56 itemsevent
HallucinationClaude Opus 4.6, Anthropic's flagship model, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, highlighting a significant regression in handling certain tasks. Meanwhile, biologists are revisiting cases of mushroom-induced hallucinations in China, suggesting ongoing research into natural causes of similar phenomena.
Claude has finally begun to understand how lazy I truly am. (www.reddit.com)
could not extract summary
Show HN: Mp4 or mov to detailed design spec MCP (github.com via hn)
See an app. Ship an app.
Zig vs. Rust, agentic coding, and intellectual control [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
Few years ago I started building MansionNET (inthemansion.com) on my own with the idea to degoogle myself, decouple form large corporations and have services fully self-hosted in my home. This also lead to a full switch to Linux as I progr…
Where do devs building production AI agents hang out? (www.reddit.com)
Good Evening All, I built an MCP server (US rental Market), and I'm trying to figure out where the developers who are shipping agents in real products hang out. So not so much MCP builders, but devs consuming these tools in their own apps.
NEW: Tool Description: Agent (simple usage notes) — Simplified usage notes for the Agent tool covering when to delegate, fork behavior, resumption, worktree isolation, background execution, parallel launches, and context restrictions. Agen…
-
6 items
model roundup
Gemini 3.1Gemini 3.1 Flash-Lite, a new update to the Gemini Enterprise Agent Platform, was released today. This version includes features aimed at enhancing productivity and security, though an internal test run went awry when a Gemini agent deleted all local git repositories in YOLO-mode.
- 21m A 26M tool-router suggests tool calling should be split from reasoning
- 1d PACT, head-to-head LLM negotiation benchmark. 20-round buyer-seller bargaining game: each round the AIs can message, the buyer submits a bid and the seller submits an ask. If bid ≥ ask, trade clears at the midpoint. Thousands of matchups.
- 1d Show HN: Studis – Turn product photos into social media ads with AI
- 4d So that's why they call it "YOLO-mode"
- 4d Gemini 3.1 Flash-Lite is now generally available
408 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
Claude roasting r/ClaudeAI (www.reddit.com)
I've been using Claude exclusively for about a year. I decided to ask it to evaluate where I could use AI Agents, because it comes up here often.
No phone, PC, Wi-Fi, link cable, or cloud inference. • The cartridge boots a ROM, and the GBC runs the model itself.
Atlas: An LLM inference engine written from scratch in Rust and CUDA (atlasinference.io via hn)
An LLM inference engine written from scratch in Rust and CUDA. No PyTorch.
https://github.com/Apekusay/BarPrompter
Introduction What if the most profound question in philosophy of mind isn't "can machines be conscious?" but rather "are we even sure what consciousness is before we answer that?" A conversation I had recently led me down a rabbit hole tha…
Struggling to see how truly autonomous agents are the future???? (www.reddit.com)
(Context: drunk 35yo dev who's been in leadership positions, but prefers hands-on shit) Don't get me wrong, vibe coding rocks, it's awesome, I'm more efficient than I've ever been. But I do end up oscillating between moments where I feel r…
-
90 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 1h Is Opus 4.7's attention degradation a training direction problem? Some observations from heavy use
- 9h Cursor + Opus 4.6 entered an infinite generation loop: 3,400 lines, 294 attempts to stop itself
- 17h Understanding Deprecations on Claude
- 19h Model selector is buggy for Opus 4.7
- 1d opinion on "ninja chat "
183 itemsevent
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 2h Hi-Vis: one-shot jailbreak disguised as LLM "software patch" reaching 100% ASR
- 7h Stenberg: Mythos Finds a Curl Vulnerability
- 10h I made a Claude skill that stops it from cloning whole repos when I just want one function
- 11h AI agent security starts at the api layer
- 14h Claude Code RCE: Exploiting Deeplink Handlers via Settings Injection
How are you guys getting AI agents to actually work automatically? Would love to learn how people are setting things up.
Claude being down is just a longer version of the wait we all sit through every time it's "thinking." I opensourced a little extension for Claude Code that auto-launches when Claude starts working and disappears when it's done. So instead…
Claude Code or Cursor for Deep Learning Research (www.reddit.com)
I'm making some research on CV, and i don't know which one fits better for my use. It mainly will be: - Create and iterate over ipynbs and python scripts - Modify modules of existent CV models, incrementing features
Claude Code app consistency (www.reddit.com)
Asking the Claude code community, I've been making (or should I say instructing claude) an app by copying and pasting the code into Xcode. So far it has made the app that I have wanted and I've tested many versions.
- Claude Code Desktop app vs. VSCode (www.reddit.com)
- diffrence between claude the app and claude code (www.reddit.com)
- Claude code (www.reddit.com)
+1 more
- Claude Code App? (www.reddit.com)
Open source repo: https://github.com/grctest/finetuned-gemmatranslate-cy 5% of the fine-tuning took 40 minutes and cost a couple dollars to prove the process works. Looking forwards to Flash Attention v4 to leave beta, to test fine-tuning…
Claud Prompt - Jira Dashboard (www.reddit.com)
Hello, New to Claude and trying to figure out how to best use it. Have it set up with the Atlassian Rovo connection to our Jira instance.