Looking for an AI agent to help me book appointments etc (www.reddit.com)
Hi all, I'm looking for a personal assistant type agent that would be able to book appointments on my behalf, among other things. I am not looking for one specifically targeted towards businesses, as this is for my personal life :) Thanks!
Our team's done cybersecurity for 12 years. We started in web security, and when GenAI apps started shipping, we shifted into LLM security.
Most AI agent safety discussions still focus on the model. Was the prompt safe?
Pinecone Nexus: The Knowledge Engine for Agents (www.pinecone.io via hn)
The Primary User Is Changing Every technological paradigm shift produces a defining data infrastructure category. Relational databases for client-server.
I can't be the only person with a normal Claude. (www.reddit.com)
I keep seeing posts like these: Claude Got Access To A Clock and Immediately Lost Its Mind Claude stopped telling me to go to bed, but there are signs. I swear Anthropic has "go to sleep" in their system prompt ClaudeCode's final words aft…
It doesn't help with pronunciation, but I feel you really need an actual teacher to get the tones down properly anyway.
Been experimenting with different ways to use claude code, so my co-worker and I started using them in a chat room together so we could plan and move to prompting with agreed upon integration contracts faster. Still in the experimental pha…
Stop Treating Agent Sandboxes as Cattle (opencomputer.dev via hn)
A direct response to
-
161 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
65 itemsmodel roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
- 2h Improve CC and plugin
- 15h Cheap Claude/Codex/Gemini Models - Pay just 25% of official rates
- 17h Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek)
- 2d Claude Sonnet 4.6 model hallucinates
- 2d 1M context beta retired yesterday on Sonnet 4.5 / 4. Here's the actual fix if you missed it.
Plenty of CLI coding agents will talk to a local LLM, but the catch is the ecosystem. Skills, slash commands, MCP servers, plugins, hooks: all the interesting tooling has been built specifically for Claude Code, and parity on every other a…
How to load skills for Claude Code Web routines? (www.reddit.com)
The official docs only has one line "The session can run shell commands, use skills committed to the cloned repository, and call any connectors you include" about how routines can use existing skills. Has anyone figured out how to set this…
- Claude Code Routines (code.claude.com via hn)
- Routines in Claude Code (claude.com via hn)
If you are stuck on a research level math problem, is there a model you might turn to to give you ideas? I have only ever tried chatgpt pro for this.
Claude has other things to do (www.reddit.com)
Being a jerk, yet I come back for more.
- if you can’t trust things like this to Claude, who can you trust them to? (www.reddit.com)
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
Sharing a project I've been building: a full end-to-end wildfire prevention pipeline that runs a Vision-Language Model directly on a satellite, using Sentinel-2 imagery. The interesting design constraint isn't model quality.
Cavemen skill questions (www.reddit.com)
Caveman looks amazing for reducing output tokens! Has anyone tried applying the Caveman skill to a headless, automated backend application?
i was just wondering why anthropic dosnt offer an image gen or video gen, and if thatll ever change? is it becuz they dont hav the money, or becuz they think image generators are unsafe/unethical/a legal minefield?
I am an Indian founder, and I have built an ERP/Founder's OS. The idea was simple, and had a few layers.
-
266 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 2h Claude Opus 4.7 won’t just output prompts—keeps arguing instead
- 6h I think a lot of vibecoders are missing that software development needs some friction
- 7h Even Sama himself doesn’t believe GPT-5.5 matches Opus 4.7 design capabilities. AI race will humble you
- 8h Make your Claude Design credits last longer
- 10h Claude Design guidelines/benchmarks on model usage?
133 itemsmodel roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 2h Mistral Medium 3.5 128B and Qwen 3.5 122B A10B on 4x RTX 3080 20GB
- 1d Updated: RTX6k (Server, 450w) Qwen3.5-122B-A10B (MXFP4_MOE) Benchmarks (llama.cpp)
- 1d Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_100 · Hugging Face
- 1d I cut Codex’s API Usage by 50% using a self modifying system
- 1d Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real.
Claude Code Visual: hooks, subagents, MCP, CLAUDE.md (www.reddit.com)
Been using Claude Code for a couple of months. Still keep forgetting the MCP hook syntax, so I finally just wrote everything down in one place.
“From tokens to dollars” is starting to feel real (www.reddit.com)
We’re entering a phase where: prompt → content → distribution → monetization is basically one pipeline. Just tried HappyHorse + Qwen, and it’s a glimpse of that future.
Use Cheaper Models with Claude (gist.github.com via hn)
Use Claude Code as your orchestrator while routing the actual LLM work to free or cheap models — perfect for bulk grunt work like content generation, scraping, data analysis, etc. You can use for different usecase like - Generating practic…
The Engineering Constraints of Distributed LLM Inference over the Open Internet (siliconandsoul.substack.com via hn)
A first‑principles reasoning exercise on running LLM inference over the open internet. This post starts from the core engineering constraints—Memory Wall, Sequential Dependency, Bandwidth Abyss, and KV‑cache state—and then walks through ho…
Flagged chat???? (www.reddit.com)
I asked Claude about the Hanta virus that appeared on the cruise ship and how it is transmitted and it started talking about the Andes virus, so I asked it why Andes Virus? Was it reported anywhere that the hanta virus on the ship was inde…
Free AI video generation tool (HappyHorse + Qwen) – early thoughts (www.reddit.com)
Free AI video generation tool (HappyHorse + Qwen) – early thoughts Just tested a combo of HappyHorse + Qwen and it’s surprisingly solid for AI-generated video content. What stood out: fast generation from simple prompts decent storytelling…
I reverse-engineered my tourist bike tour using Claude. (www.reddit.com)
I’m currently in Tokyo, and there are a lot of tourists…literally everywhere. If you’re like me and allergic to your own species (tourists… ugh!) 😅, I might have the solution for you.
Turn AI tokens into Free (Happyhorse &Qwen) (www.reddit.com)
Turn AI tokens into actual $$ (free to try) Been playing around with some AI video tools lately and found something interesting. You can test HappyHorse + Qwen for free here — it basically lets you turn simple prompts into short-form video…
NEED HELP WITH API (www.reddit.com)
I recently built a real-time analysis app using a locally hosted Python script that sends structured data to Claude via the Anthropic API every 15 seconds and displays the results on a local dashboard. The bottleneck I'm running into is AP…
- Need help with an issue (www.reddit.com)
A 1.7B model can actually turn out some code, so I'm running the training for a 9B model, then will re-run HumanEval (a full one this time). I've shown most of my homework in the article, but will be posting to github after I clean things…