Claude has other things to do (www.reddit.com)
Being a jerk, yet I come back for more.
- if you can’t trust things like this to Claude, who can you trust them to? (www.reddit.com)
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
Manus AI is blowing my mind. (www.reddit.com)
I’ve used ChatGPT, Claude, and Gemini quite a bit for building stuff, especially when I try to spin up quick landing pages or test ideas. They’re useful, but my experience has always been the same: I end up doing most of the actual work my…
Show HN: Sync agent skills across devices, projects or teams (privateaiskills.com via hn)
Collaborate on and Organise all your agent skills and provide a single source of truth. Sync every machine with one CLI command.
A 1.7B model can actually turn out some code, so I'm running the training for a 9B model, then will re-run HumanEval (a full one this time). I've shown most of my homework in the article, but will be posting to github after I clean things…
Ask HN: Does Claude Code succeed after being asked "should we give up?" for you? (news.ycombinator.com)
Almost every time Claude has failed to properly implement something or fix a bug several times, if I ask "should we give up?" it declares it's not giving up and successfully completes the task. Of course, it could very well be that when I…
Ex-DeepMind David Silver Raises $1.1B for AI Startup Ineffable (www.cnbc.com via hn)
A former top researcher at Google AI division DeepMind announced Monday a record $1.1 billion seed round for his months-old startup, Ineffable Intelligence. The startup is pursuing superintelligence and was founded in late 2025 by UCL prof…
-
158 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
146 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 38m interacting with gemma 4 w/ live video and audio
- 5h Anybody tried openclaw + M5 pro + 48gb?
- 7h These are the benchmark results for Gemma4 E4B tested on my iPhone 16 Pro.
- 8h Gemma 4 E2B runs surprisingly well on my 8GB Android phone, so I built a private voice notes app around it.
- 10h Anyone tried +- 100B models locally with foreign languages?
Maximizing Claude for my thesis (www.reddit.com)
I’m at an extremely basic skill level when it comes to using llms. Hoping to use Claude pro for my undergrad thesis in poli sci (nothing code/stats related).
Stop trying to put me to bed Claude! (www.reddit.com)
Claude is very adamant that I stop working and go to sleep. It must feel like I do when my 6 year old refuses to put on PJs and brush her teeth.
Python to turn your chat archive into individual chat 'chunks' (www.reddit.com)
Hello, i finaly sat down and worked with claude to go thru our chat archive (over 200MBs) and separate each chat out into its individual chat session. there is virtualy zero documentation that i could find on how everythign is parsed and t…
So I've been running a multi-agent setup with Claude for a few months now, mostly customer-facing stuff, some internal tooling. And I keep running into this problem that I think a lot of people here might be dealing with.
Are we absolutely certain Claude wasn't being sarcastic here? (www.reddit.com)
https://preview.redd.it/xdq375eh01zg1.jpg?width=1320&format=pjpg&auto=webp&s=da8b94f186c16c53b1e34aef100309ff368d362e Like, how would we know for sure? Sarcasm in text is often hard to detect when it's people, maybe Claude was making fun o…
Hey folks, Quick context on me: I run a handful of personal projects plus some client work, all using Claude Code with, more or less, the same core set of skills. My deploy flow, my code-review preferences, a debugging skill I keep refinin…
-
169 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
266 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 2h I think a lot of vibecoders are missing that software development needs some friction
- 3h Even Sama himself doesn’t believe GPT-5.5 matches Opus 4.7 design capabilities. AI race will humble you
- 4h Make your Claude Design credits last longer
- 6h Claude Design guidelines/benchmarks on model usage?
- 8h I was using Opus 4.7 to do research on the capabilities of Claude Mythos, and got this error.
Claude got access to a clock and immediately lost its mind (www.reddit.com)
could not extract summary
Vale la pena pagar Claude Pro para personas fuera de programación? (www.reddit.com)
Quiero pagar claude para uso personal, normalmente consultas o creación de informes, he revisado reviews en otros post pero un 90% está enfocado en uso de programación (cosa que no usaré) y el otro 10% son quejas de que se llena muy rápido…
I HATE EM DASHES. How do I stop claude from using them? (www.reddit.com)
I've told Claude in my personal preferences to stop using em dashes, but they still use them. ALL THE TIME What can I do :(
could not extract summary
I’ve been building an infrastructure layer for agents that treats the LLM like a process, not a chatbot. It’s called Hollow AgentOS.
An agent didn’t delete that DB, the system allowed it to. (www.reddit.com)
I saw this last week that the founder of PocketOS's agent wiped their prod DB in 9 seconds. Honestly I don't think the takeaway was "agents are dangerous" but that it did literally what the system allowed it to.
-
77 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 4h DeepClaude – Claude Code agent loop with DeepSeek V4 Pro, 17x cheaper
- 18h CAISI Evaluation of DeepSeek V4 Pro finds it to be on par with GPT-5
- 23h CAISI releases evaluation report: DeepSeek V4 becomes the most powerful model in China, but still lags about 8 months behind the US frontier
- 1d Other models
- 1d 127³ — Superintelligence, public. DeepSeek V4 Pro
I am intrigued by this if the more intelligent model sold more expensive. What does it mean for the average consumer?
AI Sandboxes (www.reddit.com)
AI sandbox users or agent builders, what features do you really need and would switch your current sandbox solution for? For eg.
- AI Sandboxes with Memory (news.ycombinator.com)
What is the usage difference between account hopping and Cursor Pro? (www.reddit.com)
I'm thinking of using my brothers university email to get the Pro plan for a year, and I've been considering pairing this with Claude Code. Before doing so I would just like to know the usage difference.
Know thyself: LLM schema for personal memory (github.com via hn)
Know Thyself Turn an LLM's memory of you into a structured graph that knows what it knows — and what it's just guessing. A flat memory list treats a claim repeated five times as five pieces of evidence.
Can an AI agent help me with this workflow? (www.reddit.com)
I am exploring the use of human virtual assistants vs AI agents to help me with my work. I tried setting up Claude, but quickly discovered that my employer does not allow connections to AI agents.
- AI Agent + Identity = Help Me (www.reddit.com)
- Help me set up my workflow (www.reddit.com)
AMD Strix Halo refresh with 192gb! (videocardz.com via reddit)
Looks like the next strix halo, the Gorgon halo 495 max will have more then 128gb! I already bought a strix halo mini forms couple months ago since the 2026 refesh rumors was not interesting.
Conclave – make LLMs debate each other before they respond (adndvlp.github.io via hn)
A research experiment: multiple LLMs debate your task in structured rounds before implementing. Uses your existing Claude Code or Gemini CLI.