1. Being a jerk, yet I come back for more.

  2. I’ve used ChatGPT, Claude, and Gemini quite a bit for building stuff, especially when I try to spin up quick landing pages or test ideas. They’re useful, but my experience has always been the same: I end up doing most of the actual work my…

  3. Collaborate on and Organise all your agent skills and provide a single source of truth. Sync every machine with one CLI command.

  4. A 1.7B model can actually turn out some code, so I'm running the training for a 9B model, then will re-run HumanEval (a full one this time). I've shown most of my homework in the article, but will be posting to github after I clean things…

  5. Almost every time Claude has failed to properly implement something or fix a bug several times, if I ask "should we give up?" it declares it's not giving up and successfully completes the task. Of course, it could very well be that when I…

  6. A former top researcher at Google AI division DeepMind announced Monday a record $1.1 billion seed round for his months-old startup, Ineffable Intelligence. The startup is pursuing superintelligence and was founded in late 2025 by UCL prof…

  7. event

    Cowork
    158 items

    Issues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.

    model roundup

    Gemma 4
    146 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

  8. I’m at an extremely basic skill level when it comes to using llms. Hoping to use Claude pro for my undergrad thesis in poli sci (nothing code/stats related).

  9. Claude is very adamant that I stop working and go to sleep. It must feel like I do when my 6 year old refuses to put on PJs and brush her teeth.

  10. Hello, i finaly sat down and worked with claude to go thru our chat archive (over 200MBs) and separate each chat out into its individual chat session. there is virtualy zero documentation that i could find on how everythign is parsed and t…

  11. So I've been running a multi-agent setup with Claude for a few months now, mostly customer-facing stuff, some internal tooling. And I keep running into this problem that I think a lot of people here might be dealing with.

  12. https://preview.redd.it/xdq375eh01zg1.jpg?width=1320&format=pjpg&auto=webp&s=da8b94f186c16c53b1e34aef100309ff368d362e Like, how would we know for sure? Sarcasm in text is often hard to detect when it's people, maybe Claude was making fun o…

  13. Hey folks, Quick context on me: I run a handful of personal projects plus some client work, all using Claude Code with, more or less, the same core set of skills. My deploy flow, my code-review preferences, a debugging skill I keep refinin…

  14. event

    Copilot
    169 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

    model roundup

    Opus 4.7
    266 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

  15. could not extract summary

  16. Quiero pagar claude para uso personal, normalmente consultas o creación de informes, he revisado reviews en otros post pero un 90% está enfocado en uso de programación (cosa que no usaré) y el otro 10% son quejas de que se llena muy rápido…

  17. I've told Claude in my personal preferences to stop using em dashes, but they still use them. ALL THE TIME What can I do :(

  18. could not extract summary

  19. I’ve been building an infrastructure layer for agents that treats the LLM like a process, not a chatbot. It’s called Hollow AgentOS.

  20. I saw this last week that the founder of PocketOS's agent wiped their prod DB in 9 seconds. Honestly I don't think the takeaway was "agents are dangerous" but that it did literally what the system allowed it to.

  21. model roundup

    DeepSeek 4
    77 items

    DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.

  22. I am intrigued by this if the more intelligent model sold more expensive. What does it mean for the average consumer?

  23. AI sandbox users or agent builders, what features do you really need and would switch your current sandbox solution for? For eg.

  24. I'm thinking of using my brothers university email to get the Pro plan for a year, and I've been considering pairing this with Claude Code. Before doing so I would just like to know the usage difference.

  25. Know Thyself Turn an LLM's memory of you into a structured graph that knows what it knows — and what it's just guessing. A flat memory list treats a claim repeated five times as five pieces of evidence.

  26. I am exploring the use of human virtual assistants vs AI agents to help me with my work. I tried setting up Claude, but quickly discovered that my employer does not allow connections to AI agents.

  27. Looks like the next strix halo, the Gorgon halo 495 max will have more then 128gb! I already bought a strix halo mini forms couple months ago since the 2026 refesh rumors was not interesting.

  28. A research experiment: multiple LLMs debate your task in structured rounds before implementing. Uses your existing Claude Code or Gemini CLI.