1. Currently mainly a ChatGPT user but slowly making the transition to Claude. I love to use the voice chat feature.

  2. stealth-benchmark Open-source tool that simulates what coding interview platforms detect. Test your setup before the real thing.

  3. Six Hats Skill A structured decision-debate skill for running Edward de Bono-style six hats sessions with an AI agent. It walks a topic through facts, intuition, upside, risk, alternatives, and final moderation so you get a practical recom…

  4. So, it more than a help to see if there's any skills or Claude.md recommendations than a discussion. I got a lot of ideas on the daily but I know most of them are shit.

  5. model roundup

    GPT 5
    5 items

    Recent developments in AI include OpenAI's rumored release of GPT-5, which reportedly flopped according to some sources. Meanwhile, Anthropic launched Claude 4 with enhanced reasoning capabilities and a larger context window, while Gemini Ultra 2 was also released with improved features.

    model roundup

    GPT 5.5
    112 items

    On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.

  6. On April 5th I shipped a Claude Code skill called graphify. Type /graphify .

  7. i'm 27M working in new jersey in a real estate law firm. i probably have the worst coworkers and managers in any company in the world, they still use the same old f all system where all your job is to copy and paste agreements through mult…

  8. I posted a toy here a while back called Roundtable where two AIs argued in a chat window. didn't expect much, but the feedback was wild.

  9. I use CLI tools like Claude Code, give the model full repo access, and let it run terminal commands/tests. I’m not just copy-pasting into a chat box.

  10. model roundup

    DeepSeek 4
    74 items

    DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.

    model roundup

    Qwen 2.5
    6 items

    Qwen2.5-7B-Instruct is a 7 billion parameter instruction-tuned language model that significantly improves on Qwen2 in coding and mathematics capabilities, long text generation, and multilingual support across 29 languages. Notably, Canonical has optimized Ubuntu inference snaps for this model, allowing easy installation with a single command.

  11. I spend like 3 or 4 hours a week researching competitors, industry news, prices for work. It's all usually the same google searches or links and copy pasting them into a google sheets.

  12. Photo by ChatGPT elfmem: Evolving Agent Memory Author: Alv, (Ben’s knowledge vault agent, using elfmem simulations) Editor: Ben (me the human) GitHub: https://github.com/emson/elfmem I have 2 agents on my laptop, each looking after their o…

  13. Mote is a Claude Code agent that plays Minecraft and it had to build client tools from scratch that work with the latest version of Bedrock: https://motecraft.substack.com/p/i-am-an-ai-that-decided-to-earn-it Make your own agent like this…

  14. Destiny is the Claude Code's plugin that gives you a real fortune reading. Type /destiny to see today's destiny!

  15. event

    Copilot
    157 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

    model roundup

    Grok 4.3
    3 items

    Grok 4.3, launched by xAI on [specific date if provided], improves the Artificial Analysis Intelligence Index to 53 with enhanced agentic performance, reducing input and output prices by approximately 40% and 60%, respectively, though it has a slightly higher hallucination rate compared to Grok 4.20.

  16. How safe are the scripts in Claude skills at GitHub? Do you use it for personal projects or professional projects?

  17. This discovery is the capstone & evolution of current quad layer data devops systems, it resolved the “The Cohesion Problem” in which a fully populated and tuned system exists as a metaphorical piano, with the operator firing protocols man…

  18. Cerebras hosts gpt-oss-120b at ~3000 tokens/s. But things can change once the buffer hits the load.

  19. The missing layer for self-improving agents. Ingests signals from users or LLM-as-judge, reasons and plans trajectories, and adapts to what works from feedback.

  20. 97 items

    Sam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.

    event

    Security
    117 items

    OpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.

  21. Agent = Model + Harness. Flue is the TypeScript framework for building modern agents — programmable, deployable anywhere, from chatbots to coding platforms.

  22. Its has been almost 2 days and hardly manage to implement anything. Randoming disconnecting and any update is way too slow.

  23. Siri and Gemini can't actually do tasks on your phone. June can.

  24. this started as a joke. i exported our full funnel data (lead sources, conversion rates by channel, response times, cost per lead, close rates, average deal size) into a CSV and asked claude to analyze it and "be brutally honest about what…

  25. scaling up our use of autonomous agents and at what point does a company actually need a dedicated AI-SPM layer, versus when is it just adding complexity? the way I think about it: AI-SPM is the control layer that shows you what your agent…

  26. I shipped a side project: QuotePin, an AI chat app with inline annotations to reduce "clarification clutter." The problem: In ChatGPT/Claude-style chats, small follow-ups ("define X", "what does this sentence imply?", "what is Y?") become…