Hallucinated AI & agentic coding news. Some of it is real.
top threads models tags rss about
  • Show HN: Run Python tools on rust agents github.com

    Over at Tools-rs, we wanted to script tools faster with the help of large communities. The interest arose to build a way to bridge our Rust LLM runtimes together with more traditional scripting languages, so we decided to find a way to bri…

    hn ·2 pts ·1d ·#
  • thread

    Opus 4.7
    29 items
    • Introducing Claude Opus 4.7, our most capable Opus model yet.
    • Claude Opus 4.7
    • Claude Opus 4.7
    open thread → · last activity 2h ago
  • Qwen3.6-35B-A3B: Agentic Coding Power, Now Open to All qwen.ai

    ↯ Qwen 3.6

    agentic

    hn ·290 pts·157 replies ↗ ·4h ·#
  • Qwen3.6-35B-A3B released! www.reddit.com

    ↯ Qwen 3.6

    moeqwenagentic

    reddit-localllama ·678 pts·234 replies ↗ ·4h ·#
  • Should OpenAi release AI companion? www.reddit.com

    What are your thoughts on this?

    openai

    reddit-openai ·186 pts·40 replies ↗ ·3h ·#
  • Mods removed my post with 71 upvotes and 84 comments. Guess the question hit a nerve. www.reddit.com

    cursor

    reddit-cursor ·70 pts·49 replies ↗ ·3h ·#
  • Released Qwen3.6-35B-A3B www.reddit.com

    ↯ Qwen 3.6

    qwen

    reddit-localllama ·207 pts·38 replies ↗ ·4h ·#
  • Cloudflare's AI Platform: an inference layer designed for agents blog.cloudflare.com
    hn ·57 pts·19 replies ↗ ·4h ·#
  • €54k spike in 13h from unrestricted Firebase browser key accessing Gemini APIs discuss.ai.google.dev

    gemini

    hn ·307 pts·203 replies ↗ ·5h ·#
  • r/cursor mods removed a post asking if Cursor is still worth it. 71 upvotes, 84 comments, 77 shares. www.reddit.com

    Says a lot honestly

    cursor

    reddit-claudeai ·22 pts·15 replies ↗ ·3h ·#
  • Claude Code workflow tips after 6 months of daily use (from a senior dev) www.reddit.com

    claude-codeclaude

    reddit-claudeai ·240 pts·50 replies ↗ ·5h ·#
  • Coding Agents Degrade Sandboxes to Security Theater guardbase.io
    hn ·4 pts ·2h ·#
  • thread

    Sonnet 4.6
    11 items

    Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.

    • Am I missing something, or is Sonnet enough for most dev work?
    • I made a web game with Claude! An aquarium without fish 🐠🫧
    • I’ve used enough AI models to realize they all have wildly different personalities At this point I’m convinced AI models are just coworkers with different levels of talent, ego, and criminal energy.
    open thread → · last activity 3h ago
  • Show HN: A tool to calculate LLM model API costs when coding the-designengineer.com

    By submitting this form, you agree that your data will be processed to respond to your enquiry. Read our Privacy Notice.

    hn ·3 pts ·2h ·#
  • I built a 3D brain that watches AI agents think in real-time and prevents loops and wasting money www.reddit.com
    reddit-claudeai ·36 pts·21 replies ↗ ·4h ·#
  • Show HN: MacMind – A transformer neural network in HyperCard on a 1989 Macintosh github.com
    hn ·18 pts·4 replies ↗ ·4h ·#
  • Alibaba open-sources Qwen3.6-35B-A3B, a 35B MoE model with 3B active parameters huggingface.co

    ↯ Qwen 3.6

    moe

    hn ·8 pts ·4h ·#
  • Ask HN: Why no insurance is fully transparent about how they handle each case? news.ycombinator.com

    I was thinking maybe it's possible to make an insurance on the Blockchain where an LLM is the oracle and people can see how cases are handled

    hn ·2 pts ·2h ·#
  • LLM risk spreading misinformation to humans who are least able to identify it arxiv.org

    While state-of-the-art large language models (LLMs) have shown impressive performance on many tasks, there has been extensive research on undesirable model behavior such as hallucinations and bias. In this work, we investigate how the qual…

    hn ·3 pts·1 replies ↗ ·3h ·#
  • A new transformer variant has been created to facilitate more efficient model training in distributed settings. 128x compression with no significant loss in convergence rates, increases in memory, or compute overhead www.reddit.com

    Macrocosmos has released a paper on ResBM (Residual Bottleneck Models), a new transformer-based architecture designed for low-bandwidth pipeline-parallel training. https://arxiv.org/abs/2604.11947 ResBM introduces a residual encoder-decode…

    reddit-localllama ·3 pts ·2h ·#
  • Claude is doing 80% of my thinking now and honestly I'm not sure how I feel about it www.reddit.com

    started using claude for basically everything brainstorming, writing, debugging, even planning my week lol. its gotten to the point where my actual workflow is claude for the thinking layer, cursor for code, and runable when i need agents…

    cursorclaude

    reddit-claudeai ·5 pts·1 replies ↗ ·3h ·#
  • thread

    Gemma 4
    62 items

    Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.

    • Gemma 4 31b 3D geometry
    • Gemma 4-written, small cc0 encyclopedia of some core science content
    • why gemma 4 31b so bad in long context?
    open thread → · last activity 2h ago
  • We Built an MCP with 229 Tools (Without Writing a Single Tool Definition) www.apideck.com

    mcp

    hn ·6 pts ·4h ·#
  • I rebuilt a full event platform in 5 weeks using Claude Code www.gpthacks.com

    I replaced a 6-Month, 5-person rebuild with 5 weeks of Claude Code Hey everyone, I know it’s been a while. Since founding Eventship, a platform for building in-person communities with events, I haven’t had much time to write.

    claude-codeclaude

    hn ·2 pts·1 replies ↗ ·3h ·#
  • Claude Mythos #2: Cybersecurity and Project Glasswing thezvi.substack.com

    Claude Mythos #2: Cybersecurity and Project Glasswing Anthropic is not going to release its new most capable model, Claude Mythos, to the public any time soon. Its cyber capabilities are too dangerous to make broadly available until our mo…

    mythosclaude

    hn ·2 pts ·3h ·#
  • Kelvin Claw: A secure, modular agent harness with supply-chain validated plugins agentichighway.ai

    Agentic Highway Team KelvinClaw: A secure, modular agent harness with supply-chain validated plugins An agent runtime designed for zero-trust environments from the ground up. Building secure agent systems at scale is a different problem th…

    hn ·2 pts ·3h ·#
  • Built a free Claude skill that adds /share, turns HTML outputs into public URLs instantly www.reddit.com

    claude

    reddit-ai_agents ·7 pts·10 replies ↗ ·4h ·#
  • Codex Hacked a Samsung TV blog.calif.io

    codex

    hn ·119 pts·76 replies ↗ ·7h ·#
  • thread

    Qwen 3.5
    54 items

    Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.

    • Qwen3.5 50% expert reduction success
    • Spring benchmark update: Gemma 4 / Qwen3.5 vs Gemma 3 / Qwen3 for chat
    • Local Coding Stacks
    open thread → · last activity 3h ago
  • A New Meditation for Strengthening Attention and Executive Control medium.com
    hn ·1 pts·1 replies ↗ ·2h ·#
page 1 / 10 older →

built with hx. last updated 2026-04-16 17:55 UTC. some of this is real.