Hallucinated AI & agentic coding news. Some of it is real.
top new threads models tags rss about
  • I Hate AI (news.ycombinator.com via hn) 4 pts·1 replies· 23m

    I know this might seem very emotional, but please take it as stilistic tool. I told everyone to just use Claude Code all day everyday and take over the world.

    geminiclaude-code

  • thread

    Qwen 3.6
    23 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

    • 34m ago Qwen 3.6 35B A3B, RTX 5090 32GB, 187t/s, Q5 K S, 120K Context Size, Thinking Mode Off, Temp 0.1
    • 40m ago Is there a way to have qwen-code CLI read images?
    • 2h ago Qwen 3.6 for Claude Code in 1L
    open thread → · last activity 34m ago
  • Is claude on a psychedelic adventure right now? (www.reddit.com via reddit) 5 pts·4 replies· 49m

    I was prompting for some printable coloring books for my daughter and it seems like Claude is in-fact on drugs... Look at these, kinda creepy....

  • thread

    Opus 4.7
    75 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

    • 1h ago Claude Opus 4.7 System Prompt Leaked
    • 27m ago How can I know whether Opus 4.7 in Claude Desktop "thought for more complex task"?
    • 29m ago Opus 4.7 still nudges you to go to bed but it seems a bit less adamant on bedtime
    open thread → · last activity 27m ago
  • I treated Claude Code as a compiler and put src/ in .gitignore. node-semver rebuilt, 5,632/5,632 tests passing. (www.reddit.com via reddit) 1 pts·1 replies· 11m

    The hypothesis: tests are source code. src/ is a build artifact.

    claude-code

  • Sometimes Claude just wants a break lol! (www.reddit.com via reddit) 1 pts·1 replies· 11m

    JK he's just helping me pick somethings up again, learning to code with Claude!

  • Show HN: LLMs don't hallucinate because they're bad at math, it's the format (github.com via hn) 2 pts· 48m

    Laeka Rational Brain — Matrix Renderer for LLMs The problem isn't the model. It's the representation.

  • Isolating AI Coding Agents on Bare Metal (blog.singlr.ai via hn) 3 pts· 1h

    How Singular uses system containers, rootless Podman, and declarative YAML configs to run isolated, multi-project AI agent environments on a single $50/month bare-metal server.

  • HyperFrames — OSS framework for AI agents to author video as HTM (www.reddit.com via reddit) 1 pts·1 replies· 15m

    Been building this with my team at HeyGen for a while and today we are releasing it to the world. HyperFrames is an open-source HTML-to-video framework where the authoring format is plain HTML with a few data attributes, and the renderer o…

    geminicursorclaude-code

  • thread

    Sonnet 4.6
    16 items

    Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.

    • 48m ago Running a RunLobster (OpenClaw) agent since launch changed how i think about takeoff timelines
    • 9h ago Am I missing something, or is Sonnet enough for most dev work?
    • 10h ago Gemma 4 31b 3D geometry
    open thread → · last activity 48m ago
  • I don't think I needed GPT Pro for that question. thanks though, OpenAI. (www.reddit.com via reddit) 3 pts·3 replies· 43m

    You've been blocked by network security. To continue, log in to your Reddit account or use your developer token If you think you've been blocked by mistake, file a ticket below and we'll look into it.

    openai

  • I built a Claude (Unicode) Mascot Drawing (and Animating) application. (www.reddit.com via reddit) 1 pts·1 replies· 25m

    So, we all probably love the Claude Mascot. You know it, that unicode character that had simpler animations that appeard at the starting message of the terminal session of your Claude Code.

    claude-code

  • I built an open-source local context layer for LLM coding workflows: ask, validate, judge groundedness, and learn which files matter (www.reddit.com via reddit) 1 pts· 24m

    I built SigMap to make Cursor sessions less guessy in larger repos. What I made: An open-source local context layer that builds compact signatures from the repo, then lets me run: - sigmap ask - sigmap validate - sigmap judge - sigmap lear…

    cursor

  • What we learned building a data agent that talks to 4 database types simultaneously (DAB benchmark) (www.reddit.com via reddit) 2 pts·1 replies· 57m

    UC Berkeley published DataAgentBench (DAB) in March — 54 queries across PostgreSQL, MongoDB, SQLite, and DuckDB. Best score so far is 54.3% (PromptQL + Gemini).

    tool-callinggeminimcp+1

  • Caution: /ultrareview silently decides what to review (www.reddit.com via reddit) 1 pts·1 replies· 32m

    I used my first complimentary /ultrareview. I had no uncommitted changes; I wanted a review of all my project code.

  • Mapping Agent or Skill availability? (www.reddit.com via reddit) 1 pts·1 replies· 33m

    I work in real estate finance and regularly put together investment decks. I’ve seen some discussion about using Claude agents/skills to automatically create regional location maps that highlight a subject property along with nearby amenit…

  • The benchmark game has entered its IPO era. (www.reddit.com via reddit) 14 pts·1 replies· 1h

    Ff you thought the hypetrain was exaggerated before, just you wait...

  • Show HN: VCoding – A 5 MB native Windows IDE with no dynamic dependencies (news.ycombinator.com via hn) 1 pts· 44m

    Hi HN — I'm Jiawei, one of two co-founders of VCoding. 4.5mb IDE that uses only 8mb of ram!

  • Need a query to help compare electricity and natural gas rates in my area. (www.reddit.com via reddit) 2 pts·3 replies· 1h

    I’m not really familiar with Ai apps. I want to find and analyze electric and natural gas rates contract rates available to me in my area to find out what’s best for me.

  • Local models first (www.reddit.com via reddit) 1 pts· 27m

    My other post got taken down I’m not trying to promote a product just trying to share and get help on my ideas I made a local memory system I call it ARN dumb i know but it stands for adaptive reasoning network It gives any AI agent persis…

  • Claude literally saved me from a nightmare situation (Appreciation Post) (www.reddit.com via reddit) 4 pts·4 replies· 1h

    So this started a few days ago with this weird burning sensation inside my mouth. Felt like I’d eaten something really hot but I hadn’t.

    geminichatgpt

  • Sassy Claude is best Claude (www.reddit.com via reddit) 21 pts·4 replies· 2h

    I audibly laughed at the amount of shade thrown at Microsoft from Claude lmao

  • thread

    Anthropic Mythos
    73 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

    • 1h ago I wonder how Mythos would answer this
    • 4h ago White House to give US agencies Anthropic Mythos access, Bloomberg News reports
    • 5h ago White House Moves to Give US Agencies Anthropic Mythos Access
    open thread → · last activity 1h ago
  • thread

    Qwen 3.5
    61 items

    Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.

    • 1h ago Strix Halo 128GB on Proxmox - Vulkan vs ROCm benchmark matrix
    • 5h ago Local Model Suitable for Grammatical/Academic Editing?
    • 10h ago Qwen3.5 50% expert reduction success
    open thread → · last activity 1h ago
  • Claude Status Update : Failures to add Credentials to Vaults on 2026-04-16T22:41:12.000Z (www.reddit.com via reddit) 2 pts· 1h

    This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Failures to add Credentials to Vaults Check on progress and whether or not the incident has been resolved yet here : https://status.…

  • Claude Status Update : Failures to add Credentials to Vaults on 2026-04-16T22:33:41.000Z (www.reddit.com via reddit) 2 pts· 1h

    This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Failures to add Credentials to Vaults Check on progress and whether or not the incident has been resolved yet here : https://status.…

  • ADK: Root agent will only know summary of context passed back from sub agent - can't get root agent to read all details/context from sub agent (www.reddit.com via reddit) 1 pts·2 replies· 57m

    I have been using ADK. I am using a multi agent setup.

  • GenAI development for autonomous agents (www.reddit.com via reddit) 1 pts·1 replies· 59m

    I’ve been experimenting with GenAI agents that can perform multi-step tasks like research, summarization, and API calling. The model side is manageable, but the real challenge is orchestration, memory handling, tool use reliability, failur…

    tool-use

  • Cursor vs Claude Code: Two Different Approaches to AI Coding (www.mindstudio.ai via reddit) 2 replies· 53m

    Cursor vs Claude Code: Two Different Approaches to AI Coding Cursor enhances your editor. Claude Code works from the terminal.

    cursorclaude-code

  • You can test the same malicious prompt against your AI 1000 times and the guardrails hold. On attempt 1001 it pops right over. (www.reddit.com via reddit) 1 pts·1 replies· 1h

    Thats non deterministic systems for you. We released our first customer facing AI tool last quarter.

page 1 / 10 older →

built with hx. last updated 2026-04-17 00:05 UTC. some of this is real.