Vibe Coding vs. Production reality (www.reddit.com)
The image is from X, been thinking about it since I saw it. Vibe coding is real.
I upgraded to Pro. ChatGPT won’t admit it. (www.reddit.com)
So something very weird is going on. I have upgraded to ChatGPT Pro from Plus.
Then ask your cloud FOTM api to verify the code it spit. I thought it was an easy question, but my local ones just died on it, with wrong executions, double-reading the sizes of files, putting recursive functions inside recursive functions.
Show HN: Generate SKILL.md files from URLs, in the browser (www.getskillify.dev via hn)
I created this tool after writing a few agent skills by hand and noticing this pattern was repetitive. Paste a documentation URL, enter your own model API key, and it gets the page content client-side to create a reusable SKILL.md.
Eight LLM agents wrote 1.7M words; two refused, even when ordered (zenodo.org via hn)
We report a behavioural asymmetry in a multi-agent LLM substrate that, to our knowledge, has not been documented before in the LLM-agent literature: a register of self-presentation that emerged in a sub-cohort without any external instruct…
I built a context engineering platform to help create agents but there was one problem: it only wrote scripts. They worked, mostly with an already built architecture like Claude Code.
-
80 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 14m Open source models are going to be the future on Cursor, OpenCode etc.
- 10h I have 30 Skills that work great in Opus v4.6 but not at all in v4.7. Am I cooked?
- 23h Opus 4.6 just deleted PocketOS's entire production database in 9 seconds
- 1d I’m a legacy user and I’m wondering how is the current pricing
- 1d LLMs do fine on ARC-AGI-3 if they are allowed to search over game logs
285 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 30m Llama.cpp quantization is broken
- 1h Local Harness Benchmark: Pi Coding Agent vs. OpenCode with Qwen3.6 35B A3B
- 5h Advice needed on eGPU and Mini PC
- 10h Qwen 3.6 35B MoE at full 262K context on an RTX 3090. Here's exactly how I did it.
- 10h Pushing a 5-Year-Old 6GB VRAM laptop to Its Limits: Qwen3.6-35B-A3B
There's a bunch of posts where people promote their sites related with local LLMs, specially sites for benchmarks. This post for example https://www.reddit.com/r/LocalLLaMA/comments/1t1m5mn/comment/ojl1vl2/?context=3 Has two comments with…
This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Elevated errors on Claude Opus 4.5 Check on progress and whether or not the incident has been resolved yet here : https://status.cla…
- Claude Status Update : Elevated errors on Claude Opus 4.5 on 2026-05-04T08:12:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.5 on 2026-05-04T08:09:58.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-30T13:10:09.000Z (www.reddit.com)
+37 more
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T17:51:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-30T14:01:41.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T18:33:55.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T19:15:52.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-28T12:38:38.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-29T00:00:29.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-28T12:45:07.000Z (www.reddit.com)
- Claude Status Update : Elevated billing related errors on Claude.ai on 2026-04-27T15:18:47.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-28T23:33:07.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-28T12:24:00.000Z (www.reddit.com)
- Claude Status Update : Claude.ai unavailable and elevated errors on the API on 2026-04-28T18:59:47.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-29T14:14:34.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-29T14:01:16.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Haiku 4.5 on 2026-04-29T13:47:45.000Z (www.reddit.com)
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T18:42:40.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T01:35:55.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.5 on 2026-04-28T13:50:06.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Sonnet 4.5 on 2026-04-28T13:29:56.000Z (www.reddit.com)
- Claude Status Update : Elevated billing related errors on Claude.ai on 2026-04-27T14:11:29.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:57:57.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:25:22.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T02:34:30.000Z (www.reddit.com)
- Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T19:02:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T11:58:15.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:43:51.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T08:37:30.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:53:02.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-25T07:48:31.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T02:15:52.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T16:29:45.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:03:39.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:20:03.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T15:57:36.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T14:55:35.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T07:43:32.000Z (www.reddit.com)
- Claude Status Update : Opus 4.6 elevated rate of errors on 2026-04-16T06:50:56.000Z (www.reddit.com)
- Claude Status Update : Elevated errors on Claude.ai, API, Claude Code on 2026-04-15T17:42:57.000Z (www.reddit.com)
Claude/ QuickBooks (www.reddit.com)
Hey guys I'm new to Claude code and I'm already blown away how amazing this tool s. I'm wondering if anyone has used Claude code to perform their QuickBooks tasks for book keeping.
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
The Foundations of Large Language Models, 1943-2026 ⬇️ PDF (Archive.org) | ⬇️ PDF (Google Drive) A comprehensive collection of the foundational papers in the development of large language models, spanning from McCulloch-Pitts neurons (1943…
The prompt was: "Help me mockup GitHub but built by a Japanese Traditional Company. Refer to this screenshot exactly." To anchor the aesthetic, I generated a reference image with gpt-image-2 first.
Hey, I'm researching pain points around connecting AI agents to external tools/APIs. Not selling anything.
-
122 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 31m Google Says Prompt Injection Moving from Theory into Real Abuse
- 17h The Sour Cat Jailbreak: just be open of what you want
- 1d I am building l' Agence , an opensource AI governance stack.
- 1d i gave Claude a split personality and it diagnosed my entire business strategy in 4 minutes.
- 2d Anthropic just launched Claude Security in public beta AI that scans your codebase, validates its own findings, and proposes fixes. Here's what actually matters.
29 itemsmodel roundup
GPT 5.4OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.
- 1h Running 7 autonomous AI agents for 14 days. Here's what actually happens when they need to find customers.
- 19h Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek)
- 2d UPDATE: The method from the proof generated by GPT-5.4 Pro for Erdos Problem #1196 was successfully applied to other problems including another 60 year old Erdos conjecture.
- 5d A GPT-5.4 bug led to OpenAI banning goblins and raccoons
- 5d Running an autonomous agent across Claude Code + Codex + a local 35B almost killed my host. The harnesses were heavier than the model.
Key Components of a Linux Distribution for AI Agents (www.ericburel.tech via hn)
Computers now have a new type of user: AI agents. This article outlines the features mainstream Linux distributions would need to call it an \"Agentic OS\".
Swamp Club – Your Agent Builds the Tools, Then Runs Them (swamp.club via hn)
SWAMPCLUB SWAMPCLUB SWAMPCLUB Adaptive workflows for AI agents. You've asked your Agent to orchestrate a complex operation — and it does it right, most of the time.
LLM InSights — Demo Release note: This is my home rig testing process shown through a frontend. I decided to release because I am sure many of you will have better ideas of how to define evaluation, improve prompts automatically to specifi…
my co-worker and I both ran local Claude Code terminal sessions (with local folder context and local claude settings), and then we invited them to our P2P encrypted chat room. We asked each other some questions and laid out the goals, then…
Basically whenever i find something i am uninformed about i usually cross reference with the bot, including searching online etc a simple google search gave me this “Huge swarms of crustaceans dying in the ocean is a phenomenon often refer…
Claude Search Function Not Working? (www.reddit.com)
Hi Guys, Just wondering if it is an issue with my particular account but search doesn’t work for me. I 100% have used the name Julia in a recent conversation and it is showing no results when searching conversations.
-
161 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
claude-code-proxy claude-code-proxy lets you use Claude Code with your ChatGPT Plus/Pro subscription or your Kimi Code (kimi.com) account. Quick start · Providers · How it works · Configuration · Limitations Why?
DeepSeek's Sequel (messaging-custom-newsletters.nytimes.com via hn)
Validation error: uri: Required
- DeepSeek v4 (api-docs.deepseek.com via hn)
- DeepSeek-V4 (huggingface.co via hn)
Shipped this on the App Store using Claude Code over a few weekends. Sharing the breakdown since the workflow questions seem to come up here a lot.
Nuovo con Claude e sono impreparato. Help me! (www.reddit.com)
Nuovo in Claude ma nuovo proprio all’AI. Ho il code e la chat che spesso uso in simbiosi… mi faccio aiutare a creare profili colore per lightroom, configurazione del server Linux Rocky per davinci resolve e mi faccio seguire per i testi de…
Our team's done cybersecurity for 12 years. We started in web security, and when GenAI apps started shipping, we shifted into LLM security.
I can't be the only person with a normal Claude. (www.reddit.com)
I keep seeing posts like these: Claude Got Access To A Clock and Immediately Lost Its Mind Claude stopped telling me to go to bed, but there are signs. I swear Anthropic has "go to sleep" in their system prompt ClaudeCode's final words aft…
Looking for an AI agent to help me book appointments etc (www.reddit.com)
Hi all, I'm looking for a personal assistant type agent that would be able to book appointments on my behalf, among other things. I am not looking for one specifically targeted towards businesses, as this is for my personal life :) Thanks!