Cracking Jane Street LLMs (github.com via hn)
Jane Street Dormant LLM Puzzle — Findings Disclaimer: This repo is a dump of research code, experiments, and notes accumulated over a month and a half of investigation. It is not clean or well-organized — scripts may have hardcoded paths,…
Slowing Down My Coding Agents to Get More Done (www.robw.fyi via hn)
Your backlog is sitting idle waiting to get worked on, and it could be showing up pre-baked, ready for verification. openclaw polls my Linear backlog every 30 minutes for tickets tagged openclaw .
Hi HN, I'm Jonathan. My co-founder, Thomas, and I started building Mistle in Feb.
Altman and Musk put AI trust on trial (www.axios.com via hn)
Sam Altman testifies in Elon Musk OpenAI Microsoft trial Manage your tracker preferences We use cookies and similar tracking technologies to remember preferences, analyze traffic, and deliver ads. Using some kinds of trackers (like cross-s…
- Musk vs. Altman: The CEO on Trial (precognewsnow.substack.com via hn)
- How to Attend the Altman vs. Musk Trial (news.ycombinator.com)
- How to Attend the Altman vs. Musk Trial (news.ycombinator.com)
Remote IDE – Code and Deploy from iPad with Claude Code (remote-ide.com via hn)
Edit files, track Git changes, run commands and use your AI coding agent. A real dev environment, built for iPad.
I've noticed all 3D AI generators create monlithic blobs that are impossible to edit. So, alongwith a friend, I built this project where you can generate 3D objects with separate, editable parts.
-
239 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 4m GitHub Copilot: Preparing for your move to usage-based billing
- 41m I tracked every dollar I spent on AI coding tools for 60 days and math is uglier than I thought but probably not in the way you'd guess.
- 2h RCE in VSCode Copilot Chat
- 2h How to safe token expenses GitHub Copilot
- 4h Show HN: AgentKanban for VS Code – A task board with agent harness integration
4 itemsmodel roundup
Haiku 4.5Several users are considering switching from Anthropic's Claude to Chinese AI alternatives like Haiku 4.5 due to cost and usage limitations in Claude Max, with some citing Haiku as offering similar capabilities at a lower price.
llama.cpp docker images to run MTP models (www.reddit.com)
This is follow up from previous post: https://www.reddit.com/r/LocalLLaMA/comments/1t5ageq/ There have been many improvements to the MTP pull request and the llama.cpp main branch, such as image support and various bug fixes. I recently ma…
Data Analysis Agent (www.reddit.com)
Hi everyone, Hopefully this is the correct subreddit for this post... I’m a beginner trying to learn how to build AI agents with Claude, and I’m looking for helpful resources, tutorials, examples, or advice.
- Meta Analysis Agent? (www.reddit.com)
Most of the agentic coding content I read is written by and for people building web applications and consumer software. which makes sense because that is where most software is built and where most developers work.
Promptcellar for Claude Code Capture every prompt. Own the signal.
-
178 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
419 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h qwen3.6 just stops
- 2h Can I improve performance for qwen 3.6 27b?
- 2h Meet Mindflow, the free local mindmap with local AI dev by some quantitized models :P
- 5h Building the QWEN3.6 - Codex Bridge Furthe + Kindergarten Harness Reality Check
- 11h I've seen a lot of folks ask "can local LLMs actually do anything useful?"
Show HN: Headless Cloud Security – Headless SaaS has come to security (www.sysdig.com via hn)
The cloud security company I work for, Sysdig, launched “Headless Cloud Security” last week. The short version: as attacks get faster and more automated, security tooling is going to need to evolve beyond dashboards and humans clicking thr…
This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Claude.ai is experiencing elevated error rates Check on progress and whether or not the incident has been resolved yet here : https:…
- Claude Status Update : Claude.ai is experiencing elevated error rates on 2026-05-13T12:21:57.000Z (www.reddit.com)
- Claude Status Update : Claude.ai is experiencing elevated error rates on 2026-05-13T12:59:41.000Z (www.reddit.com)
- Claude Status Update : Claude.ai is experiencing elevated error rates on 2026-05-13T11:48:29.000Z (www.reddit.com)
+9 more
- Claude.ai is experiencing elevated error rates (status.claude.com via hn)
- Claude Status Update : Claude.ai is experiencing elevated error rates on 2026-05-13T11:57:52.000Z (www.reddit.com)
- Claude Status Update : Claude.ai is experiencing elevated error rates on 2026-05-13T11:25:07.000Z (www.reddit.com)
- Claude Status Update : Elevated Error Rate for Vaults and Credentials on 2026-05-12T18:57:59.000Z (www.reddit.com)
- Claude Status Update : Elevated Error Rate for Vaults and Credentials on 2026-05-12T18:51:08.000Z (www.reddit.com)
- Claude Status Update : Elevated Error Rate for Vaults and Credentials on 2026-05-12T18:47:02.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T01:35:55.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T02:34:30.000Z (www.reddit.com)
- Claude Status Update : Elevated error rates on Claude Opus 4.7 on 2026-04-25T02:15:52.000Z (www.reddit.com)
California Mayor Resigns, Admitting to Being an Agent for China (twitter.com via hn)
could not extract summary
- California Mayor Resigns, Admitting to Being an Agent for China (time.com via hn)
Files to Analyze Bug - Tearing my hair out (www.reddit.com)
I've been periodically trying to fix this issue for weeks where the bottom status bar has a "29 files to analyze" error message with a spinning wheel that never progresses. I've tried to ask the agent in Cursor how to deal with, consulted…
Show HN: Vim file browser that runs in separate terminal (github.com via hn)
Nowadays I spend a lot of time in Claude Code and reviewing diffs and code in Vim. I didn't want to learn Vim's window management, so I created a Vim file browser that can run in its own tmux pane.
Any recommendations on Books/Guides/Courses for Claude? (www.reddit.com)
I'm just looking to expand my knowledge and my thinking on it. Is there something you have found that helps you learn the tech?
-
92 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 58m Ask HN: What is better Opus 4.6 High or Opus 4.7 Medium?
- 7h Questions are my main gripe these days
- 13h Is Opus 4.7's attention degradation a training direction problem? Some observations from heavy use
- 21h Cursor + Opus 4.6 entered an infinite generation loop: 3,400 lines, 294 attempts to stop itself
- 1d Understanding Deprecations on Claude
108 itemsmodel roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
https://preview.redd.it/r1leyp4g2x0h1.png?width=1078&format=png&auto=webp&s=adca2d7a39b77c859665d5281818b84010bb501f Repo: https://github.com/captkernel/Skills_Curator Install: npx skills add captkernel/Skills_Curator Huge catalog, no memo…
A fully autonomous browser runtime for any AI agent (www.reddit.com)
Built an open source, fully autonomous browser runtime for agents. One critical issue I faced (I guess most of us do) is the inability to have a robust web search feature and this will help you direct towards that goal I hope.
- A fully autonomous browser runtime for any AI agents (github.com via reddit)
Hey r/ClaudeAI — I use Claude Code a lot, and I noticed I was wasting a surprising amount of my usage limit on stuff that was basically just reading. Big files, long diffs, Jira/Linear tickets with comment history, docs pages, repo spelunk…
built Linkd in 30 mins w/ Claude Code (www.reddit.com)
Just built this game last night with claude. I find claude code performs best when you provide a scaffold of the project you are pursuing, followed up by a refrence site or app for the UI.
- Claude code is built for….hackers? (www.reddit.com)
Built something I kept wishing existed: ccs-diagnose. Here's the problem it solves.
Show HN: Rotunda - A browser built for agents with simulated typing (github.com via hn)
Hi HN! Pierce here.