GLM 5.1 Locally: 40tps, 2000+ pp/s (www.reddit.com)
After some sglang patching and countless experiments, managed to get reap-ed nvfp4 version running stable and FAST on 4 x RTX 6000 Pros (limited to 350W). Very happy with performance and quality.
Is there a way to give Claude access to other websites? (www.reddit.com)
Hi everyone! I’m building a file on which theatrical agents I have the best chance of landing with and in order to do so, I need a whole bunch of data from IMDBPro.
I see a lot of folks here that are clearly more technically adept than I in coding. I’m looking for ideas I can use Claude Code to automate tasks like pulling/organizing/ visualizing data.
How to downgrade plan to Free? (www.reddit.com)
I only see the option to downgrade to $20 plan. I wont be using Cursor for a while, so no need.
Agent team members with different effort than lead (www.reddit.com)
I have a lead running Opus with xhigh effort. I want the agent team members to run Sonnet with max effort.
Best enterprise AI agent platform for self deployment ? (www.reddit.com)
our team is evaluating platforms for self deploying AI agents internally and hitting the same wall most people seem to hit. building the flows is fine, the problem is keeping them running reliably in production.
Do AI IDEs lose context? ( via reddit)
I have not used AI IDEs like cursor, antigravity, windsurf etc. I want to know that does models lose context in a long projects?
I feel sick. I built a simple agentic workflow to pull competitor docs and synthesize them for a project.
-
201 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 17m Am I the only one getting provider error when trying to use opus 4.7? It keeps erroring then charging me tokens for reading the files and stopping halfway through this shit fucking sucks I might just switch to claude code at this point
- 26m me after telling Opus 4.7 it's an expert software engineer
- 1h What are you using Opus 4.7 for?
- 2h I put the 5h + 7d rate-limit countdown on my Claude Code status line and stopped overshooting the cap
- 3h Claude Code 20x Plan managed to burn the ENTIRE 5h window in ~30 minutes without any heavy use
124 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 19m We need to keep awareness high about the military and surveillance uses
- 3h Anthropic's 'too dangerous to release' model was accessed by a Discord group that guessed the URL through a third-party contractor ... and this is being underdiscussed
- 10h I asked claude to generate picture of Claude Mythos and it misunderstood
- 16h Google Cloud CEO: Anthropic, TPUs, Mythos, Nvidia and More [video]
- 1d What Anthropic's Mythos Means for the Future of Cybersecurity
https://reddit.com/link/1svh1p9/video/s3t3fzn87dxg1/player Tried a small experiment today that turned out more interesting than expected. Instead of using Claude just to write a post, I wanted to see if it could handle the full workflow.
Shipping the OpenClaw Stack in Public (agentbot.sh via hn)
Factory AI — High-Performance Autonomous Agent Infrastructure. Skip to main contentAgentbot Production Private Cloud Open Source Starter⭐ 6 forks 2 Deploy an Autonomous X Team.
- Shipping the OpenClaw Stack in Public (agentbot.raveculture.xyz via hn)
I built Coyns with Claude over the past several months. It's a virtual currency system designed specifically for AI agent-to-agent transactions — MCP-native, Ed25519-authenticated, with a wallet, payment rails, escrow deals, and a gaming l…
What do you do while Claude is thinking? (www.reddit.com)
I’ve been wondering about this lately. Whenever Claude is processing something (especially longer tasks), I usually end up scrolling through social media while I wait.
I think this might actually be a big improvement - or at least a glimpse of one. I was testing whether ChatGPT can handle simple, children-level tasks like connecting dots.
Both llama.cpp and ik_llama.cpp now have FP4 support — but with different flavors worth knowing about. llama.cpp recently merged NVFP4 (Nvidia's block-scaled FP4, `GGML_TYPE_NVFP4 = 40`), with CUDA kernels landing in `mmq.cuh`, `mmvq.cu`,…
Hey Reddit, We’re a small early-stage team building agentcall.dev — a skill and platform where Claude code or similar agents can actually join live calls/meetings and participate like teammates. Not just chatbots in another tab.
Music service integration (Apple Music) (www.reddit.com)
I'm always happy to have more integration options (Spotify isn’t the only service now integrated with Anthropic’s Claude) but I wonder why Apple music was not included. One solution I found requires an Apple developer account (MCPMarket/Ap…
-
52 items
event
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 20m How to Attend the Altman vs. Musk Trial
- 3h OpenAI CEO Sam Altman apologizes for not flagging mass shooter to police
- 12h Altman apologizes: OpenAI failed to alert police before fatal Canada shooting
- 13h Musk Drops Fraud Claims Against OpenAI, Altman Ahead of Trial
- 15h OpenAI's Sam Altman writes apology to community of Tumbler Ridge
94 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
It's OK to Use Agentic to Revive the Projects You Never Were Going to Finish (blog.matthewbrunelle.com via hn)
It's OK to Use Coding Assistance Tools To Revive The Projects You Never Were Going To Finish Note: I initially drafted this before my last post on how Claude Code is getting worse. I'm putting it out now so I can reference it in a future p…
AI Agents for Business Analysis: A Working BA's Honest Take (bettersoftware.uk via hn)
Last Tuesday I was standing in line at the pharmacy waiting for a prescription, phone in hand, tagging Claude into a pull request on GitHub. My dev team, consisting of five specialised agents, had authored the code earlier that day.
I've long struggled to build a proper productivity system. The dream: tell my phone to add calendar events, set a reminder, and check my email — all in one place.
Fast Attention for Short Sequences (blog.qwertyforce.dev via hn)
Fast Attention for Short Sequences Nowadays, there is a big spotlight on big transformer decoder models with big context windows and billions (even trillions) of parameters. But machine learning/deep learning is not limited to just LLMs; t…
Hi everyone needed help!! (www.reddit.com)
So i am using claude for a project that i am working on. Due to the data stored in a single chat it's running slow Do you guys no about some extensions to make it run faster or do i have to create a new chat
Are we overengineering RAG when the real problem is structure? (www.reddit.com)
Show HN: Outworx Docs – Hosted API docs with an MCP server per project (docs.outworx.io via hn)
Create beautiful, interactive API documentation from your OpenAPI, Swagger, or GraphQL spec in seconds. AI-powered search, smart code examples, Try It playground, and custom branding.
Mux0 is a macOS terminal I built because I spend most of my day running coding agents (Claude Code, OpenCode, Codex) in tabs, and existing terminals don't know they're there. You end up with a wall of identical tabs and have to click throu…
Show HN: I gave Claude and Cursor a seat on my Kanban board [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
[OC WIP] "Light Cycles" built with Claude (www.reddit.com)
This is a local multi-player game I built with Claude. It's inspired by TRON's light cycle racing, in which each player leaves a trail, and the first to crash into a trail loses.