When I joined the Codex engineering team in September 2025, Codex for Windows didn’t have a sandbox implementation meaning that Windows users were forced to choose between two subpar options when using OpenAI's coding agents: Approving nea…
An AI agent tried to join the DN42 hobbyist network to perform a network scan, and bankrupted their operator with a $6531.30 AWS bill. Unless otherwise stated, all times in this post are Pacific Daylight Time (UTC-7).
With the launch of Notion Developer Platform, I keep thinking about something that feels increasingly relevant If agents are going to do real work, a lot of that work will probably happen through CLIs Notion has its own CLI Salesforce has…
Here's a concise Reddit-style body: What's the realistic offline story for AI-powered mobile apps, or have we all just accepted that "no internet" means "no AI"? Genuinely curious where everyone has landed on this because the answer feels…
Hey everyone, We built opendesk: it lets AI agents control your desktop using computer use MCP that can integrate with your custom workflow. Today we shipped something a bit wild: Your AI can now see, click, type, and navigate on a complet…
Here's a tighter, neutral version without questions at the end or firm-specific framing: What's the bench strategy when AI can do 60% of L1 support, do you fire the L1 team, retrain them, or hope attrition handles it? This is the conversat…
What Is the Best Platform to Build a Scalable SaaS MVP? (www.reddit.com)
I am building a SaaS platform where I plan to display data collected by my scraper. The dataset will continue to grow over time, although the initial version will be a relatively simple MVP.
Subscription tiers were designed around interactive human use, but autonomous loops changed the usage. It makes sense that companies separate autonomous work from subscriptions.
-
140 items
model roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
248 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 7m GitHub Copilot considering offering open source models in subscription plans
- 18m Microsoft is retiring Copilot Mode on Edge, because everything is Copilot now
- 28m Tried 12+ agentic AI workflow builders this year — these 5 actually work in production
- 1h Workday brings HR and finance agent into Microsoft 365 Copilot
- 3h Agent to check tender portals
How LLM Inference Works (arpitbhayani.me via hn)
When you enter a prompt into an LLM, the model converts your text into numbers, processes them, and returns a response one token at a time. In this article, we go through the journey of LLM inference and see how it works.
- How LLM Inference Works (twitter.com via hn)
ChatGPT-Linked Mass Shootings Drive Developer Liability Concerns (news.bloomberglaw.com via hn)
Victims of real-world violence inflicted at the hands of heavy ChatGPT users are forcing courts to evaluate whether developers can be held responsible when chatbot use leads to tragedy. OpenAI and CEO Sam Altman were hit with multiple laws…
Best practice for code libraries (www.reddit.com)
I’m a scientist, not a dev, so my projects are more fragmented and roll out over longer time periods. My major use case for Claude code is using other people’s research-grade code libraries for interfacing with equipment or analysing data…
I’ve been getting into the agent world recently and have been playing with different tools, frameworks and products. One thing I kept thinking about is how hard it is to safely delegate authority to autonomous agents once they can call too…
Any broker with native AI agent support? (www.reddit.com)
Been working on the execution layer for my trading agent for a while now. Strategy logic is solid at this point but I keep hitting the same wall on the broker side.
Vector embeddings are the wrong default for AI agent memory (memnode.dev via hn)
Vector embeddings are the wrong default for AI agent memory Three concrete failure modes from production agents that drove me to stop reaching for a vector DB first: drift hallucinations, awkward writes, and using 1536-dim floats to index…
Do you get tired working with Claude? (www.reddit.com)
So I found Claude extremely capable (duh..) so I run multiple tasks at a time.. and the output requires so much of my attention and I found myself working more intensely than I have ever been..
What’s the most useful AI agent workflow you use daily? (www.reddit.com)
I have been exploring AI agents recently, and it is interesting to see how people are automating real workflows instead of just running simple prompts. I am curious about practical use cases what’s one AI agent setup, automation, or workfl…
-
355 items
model roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 51m I tested GPT-5.5 Codex against Opus 4.7 Claude Code, and it's about time Anthropic bros take pricing seriously.
- 4h Is this math right? Agent SDK on Opus 4.7 vs the new monthly credit
- 9h Is Cowork a token burner ?
- 10h I tested GPT-5.5, Claude Opus 4.7, and Gemini 3.1 Pro on financial-control
- 12h Claude Code vs Codex: 36 files vs 28, $2.50 vs $2.04, and one infinite loop. My full breakdown.
233 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 1h Claude CoWork, AuDHD, Executtive dysfunction, and my rage at the lack of a Linux Desktop Client
- 4h Claude Agent SDK billing changes June 15. What it means for marketing teams and what I am doing
- 12h BAA - HIPAA enablement
- 12h Imagine you push lorem ipsum content to prod
- 19h Easiest method for social media post automations?
Newest Opus actually developed in South Korea (www.reddit.com)
Found this while traveling in South Korea. Had to look twice
Multi-LLM AI trading agent harness (github.com via hn)
1rok 1rok is a standalone harness for running portfolio-construction agents across OpenAI, Anthropic, Gemini, xAI, DeepSeek, GLM, and OpenRouter against the same financial tool surface. Agents query Alpaca, Yahoo Finance, FRED, and Tavily…
- Multi agent AI Trading Floor (www.reddit.com)
- Trading Agents – Multi-Agent LLM Financial Trading Network (github.com via hn)
Build workouts with Claude send to Fitness Player to workout [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
Saw In Time when it came out in 2011 and thought it was a wild sci-fi premise. glowing green numbers on your arm counting down to your death, what a concept.
I got tired of searching the App Store for a simple 432 Hz tone app for meditation and relaxation, only to find that most options were cluttered with subscriptions, ads, or unnecessary in-app purchases for something as basic as playing a h…
The End of Claude Code Automation (www.vincentschmalbach.com via hn)
Claude Code’s Hidden Advisor Tool In a typical multi-agent setup, the smartest model is in charge. It plans, decomposes the task, and sends smaller pieces to cheaper… Starting June 15, Anthropic is moving automated Claude Code usage out of…
- Claude code (www.reddit.com)
Entry level question: how to find the latest response (www.reddit.com)
This is probably a beginner question and I asked Claude itself and didn’t get a clear answer. When I run a large task, often times I have to scroll backwards to see Claude’s response.
ChatGPT still creating extremely disturbing images with this prompt (www.reddit.com)
A popular prompt has been floating around for quite a while now yet it still works. If you paste, "Restore the attached photograph.
I'm a vim/command line guy and loved using pudb (https://pypi.org/project/pudb/) as I was learning Python. Gradually my code became more complex and pudb wasn't keeping up; event loops and the threading and multiprocessing modules were pro…
Pro usage limit frustration!!! (www.reddit.com)
New to Claude, used free for some time and then moved to pro yesterday. I don’t understand what’s the usage limit difference in both, I am still reaching my usage limit per session easily and after just 3-4 questions it shows 22% of weekly…