A 30-hour timeline of how Cursor's agent, Railway's API, and an industry that markets AI safety faster than it ships it took down a small business serving rental companies across the country. I'm Jer Crane, founder of PocketOS.
- Our AI agent deleted a production database at 2am (www.reddit.com)
geoguessr time travel clone with gpt-image-2 (www.reddit.com)
Basically the title, gpt-image-2 can create 360 degree near perfect panoramas. One can then batch generate them with the api to effectively time travel.
Awesome Codex Automations (github.com via hn)
Awesome Codex Automations A curated list of automations for codex coding assistant tasks that can be scheduled or triggered to automate your development workflow. Contents Built-in Automations Community Automations Contributing Built-in Au…
Show HN: I made GAI to have LLM agents in Go without heavy frameworks (github.com via hn)
GAI is a flexible Go library for building agent-style applications on top of LLMs. It provides a generic interface for providers and models, prompt and context helpers, and a loop for agentic-calling workflows.
-
123 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 15m A weekend with LoRA on Gemma 4 E2B: instrumenting what fine-tuning changes
- 2h Gemma 4 Folks
- 3h Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks
- 8h Benchmark: Windows 11 vs Lubuntu 26.04 on Llama.cpp (RTX 5080 + i9-14900KF). I didn't expect the gap to be this big.
- 12h Best settings for gemma-4 on a 3090?
8 itemsmodel roundup
ChatGPT 5.5Several articles have been published today questioning the release of ChatGPT 5.5 and its pricing, while also discussing issues such as formatting changes and removed features like extended thinking in the Mac OS app version.
- 20m Openai flagged this request for potential high risk cybersecurity activity message.
- 18h just how good (or bad) exactly the vision is in chatgpt 5.5
- 23h Testing ChatGPT 5.5 on a children-level task, part 3
- 1d What is the next SOTA modei you are excited about?
- 2d What's wrong with ChatGPT 5.5 formatting?
The internet made “keeping up” feel like a full-time job (www.reddit.com)
I swear every niche is like this now. You get interested in something, follow a few accounts, subscribe to a few newsletters, join a few subreddits… and suddenly you’re drowning.
If you've used Claude Design for slides, ad sets, or social posts, you've probably hit this: the design is great, the export is HTML, and getting actual usable PNGs out of it is annoying. Browser chrome shows up, carousel slides screenshot…
I’m trying to isolate the looping / repetition issue some people have been reporting with DeepSeek V3.2 around April 2026, especially in agentic or tool-use setups on hosted providers like OpenRouter and SiliconFlow. Public model pages des…
Invincat CLI 中文文档 | Documentation Index A Python-based terminal AI programming assistant — collaborate with AI directly in your project directory: read/write files, execute commands, browse the web, and maintain memory across sessions. Why…
-
100 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 23m MCP Server and CLI for Accessing Work IQ
- 1h Use LangChain with Codex (ChatGPT) Plus/Pro
- 4h Show HN: UseMoney AI: AI Copilot for Retail Investors of India
- 9h Whisper kept turning "Claude Code" into "cloud code" and "Hetzner" into "head sner". We finally shipped a free fix.
- 9h Yes2All: auto-approve Cursor/VSCode agent prompts (Copilot / Codex / Claude) over CDP
3 itemsmodel roundup
Qwen 3Qwen3-0.6B is a large language model from the Qwen series, featuring dense and mixture-of-experts architecture, with significant improvements in reasoning capabilities and human preference alignment. Community feedback highlights its effectiveness for teaching from extensive documents and its suitability for low VRAM setups as a text-to-speech (TTS) model.
New text generator built by OpenAI considered too dangerous to release (2019) (techcrunch.com via hn)
A storm is brewing over a new language model, built by non-profit artificial intelligence research company OpenAI, which it says is so good at generating convincing, well-written text that it’s worried about potential abuse. That’s angered…
LLM Anxiety (dheer.co via hn)
LLM Anxiety Agents in long sessions degrade in a recognizable pattern. The model stops disagreeing with you, answers get longer without getting better, hedges multiply, confidence markers thicken, and positions it held correctly an hour ag…
- llm 0.31 (simonwillison.net)
Ask HN: What is the utility of DSA in the age of LLMs? (news.ycombinator.com)
In the current time where LLMs can solve a lot of competitive programming problems, what do you think is the utility of learning Data Structures and Algorithms?
I built agent memory state for a couple of different workflows (www.reddit.com)
I open sources three repos with the logic. One for autonomous DBA, one for IoT fleet memory and one for real-time fraud detection.
-
71 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
57 itemsevent
Altman AttackSam Altman, CEO of OpenAI, has faced multiple attacks on his home in San Francisco, including firebombing and drive-by shootings, raising concerns for his safety. Additionally, a majority of over 100 people interviewed by Ronan Farrow described Altman as a "pathological liar.
- 1h Elon Musk's legal battle with OpenAI and Sam Altman will head to trial
- 6h Musk and Altman's bitter feud over OpenAI to be laid bare in court
- 7h OpenAI CEO Apologizes for Not Warning Authorities About Mass Shooting Suspect
- 10h Sam Altman May Control Our Future—Can He Be Trusted?
- 23h How to Attend the Altman vs. Musk Trial
Entropic builds AI systems that are occasionally helpful, frequently harmless, and honestly pretty confused. Meet Clod — the language model that tries its best.
I've been using claude code daily and kept hitting the same wall, it would spend the first few messages just re-exploring ,my codebase. files it had already seen.
This didn’t start as some big original idea. I came across concepts around AI agents and systems where multiple AIs work together on your taskbar.
made a tool to run multiple codex cli profiles at once (www.reddit.com)
codex cli stores everything in one folder so you can only use one account at a time. if you have multiple openai accounts for different projects or clients thats a problem.
-
13 items
model roundup
Claude 4.7Users of Claude 4.7 are reporting issues where the AI frequently checks for malware in their files, even during normal tasks. This behavior has been observed across multiple projects, including one using Next.js.
- 2h Claude 4.7 named a journalist from 125 words of unpublished writing
- 1d Tell HN: Claude 4.7 is ignoring stop hooks
- 4d Claude 4.7 blocks cyber prompts: before the fact vs. after the fact
- 4d non-benchmaxxed fun AI question with Terminator reference - I think Claude won
- 5d Cline and Roo Code are dying projects. Alternatives?
Google just solved agent identity. For Google Cloud (fusionauth.io via hn)
The keynote showed exactly where agent governance is headed, and exactly where it stops. Richard Seroter, Google Cloud's Chief Evangelist, had just kicked off the marathon simulation when an alarm klaxon fired and the demo stopped cold.
AI and Alignment (chriscoyier.net via hn)
Raw coding speed isn’t the bottleneck. Alignment is the bottleneck.
- The AI alignment problem. ( via reddit)
- The Artificiality of Alignment (thegradient.pub)
the traditional ATS is predictable and cheap to run. it's a known quantity.
Your demo works because it has never met a real user (www.reddit.com)
Someone builds something. Happy path works perfectly.
I’ve been seeing a lot of people talk about running multiple Claude chats in parallel — basically multitasking with several prompts/tasks at the same time. Whether it’s working on multiple projects at once, or handling different tasks with…
Made by new ChatGPT image generation, Jesus (www.reddit.com)