How OpenAI Kills Oracle (www.wheresyoured.at via hn)
Soundtrack — Brass Against — Karma Police It was January 21, 2025. Per The Information, Larry Ellison, CEO of Oracle, had just flown to Washington DC from Florida, and had to borrow a coat “...so he wouldn’t freeze during an interview he…
I built a hiring platform that watches engineers work in a real CAD tool (news.ycombinator.com)
https://ai-eval-lab.janardan.xyz/ https://www.janardan.xyz/writing/deconstructing-ai-eval-lab-workings I got bored of UI work at my day job and wanted to build something. Ended up building a platform that streams KiCad (a PCB design tool)…
Show HN: AI memory with biological decay (52% recall) (github.com via hn)
Most RAG setups fail because they treat memory like a static filing cabinet. When every transient bug fix or abandoned rule is stored forever, the context window eventually chokes on noise, spiking token costs and degrading the agent's rea…
Show HN: AgentSwarms – free hands-on playground to learn agentic AI, no setup (agentswarms.fyi via hn)
Show HN: AgentSwarms – free hands-on playground to learn agentic AI, no setup required!
Show HN: Cyberpunk mission control for AI agents, one HTML file (github.com via hn)
Solar System Agents The AI agent dashboard your team actually wants to look at. A cyberpunk command center for monitoring AI agent fleets in real-time.
DIY home improvement got me thinking of an AR app that overlays paint roller tracks so you can see what part of your white-primed ceiling you've already painted white. That got me thinking: what other features of such an app would really r…
-
104 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
105 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
- 6m I changed ai youtube for “screenshot an X post → give it to my claude” and my output went up
- 3h How do you learn and keep up
- 5h How do I access Claude "Computer Use"?
- 6h Cowork tab gone from Claude desktop — WSL distro never registers after reinstall. Anyone fix this?
- 6h Can I have Claude use Excel and PDFs in Windows 10 Home?
Pre trade check prompt using Claude + Market Pulse MCP - what do you think? (www.youtube.com via reddit)
Claude can now run complex technical analysis if you feed it the right data. Are junior stock analysts cooked as well?
mesa PR with 37-130% llama.cpp pp perf gain for vulkan on Linux on Intel Xe2 (gitlab.freedesktop.org via reddit)
Making sure you're not a bot! Loading...
locally uncensored is a desktop app that combines four things most people run separately: chat, a coding agent, image generation, and video generation. all local, all on your hardware, no docker, no cloud account needed.
ChatGPT-psychosis: How it can occur and how to avoid it. (www.reddit.com)
Hey everyone, If there are AI developers, prompt engineers, or system architects here, this is especially for you. You should really take this into account.
am i the only one who’s lowkey waiting for it to suddenly stop working or is everyone quietly doing this?
Andrej Karpathy: How I use LLMs [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
-
29 items
model roundup
GPT 5.4OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.
- 6m GPT-5.4 compared to GPT-5.5 on MineBench
- 1d Decreased Intelligence Density in DeepSeek V4 Pro
- 1d First impressions using GPT 5.5 for video game scripting
- 1d Testing GPT-5.5 in early access: what we are seeing so far
- 2d Top open weight models like ds v4 pro max are still like 6-7 months if not more behind closed lab models
48 itemsmodel roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
Automated systematic literature review with Claude Code (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
CAD in Codex (twitter.com via hn)
could not extract summary
- What is Codex? (openai.com)
Whenever I asked Claude to make a code change it would reprint the entire file with like 400 lines of unchanged code just to swap out one component. Burning through your daily limit for no reason.
how does chatgpt plus for $3... is this an official version ? (www.reddit.com)
someone sold me chatgpt plus for $3 and i’ve been using it heavy for almost a month now… nothing has happened lol is this actually safe or am i living on borrowed time? anyone else in the same boat?
Best Claude Code workflow for Website Redesign (www.reddit.com)
Hello everyone, I've been requested by a client to redesign their company website because their old one shows its age now. I have some experience in web development but didn't work on anything like that in a while.
Claude deleting all my tests and blaming me 🤣🤣🤣 (www.reddit.com)
My virtual brother in Christ I did not tell you to delete every unit test 🤣 Edit: It migrated the tests from gjt history to the required format easily. This isn't intended as a complaint it's just funny that it interpreted my instructions…
-
178 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h What is the best coding agent (CLI) like Claude Code for Local Development
- 3h Qwen 3.6 27B in Claude Code says it will do something then stops and prompts for user reply (not failing a tool call)
- 9h Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found!
- 11h [Qwen3.6 35b a3b] Used the top config for my setup 8gb vram and 32gb ram, and found that somehow the Q4_K_XL model from Unsloth runs just slightly faster and used less tokens for output compared to Q4_K_M despite more memory usage
- 12h Benchmark: Windows 11 vs Lubuntu 26.04 on Llama.cpp (RTX 5080 + i9-14900KF). I didn't expect the gap to be this big.
71 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
RuneBench – Agent Benchmark on RuneScape Gameplay Tasks (maxbittker.github.io via hn)
runescape-bench evaluates AI coding agents on their ability to play RuneScape.
Hey r/ClaudeAI — I just open-sourced Tarn, a CLI-first API testing tool I built collaboratively with Claude Code over the last few months. It's free, MIT-licensed, and a single static binary you install with curl | sh or cargo install.
THE PROBLEM WITH "JUST USING CLAUDE" (www.reddit.com)
Here's a scenario that probably sounds familiar. You describe a feature, Claude writes code, the code compiles, the tests pass, and everything feels great.
- How are you using Claude in your business? (www.reddit.com)
ai model for 12 gb ram 3 gb vram gtx 1050 (www.reddit.com)
gemini chatgpt claude old models = worst thing ever. any good model for 12 gb ram 3 gb vram gtx 1050 linux mint 22.2?
I built an agent that breaks your AI agents before someone else does (fabraix.com via hn)
Find gaps in your AI systems before users (or attackers) do.
Do you see dev process post AI (coding agents) era will evolve? (www.reddit.com)
Do you see dev process post AI (coding agents) era will evolve? I mean for decades agile/sprint based methodology had pretty much become a global standard.
- Do you see dev process post AI (coding agents) era will evolve? (www.reddit.com)