Ff you thought the hypetrain was exaggerated before, just you wait...
The benchmark game has entered its IPO era. (www.reddit.com via reddit) Sassy Claude is best Claude (www.reddit.com via reddit) I audibly laughed at the amount of shade thrown at Microsoft from Claude lmao
- 66 items
thread
Opus 4.7 I’ve been coding for 20+ years - Claude Code helped me ship 3 geeky iOS apps faster than ever (www.reddit.com via reddit) I’ve been writing software for over 20 years and Claude Code has been a game changer for me. Before tools like Claude Code, building iOS apps in Expo often meant spending months digging through docs, stack overflow, GH issues and random ed…
How to get better at using claude code and coding agents in general? (www.reddit.com via reddit) How to get better at using claude code and coding agents in general? And I mean everything from writing better prompts for planning, debugging but also learning the addons like skills and knowing when and how to leverage that.
- 8 items
thread
MiniMax 2.7MiniMax-M2.7 is a large language model capable of complex agent tasks and real-world programming, achieving a 66.6% medal rate on MLE Bench Lite and matching GPT-5.3-Codex in professional software engineering benchmarks. Community discussions focus on optimizing the model's performance for different hardware and running it efficiently on low-power devices.
LLMs and Agents: How do they Work? (mattrogish.com via hn) For as much as AI has dramatically changed all of our lives, mine included, I was a bit uneasy that I didn’t really know how they are implemented and how they work. What is thinking and chain-of-thought reasoning?
Claude Status Update : Claude Cowork not starting for some users on 2026-04-16T21:29:01.000Z (www.reddit.com via reddit) This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Claude Cowork not starting for some users Check on progress and whether or not the incident has been resolved yet here : https://sta…
- 21 items
thread
Qwen 3.6 - 31 items
thread
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
What else can I learn? (system engineer trying to use AI) (www.reddit.com via reddit) I decided to embrace it and when the time comes, I want to be "the guy who knows this stuff". I am a old system admin/engineer, mostly linux and OCP related.
Ollama v0.21.0-Rc0 (github.com via hn) Ollama Start building with open models. Download macOS curl -fsSL https://ollama.com/install.sh | sh or download manually Windows irm https://ollama.com/install.ps1 | iex or download manually Linux curl -fsSL https://ollama.com/install.sh…
What’s ur Favourite Claude Thinking word? Mine is combobulating (www.reddit.com via reddit) You've been blocked by network security. To continue, log in to your Reddit account or use your developer token If you think you've been blocked by mistake, file a ticket below and we'll look into it.
Is there a way to access past models in Claude chat (not Claude code)? (www.reddit.com via reddit) Currently using Sonnet 4.5 for writing and find it quite good. Sonnet 4.6 just feels off.
Codex for almost everything (openai.com via hn) We’re releasing a major update to Codex, making it a more powerful partner for the more than 3 million developers who use it every week to accelerate work across the full software development lifecycle. Codex can now operate your computer…
Supergrok integration (www.reddit.com via reddit) Correct me if I'm wrong, but Supergrok 4.20 isn't available on Cursor, because.... I use Grok a lot, and would love to get Supergrok to work with Cursor, because Composer, Codex, GPT, Opus, Sonnet..
FrontierSWE: An ultra-long horizon coding benchmark (www.frontierswe.com via hn) PostgreSQL 18 on SQLite Build a PostgreSQL 18 server in Zig that uses SQLite for storage. We show the best score achieved by any model on the task.
I can not use Claude Cowork (www.reddit.com via reddit) Hey everyone! I just bought a Claude Pro subscription, and I've never really used AI before.
Filling DOCX forms: GPT-5.1 broke it, every Claude model handled it (varstatt.com via hn) Jurij Tokarski Filling Forms No Tool Can Template Every tender form is different, templating tools need placeholders you can't insert, markdown round-trips destroy the document, and only some models can do XML surgery on the original file.…
Show HN: Runtime security for AI agents(injection,tool abuse, data exfiltration) (news.ycombinator.com via hn) Hi HN I’ve been working on an open-source project to explore a problem I keep running into with LLM systems in production: We give models the ability to call tools, access data, and make decisions… but we don’t have a real runtime security…
What happens if I move my Claude CoWork project folder? How do I reconnect it after? (www.reddit.com via reddit) Hey guys, quick question. I’m thinking about moving my Claude CoWork project folder to a different location on my computer because I’m trying to clean up my directories a bit and get organized...
Claude Status Update : Claude Cowork not starting for some users on 2026-04-16T20:47:32.000Z (www.reddit.com via reddit) This is an automatic post triggered within 2 minutes of an official Claude system status update. Incident: Claude Cowork not starting for some users Check on progress and whether or not the incident has been resolved yet here : https://sta…
Cursor coding agents course (30 minutes) (www.youtube.com via reddit) Hey everyone! I've been working on this course for months for cursor.com/learn.
AI agents dont just help banks they can now BE your bank (www.reddit.com via reddit) Seeing alot of posts here about AI agents built for financial institutions but I think the bigger shift is AI agents doing the banking for you not for the bank. I run a small dev shop and saw a blog about opening a bank account with AI thr…
If you ask chatGPT to choose 1 word, it will always choose "momentum". (www.reddit.com via reddit) One could presume it subconsciously picks what is applicable to the momentum that AI has right now.
Google, please just open source Imagen (2022), Gemini 1.0 Nano and Gemini 1.0 Pro. You have nothing to lose at this point. (www.reddit.com via reddit) Ok, so imagen (the original one from 2022, not imagen 3/4) should be open source. The gemini 1.0 nano model and the gemini 1.0 pro models should be open source.
Catai – Virtual pixel art cats for macOS that chat with you via Ollama (github.com via hn) CATAI Virtual desktop pet cats for macOS — pixel art cats that live on your dock, chat with you via Ollama LLM, and debate ideas together to help you brainstorm and refine your thoughts. Features Dock companion — Cats walk along your dock…
Show HN: Honeymux, a TUI wrapper for tmux that simplifies agent-driven workflows (hmx.dev via hn) Hi HN, I'm a big fan of tmux, so I took a shot at adding what I felt was missing: - OS-native copy/paste and search from any pane - A bit more "UI" than just a status bar and border lines - Persistent remote logins w/o the mental load of t…
I built an MCP server that turns Claude into an emergency medicine assistant — what I learned building AI for high-stakes domains (www.reddit.com via reddit) If you work in healthcare or just want to see how Claude handles high-stakes clinical reasoning — I built an MCP server for this and wanted to share what made it harder than a typical AI project. EMSy is built on top of Claude and connects…
Trying to automate the job hunt with Claude - any advice? (www.reddit.com via reddit) Im a soon to be grad trying to job hunt after getting super burnt out in the Fall. Ive been trying to get Claude to help with the hunt to varying results.