Vibe coding can turn into a gambling loop (www.reddit.com)
I use AI coding tools a lot, so this is not an anti-AI post. If anything, the problem is that they are useful enough to change how I work.
I'm late (www.reddit.com)
I started learning n8n about a month ago with the explicit goal of working as a freelancer and providing automation and AI agents to companies. Then I started seeing conversations and posts about dispensing with n8n and its demise in the n…
Show HN: Valkyr LM Inference with Realtime Guarantees (github.com via hn)
Valkyr is a fresh take on LM Inference runtimes. It's quite different from llama.cpp, vLLM, or ZINC for example.
Show HN: Sourcery – Open Deep-Research, Grounded in Evidence (sourcery-deep-research.pagey.site via hn)
Took this as a fun little exercise for learning. In the end however, this came out to be on-par with commercial level deep-research offerings.
Show HN: Kirikiri – A mobile IDE for Claude Code (iOS, open source) (news.ycombinator.com)
Claude Code runs in a terminal. Phones don't have good terminals.
You give Cursor a real task and watch it work… from memory. Ask for a landing page → generic off-brand Tailwind hero Ask for Clerk auth → skips JWT verification “I’ll write a CSV parser” → reinvents half of papaparse (badly) You just spent…
-
113 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
16 itemsmodel roundup
Qwen 3Qwen3-0.6B is a large language model from the Qwen series, featuring dense and mixture-of-experts architecture, with significant improvements in reasoning capabilities and human preference alignment. Community feedback highlights its effectiveness for teaching from extensive documents and its suitability for low VRAM setups as a text-to-speech (TTS) model.
- 5m [Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost
- 3h Looking for Small VLM/MLLMs Alternatives to Qwen Series Models
- 18h Qwen Meetup Draft Review Required (Function Calling Harness 2 - CoT Compliance from 9.91% to 100%)
- 20h Poor GPU Club : Tried Bonsai-8B on CPU & CUDA
- 2d Tested Tether's QVAC SDK on Android with a custom fork — real-time voice loop, Parakeet streaming + Qwen3 1.7B + Supertonic, LLM triggered mid-utterance
MacBook m5 pro (www.reddit.com)
Hello all, I just got my hands on an m5 pro with 64 GB (unified) memory. I’m itching to try some good models for coding.
I've been giving my coding agent access to a folder of markdown files as its long-term memory. It works surprisingly well for open-ended questions — "why did we choose Postgres over DynamoDB?" or "what's the context behind the auth rewrite…
I'm a Linux Sysadmin rather than a Dev, and I have recently discovered how much Claude has levelled up recently, and can see many different ways it can not just augment code writing and debugging but also with workflow optimisation and adm…
Ask HN: Can I trust GitHub not to use my code for LLM training? (news.ycombinator.com)
I have a growing concern regarding the safety of my source code. Some of my personal (Edit: still professional, I'm a one man conpany) projects rely on algorithms and technologies I consider special.
A Claude Code mobile app studio for solo builders (www.reddit.com)
could not extract summary
The Ultimate LLM Fine-Tuning Guide (www.reddit.com)
I was looking for a "spot-on" fine-tuning guide since quite a while, but couldn't find one. So i thought: Let's write it myself.
-
155 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
143 itemsmodel roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 41m Which model should I try?
- 7h Multi agent AI Trading Floor
- 13h I made a visualizer for Hugging Face models
- 19h Tried running Claude Code with local LLMs via Ollama — ended up subscribing to Pro anyway. But now I can't disconnect from the local server.
- 21h Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real.
Why AI Agents are either the best or worst thing we've ever built [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
Hi everyone We are a Servicing and Maintenance Company dealing with Compliance, Health & Safety and Fire Protection We use web based scheduling and certification software and web based accounts software for invoicing clients etc Is anyone…
https://preview.redd.it/e90p5hcjywyg1.png?width=907&format=png&auto=webp&s=342f78f515e98988024710407ed789b086661100 https://preview.redd.it/85cl7hcjywyg1.png?width=1915&format=png&auto=webp&s=53143e9365c94d75c32e150cff2ed04e0ac4f644 (i nev…
World shipped AgentKit a couple weeks back, sharing what i picked up (www.reddit.com)
So I've been reading up on the World AgentKit launch from April 17 and figured I'd share what I pieced together. The basic idea is a verified human delegates their World ID to an agent, and the agent carries cryptographic proof that a real…
Claude's webinars (www.reddit.com)
Are there any ways I can get transcribed versions of Claude's webinars?
- Claude.md (gist.github.com via hn)
- What do you do with Claude? (www.reddit.com)
Richard Dawkins and the Claude Delusion (plus.flux.community via hn)
Richard Dawkins and the Claude Delusion Senescence makes people believe silly things, so does bad science Prominent evolutionary biologist Richard Dawkins became a worldwide laughingstock this week for an unintentionally embarrassing artic…
- Richard Dawkins and The Claude Delusion: The great skeptic gets taken in (garymarcus.substack.com via hn)
- The Claude Delusion: Richard Dawkins believes his AI chatbot is conscious (www.dailygrail.com via hn)
-
274 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
29 itemsmodel roundup
GPT 5.4OpenAI has released GPT-5.4-Cyber for testing and claims it will compete with Claude Mythos. Meanwhile, GPT-5.4 Pro has solved the Erdős Problem #1196, showcasing its advanced capabilities in mathematics.
- 50m [AutoBe] Local LLM Benchmarks about Backend Generation, Monthly (GLM vs Qwen vs DeepSeek)
- 50m LLM proxy that lets Claude Code talk to any model
- 1d UPDATE: The method from the proof generated by GPT-5.4 Pro for Erdos Problem #1196 was successfully applied to other problems including another 60 year old Erdos conjecture.
- 4d A GPT-5.4 bug led to OpenAI banning goblins and raccoons
- 4d How is deep seek v4 not SoTA?
me beginner: How to use Kimi 2.6 in Cursor? (www.reddit.com)
I just paid kimi official subscription. I dont want to use Kimi code, the console-looking thing, but I want to use like the Cursor agentic feature.
What are tarpit ideas in the AI era? (news.ycombinator.com)
For those unfamiliar with the term, tarpit ideas are ideas that always attract lots of founders but never really work. They usually sound amazing on paper.
Recommended Agent.md file for academic research (www.reddit.com)
Hi I want to update my Agent.md file. Due to token limits and the need for smarter usage, are there any recommendations or guidelines for updating it?
Show HN: How to build CLI for agents – Loxone CLI as an example (github.com via hn)
As an experiment a built cli to configure Loxone home automation servers - their primary mechanism at the moment is a UX. Key learnings: * commands need to provide to good feedback (e.g.
why has my Sonnet started to 'agree' with me more? (www.reddit.com)
This week i noticed much more "Great question!" etc. I liked the bluntness before and dont want it to sugarcoat answers
Unauthorised [Claude Pro] Gift Purchases Made from My Account (old.reddit.com via hn)
could not extract summary