Anything that is underneath the cursor gets fed into Google's surveillance AI (mastodon.social via hn)
mcc: "Looks like Chromebook is being…" - Mastodon Skip to main contentHotkey 1 Skip to main navigationHotkey 2 Recent searches No recent searches Search options Only available when logged in. mastodon.social is one of the many independent…
Show HN: Dexgram – Telegram to Codex Desktop Bridge for Windows (github.com via hn)
I looked around for a while looking for something that: A. Is a self-contained binary.
So, as most of us here are, I'm a llama.cpp loyalist. Easy to understand, great configuration, relatively stable, etc.
Open source rule based guardrails for coding agents (github.com via hn)
Prempti Experimental Preview — This project is under active development and released as an early preview. Interfaces and behavior may change between releases.
Computer Use in Codex [video] (www.youtube.com via hn)
About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC
- Bringing Codex computer use to iOS (www.reddit.com)
Loading/running every LLM with 4M ctx in 3 clicks (old.reddit.com via hn)
could not extract summary
Mm-ctx – fast, multimodal context for agents (huggingface.co via hn)
Post 482 mm-ctx – fast, multimodal context for agents. LLM-based agents handle text incredibly well, but images, videos, or PDFs with visual content are hard to interpret.
This is something I've wanted to exist for a while so I just built it. PromptFlow Voice is a standalone desktop app.
-
170 items
model roundup
Gemma 4Gemma 4 is a family of open-source multimodal models from Google DeepMind, available in sizes up to 31 billion parameters and featuring dense and MoE architectures. Notable community highlights include the 31B model's success in production tests, with some users preferring 4-bit precision for local use, and others sharing settings for optimizing performance with smaller models.
- 24m Local audio/multimodal models that can be used for language pronunciation grading
- 56m Are harnesses like OpenClaw and Hermes really necessary?
- 3h RTX 5060Ti 16GB or RTX 3080 20GB?
- 8h Local LLM autocomplete + agentic coding on a single 16GB GPU + 64GB RAM
- 9h Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results
407 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 26m High VRAM local coding model — still Qwen 3.6 27B?
- 2h Qwen 3.6 27b MTP - getting //// in response
- 2h Thoughts on "production" model setups
- 4h Qwen3.6-35B-A3B-Uncensored-Claude-Wasserstein-MTP-GGUF
- 4h Luce DFlash + PFlash on AMD Strix Halo: Qwen3.6-27B at 2.23x decode and 3.05x prefill vs llama.cpp HIP
I built a free PII scanner for LLM prompts. No signup needed (aisecuritygateway.ai via hn)
Try an example How It Works Paste Your Prompt Enter the text you’d normally send to ChatGPT, Claude, or any AI model. Instant DLP Scan Our engine runs 50+ detection patterns across 28+ entity categories in milliseconds.
Sam Altman says Elon Musk tried to ‘kill’ OpenAI, in tense courtroom showdown (www.sfchronicle.com via reddit)
OpenAI CEO Sam Altman pushed back against Elon Musk’s accusations that he “stole a charity” by abandoning the company’s nonprofit mission to benefit humanity in favor of profits. “We created the largest or one of the largest charities in t…
Warning: Speculation. Summary: Current hardware prices may be due to military demand we can't see.
The Emotional Cost of AI-Assisted Coding (news.ycombinator.com)
This post doesn’t have a blog or article; it is a blog itself (kind of). I’m just sharing how coding agents feel.
AI agents still suck, so I built my own (www.reddit.com)
Right now the app ships with a wrapper around Claude Code, with support for codex coming this week. The broader focus is around composable flows + steps.
Open Source Claude Code Offer to Help (news.ycombinator.com)
Hello I have some time after work and would like to offer to help use Claude Code for 1 specific (TBD) open source project that needs some help. No charge.
Might be a dumb question , but I have no coding experience while trying to build a project using Claude. However, I would still like to know why it codes what it does and why.
Tamil Nadu had state elections on May 4. I wanted to see if I could build a better results site than what exists (everything out there is ad-ridden, slow, and unusable on mobile).
-
140 items
model roundup
Qwen 3.5Qwen3.5-9B is a post-trained model with 9 billion parameters that integrates multimodal learning and efficient hybrid architecture for enhanced performance. Community highlights include speculative decoding on Apple Silicon boosting Qwen3.5-9B's throughput by 4.1x, and the model outperforming others in coding tasks while addressing overthinking issues through tool usage.
- 32m How to disable reasoning for Qwen3.5 4b 9b unsloth ggufs?
- 4h Vulkan or CPU llama cpp backend for local llm for coding/code assist
- 1d prompt caching, but for rl training - 7.5x speedup on long-prompt/short-response workloads
- 3d The Quantization Method Apple Silicon Actually Rewards | by Alexandru Vasile | Mar, 2026
- 3d Does llama-swap actually work with mlx_lm.server / MLX models on macOS?
224 itemsevent
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
Nearly Optimal Attention Coresets (www.pinecone.io via hn)
Abstract We consider the problem of estimating the Attention mechanism in small space, and prove the existence of coresets for it of nearly optimal size. Specifically, we show that for any set of unit-norm keys and values in , there exists…
Cryzo: Go from an idea to business just by chatting with Ai (www.reddit.com)
Imagine Building and running your entire business… through a single chat. No dashboards.
I just automated my own law office (8 employees). The Agent does everyhing - putting incoming docs in the right folder and renaming, handing it out to the AI Agent which does the main work: checking the whole case file (document / image ll…
EDIT: Edited to provide more clarity It occurred to me, that perhaps the same draft model used for speculative decoding would be completely adequate if we just used it's output as-is for reasoning, without validating the results against th…
SpaceX backs Anthropic with data centre deal amidst Musk's OpenAI lawsuit (www.aljazeera.com via hn)
SpaceX backs Anthropic with data centre deal amidst Musk’s OpenAI lawsuit SpaceX pens a new deal with Anthropic as Elon Musk sues competitor OpenAI for allegedly backtracking on its mission. Anthropic has reached a deal to tap the computin…
OpenAI Hit with Overdose Suit Targeting ChatGPT Drug Advice (1) (news.bloomberglaw.com via hn)
The family of a college student said ChatGPT caused their son’s fatal overdose after he followed medical advice about mixing substances from the chatbot, according to a newly filed lawsuit against OpenAI Foundation and Sam Altman. On the d…
How can I handoff from one agent to another? (www.reddit.com)
I often end up hitting my limit in say claude code. Id love to just continue the conversation in cursor/ codex.
what can you do with claude for 100 usd per month No flair (www.reddit.com)
is this enough to run agents? or you run out of tokens?
Anthropic's Computer Use API: How AI Is Navigating Your Desktop Now (www.aigridnews.com via hn)
As a developer, I’ve always said that the "Holy Grail" of productivity isn't just an AI that writes code, but an AI that can actually execute the workflow. We’ve spent decades building APIs to make software talk to each other, but what abo…
Can't get the date right from agents, even with the latest models, have you observed similar issues? If you are trying to create a booking agent, what will you do to make sure that the agent books the appointment for the correct date, beca…