I’ve been building Uisato Studio, a workflow-based AI creation platform for audiovisual work. This is the Music Video mode: upload an image + audio, and the system analyzes the input, generates visual direction, creates clips, handles b-ro…
Ask HN: Is this the SWE workflow of the future? (news.ycombinator.com)
Internally transferred to a new team in a top-10 F500 company. This team is pushing incredibly hard to be seen as "AI-First" & is very opinionated about what other teams should be doing.
Akamai surges on big LLM deal as Cloudflare dims (www.theregister.com via hn)
MOST POPULAR EVENTS - Securing the Untrusted Agentic Development Layer Join us to learn how to architect a development environment where your builders and their agents can move fast and securely. - Toxic Flows: When Your AI Agent Skill Bec…
Cancelling Claude subscription renewal immediately revokes Design access (news.ycombinator.com)
Starting today, Anthropic now immediately revokes Claude Design access if you cancel your subscription plan renewal, even while you're still in a valid period you've already paid for. I had a Claude 20x max plan and cancelled my automatic…
- Cancelling Claude subscription renewal immediately revokes Design access (news.ycombinator.com)
Got parented by Claude (www.reddit.com)
Bomboclat, haven't seen an AI be this brutal.
Owl Alpha – A free model for agentic workloads (prompts logged / closed-source) (openrouter.ai via hn)
Owl Alpha openrouter/owl-alpha Released Apr 28, 20261,048,756 context$0/M input tokens$0/M output tokens OpenRouter provides an OpenAI-compatible completion API to 400+ models & providers that you can call directly, or using the OpenAI SDK…
AI Agent Passport – an open identity standard for AI agents (github.com via hn)
🌐 AI Agent Passport A verified identity standard for AI agents. AI agents are showing up at websites, booking flights, paying bills, buying groceries — with no proof of who sent them, what they're allowed to do, or whether to trust them.
Academic Research Skills for Claude Code (github.com via hn)
Academic Research Skills for Claude Code 繁體中文版 A comprehensive suite of Claude Code skills for academic research, covering the full pipeline from research to publication. Install in 30 seconds (Claude Code CLI / VS Code / JetBrains, v3.7.0…
- Composing Claude Code Skills (gist.github.com via hn)
- Claude Code->Desktop Skills (www.reddit.com)
-
78 items
model roundup
Sonnet 4.6Sonnet 4.6, a new release noted for its "unhinged" behavior, has sparked discussions among users about unexpected changes in software performance and cost management strategies involving Cursor and Claude APIs.
331 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 50m Here is the current "Free-Tier AI Stack" for 2026
- 5h It's me, not Opus 4.7, who can't stay in the guardrails
- 7h Opus 4.7 High to Composer 2 fast
- 9h Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens
- 12h Opus 4.7 and DeepSeek V4-Pro select Buddhism as preferred religion
I have a 16GB VRAM GPU and I'm looking for a reliable local OCR model. Ideally it should stay under ~60% VRAM usage, so around 9–10GB max, because I want to keep it available on-demand rather than loading a huge model only for occasional b…
Designing, Refining, and Maintaining Agent Skills at Perplexity (research.perplexity.ai via hn)
research Designing, Refining, and Maintaining Agent Skills at Perplexity Perplexity’s frontier agent products rest on a foundation of know-how and domain expertise packaged in modular Agent Skills. We maintain a carefully curated library o…
I know Claude Code runs in the cloud, but in real-world use how much difference does local hardware actually make? I’m using Claude max and local llms just cannot compete for real dev work.
Original plan was to use Kimi/GLM for planning and DeepSeek for implementation, but seeing a lot of love for MiMo and Minimax lately. Anyone running a planner + coder split on Opencode?
I am too worried about installing a Skill with a virus, so I made a tool to check skills and ran it across ~60k Skills on Clawhub and it surfaced almost 1,000 high-risk ones, but the results show that high-risk viruses often disguise thems…
b9095 finally makes -sm tensor work on dual consumer Blackwell PCIe GPUs without NCCL If youre on dual Blackwell gpus this look like it could be big. I'll have my own results for 2x5060ti asap
Hello everyone, I've officially started building .agtx which is a new low-level, declarative language designed specifically for building, routing, and sandboxing AI agents with zero boilerplate. The goal is to completely ditch the heavy OO…
Show HN merit ranking A four-stage pipeline that ranks Show HN posts by estimated merit using TrueSkill + LLM-as-judge, then surfaces where that ranking disagrees with actual HN points. The core claim: HN upvotes correlate with how easy a…
-
126 items
model roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 55m GPT 5.5 kept calling me a goblin
- 1d Stop picking LLMs by reputation. Run the eval first.
- 1d GPT-5.5 low vs. medium vs. high vs. xhigh: the reasoning curve on 26 real tasks
- 2d GPT-5.5 correcting obvious typos really kills the vibe
- 2d ARC AGI is kind of BS (and I outlined an experiment that could prove it)
376 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 1h Building out my tool library, any recommendations? I just added email capability and im starting to get hyped!
- 1h Speeding up local LLM for usable coding agent
- 5h Hello from 10KM high! - Thanks to Qwen 3.6 35b a3b!
- 5h Has anyone bought a 3080 20GB mod recently?
- 9h am I running this llama-bench of Qwen3.6-27B on these V100s right?
Tojan in "claude code" google search first result (www.reddit.com)
I never thought I would fell for this shit. I am on internet since 1996.
Use Boring Languages with LLMs (jry.io via hn)
Use boring languages with LLMs I'm Jacob - I run Sancho Studio a software consulting group, we help companies with technical leadership, strategy, and security. I keep coming back to this idea that consistency compounds.
Newbie needs help on the best tools to use (www.reddit.com)
Hi everyone. I’m (almost) a complete newbie when it comes to LLMs and personal productivity tools.
Google readies ‘AI Ultra Lite’ plan and explicit ‘usage limits’ for Gemini (9to5google.com via reddit)
Google is quietly preparing a new “AI Ultra Lite” subscription tier to slot between its $20 Pro and $250 Ultra plans, plus a dedicated dashboard for subscribers to see their remaining token budget. If you’ve been following AI news in recen…
Distraction FREE YouTube using AI, Gemini and Vector Embedding in MV3 in Browser (chromewebstore.google.com via hn)
Overview Enter your interests. AI scores every YouTube video for relevance and fades the rest.
Cyber.md: AI-native posture that speaks agent (baz.co via hn)
Agents have changed how I think about secure development [1], and specifically the way security knowledge moves through a repository. Today, developers and agents at Baz are generating more code than ever, and the amount of code being writ…
Agentic Hooks - Stream Deck plugin (www.reddit.com)
I had itch to address long running task with Claude, where I wanted to see when its done working. And I wanted separate context flow for these alerts instead of using existing flow (phone, discord, telegram, etc) This is when idea born, sh…
I built my own GTA 6 (but it's 2d pixelart and 100% AI) with Claude (www.reddit.com)
Working on a fully AI native online game similar to gta online but in habbo hotel style and all content is live AI generated! Players can create own characters, weapons, buildings in the shared universe and raid others players homes!
Retainer: Autonomous agent for extended, independent operation (github.com via hn)
Retainer A persistent personal AI agent that runs on your laptop. Memory lives on disk in a folder you own, every cycle is appended to a JSONL audit log, and the persona is an editable markdown file rendered into the system prompt each cyc…
LLM Inference Throughput Rises 4.5x with Parallel Verification (presciente.com via hn)
Edition 74 — LLM Inference Throughput Rises 4.5x with Parallel Verification | Presciente Presciente Presciente Intelligence Briefing EDITIONS TodayArchiveSearchSaved TOPICS AI InfrastructureAI ResearchAI BusinessAI PolicyModel Release Scor…