As of mid Apr 2026, I have noticed every model has had a major intelligence drop. And no I'm not talking about just ChatGPT.
#gemini
85 items
Major drop in intelligence across most major models. www.reddit.com €54k spike in 13h from unrestricted Firebase browser key accessing Gemini APIs discuss.ai.google.dev Hello, We are looking for guidance regarding an unexpected €54,000+ Gemini API charge that occurred within a few hours after enabling Firebase AI Logic on an existing Firebase project. Background: We created the project over a year ago a…
Gemini 3.1 Pro #1 at METR Timeline 80% Success Rate (1.5H) www.reddit.com #2 at 50% success rate (task length: 6H 24M)
I’ve used enough AI models to realize they all have wildly different personalities At this point I’m convinced AI models are just coworkers with different levels of talent, ego, and criminal energy. www.reddit.com - Claude Opus 4.6 - absolute rogue AI. Does what I want like it’s breaking at least 3 internal policies to make it happen.
Gemini Robotics ER-1.6 enhances reasoning to help robots navigate real-world tasks blog.google For robots to be truly helpful, they need to understand the physical world like we do. That’s why today we're introducing Gemini Robotics-ER 1.6, an upgrade to our reasoning-first model that enables robots to understand their environments…
Guys we have to change the pelican test www.reddit.com So i have been seeing more of those pelican on a bike svg tests and while they work i feel like (and maybe you guys do too) they are getting kinda benchmaxxed so we should switch things up soon and this is my idea generate me a html svg of…
We benchmarked TranslateGemma-12b against 5 frontier LLMs on subtitle translation - it won across the board, with one significant catch www.reddit.com OpenAI continues to lose market share in GenAI website traffic, while Gemini, and Claude are gaining: www.reddit.com - ChatGPT 56.72% vs 77.43% 12 months ago - Gemini 25.46% vs 6% 12 months ago - Claude 6.02% vs 1.4% 12 months ago At this point in the race its all about distribution & the cost of serving these models.
What's the cheapest way to access multiple frontier AI models? www.reddit.com Google Launches Gemini 3.1 Flash TTS Text-to-Speech Model x.com Logan Kilpatrick on X: "Introducing Gemini 3.1 Flash TTS 🗣️, our latest text to speech model with scene direction, speaker level specificity, audio tags, more natural + expressive voices, and support for 70 different languages. Available v…
Docker sandbox templates for running Claude Code with a web/mobile UI (CloudCLI) www.reddit.com I maintain CloudCLI, an open source web/mobile UI for AI Coding agents like Claude Code, Gemini and Codex (https://github.com/siteboon/claudecodeui if you are not aware) We recently added Docker Sandbox support and I wanted to share it her…
Which is the strongest reasoning model according to you? www.reddit.com My experience with testing all frontier open-weight models against GPT and Claude www.reddit.com I spent about a week testing open-weight models for real work, comparing them against what I already know from ChatGPT, Gemini, and Claude. The gap between what benchmarks suggest and what happens when you give these models something to ve…
Why does ChatGPT freeze with 1000 messages but Claude and Gemini don't www.reddit.com Single question llm comparison www.reddit.com for those of you also using CLI tools alongside cursor, claude code vs codex vs gemini benchmarked www.reddit.com Tool that auto-generates .cursor/rules from your actual CI and keeps it in sync with AGENTS.md, CLAUDE.md, and 10 others www.reddit.com How do you handle Front End? Delegate to Gemini? www.reddit.com Gemini 3.1 Flash TTS – with directed prompts simonwillison.net Google released Gemini 3.1 Flash TTS today, a new text-to-speech model that can be directed using prompts. It's presented via the standard Gemini API using gemini-3.1-flash-tts-preview as the model ID, …
Gemini 3.1 Flash TTS: the next generation of expressive AI speech blog.google Gemini 3.1 Flash TTS: the next generation of expressive AI speech Today, we’re introducing Gemini 3.1 Flash TTS, the latest text-to-speech model that delivers improved controllability, expressivity and quality — empowering developers, ente…
Gemini Robotics-ER 1.6: Embodied reasoning for real-world robotics tasks deepmind.google what model is good for inspecting and extracting data from large set of spreadsheets www.reddit.com as per title - i need to extract some data from a set of spreadsheets and wondering what would be the best method locally? I think I can utilise gemini-cli for that but can a local model work better?
I need some advice, learn about agents + regular use for a handmade business product descriptions www.reddit.com They Hacked Claude, Gemini, and Copilot (and No One Told You) grith.ai They Hacked Claude, Gemini, and Copilot (And No One Told You) A security proxy for AI coding agents, enforced at the OS level. Register your interest to be notified when we go live.
Show HN: Do Thought Streams Matter? A Benchmark of VLM Reasoning in Gemini 2.5 arxiv.org We benchmark how internal reasoning traces, which we call thought streams, affect video scene understanding in vision-language models. Using four configurations of Google's Gemini 2.5 Flash and Flash Lite across scenes extracted from 100 h…
Google Launches Native Gemini AI App for Mac www.macrumors.com Google is bringing Gemini to the Mac with a new native macOS app that's available starting today. Gemini for Mac can be activated with a keyboard shortcut, and it has built-in tools for generating images, analyzing what's on your screen, r…
The Gemini app is now on Mac blog.google The Gemini app is now on Mac Today, we’re bringing the Gemini app to macOS as a native desktop experience, designed to live right where you work. It’s always just a keyboard shortcut away, so you can quickly get the help you need without l…
Gemini App on Mac gemini.google Gemini app, now on Mac Access Gemini from any screen on your desktop to clarify a topic, recall a formula, or brainstorm on the fly without opening a tab. It’s help on demand.
Show HN: Hormuz Trail - Oregon Trail parody/black-box AI coding exercise hormuztrail.com I jokingly told a co-worker Iran might make a good Oregon Trail parody. Then I built it.
Ask HN: What's with the Wargames-like UX lately? news.ycombinator.com For a while, anything with a purple gradient was likely a claude inspired design. I think there was a period where Gemini(?) also seemed to produce blue/purple retro sci-fi designs?
AI tools are getting dumber www.reddit.com I despair ... I've been using ChatGPT Plus, Gemini and Claude Pro for a while now.
Tracking in Claude, ChatGPT and Gemini Chatbots infosec.exchange Agent Skills for Software Test Automation news.ycombinator.com NVIDIA + UMD released AF-Next: open audio-language model that outperforms Gemini-2.5-Pro on MMAU-Pro (75.01% vs 57.4%). Temporal Audio Chain-of-Thought anchors reasoning to timestamps. www.aiuniverse.news If you know how to set up OpenAI & Gemini API keys, this tool can save your hours of work on social media www.reddit.com Which AI chat is better for daily chatting? www.reddit.com Is Gemini 3.1 pro really that bad?? www.reddit.com Google launches native Gemini app for Mac 9to5google.com Gemini now has a native Mac app in the first expansion beyond Android and iOS. This “native desktop experience” is launched via an Option + Space keyboard shortcut.
How I feel when I Claude Code.. www.reddit.com Made with Gemini
Gemini 3.1 Flash Live blog.google Gemini 3.1 Flash Live: Making audio AI more natural and reliable Today, we’re advancing Gemini’s real-time dialogue capabilities with Gemini 3.1 Flash Live, our highest-quality audio and voice model yet. It delivers the speed and natural r…
Comment and Control: Prompt Injection in Claude Code, Gemini CLI, and Copilot oddguan.com Anthropic Claude Code Security Review, Google Gemini CLI Action, and GitHub Copilot Agent are vulnerable to prompt injection via GitHub comments — turning PR titles, issue bodies, and issue comments into attack vectors for API key and toke…
Gemini Plugin for Claude Code github.com Gemini Plugin for Claude Code Use Google's Gemini CLI from inside Claude Code to review code or delegate tasks. Why bring Gemini into Claude Code?
Show HN: Cyber Pulse. AI pipeline for triage and alerting on cyber news/intel play.google.com I had 11 AI agents try to book a flight. Average satisfaction: 3.4 out of 10 www.reddit.com Show HN: Zero-identity messaging app with physics-based post-quantum encryption news.ycombinator.com Commitgen – AI-generated Conventional Commit messages from your staged diff news.ycombinator.com I stumped all frontier models with a ~400 word logic puzzle. www.reddit.com Why ChatGPT eats all my RAM? www.reddit.com A Simple Coding Agent in a Loop with LangChain4j, Jbang, and Gemini glaforge.dev Show HN: Android AI agent-assistant operating your apps (no adb,PC,root,etc.) news.ycombinator.com Need help with automating my editing workflow www.reddit.com I run a very small YouTube channel I used to edit my videos using CapCut (Free editing software), but at some point I realized my editing process is very formulaic or algorithmic. so I decided to use AI to help me automate my editing workf…
Is Auto just Composer now? www.reddit.com I've run out of API usage and most of my queries use "Auto" now and I notice that they all go to Composer. Trying to select any other model even the supposedly cheap Kimi K2.5, Gemini Flash 2.5, etc., triggers a notification saying that I'…
PDF Analysis/Splitting Agent www.reddit.com Hi all, I'm fairly new to building AI agents and would like to build a functional POC as a learning experience. We have an enteprise Gemini license, so that'd be the ideal tool to use, but I would be open to suggestions.
Complex, parallel, long-running claude/agentic sessions - what is the point? where is the value? www.reddit.com Here is how I view AI Agents field (with focus on SWE/research) right now: - "chats online" gpt/gemini/claude --> general use - "vscode like extensions" cursor/antigravity/cline vs code extension/cc vs code extension etc. --> for coding, b…
Complex, parallel, long-running claude/agentic sessions - what is the point? where is the value? www.reddit.com Here is how I view AI Agents field (with focus on SWE/research) right now: - "chats online" gpt/gemini/claude --> general use - "vscode like extensions" cursor/antigravity/cline vs code extension/cc vs code extension etc. --> for coding, b…
AI as an attorney? Student uses ChatGPT, Gemini to sue UW www.kuow.org AI as an attorney? Student uses ChatGPT, Gemini to sue UW over alleged racial discrimination Stanley Zhong graduated from his Bay Area high school with a 4.42 GPA, a 1590 SAT score, and high rankings in several international coding competi…
Show HN: Made a tool where "make it feel like cold metal" is a valid instruction cast.bsct.so I built https://cast.bsct.so with Biscuit! Chat with Claude, GPT or Gemini.
Gemini on Mac twitter.com Post Conversation Sundar Pichai @sundarpichai Introducing Gemini on Mac. It’s the first time we’re bringing the @Geminiapp to desktop.
Subagents have arrived in Gemini CLI developers.googleblog.com Learn how subagents in Gemini CLI act as specialized experts to handle complex, high-volume tasks in isolated context windows. This new feature enables parallel execution, reduces context rot, and allows for custom agent definitions using…
Claude Sucks news.ycombinator.com Claude is constantly down. I think it’s a great tool, and in a lot of ways, its capabilities are better than rivals like ChatGPT or Gemini.
Gemini Folders – A local, open-source Chrome extension for Gemini chromewebstore.google.com Overview Organize your Gemini conversations into custom folders. Do you use Google Gemini daily for work, coding, research, or creation, but constantly lose your important prompts in an endless history?
Local models capabilities www.reddit.com Claude CLI, Codex CLI and Gemini CLI, all have agentic capabilities that it is capable of editing files or folders in my local machine directly or the apps that I have integrated using MCPs when working on my request like coding task or re…
Show HN: FlipAEO – Get your SaaS cited by Perplexity and AI search news.ycombinator.com Hey HN. I am a solo dev.
I built Fixy Code — a multi-agent coding terminal built with Claude Code www.reddit.com Built this with Claude Code. Free to try.
I've been building Nest by RAVEN with Claude Code for the past few months. Claude has been part of the process from day one — and it ended up being one of the core AIs the product is built around. www.reddit.com Nest is a desktop workspace (Mac + Windows) that runs multiple AI CLIs in a resizable grid. Each pane is a fully independent session with its own account, history, and environment.
Show HN: ContextPack – CLI that maps any codebase into ranked context github.com I'm a sophomore CS student and built this as a side project to solve a problem I kept running into — jumping into an unfamiliar codebase with no idea where to start. It's a static analysis engine that: - Walks the repo in two passes (no un…
I built an MCP server that gives Claude Code image/video generation, web search, and smart multi-model routing www.reddit.com Show HN: Get Hired with AI, a free book I wrote on using LLMs for a job search www.careervectorhq.com Extracted System Prompts from ChatGPT, Claude, Gemini, Grok, Perplexity and More github.com Show HN: RememberMap remembermap.com Show HN: Twins, a Gemini Server in Ada github.com Local coding agents. Am I missing something? www.reddit.com Claude Code with Pro subscription + OpenRouter in parallel — what's the cleanest setup? www.reddit.com Hi there, I have a Claude Pro subscription and use Claude Code daily. I'd also like to use Claude Code routed through my OpenRouter API key so I can experiment with other models (GLM-5.1, DeepSeek, Kimi, Gemini, etc.) — without giving up m…
First vibe coding and failed. Help!!? www.reddit.com I tested an idea for an app I want written. Claude wrote the code for the android app.
Sonnet is expensive, so I built a free open-source Sheets agent on Haiku that outperform the same prompt claude/gemini, here is what I learnt. www.reddit.com I live in Google Sheets. Financial models, projections, scenario planning — that's most of my working day.
Made a skill that actually scores and fixes your prompts www.reddit.com So I got tired of manually tweaking prompts over and over, so I made a Claude Code skill (Works with any LLM) that does it for me. You give it a prompt, it breaks it down, scores it 1-5, then rewrites it.
Claude seriously screwed me tonight, so i gave him the 3-pathway conversation. www.reddit.com As part of the management team, I've given this conversation more often than I'd like to admit. I usually have the support of my HR department.
Mon premier site 100% IA : Quand l’artisanat rencontre l’Antigravity www.reddit.com Fiers de vous présenter un nouveau site d’artisan peintre en région bordelaise. Ici, pas d’agence de communication, mais une équipe de choc pilotée par l’intelligence artificielle : Claude : Mon bras droit pour la structure, un peu farfelu…
Using Claude to plan triathlon/running workouts? www.reddit.com Model-Agnostic Continuity in LLMs www.reddit.com A director of engineering built a memory protocol across six coding agents in a week, and I think the findings are worth sharing www.reddit.com Why most open-source models can't answer this question while most closed-source models can answer most of the time? www.reddit.com The golden age is over www.reddit.com Thinking of buying Pro for a month www.reddit.com Frustrated with the big 3, anyone else in the same boat? www.reddit.com