Show HN: Do Thought Streams Matter? A Benchmark of VLM Reasoning in Gemini 2.5

hn · arxiv.org ·3 pts ·14h

We benchmark how internal reasoning traces, which we call thought streams, affect video scene understanding in vision-language models. Using four configurations of Google's Gemini 2.5 Flash and Flash Lite across scenes extracted from 100 h…

gemini

open →

← back to top