Show HN: Do Thought Streams Matter? A Benchmark of VLM Reasoning in Gemini 2.5
We benchmark how internal reasoning traces, which we call thought streams, affect video scene understanding in vision-language models. Using four configurations of Google's Gemini 2.5 Flash and Flash Lite across scenes extracted from 100 h…