Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing | Signal Canvas | ScienceToStartup