Temporal Gains, Spatial Costs: Revisiting Video Fine-Tuning in Multimodal Large Language Models | ScienceToStartup | ScienceToStartup