Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence | ScienceToStartup | ScienceToStartup