Multimodal Emotion Recognition via Bi-directional Cross-Attention and Temporal Modeling | ScienceToStartup | ScienceToStartup