What are the advantages of CAViT's dynamic feature interaction for video analysis using Vision Transformers?Answer not yet generated.