How to Utilize Complementary Vision-Text Information for 2D Structure Understanding | ScienceToStartup | ScienceToStartup