What are the challenges in distilling complex datasets like | ScienceToStartup | ScienceToStartup