How do new NLP evaluation frameworks provide deeper insights | ScienceToStartup | ScienceToStartup