Common TF-IDF variants arise as key components in the test statistic of a penalized likelihood-ratio test for word burstiness | ScienceToStartup | ScienceToStartup