What is the trend in LLM evaluation? | ScienceToStartup