The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition | ScienceToStartup | ScienceToStartup