When Chain-of-Thought Backfires: Evaluating Prompt Sensitivity in Medical Language Models | ScienceToStartup | ScienceToStartup