SelfGrader: Stable Jailbreak Detection for Large Language Models using Token-Level Logits | ScienceToStartup | ScienceToStartup