Context Sensitivity Fingerprints (CSF) are compact profiles that quantify how model bias shifts across varying contextual framings in prompts. They measure per-dimension dispersion and paired contrasts, revealing that bias scores from fixed-condition tests may not generalize to real-world deployment.
Context Sensitivity Fingerprints (CSF) help identify how AI model biases change depending on the context of the input, like when a prompt mentions a specific time or place. This tool shows that models might seem fair in simple tests but can become biased in real-world situations, offering a more thorough way to check for fairness.
CSF
Was this definition helpful?