Translation-Aware Contamination Detection is a method designed to identify data contamination in Large Language Models across multilingual contexts. It extends existing techniques like Tested Slot Guessing with choice-reordering and integrates Min-K% probability analysis to capture subtle behavioral and distributional signals, even when translation suppresses conventional indicators.
Translation-Aware Contamination Detection is a new way to find hidden cheating in AI language models, especially when they're used with multiple languages. It helps uncover when a model has secretly memorized test answers, even if those answers were translated, ensuring that we can trust how well these models truly understand and generate language.
TACD, Multilingual Contamination Detection, Cross-lingual Contamination Detection
Was this definition helpful?