Contrastive Attribution in the Wild: An Interpretability Analysis of LLM Failures on Realistic Benchmarks | ScienceToStartup