What are the specific advantages of Step-Decomposed Influence for understanding attention mechanisms in transformers?Answer not yet generated.