What are the trade-offs between interpretability and performance in LLM alignment?Answer not yet generated.