What are the specific efficiency gains offered by memory-augmented attention in LLMs?Answer not yet generated.