How does grounding dialogue synthesis in reasoning scenarios improve model evaluation?Answer not yet generated.