How can Omanic's structured annotations enhance the assessment of LLM reasoning capabilities?Answer not yet generated.