Hit-RAG: Learning to Reason with Long Contexts via Preference Alignment | ScienceToStartup | ScienceToStartup