On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents | ScienceToStartup | ScienceToStartup