In-Context Reinforcement Learning (ICRL) reframes sequential decision-making, enabling systems to learn and adapt from rich trajectory information without explicit model retraining. It treats action sequences and performance feedback as primary learning signals, facilitating efficient experience reuse in complex optimization tasks.
In-Context Reinforcement Learning (ICRL) allows AI systems, especially those powered by large language models, to learn and improve from past experiences without needing to be retrained. It works by feeding sequences of actions and feedback directly into the model's input, enabling it to adapt and make better decisions in complex, iterative tasks like optimizing computer code.
Prompt-based RL, Trajectory-conditioned learning, In-Context Learning for RL, LLM-driven RL
Was this definition helpful?