Multi-Persona Thinking (MPT) is an inference-time framework for Large Language Models (LLMs) that reduces social biases. It guides models to adopt contrasting social identities and a neutral viewpoint, using iterative dialectical reasoning to expose and correct biases.
Multi-Persona Thinking (MPT) is a new method that helps large AI models reduce their social biases. It works by having the AI consider a problem from several different viewpoints, including contrasting social identities and a neutral stance, then using this multi-perspective thinking to correct biased outputs. This approach significantly lowers bias while keeping the AI's ability to reason intact.
MPT, Multi-Persona Prompting, Dialectical Bias Mitigation
Was this definition helpful?