AI RESEARCH
Diverse reasoning traces teach LLMs to make better decisions
Amazon Science
•
How to train language models to generate diverse, accurate reasoning paths using tokens that control distinct reasoning strategies.