AI RESEARCH

Diverse reasoning traces teach LLMs to make better decisions

Amazon Science

How to train language models to generate diverse, accurate reasoning paths using tokens that control distinct reasoning strategies.