Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI

About This Tutorial

In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy of a small language model (SLM). The example uses Amazon SageMaker jobs, so you can focus on