Your repo is a preference dataset: extracting taste from merge history

r/LocalLLaMA
AI Research

You're spending less time thinking 'Can we build this?' but asking 'Which of all the possibilities should we build?' Now taste bottlenecks execution. And eliciting preferences from experts is expensive but what if you could extract them from the versioned artifacts you've been maintaining all along? Under a mild structural assumption that your team's trajectory of accepted revisions is directionally improving in expectation, you can distill preferences into your agents. Implicit Preference Distillation facilitates cheaply aligning your AI with your institutional practices.