AI RESEARCH
Pairwise Reference Alignment as a Model-Level Ordinal Observable
arXiv CS.LG
•
ArXi:2605.30758v1 Announce Type: cross Pairwise preference data is widely used in language-model evaluation and alignment, often for model ranking, reward modeling, or preference optimization. This note formulates a basic measurement question: given a reference distribution of pairwise preferences, what model-level quantity is estimated when we test whether a model ranks preferred responses above rejected responses? We define pairwise reference alignment as an ordinal observable induced by a model scoring function.