AI RESEARCH

Support-aware offline policy selection for advertising marketplaces

arXiv CS.AI

ArXi:2605.21736v1 Announce Type: cross Logged advertising auctions make offline reserve-price evaluation attractive but risky. Replay tables can identify policies with large apparent yield gains, yet they can also hide weak threshold, multiple-comparison effects, subgroup harm, and bidder-response uncertainty. Existing replay and off-policy evaluation methods estimate or rank policy values, but they do not directly answer the operational question of whether the available evidence is strong enough to justify validation.