AI RESEARCH
Tetris: Tile-level Sampling for Efficient and High-Fidelity Video Object Tracking
arXiv CS.CV
•
ArXi:2605.25538v2 Announce Type: replace Track materialization converts raw video into reusable object tracks that downstream queries can run against without rerunning tracking, but extracting those tracks efficiently and with high fidelity remains expensive. Prior systems reduce cost through temporal frame sampling, erasing the inter-frame motion that fine-grained tracking requires. In stationary video, however, large portions of each frame contain no objects of interest, and the remaining regions tolerate different sampling rates.