AI RESEARCH

Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation

arXiv CS.CV

ArXi:2509.09946v2 Announce Type: replace Multi-Target Multi-Camera Tracking (MTMC) is an essential computer vision task for automating large-scale surveillance. With camera calibration and depth information, the targets in the scene can be projected into 3D space, offering unparalleled levels of automatic perception of a 3D environment. However, tracking in the 3D space requires replacing all 2D tracking components from the ground up, which may be infeasible for existing MTMC systems.