AI RESEARCH

Multi-Channel Replay Speech Detection using Acoustic Maps

arXiv CS.LG

ArXi:2602.16399v2 Announce Type: replace-cross Replay attacks remain a critical vulnerability for automatic speaker verification systems, particularly in real-time voice assistant applications. In this work, we propose acoustic maps as a novel spatial feature representation for replay speech detection from multi-channel recordings. Derived from classical beamforming over discrete azimuth and elevation grids, acoustic maps encode directional energy distributions that reflect physical differences between human speech radiation and loudspeaker-based replay.