Evaluating Generalization and Robustness in Russian Anti-Spoofing: The RuASD Initiative

digitado ⋅ 7 de April de 2026

arXiv:2604.02374v1 Announce Type: new
Abstract: RuASD (Russian AntiSpoofing Dataset) is a dedicated, reproducible benchmark for Russian-language speech anti-spoofing designed to evaluate both in-domain discrimination and robustness to deployment-style distribution shifts. It combines a large spoof subset synthesized using 37 modern Russian-capable TTS and voice-cloning systems with a bona fide subset curated from multiple heterogeneous open Russian speech corpora, enabling systematic evaluation across diverse data sources. To emulate typical dissemination and channel effects in a controlled and reproducible manner, RuASD includes configurable simulations of platform and transmission distortions, including room reverberation, additive noise/music, and a range of speech-codec transcodings implemented via a unified processing chain. We benchmark a diverse set of publicly available anti-spoofing countermeasures spanning lightweight supervised architectures, graph-attention models, SSL-based detectors, and large-scale pretrained systems, and report reference results on both clean and simulated conditions to characterize robustness under realistic perturbation pipelines. The dataset is publickly available at href{https://huggingface.co/datasets/MTUCI/RuASD}{underline{Hugging Face}} and href{https://modelscope.cn/datasets/lab260/RuASD}{underline{ModelScope}}.

Like 0

Liked Liked