DeepASMR-DB is a significant contribution to the field of speech synthesis, specifically designed to address the challenges of generating Autonomous Sensory Meridian Response (ASMR) speech. It is a comprehensive, multi-speaker speech corpus spanning 670 hours of audio in both English and Chinese. The primary purpose of DeepASMR-DB is to serve as a foundational dataset for training and evaluating advanced ASMR generation systems, such as the DeepASMR framework. It provides the diverse and specialized data needed to develop models capable of zero-shot speaker adaptation, meaning they can synthesize ASMR in a target speaker's voice using only a short snippet of their ordinary speech. This corpus is vital for researchers and ML engineers working on Text-to-Speech (TTS) systems, particularly those focused on specialized speech styles, low-resource scenarios, and personalized audio experiences, enabling the creation of high-fidelity, nuanced ASMR content.
DeepASMR-DB is a large collection of ASMR speech recordings in English and Chinese, totaling 670 hours. It's used to train AI systems, like DeepASMR, to create realistic ASMR sounds in different voices, even if the AI hasn't heard that voice make ASMR before.
ASMR speech corpus, DeepASMR corpus
Was this definition helpful?