Raj, D., Povey, D., Khudanpur, S. (2023) GPU-accelerated Guided Source Separation for Meeting Transcription. Proc. Interspeech 2023, 3507-3511, doi: 10.21437/Interspeech.2023-42
Wu S, Wang C, Chen H, et al. The multimodal information based speech processing (misp) 2023 challenge: Audio-visual target speaker extraction[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024: 8351-8355.
Luo L, Li T, Li L, et al. The XMUSpeech system for audio-visual target speaker extraction in MISP 2023 challenge[C]//2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE, 2024: 39-40.