Audio samples from paper "Neural Multi-Channel and Multi-Microphone Acoustic Echo Cancellation"
Authors: Chenggang Zhang, Jinjiang Liu, Hao Li and Xueliang Zhang
Multi-Channel Multi-Microphone (MCMM) AEC results at SER = -5 dB.
Simulation room scenario
Signals
Sample 1
Sample 2
Sample 3
Sample 4
Reference microphone signal
Target signal
Yang [1]
Cheng et al. [2]
Zhang [3]
ICRN-S
ICRN
Near-end signal is speech, while far-end signal is music
Signals
Sample 1
Sample 2
Sample 3
Sample 4
Reference microphone signal
Target signal
Yang [1]
Cheng et al. [2]
Zhang [3]
ICRN-S
ICRN
Near-end signal is music, while far-end signal is speech
Signals
Sample 1
Sample 2
Sample 3
Sample 4
Reference microphone signal
Target signal
Yang [1]
Cheng et al. [2]
Zhang [3]
ICRN-S
ICRN
Real record stereo signal using a laptop equipped with two loudspeakers and two microphones. This recording includes single-talk (the reference signal is a stereo song),
double-talk (four speeches by the near-end speaker), and the echo path is changed scenarios.
Signals
Sample
Reference microphone signal
ICRN-S estimated signal
References
[1]Feiran Yang, Ming Wu, and Jun Yang. "Stereophonic acoustic echo suppression based on Wiener filter in the short-time fourier transform domain." IEEE Signal Processing Letters 19.4 (2012): 227-230.
[2]Linjuan Cheng, et al. "Deep learning-based stereophonic acoustic echo suppression without decorrelation." The Journal of the Acoustical Society of America 150.2 (2021): 816-829.
[3]Hao Zhang, and DeLiang Wang. "A Deep Learning Approach to Multi-Channel and Multi-Microphone Acoustic Echo Cancellation." Proc. Interspeech 2021 (2021): 1139-1143.