Author

Xiong-Xiao

Principal Applied scientist, Microsoft - Cited by 3,834 - Deep learning based signal processing - speech recognition - keyword search.

Biography

Dr.  Xiong-Xiao is currently working in Department of Respiratory, Xiamen Branch of Zhongshan Hospital Affiliated to Fudan University, Xiamen, Fujian, China. He has published numerous research papers and articles in reputed journals and has various other achievements in the related studies. He has extended his valuable service towards the scientific community with his extensive research work. 
Title
Cited by
Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022202
456
2022
Continuous speech separation: Dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J LiICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020202
147
2020
Multi-channel overlapped speech recognition with location guided speech extraction network
Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018201
112
2018
Unified architecture for multichannel end-to-end speech recognition with neural beamforming
T Ochiai, S Watanabe, T Hori, JR Hershey, X XiaoIEEE Journal of Selected Topics in Signal Processing 11 (8), 1274-12, 2017201
88
2017
Recognizing overlapped speech in meetings: A multichannel separation approach using neural networks
T Yoshioka, H Erdogan, Z Chen, X Xiao, F AllevaarXiv preprint arXiv:1810.03655, 2018201
82
2018
Computerized intelligent assistant for conferences
A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ...US Patent 10,867,610, 2020202
75
2020
Single channel speech separation with constrained utterance level permutation invariant training using grid lstm
C Xu, W Rao, X Xiao, ES Chng, H Li2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018201
74
2018
Advances in online audio-visual meeting transcription
T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ...2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019201
74
2019
Developing far-field speaker system via teacher-student learning
J Li, R Zhao, Z Chen, C Liu, X Xiao, G Ye, Y Gong2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018201
62
2018
Single-channel speech extraction using speaker inventory and attention network
X Xiao, Z Chen, T Yoshioka, H Erdogan, C Liu, D Dimitriadis, J Droppo, ...ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019201
60
2019
Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020
X Xiao, N Kanda, Z Chen, T Zhou, T Yoshioka, S Chen, Y Zhao, G Liu, ...ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021202
55
2021
Cracking the cocktail party problem by multi-beam deep attractor network
Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y Gong2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017201
45
2017
Multi-channel speech separation
Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y GongUS Patent 10,8,822, 2020202
39
2020
Speech separation using speaker inventory
P Wang, Z Chen, X Xiao, Z Meng, T Yoshioka, T Zhou, L Lu, J Li2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019201
29
2019
Multi-microphone speech separation
Z Chen, H Erdogan, T Yoshioka, FA Alleva, X XiaoUS Patent 10,957,337, 2021202
29
2021
Efficient integration of fixed beamformers and speech separation networks for multi-channel far-field speech separation
Z Chen, T Yoshioka, X Xiao, L Li, ML Seltzer, Y Gong2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018201
29
2018
Low-latency speaker-independent continuous speech separation
T Yoshioka, Z Chen, C Liu, X Xiao, H Erdogan, D DimitriadisICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019201
25
2019
A bidirectional lstm approach with word embeddings for sentence boundary detection
C Xu, L Xie, X XiaoJournal of Signal Processing Systems 90, 1063-1075, 2018201
21
2018
Streaming multi-talker ASR with token-level serialized output training
N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ...arXiv preprint arXiv:2202.00842, 2022202
18
2022
Pykaldi2: Yet another speech toolkit based on kaldi and pytorch
L Lu, X Xiao, Z Chen, Y GongarXiv preprint arXiv:1907.05955, 2019201
18
2019