Obserwuj
Zhenyu Tang
Zhenyu Tang
Senior Research Scientist at Meta; PhD, University of Maryland
Zweryfikowany adres z cs.umd.edu - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
FAST-RIR: Fast neural diffuse room impulse response generator
A Ratnarajah, SX Zhang, M Yu, Z Tang, D Manocha, D Yu
ICASSP 2022, 2022
722022
IR-GAN: Room impulse response generator for far-field speech recognition
A Ratnarajah, Z Tang, D Manocha
Interspeech 2021, 286-290, 2021
702021
Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks
Z Tang, JD Kanu, K Hogan, D Manocha
Interspeech 2019, 654--658, 0
60*
Scene-Aware Audio Rendering via Deep Acoustic Analysis
Z Tang, NJ Bryan, D Li, TR Langlois, D Manocha
IEEE Transactions on Visualization and Computer Graphics, 2020
482020
Improving reverberant speech training using diffuse acoustic simulation
Z Tang, L Chen, B Wu, D Yu, D Manocha
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
452020
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
A Ratnarajah, Z Tang, RC Aralikatti, D Manocha
ACM Multimedia, 2022
352022
GWA: A Large High-Quality Acoustic Dataset for Audio Processing
Z Tang, R Aralikatti, A Ratnarajah, D Manocha
SIGGRAPH Conference Proceedings 2022, 2022
322022
Ts-rir: Translated synthetic room impulse responses for speech augmentation
A Ratnarajah, Z Tang, D Manocha
2021 IEEE automatic speech recognition and understanding workshop (ASRU …, 2021
252021
Learning Acoustic Scattering Fields for Dynamic Interactive Sound Propagation
Z Tang, HY Meng, D Manocha
IEEE Conference on Virtual Reality and 3D User Interfaces (IEEE VR), 2021
212021
Low-frequency compensated synthetic impulse responses for improved far-field speech recognition
Z Tang, HY Meng, D Manocha
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Receiver placement for speech enhancement using sound propagation optimization
N Morales, Z Tang, D Manocha
Applied Acoustics 155, 53-62, 2019
142019
HeadPager: page turning with computer vision based head interaction
Z Tang, C Yan, S Ren, H Wan
Computer Vision–ACCV 2016 Workshops: ACCV 2016 International Workshops …, 2017
122017
Improving reverberant speech separation with synthetic room impulse responses
R Aralikatti, A Ratnarajah, Z Tang, D Manocha
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
11*2021
Point-based Acoustic Scattering for Interactive Sound Propagation via Surface Encoding
HY Meng, Z Tang, D Manocha
Proceedings of the Thirtieth International Joint Conference on Artificial …, 2021
72021
Scene-aware sound rendering in virtual and real worlds
Z Tang, D Manocha
2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and …, 2020
52020
Dynamic sound field synthesis for speech and music optimization
Z Tang, N Morales, D Manocha
Proceedings of the 26th ACM international conference on Multimedia, 1901-1909, 2018
52018
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing
P Anastassiou, Z Tang, K Peng, D Jia, J Li, M Tu, Y Wang, Y Wang, M Ma
arXiv preprint arXiv:2404.06674, 2024
42024
Synthetic wave-geometric impulse responses for improved speech dereverberation
R Aralikatti, Z Tang, D Manocha
arXiv preprint arXiv:2212.05360, 2022
32022
Scene-aware far-field automatic speech recognition
Z Tang, D Manocha
US Patent App. 17/723,339, 2022
32022
Online self-attentive gated RNNs for real-time speaker separation
O Kabeli, Y Adi, Z Tang, B Xu, A Kumar
arXiv preprint arXiv:2106.13493, 2021
22021
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20