Joint time-frequency and time domain learning for speech enhancement C Tang, C Luo, Z Zhao, W Xie, W Zeng Proceedings of the twenty-ninth international conference on international …, 2021 | 81 | 2021 |
Pinnsformer: A transformer-based framework for physics-informed neural networks Z Zhao, X Ding, BA Prakash arXiv preprint arXiv:2307.11833, 2023 | 44 | 2023 |
Art-v: Auto-regressive text-to-video generation with diffusion models W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 29 | 2024 |
Lstprompt: Large language models as zero-shot time series forecasters by long-short-term prompting H Liu, Z Zhao, J Wang, H Kamarthi, BA Prakash arXiv preprint arXiv:2402.16132, 2024 | 24 | 2024 |
Tsi-bench: Benchmarking time series imputation W Du, J Wang, L Qian, Y Yang, Z Ibrahim, F Liu, Z Wang, H Liu, Z Zhao, ... arXiv preprint arXiv:2406.12747, 2024 | 17 | 2024 |
TridentSE: Guiding speech enhancement with 32 global tokens D Yin, Z Zhao, C Tang, Z Xiong, C Luo arXiv preprint arXiv:2210.12995, 2022 | 15 | 2022 |
Microcinema: A divide-and-conquer approach for text-to-video generation Y Wang, J Bao, W Weng, R Feng, D Yin, T Yang, J Zhang, Q Dai, Z Zhao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 14 | 2024 |
RetrieverTTS: Modeling decomposed factors for text-based speech insertion D Yin, C Tang, Y Liu, X Wang, Z Zhao, Y Zhao, Z Xiong, S Zhao, C Luo arXiv preprint arXiv:2206.13865, 2022 | 12 | 2022 |
Zero-shot text-to-speech for text-based insertion in audio narration C Tang, C Luo, Z Zhao, D Yin, Y Zhao, W Zeng arXiv preprint arXiv:2109.05426, 2021 | 9 | 2021 |
General-purpose speech representation learning through a self-supervised multi-granularity framework Y Zhao, D Yin, C Luo, Z Zhao, C Tang, W Zeng, ZJ Zha arXiv preprint arXiv:2102.01930, 2021 | 8 | 2021 |
Performative time-series forecasting Z Zhao, A Rodriguez, BA Prakash arXiv preprint arXiv:2310.06077, 2023 | 4 | 2023 |
Time-mmd: A new multi-domain multimodal dataset for time series analysis H Liu, S Xu, Z Zhao, L Kong, H Kamarthi, AB Sasanur, M Sharma, J Cui, ... arXiv preprint arXiv:2406.08627, 2024 | 3 | 2024 |
Time-series forecasting for out-of-distribution generalization using invariant learning H Liu, H Kamarthi, L Kong, Z Zhao, C Zhang, BA Prakash arXiv preprint arXiv:2406.09130, 2024 | 2 | 2024 |
An anchor-free detector for continuous speech keyword spotting Z Zhao, C Tang, C Yao, C Luo arXiv preprint arXiv:2208.04622, 2022 | 2 | 2022 |
Time-mmd: Multi-domain multimodal dataset for time series analysis H Liu, S Xu, Z Zhao, L Kong, H Prabhakar Kamarthi, A Sasanur, ... Advances in Neural Information Processing Systems 37, 77888-77933, 2024 | 1 | 2024 |
Speech enhancement T Chuanxin, Z Zhao, C Luo, W Zeng US Patent App. 17/927,861, 2023 | | 2023 |
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss Z Zhao, L Wu, C Tang, D Yin, Y Zhao, C Luo ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
ART• V: Auto-Regressive Text-to-Video Generation with Diffusion Models Supplementary Material W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ... context 1024, 1024, 0 | | |