Salmonn: Towards generic hearing abilities for large language models C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang arXiv preprint arXiv:2310.13289, 2023 | 261 | 2023 |
A novel robust detection algorithm for spectrum sensing L Lu, HC Wu, SS Iyengar IEEE Journal on selected areas in communications 29 (2), 305-315, 2011 | 76 | 2011 |
Novel Robust Direction-of-Arrival-Based Source Localization Algorithm for Wideband Signals L Lu, H Wu IEEE, 2012 | 68 | 2012 |
Seed-tts: A family of high-quality versatile speech generation models P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ... arXiv preprint arXiv:2406.02430, 2024 | 61 | 2024 |
Connecting speech encoder and large language model for asr W Yu, C Tang, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 54 | 2024 |
Novel energy-based localization technique for multiple sources L Lu, H Zhang, HC Wu IEEE Systems Journal 8 (1), 142-150, 2013 | 37 | 2013 |
Robust expectation-maximization algorithm for multiple wideband acoustic source localization in the presence of nonuniform noise variances L Lu, HC Wu, K Yan, SS Iyengar IEEE Sensors Journal 11 (3), 536-544, 2010 | 37 | 2010 |
Unleashing infinite-length input capacity for large-scale language models with self-controlled memory system X Liang, B Wang, H Huang, S Wu, P Wu, L Lu, Z Ma, Z Li arXiv preprint arXiv:2304.13343, 2023 | 30 | 2023 |
Robust expectation–maximization direction-of-arrival estimation algorithm for wideband source signals L Lu, HC Wu IEEE Transactions on Vehicular Technology 60 (5), 2395-2400, 2011 | 26 | 2011 |
Polyvoice: Language models for speech to speech translation Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ... arXiv preprint arXiv:2306.02982, 2023 | 24 | 2023 |
Systems and methods for accessing codewords in parallel using a three sensor reader L Lu, H Xia US Patent 9,007,707, 2015 | 24 | 2015 |
Systems and methods for three reader storage access L Lu, H Xia US Patent 9,245,580, 2016 | 22 | 2016 |
Adaptive cooperative spectrum sensing based on a novel robust detection algorithm H Zhang, HC Wu, L Lu, SS Iyengar 2012 IEEE International Conference on Communications (ICC), 3511-3515, 2012 | 21 | 2012 |
video-salmonn: Speech-enhanced audio-visual large language models G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, Z Ma, Y Wang, C Zhang arXiv preprint arXiv:2406.15704, 2024 | 20 | 2024 |
Seed-asr: Understanding diverse speech and contexts with llm-based speech recognition Y Bai, J Chen, J Chen, W Chen, Z Chen, C Ding, L Dong, Q Dong, Y Du, ... arXiv preprint arXiv:2407.04675, 2024 | 16 | 2024 |
Sd-eval: A benchmark dataset for spoken dialogue understanding beyond words J Ao, Y Wang, X Tian, D Chen, J Zhang, L Lu, Y Wang, H Li, Z Wu arXiv preprint arXiv:2406.13340, 2024 | 16 | 2024 |
Systems and methods for multi-head coefficient based scaling AR Nayak, L Lu, R Rauschmayer, H Xia US Patent 9,542,972, 2017 | 16 | 2017 |
Analysis and algorithm for robust adaptive cooperative spectrum-sensing H Zhang, HC Wu, L Lu IEEE Transactions on Wireless Communications 13 (2), 618-629, 2014 | 14 | 2014 |
Spatial attention for far-field speech recognition with deep beamforming neural networks W He, L Lu, B Zhang, J Mahadeokar, K Kalgaonkar, C Fuegen ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 13 | 2020 |
Fine-grained audio-visual joint representations for multimodal large language models G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang arXiv preprint arXiv:2310.05863, 2023 | 12 | 2023 |