Obserwuj
Xingjian Du (杜行健)
Xingjian Du (杜行健)
University of Rochester | Bytedance AI Lab
Zweryfikowany adres z bytedance.com
Tytuł
Cytowane przez
Cytowane przez
Rok
RWKV: Reinventing RNNs for the Transformer Era
P Bo, A Eric, A Quentin, A Alon, A Samuel, B Stella, C Huanqi, C Xin, ...
Findings of the Association for Computational Linguistics: EMNLP 2023, 14048 …, 2023
502*2023
Hts-at: A hierarchical token-semantic audio transformer for sound classification and detection
SD Chen Ke, Xingjian Du, Bilei Zhu, Zejun Ma, Taylor Berg-Kirkpatrick
ICASSP 2022, 646-650, 2022
344*2022
Eagle and finch: Rwkv with matrix-valued states and dynamic recurrence
B Peng, D Goldstein, Q Anthony, A Albalak, E Alcaide, S Biderman, ...
arXiv preprint arXiv:2404.05892 3, 2024
512024
Zero-shot audio source separation through query-based learning from weakly-labeled data
K Chen, X Du, B Zhu, Z Ma, T Berg-Kirkpatrick, S Dubnov
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 4441-4449, 2022
412022
Bytecover: Cover song identification via multi-loss training
X Du, Z Yu, B Zhu, X Chen, Z Ma
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
332021
Graph contrastive learning with implicit augmentations
H Liang, X Du, B Zhu, Z Ma, K Chen, J Gao
Neural Networks 163, 156-164, 2023
302023
Bytecover2: Towards dimensionality reduction of latent embedding for efficient cover song identification
X Du, K Chen, Z Wang, B Zhu, Z Ma
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
282022
Aag-stega: Automatic audio generation-based steganography
Z Yang, X Du, Y Tan, Y Huang, YJ Zhang
arXiv preprint arXiv:1809.03463, 2018
232018
Universal source separation with weakly labelled data
Q Kong, K Chen, H Liu, X Du, T Berg-Kirkpatrick, S Dubnov, MD Plumbley
arXiv preprint arXiv:2305.07447, 2023
192023
Speech enhancement with weakly labelled data from audioset
Q Kong, H Liu, X Du, L Chen, R Xia, Y Wang
arXiv preprint arXiv:2102.09971, 2021
192021
Foundation models for music: A survey
Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis, C Donahue, C Lin, ...
arXiv preprint arXiv:2408.14340, 2024
142024
Contrastive unsupervised learning for audio fingerprinting
Z Yu, X Du, B Zhu, Z Ma
arXiv preprint arXiv:2010.13540, 2020
122020
CatNet: Music source separation system with mix-audio augmentation
X Song, Q Kong, X Du, Y Wang
arXiv preprint arXiv:2102.09966, 2021
112021
Repgn: Object detection with relational proposal graph network
X Du, X Shi, R Huang
arXiv preprint arXiv:1904.08959, 2019
102019
Joint music and language attention models for zero-shot music tagging
X Du, Z Yu, J Lin, B Zhu, Q Kong
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
82024
Bytecover3: Accurate cover song identification on short queries
X Du, Z Wang, X Liang, H Liang, B Zhu, Z Ma
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
ICASSP 2022–2022 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP)
K Chen, X Du, B Zhu, Z Ma, T Berg-Kirkpatrick, S Dubnov
IEEE, Singapore, 2022
82022
End-to-end model for speech enhancement by consistent spectrogram masking
X Du, M Zhu, X Shi, X Zhang, W Zhang, J Chen
arXiv preprint arXiv:1901.00295, 2019
72019
Singing melody extraction from polyphonic music based on spectral correlation modeling
X Du, B Zhu, Q Kong, Z Ma
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
52021
Bytecomposer: a human-like melody composition method based on language model agent
X Liang, X Du, J Lin, P Zou, Y Wan, B Zhu
arXiv preprint arXiv:2402.17785, 2024
42024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20