关注
Anurag Kumar
Anurag Kumar
Meta Research, Carnegie Mellon University (CMU)
在 ieee.org 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Ego4d: Around the world in 3,000 hours of egocentric video
K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
8772022
Audio Event Detection using Weakly Labeled Data
A Kumar, B Raj
24th ACM International Conference on Multimedia (ACM MM), 2016
2072016
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes
A Kumar, M Khadkevich, C Fugen
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2018
1742018
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes
A Kumar, M Khadkevich, C Fügen
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2018
1742018
Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks
A Kumar, D Florencio
Interspeech, 2016
1462016
Audio event detection from acoustic unit occurrence patterns
A Kumar, P Dighe, R Singh, S Chaudhuri, B Raj
2012 IEEE international conference on acoustics, speech and signal …, 2012
762012
A closer look at weak label learning for audio events
A Shah, A Kumar, AG Hauptmann, B Raj
arXiv preprint arXiv:1804.09288, 2018
672018
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording
B Elizalde, A Kumar, A Shah, A Badlani, E Vincent, B Raj, I Lane
Workshop on Detection and Classification of Acoustic Scenes and Events …, 2016
60*2016
Content Based Representations Of Audio Using Siamese Neural Networks
P Manocha, R Badlani, A Kumar, A Shah, B Elizalde, B Raj
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2018
592018
Deep cnn framework for audio event recognition using weakly labeled web data
A Kumar, B Raj
NIPS Workshop on Machine Learning for Audio, 2017
562017
Remixit: Continual self-training of speech enhancement models via bootstrapped remixing
E Tzinis, Y Adi, VK Ithapu, B Xu, P Smaragdis, A Kumar
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1329-1341, 2022
542022
Multi-Channel Speech Enhancement using Graph Neural Networks
P Tzirakis, A Kumar, J Donley
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2021
512021
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition
A Kumar, VK Ithapu
International Conference on Machine Learning (ICML), 2020, 2020
502020
Audio Event and Scene Recognition: A Unified Approach using Strongly and Weakly Labeled Data
A Kumar, B Raj
International Joint Conference on Neural Networks (IJCNN), 2017
492017
NORESQA--A Framework for Speech Quality Assessment using Non-Matching References
P Manocha, B Xu, A Kumar
Advances in neural information processing systems, 2021
462021
Informedia@ TrecVID 2014: MED and MER
SI Yu, L Jiang, Z Xu, Z Lan, S Xu, X Chang, X Li, Z Mao, C Gan, Y Miao, ...
TREC Video Retrieval Evaluation 2014, 2014
442014
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
K Tan, B Xu, A Kumar, E Nachmani, Y Adi
IEEE Signal Processing Letters, 2020
432020
TPARN: Triple-path attentive recurrent network for time-domain multichannel speech enhancement
A Pandey, B Xu, A Kumar, J Donley, P Calamia, DL Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
412022
Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data
H Fayek, A Kumar
29th International Joint Conference on Artificial Intelligence (IJCAI), 2020
402020
Torchaudio-squim: Reference-less speech quality and intelligibility measures in torchaudio
A Kumar, K Tan, Z Ni, P Manocha, X Zhang, E Henderson, B Xu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
372023
系统目前无法执行此操作,请稍后再试。
文章 1–20