Xinyu Li

Trích dẫn bởi

	Tất cả	Từ 2020
Trích dẫn	2351	2134
h-index	23	23
i10-index	38	33

620

310

155

465

20162017201820192020202120222023202420256 38 65 103 150 230 474 581 606 92

Truy cập công khai

Xem tất cả

18 bài viết

1 bài viết

có sẵn

không có sẵn

Dựa trên yêu cầu tài trợ

Đồng tác giả

Ivan MarsicRutgers University, Department of Electrical and Computer EngineeringEmail được xác minh tại rutgers.edu
Joseph TigheAmazonEmail được xác minh tại cs.unc.edu
Bing ShuaiResearch scientist at AmazonEmail được xác minh tại amazon.com
Yuanjun XiongAdobe FireflyEmail được xác minh tại adobe.com
Davide ModoloSenior Manager of Applied Science, Amazon AGIEmail được xác minh tại amazon.com
Yi ZhuBoson AIEmail được xác minh tại ucmerced.edu
David FanMeta FAIR LabsEmail được xác minh tại cs.princeton.edu
Wei Xia 夏威Sr. Principal Scientist of AWS AIEmail được xác minh tại amazon.com
Jue WangAmazon AGI, ANUEmail được xác minh tại amazon.com

Theo dõi

Xinyu Li

Amazon AGI

Email được xác minh tại amazon.com - Trang chủ

Video understanding Multimedia understanding AGI


Tiêu đề Sắp xếp theo số lượt trích dẫn Sắp xếp theo năm Sắp xếp theo tiêu đề	Trích dẫn bởi Trích dẫn bởi	Năm
VidTr: Video Transformer Without Convolutions Y Zhang, X Li, C Liu, B Shuai, Y Zhu, B Brattoli, H Chen, I Marsic, J Tighe 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 13557-13567, 2021	247	2021
A comprehensive study of deep video action recognition Y Zhu, X Li, C Liu, M Zolfaghari, Y Xiong, C Wu, Z Zhang, J Tighe, ... arXiv preprint arXiv:2012.06567, 2020	247	2020
SiamMOT: Siamese Multi-Object Tracking B Shuai, A Berneshawi, X Li, D Modolo, J Tighe arXiv preprint arXiv:2105.11595, 2021	185	2021
Deep Learning for RFID-Based Activity Recognition X Li, Y Zhang, I Marsic, A Sarcevic, RS Burd The 14th ACM Conference on Embedded Networked Sensor Systems (SenSys 2016), 2016	177	2016
Multimodal affective analysis using hierarchical attention strategy with word-level alignment Y Gu, K Yang, S Fu, S Chen, X Li, I Marsic Proceedings of the conference. Association for Computational Linguistics …, 2018	170	2018
Long short-term transformer for online action detection M Xu, Y Xiong, H Chen, X Li, W Xia, Z Tu, S Soatto Advances in Neural Information Processing Systems 34, 1086-1099, 2021	150	2021
Tuber: Tubelet transformer for video action detection J Zhao, Y Zhang, X Li, H Chen, B Shuai, M Xu, C Liu, K Kundu, Y Xiong, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	115	2022
Multi-stream network with temporal attention for environmental sound classification X Li, V Chebiyyam, K Kirchhoff arXiv preprint arXiv:1901.08608, 2019	90	2019
Video contrastive learning with global context H Kuang, Y Zhu, Z Zhang, X Li, J Tighe, S Schwertfeger, C Stachniss, M Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	78	2021
Hybrid attention based multimodal network for spoken language classification Y Gu, K Yang, S Fu, S Chen, X Li, I Marsic Proceedings of the Conference. association for Computational Linguistics …, 2018	73	2018
Speech intention classification with multimodal deep learning Y Gu, X Li, S Chen, J Zhang, I Marsic Advances in Artificial Intelligence: 30th Canadian Conference on Artificial …, 2017	65	2017
What to look at and where: Semantic and spatial refined transformer for detecting human-object interactions ASM Iftekhar, H Chen, K Kundu, X Li, J Tighe, D Modolo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	64	2022
Directional temporal modeling for action recognition X Li, B Shuai, J Tighe Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	61	2020
Concurrent activity recognition with multimodal CNN-LSTM structure X Li, Y Zhang, J Zhang, S Chen, I Marsic, RA Farneth, RS Burd arXiv preprint arXiv:1702.01638, 2017	56	2017
Revisiting multimodal representation in contrastive learning: from patch and token embeddings to finite discrete tokens Y Chen, J Yuan, Y Tian, S Geng, X Li, D Zhou, DN Metaxas, H Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	41	2023
Human conversation analysis using attentive multimodal networks with hierarchical encoder-decoder Y Gu, X Li, K Huang, S Fu, K Yang, S Chen, M Zhou, I Marsic Proceedings of the 26th ACM international conference on Multimedia, 537-545, 2018	41	2018
Activity Recognition for Medical Teamwork Based on Passive RFID X Li, D Yao, X Pan, J Johannaman, JW Yang, R Webman, A Sarcevic, ... 2016 IEEE International Conference on RFID (RFID), 1-9, 2016	41	2016
Mutual correlation attentive factors in dyadic fusion networks for speech emotion recognition Y Gu, X Lyu, W Sun, W Li, S Chen, X Li, I Marsic Proceedings of the 27th ACM International Conference on Multimedia, 157-166, 2019	34	2019
Cat: Causal audio transformer for audio classification X Liu, H Lu, J Yuan, X Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	31	2023
Deep neural network for RFID-based activity recognition X Li, Y Zhang, M Li, I Marsic, JW Yang, RS Burd Proceedings of the Eighth Wireless of the Students, by the Students, and for …, 2016	31	2016

Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.

Bài viết 1–20

Trích dẫn mỗi năm

Trích dẫn trùng lặp

Trích dẫn được hợp nhất

Thêm đồng tác giảĐồng tác giả

Theo dõi

Trích dẫn bởi

Đồng tác giả