Theo dõi
Xinyu Li
Xinyu Li
Amazon AGI
Email được xác minh tại amazon.com - Trang chủ
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
VidTr: Video Transformer Without Convolutions
Y Zhang, X Li, C Liu, B Shuai, Y Zhu, B Brattoli, H Chen, I Marsic, J Tighe
2021 IEEE/CVF International Conference on Computer Vision (ICCV), 13557-13567, 2021
2472021
A comprehensive study of deep video action recognition
Y Zhu, X Li, C Liu, M Zolfaghari, Y Xiong, C Wu, Z Zhang, J Tighe, ...
arXiv preprint arXiv:2012.06567, 2020
2472020
SiamMOT: Siamese Multi-Object Tracking
B Shuai, A Berneshawi, X Li, D Modolo, J Tighe
arXiv preprint arXiv:2105.11595, 2021
1852021
Deep Learning for RFID-Based Activity Recognition
X Li, Y Zhang, I Marsic, A Sarcevic, RS Burd
The 14th ACM Conference on Embedded Networked Sensor Systems (SenSys 2016), 2016
1772016
Multimodal affective analysis using hierarchical attention strategy with word-level alignment
Y Gu, K Yang, S Fu, S Chen, X Li, I Marsic
Proceedings of the conference. Association for Computational Linguistics …, 2018
1702018
Long short-term transformer for online action detection
M Xu, Y Xiong, H Chen, X Li, W Xia, Z Tu, S Soatto
Advances in Neural Information Processing Systems 34, 1086-1099, 2021
1502021
Tuber: Tubelet transformer for video action detection
J Zhao, Y Zhang, X Li, H Chen, B Shuai, M Xu, C Liu, K Kundu, Y Xiong, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1152022
Multi-stream network with temporal attention for environmental sound classification
X Li, V Chebiyyam, K Kirchhoff
arXiv preprint arXiv:1901.08608, 2019
902019
Video contrastive learning with global context
H Kuang, Y Zhu, Z Zhang, X Li, J Tighe, S Schwertfeger, C Stachniss, M Li
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
782021
Hybrid attention based multimodal network for spoken language classification
Y Gu, K Yang, S Fu, S Chen, X Li, I Marsic
Proceedings of the Conference. association for Computational Linguistics …, 2018
732018
Speech intention classification with multimodal deep learning
Y Gu, X Li, S Chen, J Zhang, I Marsic
Advances in Artificial Intelligence: 30th Canadian Conference on Artificial …, 2017
652017
What to look at and where: Semantic and spatial refined transformer for detecting human-object interactions
ASM Iftekhar, H Chen, K Kundu, X Li, J Tighe, D Modolo
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
642022
Directional temporal modeling for action recognition
X Li, B Shuai, J Tighe
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
612020
Concurrent activity recognition with multimodal CNN-LSTM structure
X Li, Y Zhang, J Zhang, S Chen, I Marsic, RA Farneth, RS Burd
arXiv preprint arXiv:1702.01638, 2017
562017
Revisiting multimodal representation in contrastive learning: from patch and token embeddings to finite discrete tokens
Y Chen, J Yuan, Y Tian, S Geng, X Li, D Zhou, DN Metaxas, H Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
412023
Human conversation analysis using attentive multimodal networks with hierarchical encoder-decoder
Y Gu, X Li, K Huang, S Fu, K Yang, S Chen, M Zhou, I Marsic
Proceedings of the 26th ACM international conference on Multimedia, 537-545, 2018
412018
Activity Recognition for Medical Teamwork Based on Passive RFID
X Li, D Yao, X Pan, J Johannaman, JW Yang, R Webman, A Sarcevic, ...
2016 IEEE International Conference on RFID (RFID), 1-9, 2016
412016
Mutual correlation attentive factors in dyadic fusion networks for speech emotion recognition
Y Gu, X Lyu, W Sun, W Li, S Chen, X Li, I Marsic
Proceedings of the 27th ACM International Conference on Multimedia, 157-166, 2019
342019
Cat: Causal audio transformer for audio classification
X Liu, H Lu, J Yuan, X Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
312023
Deep neural network for RFID-based activity recognition
X Li, Y Zhang, M Li, I Marsic, JW Yang, RS Burd
Proceedings of the Eighth Wireless of the Students, by the Students, and for …, 2016
312016
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20