Segui
Jun Chen
Jun Chen
Email verificata su kaust.edu.sa - Home page
Titolo
Citata da
Citata da
Anno
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
D Zhu*, J Chen*, X Shen, X Li, M Elhoseiny
ICLR 2024, 2023
20562023
MiniGPT-v2: Large Language Model As a Unified Interface For Vision-Language Multi-task Learning
J Chen, D Zhu, X Shen, X Li, Z Liu, P Zhang, R Krishnamoorthi, ...
arXiv preprint arXiv:2310.09478, 2023
4252023
VisualGPT: Data-efficient adaptation of pretrained language models for image captioning
J Chen, H Guo, K Yi, B Li, M Elhoseiny
CVPR 2022, 2021
2282021
Chatgpt asks, blip-2 answers: Automatic questioning towards enriched visual descriptions
D Zhu, J Chen, K Haydarov, X Shen, W Zhang, M Elhoseiny
Transactions on Machine Learning Research (TMLR), 2023
912023
Predicting candidate genes from phenotypes, functions, and anatomical site of expression
J Chen, AT Althagafi, R Hoehndorf
Bioinformatics 2020, 2020
572020
DeepViral: prediction of novel virus-host interactions from protein sequences and infectious disease phenotypes.
W Liu-Wei, S Kafkas, J Chen, NJ Dimonaco, J Tegnér, R Hoehndorf
Bioinformatics 2021, 2021
502021
Exploring long tail visual relationship recognition with large vocabulary
S Abdelkarim, A Agarwal, P Achlioptas, J Chen, J Huang, B Li, K Church, ...
ICCV 2021, 15921-15930, 2021
39*2021
Exploring open-vocabulary semantic segmentation from clip vision encoder distillation only
J Chen, D Zhu, G Qian, B Ghanem, Z Yan, C Zhu, F Xiao, SC Culatana, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision, 699-710, 2023
38*2023
Efficient self-supervised vision pretraining with local masked reconstruction
J Chen, M Hu, B Li, M Elhoseiny
WACV 2025, 2022
372022
Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
J Chen, D Zhu, K Haydarov, X Li, M Elhoseiny
arXiv preprint arXiv:2304.04227, 2023
362023
Llm as a robotic brain: Unifying egocentric memory and control
J Mai, J Chen, G Qian, M Elhoseiny, B Ghanem
arXiv, 2023
312023
An introduction to vision-language modeling
F Bordes, RY Pang, A Ajay, AC Li, A Bardes, S Petryk, O Mañas, Z Lin, ...
arXiv preprint arXiv:2405.17247, 2024
292024
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition
J Chen, A Agarwal, S Abdelkarim, D Zhu, M Elhoseiny
CVPR 2022, 19507-19517, 2022
18*2022
MammalNet: A Large-scale Video Benchmark for Mammal Recognition and Behavior Understanding
J Chen, M Hu, DJ Coker, ML Berumen, B Costelloe, S Beery, A Rohrbach, ...
CVPR 2023, 13052-13061, 2023
142023
Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation
U Akujuobi, J Chen, M Elhoseiny, M Spranger, X Zhang
NeurIPS 2020, 2020
132020
Efficient long-distance relation extraction with DG-SpanBERT
J Chen, R Hoehndorf, M Elhoseiny, X Zhang
Technical Report, 2020
102020
Minigpt-med: Large language model as a general interface for radiology diagnosis
A Alkhaldi, R Alnajim, L Alabdullatef, R Alyahya, J Chen, D Zhu, A Alsinan, ...
arXiv, 2024
62024
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
X Shen, Y Xiong, C Zhao, L Wu, J Chen, C Zhu, Z Liu, F Xiao, ...
arXiv preprint arXiv:2410.17434, 2024
2024
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time
S Chowdhury, S Nag, S Dasgupta, J Chen, M Elhoseiny, R Gao, ...
ECCV 2024, 2024
2024
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–19