Takip et
Yue Fan
Yue Fan
ucsc.edu üzerinde doğrulanmış e-posta adresine sahip - Ana Sayfa
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Visdrone-det2018: The vision meets drone object detection in image challenge results
P Zhu, L Wen, D Du, X Bian, H Ling, Q Hu, Q Nie, H Cheng, C Liu, X Liu, ...
Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 0-0, 2018
1692018
Llm-coordination: evaluating and analyzing multi-agent coordination abilities in large language models
S Agashe, Y Fan, A Reyna, XE Wang
Findings of NAACL 2025, 2024
45*2024
Jarvis: A neuro-symbolic commonsense reasoning framework for conversational embodied agents
K Zheng, K Zhou, J Gu, Y Fan, J Wang, Z Di, X He, XE Wang
arXiv preprint arXiv:2208.13266, 2022
332022
Muffin or chihuahua? challenging multimodal large language models with multipanel vqa
Y Fan, J Gu, K Zhou, Q Yan, S Jiang, CC Kuo, X Guan, XE Wang
arXiv preprint arXiv:2401.15847, 2024
212024
Aerial vision-and-dialog navigation
Y Fan, W Chen, T Jiang, C Zhou, Y Zhang, XE Wang
Findings of ACL 2023, 2022
172022
Learn by observation: Imitation learning for drone patrolling from videos of a human navigator
Y Fan, S Chu, W Zhang, R Song, Y Li
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2020
132020
Read anywhere pointed: Layout-aware gui screen reading with tree-of-lens grounding
Y Fan, L Ding, CC Kuo, S Jiang, Y Zhao, X Guan, J Yang, Y Zhang, ...
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
82024
Mmworld: Towards multi-discipline multi-faceted world model evaluation in videos
X He, W Feng, K Zheng, Y Lu, W Zhu, J Li, Y Fan, J Wang, L Li, Z Yang, ...
arXiv preprint arXiv:2406.08407, 2024
72024
Athena 3.0: Personalized Multimodal ChatBot with Neuro-Symbolic Dialogue Generators
Y Fan, KK Bowden, W Cui, W Chen, V Harrison, A Ramirez, S Agashe, ...
Alexa Prize SocialBot Grand Challenge 5, 2023
52023
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
Y Fan, J Gu, K Zheng, X Wang
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
42023
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
Q Yan, Y Fan, H Li, S Jiang, Y Zhao, X Guan, CC Kuo, XE Wang
arXiv preprint arXiv:2502.16033, 2025
2025
GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration
Y Fan, H Zhao, R Zhang, Y Shen, XE Wang, G Wu
arXiv preprint arXiv:2501.13896, 2025
2025
Active Listening: Personalized Question Generation in Open-Domain Social Conversation with User Model Based Prompting
K Bowden, Y Fan, W Chen, W Cui, D Harrison, X Wang, M Walker
Findings of the Association for Computational Linguistics: EMNLP 2024, 14120 …, 2024
2024
SlugJARVIS: Multimodal Commonsense Knowledge-based Embodied AI for SimBot Challenge
J Gu, K Zheng, K Zhou, Y Fan, X He, J Wang, Z Di, XE Wang
Alexa Prize Simbot Challenge, 2022
2022
Diagnosing Hallucination Problem in Object Navigation
K Zhou, K Lee, Y Fan, XE Wang
Causal and Object-Centric Representations for Robotics Workshop at CVPR 2024, 0
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–15