Suivre
Shengyi Qian
Shengyi Qian
Research Scientist, Meta FAIR
Adresse e-mail validée de meta.com - Page d'accueil
Titre
Citée par
Citée par
Année
LLM-Grounder: Open-vocabulary 3d visual grounding with large language model as an agent
J Yang, X Chen, S Qian, N Madaan, M Iyengar, DF Fouhey, J Chai
2024 IEEE International Conference on Robotics and Automation (ICRA), 7694-7701, 2024
812024
Learning single-image depth from videos using quality assessment networks
W Chen, S Qian, J Deng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
802019
OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
W Chen, S Qian, D Fan, N Kojima, M Hamilton, J Deng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
672020
Planar Surface Reconstruction from Sparse Views
L Jin, S Qian, A Owens, DF Fouhey
International Conference on Computer Vision (ICCV), 2021
402021
Affordancellm: Grounding affordance from vision language models
S Qian, W Chen, M Bai, X Zhou, Z Tu, LE Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
322024
Associative3D: Volumetric Reconstruction from Sparse Views
S Qian, L Jin, D Fouhey
European Conference on Computer Vision (ECCV), 2020
242020
Understanding 3D Object Articulation in Internet Videos
S Qian, L Jin, C Rockwell, S Chen, DF Fouhey
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
222022
Multi-object hallucination in vision language models
X Chen, Z Ma, X Zhang, S Xu, S Qian, J Yang, D Fouhey, J Chai
Advances in Neural Information Processing Systems 37, 44393-44418, 2024
202024
Understanding 3D Object Interaction from a Single Image
S Qian, DF Fouhey
International Conference on Computer Vision (ICCV), 2023
132023
Pitfalls in link prediction with graph neural networks: Understanding the impact of target-link inclusion & better practices
J Zhu, Y Zhou, VN Ioannidis, S Qian, W Ai, X Song, D Koutra
Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024
112024
Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
Z Chen, S Qian, A Owens
International Conference on Computer Vision (ICCV), 2023
112023
3d-grand: A million-scale dataset for 3d-llms with better grounding and less hallucination
J Yang, X Chen, N Madaan, M Iyengar, S Qian, DF Fouhey, J Chai
arXiv preprint arXiv:2406.05132, 2024
92024
Multimodal graph benchmark
J Zhu, Y Zhou, S Qian, Z He, T Zhao, N Shah, D Koutra
arXiv preprint arXiv:2406.16321, 2024
72024
Recognizing scenes from novel viewpoints
S Qian, A Kirillov, N Ravi, DS Chaplot, J Johnson, DF Fouhey, G Gkioxari
arXiv preprint arXiv:2112.01520, 2021
72021
3d-mvp: 3d multiview pretraining for robotic manipulation
S Qian, K Mo, V Blukis, DF Fouhey, D Fox, A Goyal
arXiv preprint arXiv:2406.18158, 2024
32024
Linkgpt: Teaching large language models to predict missing links
Z He, J Zhu, S Qian, J Chai, D Koutra
arXiv preprint arXiv:2406.04640, 2024
32024
SpotTarget: Rethinking the effect of target edges for link prediction in graph neural networks
J Zhu, Y Zhou, VN Ioannidis, S Qian, W Ai, X Song, D Koutra
32023
3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
J Yang, X Chen, N Madaan, M Iyengar, S Qian, DF Fouhey, J Chai
arXiv e-prints, arXiv: 2406.05132, 2024
2024
Learning to Interact with the 3D World
S Qian
University of Michigan (PhD Dissertation), 2024
2024
Multimodal Attributed Graphs: Benchmarking and Rethinking
J Zhu, Y Zhou, S Qian, Z He, T Zhao, N Shah, D Koutra
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20