Obserwuj
Shengyi Qian
Shengyi Qian
Research Scientist, Meta FAIR
Zweryfikowany adres z meta.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Learning single-image depth from videos using quality assessment networks
W Chen, S Qian, J Deng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
792019
OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
W Chen, S Qian, D Fan, N Kojima, M Hamilton, J Deng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
632020
LLM-Grounder: Open-vocabulary 3d visual grounding with large language model as an agent
J Yang, X Chen, S Qian, N Madaan, M Iyengar, DF Fouhey, J Chai
2024 IEEE International Conference on Robotics and Automation (ICRA), 7694-7701, 2024
622024
Planar Surface Reconstruction from Sparse Views
L Jin, S Qian, A Owens, DF Fouhey
International Conference on Computer Vision (ICCV), 2021
342021
Affordancellm: Grounding affordance from vision language models
S Qian, W Chen, M Bai, X Zhou, Z Tu, LE Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
242024
Associative3D: Volumetric Reconstruction from Sparse Views
S Qian, L Jin, D Fouhey
European Conference on Computer Vision (ECCV), 2020
222020
Understanding 3D Object Articulation in Internet Videos
S Qian, L Jin, C Rockwell, S Chen, DF Fouhey
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
202022
Pitfalls in link prediction with graph neural networks: Understanding the impact of target-link inclusion & better practices
J Zhu, Y Zhou, VN Ioannidis, S Qian, W Ai, X Song, D Koutra
Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024
102024
Understanding 3D Object Interaction from a Single Image
S Qian, DF Fouhey
International Conference on Computer Vision (ICCV), 2023
92023
Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
Z Chen, S Qian, A Owens
International Conference on Computer Vision (ICCV), 2023
82023
Multi-object hallucination in vision-language models
X Chen, Z Ma, X Zhang, S Xu, S Qian, J Yang, DF Fouhey, J Chai
arXiv preprint arXiv:2407.06192, 2024
72024
Recognizing scenes from novel viewpoints
S Qian, A Kirillov, N Ravi, DS Chaplot, J Johnson, DF Fouhey, G Gkioxari
arXiv preprint arXiv:2112.01520, 2021
72021
Multimodal graph benchmark
J Zhu, Y Zhou, S Qian, Z He, T Zhao, N Shah, D Koutra
arXiv preprint arXiv:2406.16321, 2024
32024
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
J Yang, X Chen, N Madaan, M Iyengar, S Qian, DF Fouhey, J Chai
arXiv preprint arXiv:2406.05132, 2024
32024
3d-mvp: 3d multiview pretraining for robotic manipulation
S Qian, K Mo, V Blukis, DF Fouhey, D Fox, A Goyal
arXiv preprint arXiv:2406.18158, 2024
12024
SpotTarget: Rethinking the Effect of Target Edges for Link Prediction in Graph Neural Networks
J Zhu, Y Zhou, VN Ioannidis, S Qian, W Ai, X Song, D Koutra
arXiv preprint arXiv:2306.00899, 2023
12023
LinkGPT: Teaching Large Language Models To Predict Missing Links
Z He, J Zhu, S Qian, J Chai, D Koutra
arXiv preprint arXiv:2406.04640, 2024
2024
3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
J Yang, X Chen, N Madaan, M Iyengar, S Qian, DF Fouhey, J Chai
arXiv e-prints, arXiv: 2406.05132, 2024
2024
Learning to Interact with the 3D World
S Qian
University of Michigan (PhD Dissertation), 2024
2024
Multimodal Attributed Graphs: Benchmarking and Rethinking
J Zhu, Y Zhou, S Qian, Z He, T Zhao, N Shah, D Koutra
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20