MMCode: Benchmarking Multimodal Large Language Models for Code Generation with Visually Rich Programming Problems K Li, Y Tian, Q Hu, Z Luo, Z Huang, J Ma EMNLP 2024 (Findings), ICLR 2025 Workshop on Reasoning and Planning for …, 2024 | 18 | 2024 |
Instructcoder: Instruction tuning large language models for code editing K Li, Q Hu, J Zhao, H Chen, Y Xie, T Liu, M Shieh, J He ACL 2024 Oral (srw), 2024 | 13* | 2024 |
Not All Samples Should Be Utilized Equally: Towards Understanding and Improving Dataset Distillation S Wang, Y Yang, Q Wang, K Li, L Zhang, J Yan arXiv preprint arXiv:2408.12483, 2024 | 5 | 2024 |
Robi Butler: Remote Multimodal Interactions with Household Robot Assistant A Xiao, N Janaka, T Hu, A Gupta, K Li, C Yu, D Hsu ICRA 2025, 2024 | 3* | 2024 |
Towards Better Text-to-Image Generation Alignment via Attention Modulation Y Wu, X Cao, K Li, Z Chen, H Wang, L Meng, Z Huang ICONIP 2024, 2024 | 3 | 2024 |
ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use K Li, Z Meng, H Lin, Z Luo, Y Tian, J Ma, Z Huang, TS Chua ICLR 2025 Workshop on Reasoning and Planning for Large Language Models, 2025 | 1 | 2025 |
MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation Y Wang, P Ji, C Yang, K Li, M Hu, J Li, G Sartoretti arXiv preprint arXiv:2502.12468, 2025 | | 2025 |