Faithscore: Evaluating hallucinations in large vision-language models L Jing, R Li, Y Chen, M Jia, X Du arXiv preprint arXiv:2311.01477, 2023 | 41 | 2023 |
Multi-source semantic graph-based multimodal sarcasm explanation generation L Jing, X Song, K Ouyang, M Jia, L Nie arXiv preprint arXiv:2306.16650, 2023 | 12 | 2023 |
Multimodal activation: Awakening dialog robots without wake words L Nie, M Jia, X Song, G Wu, H Cheng, J Gu Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 12 | 2021 |
Debiasing Multimodal Sarcasm Detection with Contrastive Learning M Jia, C Xie, L Jing Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18354 …, 2024 | 7 | 2024 |
Plug: Leveraging pivot language in cross-lingual instruction tuning Z Zhang, DH Lee, Y Fang, W Yu, M Jia, M Jiang, F Barbieri arXiv preprint arXiv:2311.08711, 2023 | 7 | 2023 |
Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training M Jia, Z Zhang, W Yu, F Jiao, M Jiang arXiv preprint arXiv:2404.14604, 2024 | 5 | 2024 |
Knowledge-enhanced Memory Model for Emotional Support Conversation M Jia, Q Chen, L Jing, D Fu, R Li arXiv preprint arXiv:2310.07700, 2023 | 5 | 2023 |
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph S Ouyang, W Yu, K Ma, Z Xiao, Z Zhang, M Jia, J Han, H Zhang, D Yu arXiv preprint arXiv:2410.14684, 2024 | 1 | 2024 |
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks M Jia, W Yu, K Ma, T Fang, Z Zhang, S Ouyang, H Zhang, M Jiang, D Yu arXiv preprint arXiv:2410.01744, 2024 | 1 | 2024 |
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Z Zhang, T Ge, Z Liang, W Yu, D Yu, M Jia, D Yu, M Jiang arXiv preprint arXiv:2406.12050, 2024 | 1 | 2024 |
Query-Oriented Micro-Video Summarization M Jia, Y Wei, X Song, T Sun, M Zhang, L Nie IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 1 | 2024 |
Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench Z Liu, G Dou, M Jia, Z Tan, Q Zeng, Y Yuan, M Jiang arXiv preprint arXiv:2410.22108, 2024 | | 2024 |
MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems Z Zhu, M Jia, Z Zhang, L Li, M Jiang arXiv preprint arXiv:2410.14179, 2024 | | 2024 |
Multimodal Interaction Modeling via Self-Supervised Multi-Task Learning for Review Helpfulness Prediction HL Gong, M Jia, L Jing arXiv preprint arXiv:2402.18107, 2024 | | 2024 |