Muirbench: A comprehensive benchmark for robust multi-image understanding F Wang, X Fu, JY Huang, Z Li, Q Liu, X Liu, MD Ma, N Xu, W Zhou, ... arXiv preprint arXiv:2406.09411, 2024 | 40 | 2024 |
Robust natural language understanding with residual attention debiasing F Wang, JY Huang, T Yan, W Zhou, M Chen ACL 2023, 2023 | 13 | 2023 |
Contrastive Instruction Tuning TL Yan, F Wang, JY Huang, W Zhou, F Yin, A Galstyan, W Yin, M Chen ACL 2024, 2024 | 10 | 2024 |
Monotonic paraphrasing improves generalization of language model prompting Q Liu, F Wang, N Xu, T Yan, T Meng, M Chen EMNLP 2024, 2024 | 6 | 2024 |
Promote, Suppress, Iterate: How Language Models Answer One-to-Many Factual Queries TL Yan, R Jia arXiv preprint arXiv:2502.20475, 2025 | | 2025 |