Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards A Rame, G Couairon, M Shukor, C Dancette, JB Gaya, L Soulier, M Cord NeurIPS 2023, 2023 | 87 | 2023 |
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks M Shukor, C Dancette, A Rame, M Cord Transactions on Machine Learning Research (TMLR), 2023, 2023 | 27* | 2023 |
Transformer decoders with multimodal regularization for cross-modal food retrieval M Shukor, G Couairon, A Grechka, M Cord Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 22 | 2022 |
eP-ALM: Efficient Perceptual Augmentation of Language Models M Shukor, C Dancette, M Cord ICCV 2023, 2023 | 21 | 2023 |
Synthetic training data generation for deep learning based quality inspection P Gutierrez, M Luschkova, A Cordier, M Shukor, M Schappert, T Dahmen International Conference on Quality Control by Artificial Vision (QCAV 2021 …, 2021 | 19 | 2021 |
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment M Shukor, G Couairon, M Cord BMVC 2022, 2022 | 18 | 2022 |
Beyond task performance: Evaluating and reducing the flaws of large multimodal models with in-context learning M Shukor, A Rame, C Dancette, M Cord ICLR 2024, 2023 | 11 | 2023 |
Semantic unfolding of stylegan latent space M Shukor, X Yao, BB Damodaran, P Hellier 2022 IEEE International Conference on Image Processing (ICIP), 221-225, 2022 | 10* | 2022 |
Vision and structured-language pretraining for cross-modal food retrieval M Shukor, N Thome, M Cord Computer Vision and Image Understanding 247, 104071, 2024 | 9* | 2024 |
What Makes Multimodal In-Context Learning Work? FB Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 7 | 2024 |
Video coding using learned latent gan compression M Shukor, BB Damodaran, X Yao, P Hellier Proceedings of the 30th ACM International Conference on Multimedia, 2239-2248, 2022 | 7 | 2022 |
Improved baselines for data-efficient perceptual augmentation of llms T Vallaeys, M Shukor, M Cord, J Verbeek arXiv preprint arXiv:2403.13499, 2024 | 6 | 2024 |
A Concept-Based Explainability Framework for Large Multimodal Models J Parekh, P Khayatan, M Shukor, A Newson, M Cord NeurIPS 2024, 2024 | 4 | 2024 |
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models BT Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, M Cord arXiv preprint arXiv:2403.20105, 2024 | 2 | 2024 |
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs M Shukor, M Cord NeurIPS 2024, 2024 | 1 | 2024 |
Skipping Computations in Multimodal LLMs M Shukor, M Cord NeurIPSw 2024, 2024 | | 2024 |
Methods and apparatuses for encoding/decoding an image or a video P Hellier, M Shukor, BB Damodaran, YAO Xu US Patent App. 18/573,260, 2024 | | 2024 |
Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features P Couairon, M Shukor, JE Haugeard, M Cord, N Thome NeurIPS 2024, 2024 | | 2024 |
Supplementary material for eP-ALM: Efficient Perceptual Augmentation of Language Models M Shukor, C Dancette, M Cord | | |