Follow
Mustafa Shukor
Mustafa Shukor
PhD Student at Sorbonne University
Verified email at sorbonne-universite.fr - Homepage
Title
Cited by
Cited by
Year
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
A Rame, G Couairon, M Shukor, C Dancette, JB Gaya, L Soulier, M Cord
NeurIPS 2023, 2023
872023
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks
M Shukor, C Dancette, A Rame, M Cord
Transactions on Machine Learning Research (TMLR), 2023, 2023
27*2023
Transformer decoders with multimodal regularization for cross-modal food retrieval
M Shukor, G Couairon, A Grechka, M Cord
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
222022
eP-ALM: Efficient Perceptual Augmentation of Language Models
M Shukor, C Dancette, M Cord
ICCV 2023, 2023
212023
Synthetic training data generation for deep learning based quality inspection
P Gutierrez, M Luschkova, A Cordier, M Shukor, M Schappert, T Dahmen
International Conference on Quality Control by Artificial Vision (QCAV 2021 …, 2021
192021
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment
M Shukor, G Couairon, M Cord
BMVC 2022, 2022
182022
Beyond task performance: Evaluating and reducing the flaws of large multimodal models with in-context learning
M Shukor, A Rame, C Dancette, M Cord
ICLR 2024, 2023
112023
Semantic unfolding of stylegan latent space
M Shukor, X Yao, BB Damodaran, P Hellier
2022 IEEE International Conference on Image Processing (ICIP), 221-225, 2022
10*2022
Vision and structured-language pretraining for cross-modal food retrieval
M Shukor, N Thome, M Cord
Computer Vision and Image Understanding 247, 104071, 2024
9*2024
What Makes Multimodal In-Context Learning Work?
FB Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
72024
Video coding using learned latent gan compression
M Shukor, BB Damodaran, X Yao, P Hellier
Proceedings of the 30th ACM International Conference on Multimedia, 2239-2248, 2022
72022
Improved baselines for data-efficient perceptual augmentation of llms
T Vallaeys, M Shukor, M Cord, J Verbeek
arXiv preprint arXiv:2403.13499, 2024
62024
A Concept-Based Explainability Framework for Large Multimodal Models
J Parekh, P Khayatan, M Shukor, A Newson, M Cord
NeurIPS 2024, 2024
42024
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models
BT Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, M Cord
arXiv preprint arXiv:2403.20105, 2024
22024
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
M Shukor, M Cord
NeurIPS 2024, 2024
12024
Skipping Computations in Multimodal LLMs
M Shukor, M Cord
NeurIPSw 2024, 2024
2024
Methods and apparatuses for encoding/decoding an image or a video
P Hellier, M Shukor, BB Damodaran, YAO Xu
US Patent App. 18/573,260, 2024
2024
Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
P Couairon, M Shukor, JE Haugeard, M Cord, N Thome
NeurIPS 2024, 2024
2024
Supplementary material for eP-ALM: Efficient Perceptual Augmentation of Language Models
M Shukor, C Dancette, M Cord
The system can't perform the operation now. Try again later.
Articles 1–19