Valentin Gabeur

I am a final year PhD student at Inria (Thoth team), advised by Cordelia Schmid and Karteek Alahari. My research focuses on multi-modal learning for video understanding, at the intersection of computer vision, audio processing, speech recognition and NLP. During my PhD, I have worked as a Student Researcher at Google AI Research. I received a MS in Robotics from Toulouse University in 2018.

Prior to that, I have worked in industrial automation and machine design for 6 years, mostly as a mechanical engineer. I received a MS in Engineering from ICAM Lille.

Email  /  CV  /  Google Scholar  /  LinkedIn  /  Twitter  /  GitHub

profile photo
Research
mmcvr

Masking Modalities for Cross-modal Video Retrieval
Valentin Gabeur, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid
WACV, 2022  
arXiv / bibtex
Pre-training strategy for learning multi-modal fusion from unlabelled videos.

mmt

Multi-modal Transformer for Video Retrieval
Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid
ECCV, 2020 (Spotlight paper)  
arXiv / code, models, data / bibtex
Cross-modal architecture to encode language captions and videos in a common embedding space.

video-pent

CVPR 2020 Video Pentathlon Challenge: Multi-modal Transformer for Video Retrieval
Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid
CVPR Video Pentathlon Workshop, 2020 (First place)  
report / paper / challenge / recording
Winning approach for the CVPR 2020 Video Pentathlon Challenge, a video retrieval competition.

moulding

Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images
Valentin Gabeur, Jean-Sebastien Franco, Xavier Martin, Cordelia Schmid, Gregory Rogez
ICCV, 2019  
arXiv / bibtex
Efficient 3D shape representation through the combination of depth maps.


The code for this page is available here.
Credits to Jon Barron for the template.