I’m an AI Scientist at Mistral AI, Paris. My research focuses on multimodal foundation models for vision and language. Previously, I was a postdoctoral researcher at FAIR in Meta AI, where I contributed to the development of DINOv3, a state-of-the-art self-supervised vision foundation model, and DINO-world, a latent video world model. During my PhD at KTH, Stockholm, I worked on the explainability of deep learning models, applied to computer vision and bioinformatics.
Learn more about me and my research on my website.
Connect with me: