📄

Publications

2026

Vision-Speech Models: Teaching Speech Models to Converse About Images

Amélie Royer*, Moritz Böhle*, Gabriel de Marmiesse, Laurent Mazaré, Neil Zeghidour, Alexandre Défossez, Patrick Pérez

Conference on Computer Vision and Pattern Recognition (CVPR) 2026

2025

CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion

Moritz Böhle*, Amélie Royer*, Juliette Marrie*, Edouard Grave, Patrick Pérez

arXiv preprint 2025

2024

Moshi: A Speech-Text Foundation Model for Real-Time Dialogue

Alexandre Défossez, Laurent Mazaré, Manu Orsini, Amélie Royer, Patrick Pérez, Hervé Jégou, Edouard Grave, Neil Zeghidour

Technical Report 2024

InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-Task Learning

Babak Ehteshami Bejnordi, Gaurav Kumar, Amélie Royer, Christos Louizos, Tijmen Blankevoort, Mohsen Ghafoorian

British Machine Vision Conference (BMVC) 2024

Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding

Benjamin Bergner, Andrii Skliar, Amélie Royer, Tijmen Blankevoort, Yuki Asano, Babak Ehteshami Bejnordi

ES-FoMo II Workshop, ICML 2024

2023

MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers

Jakob Drachmann Havtorn*, Amélie Royer*, Tijmen Blankevoort, Babak Ehteshami Bejnordi

ICCV Workshop on New Ideas in Vision Transformers (NViT) 2023

Scalarization for Multi-Task and Multi-Domain Learning at Scale

Amélie Royer, Tijmen Blankevoort, Babak Ehteshami Bejnordi

Conference on Neural Information Processing Systems (NeurIPS) 2023

2022

Knowledge Distillation: A good teacher is patient and consistent

Lucas Beyer*, Xiaohua Zhai*, Amélie Royer*, Larisa Markeeva*, Rohan Anil, Alexander Kolesnikov

Conference on Computer Vision and Pattern Recognition (CVPR) (oral) 2022

Revisiting single-gated Mixtures of Experts

Amélie Royer, Ilia Karmanov, Andrii Skliar, Babak Ehteshami Bejnordi, Tijmen Blankevoort

British Machine Vision Conference (BMVC) 2022

2020

A Flexible Selection Scheme for Minimum-Effort Transfer Learning

Amélie Royer and Christoph Lampert

Winter Conference on Applications of Computer Vision (WACV) 2020

Localizing Grouped Instances for Efficient Detection in Low-Resource Scenarios

Amélie Royer and Christoph Lampert

Winter Conference on Applications of Computer Vision (WACV) 2020

Multiple-Environment Markov Decision Processes: Efficient Analysis and Applications

Krishnendu Chatterjee, Martin Chmelík, Deep Karkhanis, Petr Novotný and Amélie Royer

International Conference on Automated Planning and Scheduling (ICAPS) 2020

2018

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings

Amélie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar Mosseri, Forrester Cole, Kevin Murphy

Domain Adaptation for Visual Understanding Workshop at ICML/IJCAI/EJCAI 2018 2018

2017

Probabilistic Image Colorization

Amélie Royer*, Alexander Kolesnikov*, Christoph Lampert

British Machine Vision Conference (BMVC) 2017

2016

Audio Word Similarity for Clustering with zero Resources based on iterative HMM Classification

Amélie Royer, Guillaume Gravier, Vincent Claveau

International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016

2015

Classifier Adaptation at Prediction Time

Amélie Royer and Christoph Lampert

Conference on Computer Vision and Pattern Recognition (CVPR) 2015