- 
Visually Grounded Interaction and Language, NeurIPS 2019, NeurIPS 2018
 
- 
Emergent Communication: Towards Natural Language, NeurIPS 2019
 
- 
Workshop on Multimodal Understanding and Learning for Embodied Applications, ACM Multimedia 2019
 
- 
Beyond Vision and Language: Integrating Real-World Knowledge, EMNLP 2019
 
- 
The How2 Challenge: New Tasks for Vision & Language, ICML 2019
 
- 
Visual Question Answering and Dialog, CVPR 2019, CVPR 2017
 
- 
Multi-modal Learning from Videos, CVPR 2019
 
- 
Multimodal Learning and Applications Workshop, CVPR 2019, ECCV 2018
 
- 
Habitat: Embodied Agents Challenge and Workshop, CVPR 2019
 
- 
Closing the Loop Between Vision and Language & LSMD Challenge, ICCV 2019
 
- 
Multi-modal Video Analysis and Moments in Time Challenge, ICCV 2019
 
- 
Cross-Modal Learning in Real World, ICCV 2019
 
- 
Spatial Language Understanding and Grounded Communication for Robotics, NAACL 2019
 
- 
YouTube-8M Large-Scale Video Understanding, ICCV 2019, ECCV 2018, CVPR 2017
 
- 
Language and Vision Workshop, CVPR 2019, CVPR 2018, CVPR 2017, CVPR 2015
 
- 
Sight and Sound, CVPR 2019, CVPR 2018
 
- 
The Large Scale Movie Description Challenge (LSMDC), ICCV 2019, ICCV 2017
 
- 
Wordplay: Reinforcement and Language Learning in Text-based Games, NeurIPS 2018
 
- 
Interpretability and Robustness in Audio, Speech, and Language, NeurIPS 2018
 
- 
Multimodal Robot Perception, ICRA 2018
 
- 
WMT18: Shared Task on Multimodal Machine Translation, EMNLP 2018
 
- 
Shortcomings in Vision and Language, ECCV 2018
 
- 
Grand Challenge and Workshop on Human Multimodal Language, ACL 2018
 
- 
Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, EMNLP 2018, EMNLP 2017, NAACL-HLT 2016, EMNLP 2015, ACL 2014, NAACL-HLT 2013
 
- 
Visual Understanding Across Modalities, CVPR 2017
 
- 
International Workshop on Computer Vision for Audio-Visual Media, ICCV 2017
 
- 
Language Grounding for Robotics, ACL 2017
 
- 
Computer Vision for Audio-visual Media, ECCV 2016
 
- 
Language and Vision, ACL 2016, EMNLP 2015