Research

ICCV

All Work

Helping computer vision and language models understand what they see

MIT News

↗

AI model speeds up high-resolution computer vision

MIT News

↗

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

ICCV Multimodal Learning

Dynamic Video Quantization for Efficient Inference

Curious Representation Learning for Embodied Intelligence

Reasoning about Human-Object Interactions through Dual Attention Networks

ICCV Computer Vision