Toggle Menu
MIT-IBM Watson AI Lab
Research
News
Inside the lab
MIT-IBM Watson AI Lab
Research
Featured
Papers + Code
Search
News
News
Inside the lab
Inside the lab
Membership
People
Careers
Contact
MIT
IBM Research
IBM Watson
X
Medium
314 Main St.
Cambridge, MA
02141
MIT-IBM Watson AI Lab
Research
Featured
Papers + Code
Search
News
News
Inside the lab
Inside the lab
Membership
People
Careers
Contact
MIT
IBM Research
IBM Watson
X
Medium
314 Main St.
Cambridge, MA
02141
Search
Research
Research
Papers + Code
What’s Next in AI
Search
↳ Enter
research
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
CVPR
signSGD via Zeroth-Order Oracle
signSGD via Zeroth-Order Oracle
Optimization
Robustness
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data
ICML
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning.
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning.
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
Robustness
Natural Language Processing
Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective
Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective
IJCAI
Adversarial T-shirt! Evading Person Detectors in A Physical World
Adversarial T-shirt! Evading Person Detectors in A Physical World
ECCV
Computer Vision
AutoZOOM: Autoencoder-based Zeroth Order Optimization Method for Attacking Black-box Neural Networks
AutoZOOM: Autoencoder-based Zeroth Order Optimization Method for Attacking Black-box Neural Networks
Robustness
Optimization
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
CVPR
Fast Training of Provably Robust Neural Networks by SingleProp
Fast Training of Provably Robust Neural Networks by SingleProp
AAAI
Towards Certificated Model Robustness Against Weight Perturbations
Towards Certificated Model Robustness Against Weight Perturbations
AAAI
Sign-OPT: Defending the hard-label black-box cyber attack
Sign-OPT: Defending the hard-label black-box cyber attack
Cybersecurity
Adversarial Machine Learning
S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
NeurIPS
Computer Vision
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
Robustness
Explainability
On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
Optimization
Robustness
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
ICML
Machine Learning
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications
ICML
Hidden Cost of Randomized Smoothing
Hidden Cost of Randomized Smoothing
AISTATS
Machine Learning
Self-Progressing Robust Training
Self-Progressing Robust Training
AAAI
Machine Learning
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
ECCV
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
ICML
Machine Learning
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
ICLR
Machine Learning
CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks
CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks
Robustness
Proper Network Interpretability Helps Adversarial Robustness in Classification
Proper Network Interpretability Helps Adversarial Robustness in Classification
ICML
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
NeurIPS
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
Optimization
Deep Learning
Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach
Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach
Robustness
Higher-Order Certification For Randomized Smoothing
Higher-Order Certification For Randomized Smoothing
Optimization
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
ICLR
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
ICLR
Efficient Neural Network Robustness Certification with General Activation Functions
Efficient Neural Network Robustness Certification with General Activation Functions
NeurIPS
Adversarial Attack Generation Empowered by Min-Max Optimization
Adversarial Attack Generation Empowered by Min-Max Optimization
NeurIPS
Neural Network Robustness Certification with General Activation Functions
Neural Network Robustness Certification with General Activation Functions
Robustness
Deep Learning
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
ICLR
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach
ICML
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
NeurIPS
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
CVPR
Masked Motion Encoding for Self-Supervised Video Representation Learning
Masked Motion Encoding for Self-Supervised Video Representation Learning
CVPR
On-Device Training Under 256KB Memory
On-Device Training Under 256KB Memory
NeurIPS
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
CVPR
A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
ICML
Topological Experience Replay
Topological Experience Replay
ICLR
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
ICCV
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
ICCV
Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning
Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning
CVPR
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
CVPR
Computer Vision
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
CVPR
Computer Vision
Location-aware Graph Convolutional Networks for Video Question Answering
Location-aware Graph Convolutional Networks for Video Question Answering
AAAI
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence
Entrepreneurship
Future of Work
A Sequential Set Generation Method for Predicting Set-Valued Outputs
A Sequential Set Generation Method for Predicting Set-Valued Outputs
AAAI
Training Stronger Baselines for Learning to Optimize
Training Stronger Baselines for Learning to Optimize
Optimization
MCUNet: Tiny Deep Learning on IoT Devices
MCUNet: Tiny Deep Learning on IoT Devices
Deep Learning
Efficient AI
Towards Verifying Robustness of Neural Networks against Semantic Perturbations
Towards Verifying Robustness of Neural Networks against Semantic Perturbations
CVPR
Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization
Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization
AISTATS
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
ICML
Detector-Free Weakly Supervised Grounding by Separation
Detector-Free Weakly Supervised Grounding by Separation
ICCV
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
ICML
Procedural Image Programs for Representation Learning
Procedural Image Programs for Representation Learning
NeurIPS
SNAKE: Shape-aware Neural 3D Keypoint Field
SNAKE: Shape-aware Neural 3D Keypoint Field
NeurIPS
A Broad Study on the Transferability of Visual Representations With Contrastive Learning
A Broad Study on the Transferability of Visual Representations With Contrastive Learning
ICCV
Learning Active Camera for Multi-Object Navigation
Learning Active Camera for Multi-Object Navigation
NeurIPS
Dense Regression Network for Video Grounding
Dense Regression Network for Video Grounding
CVPR
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
CVPR
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
CVPR
LitePose: Efficient Architecture Design for 2D Human Pose Estimation
LitePose: Efficient Architecture Design for 2D Human Pose Estimation
CVPR
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
NeurIPS
Optimizer Amalgamation
Optimizer Amalgamation
ICLR
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
ICLR
Planning with Large Language Models for Code Generation
Planning with Large Language Models for Code Generation
ICLR
Discrete Graph Structure Learning for Forecasting Multiple Time Series
Discrete Graph Structure Learning for Forecasting Multiple Time Series
ICLR
Time Series
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
ICML
DAG-GNN: DAG Structure Learning with Graph Neural Networks
DAG-GNN: DAG Structure Learning with Graph Neural Networks
ICML
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization
NeurIPS
CAG: A Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator
CAG: A Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator
AAAI
Proximal Stochastic Recursive Momentum Methods for Nonconvex Composite Decentralized Optimization
Proximal Stochastic Recursive Momentum Methods for Nonconvex Composite Decentralized Optimization
AAAI
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
NeurIPS
Optimization
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
ICLR
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification
ICLR
Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning
Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning
ICLR
3D Concept Learning and Reasoning from Multi-View Images
3D Concept Learning and Reasoning from Multi-View Images
CVPR
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
CVPR
GC-Flow: A Graph-Based Flow Network for Effective Clustering
GC-Flow: A Graph-Based Flow Network for Effective Clustering
ICML
Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction
Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction
ICML
Advancing Model Pruning via Bi-level Optimization
Advancing Model Pruning via Bi-level Optimization
NeurIPS
Optimization
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
Graph Deep Learning
AAAI
Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks
Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks
NeurIPS
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
AAAI
Machine Learning
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
Generative Models
Graph Deep Learning
Scalable Graph Learning for Anti-Money Laundering: A First Look
Scalable Graph Learning for Anti-Money Laundering: A First Look
AI in Finance
Graph Deep Learning
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
Optimization
Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics
Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics
AI in Finance
Graph Deep Learning
Self-supervised Moving Vehicle Tracking with Stereo Sound
Self-supervised Moving Vehicle Tracking with Stereo Sound
Multimodal Learning
Computer Vision
Big-Little-Video-Net: Work smarter, not harder, for video understanding
Big-Little-Video-Net: Work smarter, not harder, for video understanding
Computer Vision
ZO-AdaMM: Derivative-free optimization for black-box problems
ZO-AdaMM: Derivative-free optimization for black-box problems
Optimization
NeurIPS
Deep Symbolic Superoptimization Without Human Knowledge
Deep Symbolic Superoptimization Without Human Knowledge
Neuro-Symbolic AI
Optimization
The Lottery Ticket Hypothesis for the Pre-trained BERT Networks
The Lottery Ticket Hypothesis for the Pre-trained BERT Networks
Efficient AI
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
ICCV
Multimodal Learning
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
AAAI
Computer Vision
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
ICLR
Computer Vision
Robust Overfitting may be mitigated by properly learned smoothening
Robust Overfitting may be mitigated by properly learned smoothening
ICLR
Debiased Contrastive Learning
Debiased Contrastive Learning
NeurIPS
Foley Music: Learning to Generate Music from Videos
Foley Music: Learning to Generate Music from Videos
ECCV
Computer Vision
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
NeurIPS
Dynamic Video Quantization for Efficient Inference
Dynamic Video Quantization for Efficient Inference
ICCV
ICCV
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
AAAI
Computer Vision
STAR: A Benchmark for Situated Reasoning in Real-World Videos
STAR: A Benchmark for Situated Reasoning in Real-World Videos
NeurIPS
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
NeurIPS
Memory-efficient Patch-based Inference for Tiny Deep Learning
Memory-efficient Patch-based Inference for Tiny Deep Learning
NeurIPS
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
CVPR
On the Equivalence between Neural Network and Support Vector Machine
On the Equivalence between Neural Network and Support Vector Machine
NeurIPS
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data
NeurIPS
Graph Universal Adversarial Attacks: A Few Bad Actors Ruin Graph Learning Models
Graph Universal Adversarial Attacks: A Few Bad Actors Ruin Graph Learning Models
Machine Learning
Social Networks
Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport
Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport
AAAI
Machine Learning
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
ECCV
Computer Vision
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
ICLR
A content-aware attack generator for AI cybersecurity
A content-aware attack generator for AI cybersecurity
Adversarial Machine Learning
Cybersecurity
Embedding Compression with Isotropic Iterative Quantization
Embedding Compression with Isotropic Iterative Quantization
Natural Language Processing
Online AI planning with graph neural networks and adaptive scheduling
Online AI planning with graph neural networks and adaptive scheduling
Automated Planning
Graph Deep Learning
Learning to learn with distributional signatures for text data
Learning to learn with distributional signatures for text data
ICLR
Transfer Learning
Fast and efficient black-box testing for AI cybersecurity
Fast and efficient black-box testing for AI cybersecurity
Cybersecurity
Robustness
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series
ICLR
RegionViT: Regional-to-Local Attention for Vision Transformers
RegionViT: Regional-to-Local Attention for Vision Transformers
ICLR
Can an Image Classifier Suffice For Action Recognition?
Can an Image Classifier Suffice For Action Recognition?
ICLR
Data-Efficient Graph Grammar Learning for Molecular Generation
Data-Efficient Graph Grammar Learning for Molecular Generation
ICLR
Directed Acyclic Graph Neural Networks
Directed Acyclic Graph Neural Networks
ICLR
Graph Deep Learning