Papers + Code

Peer-review is the lifeblood of scientific validation and a guardrail against runaway hype in AI. Our commitment to publishing in the top venues reflects our grounding in what is real, reproducible, and truly innovative.

Sort by Newest ↑
Ensemble Estimation of Information Divergence
Ensemble Estimation of Information Divergence
 
Gradient Descent for Spiking Neural Networks
Gradient Descent for Spiking Neural Networks
 
Domain Adaptation by Using Causal Inference to Predict Invariant Conditional Distributions
Domain Adaptation by Using Causal Inference to Predict Invariant Conditional Distributions
 
Action Centered Contextual Bandits
Action Centered Contextual Bandits
 
BlockDrop: Dynamic Inference Paths in Residual Networks
BlockDrop: Dynamic Inference Paths in Residual Networks
 
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning.
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning.
 
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
 
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
 
Learning to Separate Object Sounds by Watching Unlabeled Video
Learning to Separate Object Sounds by Watching Unlabeled Video
 
Learning to Teach in Cooperative Multiagent Reinforcement Learning
Learning to Teach in Cooperative Multiagent Reinforcement Learning
 
Dialog-based Interactive Image Retrieval
Dialog-based Interactive Image Retrieval
 
Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation
Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation
 
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
 
AutoZOOM: Autoencoder-based Zeroth Order Optimization Method for Attacking Black-box Neural Networks
AutoZOOM: Autoencoder-based Zeroth Order Optimization Method for Attacking Black-box Neural Networks
 
Delta-encoder: an effective sample synthesis method for few-shot object recognition
Delta-encoder: an effective sample synthesis method for few-shot object recognition
 
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
 
Sparse, Smart Contours to Represent and Edit Images
Sparse, Smart Contours to Represent and Edit Images
 
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning
 
Characterizing and Learning Equivalence Classes of Causal DAGs under Interventions
Characterizing and Learning Equivalence Classes of Causal DAGs under Interventions
 
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
 
Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation
Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation
 
Deriving Machine Attention from Human Rationales
Deriving Machine Attention from Human Rationales
 
emrQA: A Large Corpus for Question Answering on Electronic Medical Records
emrQA: A Large Corpus for Question Answering on Electronic Medical Records
 
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
 
Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency
Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency
 
Interpretable Basis Decomposition for Visual Explanation
Interpretable Basis Decomposition for Visual Explanation
 
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
 
signSGD via Zeroth-Order Oracle
signSGD via Zeroth-Order Oracle
 
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
 
MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare
MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare
 
Experimental Design for Cost-Aware Learning of Causal Graphs
Experimental Design for Cost-Aware Learning of Causal Graphs
 
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
 
Co-regularized Alignment for Unsupervised Domain Adaptation
Co-regularized Alignment for Unsupervised Domain Adaptation
 
Unsupervised learning with contrastive latent variable models
Unsupervised learning with contrastive latent variable models
 
Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems
Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems
 
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
 
CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks
CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks
 
Scalable Graph Learning for Anti-Money Laundering: A First Look
Scalable Graph Learning for Anti-Money Laundering: A First Look
 
Efficient Neural Network Robustness Certification with General Activation Functions
Efficient Neural Network Robustness Certification with General Activation Functions
 
HOGWILD!-Gibbs Can Be PanAccurate
HOGWILD!-Gibbs Can Be PanAccurate
 
How Does Batch Normalization Help Optimization?
How Does Batch Normalization Help Optimization?
 
Learning Libraries of Subroutines for Neurally–Guided Bayesian Program Induction
Learning Libraries of Subroutines for Neurally–Guided Bayesian Program Induction
 
Direct Estimation of Differences in Causal Graphs
Direct Estimation of Differences in Causal Graphs
 
The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization
The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization
 
Learning and Testing Causal Models with Interventions
Learning and Testing Causal Models with Interventions
 
Neural Network Robustness Certification with General Activation Functions
Neural Network Robustness Certification with General Activation Functions
 
Weakly Supervised Dense Event Captioning in Videos
Weakly Supervised Dense Event Captioning in Videos
 
Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach
Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach
 
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach
 
A Sequential Set Generation Method for Predicting Set-Valued Outputs
A Sequential Set Generation Method for Predicting Set-Valued Outputs
 
MAi : An Intelligent Model Acquisition Interface for Interactive Specification of Dialogue Agents
MAi : An Intelligent Model Acquisition Interface for Interactive Specification of Dialogue Agents
 
Tight Certificates of Adversarial Robustness
Tight Certificates of Adversarial Robustness
 
Moments in Time Dataset: one million videos for event understanding
Moments in Time Dataset: one million videos for event understanding
 
Structural Supervision Improves Learning of Non-Local Grammatical Dependencies
Structural Supervision Improves Learning of Non-Local Grammatical Dependencies
 
Revisiting RCNN: On Awakening the Classification Power of Faster RCNN
Revisiting RCNN: On Awakening the Classification Power of Faster RCNN
 
Unsupervised learning by competing hidden units
Unsupervised learning by competing hidden units
 
Towards Optimal Transport with Global Invariances
Towards Optimal Transport with Global Invariances
 
Support and Invertibility in Domain-Invariant Representations
Support and Invertibility in Domain-Invariant Representations
 
Size of Interventional Markov Equivalence Classes in Random DAG Models
Size of Interventional Markov Equivalence Classes in Random DAG Models
 
ABCD-Strategy: Budgeted Experimental Design for Targeted Causal Structure Discovery
ABCD-Strategy: Budgeted Experimental Design for Targeted Causal Structure Discovery
 
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
 
Learning Embeddings into Entropic Wasserstein Spaces
Learning Embeddings into Entropic Wasserstein Spaces
 
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
 
Bayesian Nonparametric Federated Learning of Neural Networks
Bayesian Nonparametric Federated Learning of Neural Networks
 
Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity Typing
Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity Typing
 
Variational Russian Roulette for Deep Bayesian Nonparametrics
Variational Russian Roulette for Deep Bayesian Nonparametrics
 
Write, Execute, Assess: Program Synthesis with a REPL
Write, Execute, Assess: Program Synthesis with a REPL
 
Molecular Hypergraph Grammar with Its Application to Molecular Optimization
Molecular Hypergraph Grammar with Its Application to Molecular Optimization
 
Estimating Information Flow in Deep Neural Networks
Estimating Information Flow in Deep Neural Networks
 
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
 
Scalable Fair Clustering
Scalable Fair Clustering
 
Collective Model Fusion for Multiple Black-Box Experts
Collective Model Fusion for Multiple Black-Box Experts
 
Dirichlet Simplex Nest and Geometric Inference
Dirichlet Simplex Nest and Geometric Inference
 
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications
 
Grounding Spoken Words in Unlabeled Video
Grounding Spoken Words in Unlabeled Video
 
Additive Adversarial Learning for Unbiased Authentication
Additive Adversarial Learning for Unbiased Authentication
 
Unsupervised learning of action classes with continuous temporal embedding
Unsupervised learning of action classes with continuous temporal embedding
 
HAQ: Hardware-Aware Automated Quantization With Mixed Precision
HAQ: Hardware-Aware Automated Quantization With Mixed Precision
 
Identifying Interpretable Action Concepts in Deep Networks
Identifying Interpretable Action Concepts in Deep Networks
 
Deep Leakage from Gradients
Deep Leakage from Gradients
 
Unsupervised Grounding of Plannable First-Order Logic Representation from Images
Unsupervised Grounding of Plannable First-Order Logic Representation from Images
 
Towards Stable Symbol Grounding with Zero-Suppressed State AutoEncoder
Towards Stable Symbol Grounding with Zero-Suppressed State AutoEncoder
 
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
 
Attributed Description Logics: Reasoning on Knowledge Graphs
Attributed Description Logics: Reasoning on Knowledge Graphs
 
Neural language models as psycholinguistic subjects: Representations of syntactic state
Neural language models as psycholinguistic subjects: Representations of syntactic state
 
Unsupervised Clinical Language Translation
Unsupervised Clinical Language Translation
 
Adversarially Robust Submodular Maximization under Knapsack Constraints
Adversarially Robust Submodular Maximization under Knapsack Constraints
 
Retaining Privileged Information for Multi-Task Learning
Retaining Privileged Information for Multi-Task Learning
 
Coupled Variational Recurrent Collaborative Filtering
Coupled Variational Recurrent Collaborative Filtering
 
Bayesian Inference of Linear Temporal Logic Specifications for Contrastive Explanations
Bayesian Inference of Linear Temporal Logic Specifications for Contrastive Explanations
 
Evaluating the Interpretability of the Knowledge Compilation Map: Communicating Logical Statements Effectively
Evaluating the Interpretability of the Knowledge Compilation Map: Communicating Logical Statements Effectively
 
Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective
Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective
 
DDL: Deep Dictionary Learning for Predictive Phenotyping
DDL: Deep Dictionary Learning for Predictive Phenotyping
 
RDPD: Rich Data Helps Poor Data via Imitation
RDPD: Rich Data Helps Poor Data via Imitation
 
The Algonauts Project: A Platform for Communication between the Sciences of Biological and Artificial Intelligence
The Algonauts Project: A Platform for Communication between the Sciences of Biological and Artificial Intelligence
 
Defensive Quantization: When Efficiency Meets Robustness
Defensive Quantization: When Efficiency Meets Robustness
 
Watch, Reason and Code: Learning to Represent Videos Using Program
Watch, Reason and Code: Learning to Represent Videos Using Program
 
Seeing What a GAN Cannot Generate
Seeing What a GAN Cannot Generate
 
Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference
Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference
 
The Future of Work: How New Technologies Are Transforming Tasks
The Future of Work: How New Technologies Are Transforming Tasks
 
AutoGAN: Neural Architecture Search for Generative Adversarial Networks
AutoGAN: Neural Architecture Search for Generative Adversarial Networks
 
Reasoning about Human-Object Interactions through Dual Attention Networks
Reasoning about Human-Object Interactions through Dual Attention Networks
 
Cross-channel Communication Networks
Cross-channel Communication Networks
 
Out-of-Domain Detection for Low-Resource Text Classification Tasks
Out-of-Domain Detection for Low-Resource Text Classification Tasks
 
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
 
Context-Aware Conversation Thread Detection in Multi-Party Chat
Context-Aware Conversation Thread Detection in Multi-Party Chat
 
Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets
Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets
 
TWEETQA: A Social Media Focused Question Answering Dataset
TWEETQA: A Social Media Focused Question Answering Dataset
 
Self-Supervised Learning for Contextualized Extractive Summarization
Self-Supervised Learning for Contextualized Extractive Summarization
 
Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers
Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers
 
Improving Question Answering over Incomplete KBs with Knowledge-Aware Reader
Improving Question Answering over Incomplete KBs with Knowledge-Aware Reader
 
SimVAE: Simulator-Assisted Training for Interpretable Generative Models
SimVAE: Simulator-Assisted Training for Interpretable Generative Models
 
Reverse-engineering causal graphs with soft interventions
Reverse-engineering causal graphs with soft interventions
 
New tricks from old dogs: multi-source transfer learning
New tricks from old dogs: multi-source transfer learning
 
Causal inference is expensive. Here’s an algorithm for fixing that.
Causal inference is expensive. Here’s an algorithm for fixing that.
 
Visual Concept-Metaconcept Learning
Visual Concept-Metaconcept Learning
 
Alleviating label switching with optimal transport
Alleviating label switching with optimal transport
 
Topics are more meaningful than words. AI for comparative literature.
Topics are more meaningful than words. AI for comparative literature.
 
Imitation learning from observations
Imitation learning from observations
 
Using geometry to understand documents
Using geometry to understand documents
 
SPAHM: Parameter matching for model fusion
SPAHM: Parameter matching for model fusion
 
Big-Little-Video-Net: Work smarter, not harder, for video understanding
Big-Little-Video-Net: Work smarter, not harder, for video understanding
 
ZO-AdaMM: Derivative-free optimization for black-box problems
ZO-AdaMM: Derivative-free optimization for black-box problems
 
Scalable inference of topic evolution via models for latent geometric structures
Scalable inference of topic evolution via models for latent geometric structures
 
Sample Efficient Active Learning of Causal Trees
Sample Efficient Active Learning of Causal Trees
 
Adversarial Examples Are Not Bugs, They Are Features
Adversarial Examples Are Not Bugs, They Are Features
 
ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models
ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models
 
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
 
Statistical Model Aggregation via Parameter Matching
Statistical Model Aggregation via Parameter Matching
 
Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers
Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers
 
Characterization and Learning of Causal Graphs with Latent Variables from Soft Interventions
Characterization and Learning of Causal Graphs with Latent Variables from Soft Interventions
 
Hierarchical Optimal Transport for Document Representation
Hierarchical Optimal Transport for Document Representation
 
A Game Theoretic Approach to Class-wise Selective Rationalization
A Game Theoretic Approach to Class-wise Selective Rationalization
 
Learning New Tricks From Old Dogs: Multi-Source Transfer Learning From Pre-Trained Networks
Learning New Tricks From Old Dogs: Multi-Source Transfer Learning From Pre-Trained Networks
 
Image Synthesis with a Single (Robust) Classifier
Image Synthesis with a Single (Robust) Classifier
 
Private Testing of Distributions via Sample Permutations
Private Testing of Distributions via Sample Permutations
 
Sobolev Independence Criterion
Sobolev Independence Criterion
 
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization
 
Scalable Spike Source Localization in Extracellular Recordings using Amortized Variational Inference
Scalable Spike Source Localization in Extracellular Recordings using Amortized Variational Inference
 
Point-Voxel CNN for Efficient 3D Deep Learning
Point-Voxel CNN for Efficient 3D Deep Learning
 
Embedding Compression with Isotropic Iterative Quantization
Embedding Compression with Isotropic Iterative Quantization
 
Automating machine learning with a joint selection framework
Automating machine learning with a joint selection framework
 
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
 
MULE: Multimodal Universal Language Embedding
MULE: Multimodal Universal Language Embedding
 
Robust Low-Rank Discovery of Data-Driven Partial Differential Equations
Robust Low-Rank Discovery of Data-Driven Partial Differential Equations
 
Top-Quality Planning: Finding Practically Useful Sets of Best Plans
Top-Quality Planning: Finding Practically Useful Sets of Best Plans
 
Towards Certificated Model Robustness Against Weight Perturbations
Towards Certificated Model Robustness Against Weight Perturbations
 
CASTER: Predicting drug interactions with chemical substructure representation
CASTER: Predicting drug interactions with chemical substructure representation
 
CAG: A Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator
CAG: A Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator
 
Fastened crown: Tightened neural network robustness certificates
Fastened crown: Tightened neural network robustness certificates
 
A unifying framework for expectation-aware AI planning
A unifying framework for expectation-aware AI planning
 
Reading between the lines with graph deep learning for NLP
Reading between the lines with graph deep learning for NLP
 
Online AI planning with graph neural networks and adaptive scheduling
Online AI planning with graph neural networks and adaptive scheduling
 
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
 
Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics
Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics
 
Self-supervised Moving Vehicle Tracking with Stereo Sound
Self-supervised Moving Vehicle Tracking with Stereo Sound
 
The sound of motions
The sound of motions
 
TSM: Temporal Shift Module for Efficient Video Understanding
TSM: Temporal Shift Module for Efficient Video Understanding
 
Graph Convolutional Networks for Temporal Action Localization
Graph Convolutional Networks for Temporal Action Localization
 
On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
 
LaSO: Label-Set Operations networks for multi-label few-shot learning
LaSO: Label-Set Operations networks for multi-label few-shot learning
 
SpotTune: Transfer Learning through Adaptive Fine-tuning
SpotTune: Transfer Learning through Adaptive Fine-tuning
 
RepMet: Representative-based metric learning for classification and one-shot object detection
RepMet: Representative-based metric learning for classification and one-shot object detection
 
ObjectNet: A bias-controlled dataset object recognition
ObjectNet: A bias-controlled dataset object recognition
 
CASTER: An AI framework for preventing adverse reactions to medication
CASTER: An AI framework for preventing adverse reactions to medication
 
Adversarial Robustness vs Model Compression, or Both?
Adversarial Robustness vs Model Compression, or Both?
 
Reshaping Diverse Planning
Reshaping Diverse Planning
 
Location-aware Graph Convolutional Networks for Video Question Answering
Location-aware Graph Convolutional Networks for Video Question Answering
 
Deep Symbolic Superoptimization Without Human Knowledge
Deep Symbolic Superoptimization Without Human Knowledge
 
A Closer Look at Deep Policy Gradients
A Closer Look at Deep Policy Gradients
 
Deep Audio Priors Emerge From Harmonic Convolutional Networks
Deep Audio Priors Emerge From Harmonic Convolutional Networks
 
Once for All: Train One Network and Specialize it for Efficient Deployment
Once for All: Train One Network and Specialize it for Efficient Deployment
 
Implementation Matters in Deep RL: A Case Study on PPO and TRPO
Implementation Matters in Deep RL: A Case Study on PPO and TRPO
 
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence
 
Learning Rate Rewinding for elegant neural network pruning
Learning Rate Rewinding for elegant neural network pruning
 
Layer-wise federated learning with FedMA
Layer-wise federated learning with FedMA
 
Learning to learn with distributional signatures for text data
Learning to learn with distributional signatures for text data
 
Fast and efficient black-box testing for AI cybersecurity
Fast and efficient black-box testing for AI cybersecurity
 
CLEVRER: The first video dataset for neuro-symbolic reasoning
CLEVRER: The first video dataset for neuro-symbolic reasoning
 
Why Gradient Clipping accelerates training for neural networks
Why Gradient Clipping accelerates training for neural networks
 
SenSR: the first practical algorithm for individual fairness
SenSR: the first practical algorithm for individual fairness
 
Federated Adversarial Domain Adaptation
Federated Adversarial Domain Adaptation
 
Why Gradient Clipping Accelerates Training: A Theoretical Justification For Adaptivity
Why Gradient Clipping Accelerates Training: A Theoretical Justification For Adaptivity
 
Sign Bits Are All You Need For Black-box Attacks
Sign Bits Are All You Need For Black-box Attacks
 
APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
 
Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning
Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning
 
Towards Verifying Robustness of Neural Networks against Semantic Perturbations
Towards Verifying Robustness of Neural Networks against Semantic Perturbations
 
Dense Regression Network for Video Grounding
Dense Regression Network for Video Grounding
 
Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors
Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors
 
Bio-Inspired Hashing for Unsupervised Similarity Search
Bio-Inspired Hashing for Unsupervised Similarity Search
 
Proper Network Interpretability Helps Adversarial Robustness in Classification
Proper Network Interpretability Helps Adversarial Robustness in Classification
 
Unsupervised Speech Decomposition via Triple Information Bottleneck
Unsupervised Speech Decomposition via Triple Information Bottleneck
 
Invariant Rationalization
Invariant Rationalization
 
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
 
Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion
Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion
 
AR-Net: Adaptive Frame Resolution for Efficient Action Recognition
AR-Net: Adaptive Frame Resolution for Efficient Action Recognition
 
Characterization of Overlap in Observational Studies
Characterization of Overlap in Observational Studies
 
COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder
COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder
 
Gaussian-Smoothed Optimal Transport: Metric Structure and Statistical Efficiency
Gaussian-Smoothed Optimal Transport: Metric Structure and Statistical Efficiency
 
Auditing ML Models for Individual Bias and Unfairness
Auditing ML Models for Individual Bias and Unfairness
 
We Have So Much in Common: Modeling Semantic Relational Set Abstractions in Videos
We Have So Much in Common: Modeling Semantic Relational Set Abstractions in Videos
 
Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation
Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation
 
TAFSSL: Task-Adaptive Feature Sub-Space Learning for few-shot classification
TAFSSL: Task-Adaptive Feature Sub-Space Learning for few-shot classification
 
OnlineAugment: Online Data Augmentation with Less Domain Knowledge
OnlineAugment: Online Data Augmentation with Less Domain Knowledge
 
A Broader Study of Cross-Domain Few-Shot Learning
A Broader Study of Cross-Domain Few-Shot Learning
 
DataMix: Efficient Privacy-Preserving Edge-Cloud Inference
DataMix: Efficient Privacy-Preserving Edge-Cloud Inference
 
Foley Music: Learning to Generate Music from Videos
Foley Music: Learning to Generate Music from Videos
 
Adversarial T-shirt! Evading Person Detectors in A Physical World
Adversarial T-shirt! Evading Person Detectors in A Physical World
 
Learning to Scale Multilingual Representations for Vision-Language Tasks
Learning to Scale Multilingual Representations for Vision-Language Tasks
 
Does enforcing fairness mitigate biases caused by subpopulation shift?
Does enforcing fairness mitigate biases caused by subpopulation shift?
 
MCUNet: Tiny Deep Learning on IoT Devices
MCUNet: Tiny Deep Learning on IoT Devices
 
Approximate Cross-Validation for Structured Models
Approximate Cross-Validation for Structured Models
 
AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning
AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning
 
Uncertainty-Aware Learning for Zero-Shot Semantic Segmentation
Uncertainty-Aware Learning for Zero-Shot Semantic Segmentation
 
Revisiting the Sample Complexity of Sparse Spectrum Approximation of Gaussian Processes
Revisiting the Sample Complexity of Sparse Spectrum Approximation of Gaussian Processes
 
The Lottery Ticket Hypothesis for the Pre-trained BERT Networks
The Lottery Ticket Hypothesis for the Pre-trained BERT Networks
 
Continuous Regularized Wasserstein Barycenters
Continuous Regularized Wasserstein Barycenters
 
Tiny Transfer Learning: Towards Memory-Efficient On-Device Learning
Tiny Transfer Learning: Towards Memory-Efficient On-Device Learning
 
Auxiliary Task Reweighting for Minimum-data Learning
Auxiliary Task Reweighting for Minimum-data Learning
 
Log-Likelihood Ratio Minimizing Flows: Towards Robust and Quantifiable Neural Distribution Alignment
Log-Likelihood Ratio Minimizing Flows: Towards Robust and Quantifiable Neural Distribution Alignment
 
Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations
Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations
 
Asymptotic Guarantees for Generative Modeling based on the Smooth Wasserstein Distance
Asymptotic Guarantees for Generative Modeling based on the Smooth Wasserstein Distance
 
Training Stronger Baselines for Learning to Optimize
Training Stronger Baselines for Learning to Optimize
 
Applications of Common Entropy in Causal Inference
Applications of Common Entropy in Causal Inference
 
Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning
Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning
 
Entropic Causal Inference: Identifiability and Finite Sample Results
Entropic Causal Inference: Identifiability and Finite Sample Results
 
Higher-Order Certification For Randomized Smoothing
Higher-Order Certification For Randomized Smoothing
 
Debiased Contrastive Learning
Debiased Contrastive Learning
 
Learning Restricted Boltzmann Machines With Sparse Latent Variables
Learning Restricted Boltzmann Machines With Sparse Latent Variables
 
Differentiable Augmentation for Data-Efficient GAN Training
Differentiable Augmentation for Data-Efficient GAN Training
 
Robust Federated Learning: The Case of Affine Distribution Shifts
Robust Federated Learning: The Case of Affine Distribution Shifts
 
Sharp Representation Theorems for ReLU Networks with Precise Dependence on Depth
Sharp Representation Theorems for ReLU Networks with Precise Dependence on Depth
 
Online Bayesian Goal Inference for Boundedly-Rational Planning Agents
Online Bayesian Goal Inference for Boundedly-Rational Planning Agents
 
Adversarially-learned Inference via an Ensemble of Discrete Undirected Graphical Models
Adversarially-learned Inference via an Ensemble of Discrete Undirected Graphical Models
 
Active Structure Learning of Causal DAGs via Directed Clique Trees
Active Structure Learning of Causal DAGs via Directed Clique Trees
 
CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models
CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models
 
Fairness in Streaming Submodular Maximization: Algorithms and Hardness
Fairness in Streaming Submodular Maximization: Algorithms and Hardness
 
Testing Determinantal Point Processes
Testing Determinantal Point Processes
 
Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms
Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms
 
Learning Physical Graph Representations from Visual Scenes
Learning Physical Graph Representations from Visual Scenes
 
Black Loans Matter: Fighting Bias for AI Fairness in Lending
Black Loans Matter: Fighting Bias for AI Fairness in Lending
 
Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations
Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations
 
Learning Neural-Symbolic Descriptive Planning Models via Cube-Space Priors: The Voyage Home (to STRIPS)
Learning Neural-Symbolic Descriptive Planning Models via Cube-Space Priors: The Voyage Home (to STRIPS)
 
The Computational Limits of Deep Learning
The Computational Limits of Deep Learning
 
Do Neural Networks Really Need to Be So Big?
Do Neural Networks Really Need to Be So Big?
 
Can a Fruit Fly Learn Word Embeddings?
Can a Fruit Fly Learn Word Embeddings?
 
Large Associative Memory Problem in Neurobiology and Machine Learning
Large Associative Memory Problem in Neurobiology and Machine Learning
 
On Learning Continuous Pairwise Markov Random Fields
On Learning Continuous Pairwise Markov Random Fields
 
Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization
Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization
 
Directed Acyclic Graph Neural Networks
Directed Acyclic Graph Neural Networks
 
Discrete Graph Structure Learning for Forecasting Multiple Time Series
Discrete Graph Structure Learning for Forecasting Multiple Time Series
 
VA-RED^2: Video Adaptive Redundancy Reduction
VA-RED^2: Video Adaptive Redundancy Reduction
 
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
 
Improved Mutual Information Estimation
Improved Mutual Information Estimation
 
Fast Training of Provably Robust Neural Networks by SingleProp
Fast Training of Provably Robust Neural Networks by SingleProp
 
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
 
Adversarial Option-Aware Hierarchical Imitation Learning
Adversarial Option-Aware Hierarchical Imitation Learning
 
Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study
Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study
 
Benchmarking Commercial Intent Detection Services with Practice-Driven Evaluations
Benchmarking Commercial Intent Detection Services with Practice-Driven Evaluations
 
Multilingual BERT Post-Pretraining Alignment
Multilingual BERT Post-Pretraining Alignment
 
Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning
Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning
 
Outlier-Robust Optimal Transport
Outlier-Robust Optimal Transport
 
Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not?
Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not?
 
Auto-NBA: Efficient and Effective Search Over The Joint Space of Networks, Bitwidths, and Accelerators
Auto-NBA: Efficient and Effective Search Over The Joint Space of Networks, Bitwidths, and Accelerators
 
Global Prosody Style Transfer Without Text Transcriptions
Global Prosody Style Transfer Without Text Transcriptions
 
AGENT: A Benchmark for Core Psychological Reasoning
AGENT: A Benchmark for Core Psychological Reasoning
 
Complementary Evidence Identification in Open-Domain Question Answering
Complementary Evidence Identification in Open-Domain Question Answering
 
Learning-based Support Estimation In Sublinear Time
Learning-based Support Estimation In Sublinear Time
 
Anycost GANs for Interactive Image Synthesis and Editing
Anycost GANs for Interactive Image Synthesis and Editing
 
Black-box Explanation of Object Detectors via Saliency Maps
Black-box Explanation of Object Detectors via Saliency Maps
 
Fair Selective Classification Via Sufficiency
Fair Selective Classification Via Sufficiency
 
Statistical inference for individual fairness
Statistical inference for individual fairness
 
Individually Fair Ranking
Individually Fair Ranking
 
Individually Fair Gradient Boosting
Individually Fair Gradient Boosting
 
SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness
SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness
 
MVFNet: Multi-View Fusion Network for Efficient Video Recognition
MVFNet: Multi-View Fusion Network for Efficient Video Recognition
 
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
 
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
 
Learning Task Decomposition with Ordered Memory Policy Network
Learning Task Decomposition with Ordered Memory Policy Network
 
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
 
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics
 
Hidden Cost of Randomized Smoothing
Hidden Cost of Randomized Smoothing
 
Neural Network Control Policy Verification with Persistent Adversarial Perturbations
Neural Network Control Policy Verification with Persistent Adversarial Perturbations
 
Fine-grained Angular Contrastive Learning with Coarse Labels
Fine-grained Angular Contrastive Learning with Coarse Labels
 
Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
 
Complexity of Finding Stationary Points of Nonconvex Nonsmooth Functions
Complexity of Finding Stationary Points of Nonconvex Nonsmooth Functions
 
Why Do These Match? Explaining the Behavior of Image Similarity Models
Why Do These Match? Explaining the Behavior of Image Similarity Models
 
Self-Progressing Robust Training
Self-Progressing Robust Training
 
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
 
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules
 
Separating Skills and Concepts for Novel Visual Question Answering
Separating Skills and Concepts for Novel Visual Question Answering
 
High-Dimensional Feature Selection for Sample Efficient Treatment Effect Estimation
High-Dimensional Feature Selection for Sample Efficient Treatment Effect Estimation
 
Parameter-Efficient Transfer Learning with Diff Pruning
Parameter-Efficient Transfer Learning with Diff Pruning
 
Sequence-Level Mixed Sample Data Augmentation
Sequence-Level Mixed Sample Data Augmentation
 
Generating Adversarial Computer Programs using Optimized Obfuscations
Generating Adversarial Computer Programs using Optimized Obfuscations
 
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
 
Robust Overfitting may be mitigated by properly learned smoothening
Robust Overfitting may be mitigated by properly learned smoothening
 
Ordering-based causal structure learning in the presence of latent variables
Ordering-based causal structure learning in the presence of latent variables
 
Causal structure discovery from distributions arising from mixtures of DAGs
Causal structure discovery from distributions arising from mixtures of DAGs
 
Online Optimal Control with Affine Constraints
Online Optimal Control with Affine Constraints
 
A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving
A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving
 
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
 
Lite Transformer with Long-Short Range Attention
Lite Transformer with Long-Short Range Attention
 
GAN Compression: Efficient Architectures for Interactive Conditional GANs
GAN Compression: Efficient Architectures for Interactive Conditional GANs
 
Wasserstein Style Transfer
Wasserstein Style Transfer
 
Unsupervised Hierarchical Matching with Optimal Transport over Hyperbolic Spaces
Unsupervised Hierarchical Matching with Optimal Transport over Hyperbolic Spaces
 
Heterogeneous Knowledge Transfer via Hierarchical Teaching in Cooperative Multiagent Reinforcement Learning
Heterogeneous Knowledge Transfer via Hierarchical Teaching in Cooperative Multiagent Reinforcement Learning
 
Nano-Material Configuration Design with Deep Surrogate Langevin Dynamics
Nano-Material Configuration Design with Deep Surrogate Langevin Dynamics
 
Building Calibrated Deep Models via Uncertainty Matching with Auxiliary Interval Predictors
Building Calibrated Deep Models via Uncertainty Matching with Auxiliary Interval Predictors
 
Understanding Behavior of Clinical Models under Domain Shifts
Understanding Behavior of Clinical Models under Domain Shifts
 
Fashion IQ: A New Dataset towards Retrieving Images by Natural Language Feedback
Fashion IQ: A New Dataset towards Retrieving Images by Natural Language Feedback
 
StarNet: towards weakly supervised few-shot detection and explainable few-shot classification
StarNet: towards weakly supervised few-shot detection and explainable few-shot classification
 
Do GANs always have Nash equilibria?
Do GANs always have Nash equilibria?
 
Two Simple Ways to Learn Individual Fairness Metrics from Data
Two Simple Ways to Learn Individual Fairness Metrics from Data
 
Semi-Supervised Action Recognition with Temporal Contrastive Learning
Semi-Supervised Action Recognition with Temporal Contrastive Learning
 
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
 
Camera On-boarding for Person Re-identification using Hypothesis Transfer Learning
Camera On-boarding for Person Re-identification using Hypothesis Transfer Learning
 
Non-Adversarial Video Synthesis with Learned Priors
Non-Adversarial Video Synthesis with Learned Priors
 
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
 
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
 
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
 
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
 
Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport
Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport
 
Graph Universal Adversarial Attacks: A Few Bad Actors Ruin Graph Learning Models
Graph Universal Adversarial Attacks: A Few Bad Actors Ruin Graph Learning Models
 
Linear Mode Connectivity and The Lottery Ticket Hypothesis
Linear Mode Connectivity and The Lottery Ticket Hypothesis
 
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
 
Curvature-corrected learning dynamics in deep neural networks
Curvature-corrected learning dynamics in deep neural networks
 
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
 
Model Fusion with Kullback–Leibler Divergence
Model Fusion with Kullback–Leibler Divergence
 
Polygonal Building Extraction by Frame Field Learning
Polygonal Building Extraction by Frame Field Learning
 
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
 
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
 
Differentiable Sorting Networks for Scalable Sorting and Ranking Supervision
Differentiable Sorting Networks for Scalable Sorting and Ranking Supervision
 
Leveraging Language to Learn Program Abstractions and Search Heuristics
Leveraging Language to Learn Program Abstractions and Search Heuristics
 
Correlation Clustering in Constant Many Parallel Rounds
Correlation Clustering in Constant Many Parallel Rounds
 
A Targeted Assessment of Incremental Processing in Neural Language Models and Humans
A Targeted Assessment of Incremental Processing in Neural Language Models and Humans
 
Lexicon Learning for Few-Shot Neural Sequence Modeling
Lexicon Learning for Few-Shot Neural Sequence Modeling
 
Structural Guidance for Transformer Language Models
Structural Guidance for Transformer Language Models
 
On Sample Based Explanation Methods for NLP: Efficiency, Faithfulness, and Semantic Evaluation
On Sample Based Explanation Methods for NLP: Efficiency, Faithfulness, and Semantic Evaluation
 
MSRP Industry Night: Career Exploration with IBM
MSRP Industry Night: Career Exploration with IBM
 
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach
Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach
 
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations
 
The Algonauts Project 2021 Challenge: How the Human Brain Makes Sense of a World in Motion
The Algonauts Project 2021 Challenge: How the Human Brain Makes Sense of a World in Motion
 
On sensitivity of meta-learning to support data
On sensitivity of meta-learning to support data
 
Tune It the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density
Tune It the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density
 
Learning Cross-Modal Contrastive Features for Video Domain Adaptation
Learning Cross-Modal Contrastive Features for Video Domain Adaptation
 
CDS: Cross-domain self-supervised pre-training
CDS: Cross-domain self-supervised pre-training
 
Curious Representation Learning for Embodied Intelligence
Curious Representation Learning for Embodied Intelligence
 
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
 
Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting
Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting
 
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
 
Detector-Free Weakly Supervised Grounding by Separation
Detector-Free Weakly Supervised Grounding by Separation
 
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
 
A Broad Study on the Transferability of Visual Representations With Contrastive Learning
A Broad Study on the Transferability of Visual Representations With Contrastive Learning
 
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam
 
Dynamic Video Quantization for Efficient Inference
Dynamic Video Quantization for Efficient Inference
 
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
 
Post-processing for Individual Fairness
Post-processing for Individual Fairness
 
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
 
Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese Language Models
Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese Language Models
 
How Do Neural Sequence Models Generalize? Local and Global Cues for Out-of-Distribution Prediction
How Do Neural Sequence Models Generalize? Local and Global Cues for Out-of-Distribution Prediction
 
Adversarial Attack Generation Empowered by Min-Max Optimization
Adversarial Attack Generation Empowered by Min-Max Optimization
 
Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing
Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing
 
Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks
Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks
 
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data
 
Learning with Algorithmic Supervision via Continuous Relaxations
Learning with Algorithmic Supervision via Continuous Relaxations
 
On the Equivalence between Neural Network and Support Vector Machine
On the Equivalence between Neural Network and Support Vector Machine
 
OpenMatch: Open-Set Semi-supervised Learning with Open-set Consistency Regularization
OpenMatch: Open-Set Semi-supervised Learning with Open-set Consistency Regularization
 
Targeted Neural Dynamical Modeling
Targeted Neural Dynamical Modeling
 
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up
 
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
 
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
 
Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
 
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
 
Robust Deep Reinforcement Learning through Adversarial Loss
Robust Deep Reinforcement Learning through Adversarial Loss
 
Understanding Interlocking Dynamics of Cooperative Rationalization
Understanding Interlocking Dynamics of Cooperative Rationalization
 
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
 
Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception
Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception
 
Memory-efficient Patch-based Inference for Tiny Deep Learning
Memory-efficient Patch-based Inference for Tiny Deep Learning
 
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
 
3DP3: 3D Scene Perception via Probabilistic Programming
3DP3: 3D Scene Perception via Probabilistic Programming
 
Measuring Generalization with Optimal Transport
Measuring Generalization with Optimal Transport
 
PARP: Prune Once, Adjust and Re-Prune for Self-Supervised Speech Recognition
PARP: Prune Once, Adjust and Re-Prune for Self-Supervised Speech Recognition
 
Sequence-to-Sequence Learning with Latent Neural Grammars
Sequence-to-Sequence Learning with Latent Neural Grammars
 
Sliced Mutual Information: A Scalable Measure of Statistical Dependence
Sliced Mutual Information: A Scalable Measure of Statistical Dependence
 
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
 
STAR: A Benchmark for Situated Reasoning in Real-World Videos
STAR: A Benchmark for Situated Reasoning in Real-World Videos
 
Unsupervised Domain Generalization by Learning a Bridge Across Domains
Unsupervised Domain Generalization by Learning a Bridge Across Domains
 
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
 
IA-RED^2 : Interpretability-Aware Redundancy Reduction for Vision Transformers
IA-RED^2 : Interpretability-Aware Redundancy Reduction for Vision Transformers
 
Noether networks: meta-learning useful conserved quantities
Noether networks: meta-learning useful conserved quantities
 
Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning
Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning
 
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
 
Change Point Detection via Multivariate Singular Spectrum Analysis
Change Point Detection via Multivariate Singular Spectrum Analysis
 
An Exact Characterization of the Generalization Error for the Gibbs Algorithm
An Exact Characterization of the Generalization Error for the Gibbs Algorithm
 
Grammar-Based Grounded Lexicon Learning
Grammar-Based Grounded Lexicon Learning
 
Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark
Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark
 
Learning to Delegate for Large-scale Vehicle Routing
Learning to Delegate for Large-scale Vehicle Routing
 
Efficient Generalization with Distributionally Robust Learning
Efficient Generalization with Distributionally Robust Learning
 
Object DGCNN: 3D Object Detection using Dynamic Graphs
Object DGCNN: 3D Object Detection using Dynamic Graphs
 
A Bayesian-Symbolic Approach to Reasoning and Learning in Intuitive Physics
A Bayesian-Symbolic Approach to Reasoning and Learning in Intuitive Physics
 
A large-scale benchmark for few-shot program induction and synthesis
A large-scale benchmark for few-shot program induction and synthesis
 
Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time
Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time
 
Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization
Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization
 
Discovering State and Action Abstractions for Generalized Task and Motion Planning
Discovering State and Action Abstractions for Generalized Task and Motion Planning
 
Temporal and Object Quantification Networks
Temporal and Object Quantification Networks
 
Reinforcement Learning for Classical Planning: Viewing Heuristics As Dense Reward Generators
Reinforcement Learning for Classical Planning: Viewing Heuristics As Dense Reward Generators
 
Few-Shot Bayesian Imitation Learning with Logical Program Policies
Few-Shot Bayesian Imitation Learning with Logical Program Policies
 
Learning Symbolic Operators for Task and Motion Planning
Learning Symbolic Operators for Task and Motion Planning
 
CAMPS: Learning Context Specific Abstraction for Efficient Planning in Factored MDPs
CAMPS: Learning Context Specific Abstraction for Efficient Planning in Factored MDPs
 
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling
 
Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks
Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks
 
Reverse Engineering of Imperceptible Adversarial Image Perturbations
Reverse Engineering of Imperceptible Adversarial Image Perturbations
 
Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge
Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge
 
Measuring the robustness of Gaussian processes to kernel choice
Measuring the robustness of Gaussian processes to kernel choice
 
Regret, stability & fairness in matching markets with bandit learners
Regret, stability & fairness in matching markets with bandit learners
 
A Class of Geometric Structures in Transfer Learning: Minimax Bounds and Optimality
A Class of Geometric Structures in Transfer Learning: Minimax Bounds and Optimality
 
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series
 
Natural Language Descriptions of Deep Visual Features
Natural Language Descriptions of Deep Visual Features
 
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
 
Controlling Directions Orthogonal to a Classifier
Controlling Directions Orthogonal to a Classifier
 
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics
 
Monotonic Differentiable Sorting Networks
Monotonic Differentiable Sorting Networks
 
Linking Emergent and Natural Languages via Corpus Transfer
Linking Emergent and Natural Languages via Corpus Transfer
 
Topological Experience Replay
Topological Experience Replay
 
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation
 
How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective
How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective
 
Optimizer Amalgamation
Optimizer Amalgamation
 
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
 
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
 
Multi-Critic Actor Learning: Teaching RL Policies to Act with Style
Multi-Critic Actor Learning: Teaching RL Policies to Act with Style
 
Network Augmentation for Tiny Deep Learning
Network Augmentation for Tiny Deep Learning
 
Extending the WILDS Benchmark for Unsupervised Adaptation
Extending the WILDS Benchmark for Unsupervised Adaptation
 
Combinatorial Scientific Discovery: Finding New Concept Combinations Beyond Link Prediction
Combinatorial Scientific Discovery: Finding New Concept Combinations Beyond Link Prediction
 
Neural Parameter Allocation Search
Neural Parameter Allocation Search
 
Overcoming The Spectral Bias of Neural Value Approximation
Overcoming The Spectral Bias of Neural Value Approximation
 
Bi-linear Value Networks for Multi-goal Reinforcement Learning
Bi-linear Value Networks for Multi-goal Reinforcement Learning
 
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
 
RegionViT: Regional-to-Local Attention for Vision Transformers
RegionViT: Regional-to-Local Attention for Vision Transformers
 
Can an Image Classifier Suffice For Action Recognition?
Can an Image Classifier Suffice For Action Recognition?
 
Equivariant Self-Supervised Learning: Encouraging Equivariance in Representations
Equivariant Self-Supervised Learning: Encouraging Equivariance in Representations
 
Data-Efficient Graph Grammar Learning for Molecular Generation
Data-Efficient Graph Grammar Learning for Molecular Generation
 
Adversarial Support Alignment
Adversarial Support Alignment
 
Cross-Modal Discrete Representation Learning
Cross-Modal Discrete Representation Learning
 
Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari
Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari
 
Beyond Fairness: Reparative Algorithms to Address Historical Injustices of Housing Discrimination in the US
Beyond Fairness: Reparative Algorithms to Address Historical Injustices of Housing Discrimination in the US
 
Disentangling Visual and Written Concepts in CLIP
Disentangling Visual and Written Concepts in CLIP
 
LitePose: Efficient Architecture Design for 2D Human Pose Estimation
LitePose: Efficient Architecture Design for 2D Human Pose Estimation
 
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation
 
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
 
Targeted Supervised Contrastive Learning for Long-Tailed Recognition
Targeted Supervised Contrastive Learning for Long-Tailed Recognition
 
SimVQA: Exploring Simulated Environments for Visual Question Answering
SimVQA: Exploring Simulated Environments for Visual Question Answering
 
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
 
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
 
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
 
DeepCurrents: Learning Implicit Representations of Shapes With Boundaries
DeepCurrents: Learning Implicit Representations of Shapes With Boundaries
 
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
 
Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
 
A path to AI impact
A path to AI impact
 
Fast Convergence for Unstable Reinforcement Learning Problems by Logarithmic Mapping
Fast Convergence for Unstable Reinforcement Learning Problems by Logarithmic Mapping
 
On Convergence of Gradient Descent Ascent: A Tight Local Analysis
On Convergence of Gradient Descent Ascent: A Tight Local Analysis
 
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
 
CONTENTVEC: An Improved Self-Supervised Speech Representation by Disentangling Speakers
CONTENTVEC: An Improved Self-Supervised Speech Representation by Disentangling Speakers
 
Prompting Decision Transformer for Few-shot Policy Generalization
Prompting Decision Transformer for Few-shot Policy Generalization
 
Log-Euclidean Signatures for Intrinsic Distances Between Unaligned Datasets
Log-Euclidean Signatures for Intrinsic Distances Between Unaligned Datasets
 
Selective Regression under Fairness Criteria
Selective Regression under Fairness Criteria
 
Entropic Causal Inference: Graph Identifiability
Entropic Causal Inference: Graph Identifiability
 
Differentiable Top-k Classification Learning
Differentiable Top-k Classification Learning
 
Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity
Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity
 
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
 
Inductive Link Prediction Using Hyper-Relational Facts
Inductive Link Prediction Using Hyper-Relational Facts
 
Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations
Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations
 
An Adversarial Framework for Generating Unseen Images by Activation Maximization
An Adversarial Framework for Generating Unseen Images by Activation Maximization
 
Music Gesture for Visual Sound Separation
Music Gesture for Visual Sound Separation
 
Boston-area undergraduates develop a taste for careers in tech at Break Through Tech AI
Boston-area undergraduates develop a taste for careers in tech at Break Through Tech AI
 
Contrastive Learning with Complex Heterogeneity
Contrastive Learning with Complex Heterogeneity
 
Context-Specific Representation Abstraction for Deep Option Learning
Context-Specific Representation Abstraction for Deep Option Learning
 
Outlier impact characterization for time series data
Outlier impact characterization for time series data
 
Expanding the MIT-IBM Watson AI Lab’s network of neurons
Expanding the MIT-IBM Watson AI Lab’s network of neurons
 
A unified framework for domain adaptive pose estimation
A unified framework for domain adaptive pose estimation
 
A broad study of pre-training for domain generalization and adaptation
A broad study of pre-training for domain generalization and adaptation
 
Self-supervised classification network
Self-supervised classification network
 
SNAKE: Shape-aware Neural 3D Keypoint Field
SNAKE: Shape-aware Neural 3D Keypoint Field
 
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
How hard are computer vision datasets? Calibrating dataset difficulty to viewing time
 
On-Device Training Under 256KB Memory
On-Device Training Under 256KB Memory
 
Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting
Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting
 
Procedural Image Programs for Representation Learning
Procedural Image Programs for Representation Learning
 
Influencing Long-Term Behavior in Multiagent Reinforcement Learning
Influencing Long-Term Behavior in Multiagent Reinforcement Learning
 
FETA: Towards Specializing Foundation Models for Expert Task Applications
FETA: Towards Specializing Foundation Models for Expert Task Applications
 
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
 
k-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension
k-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension
 
Faster Linear Algebra for Distance Matrices
Faster Linear Algebra for Distance Matrices
 
Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks
Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks
 
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
 
Learning Physical Dynamics with Subequivariant Graph Neural Networks
Learning Physical Dynamics with Subequivariant Graph Neural Networks
 
Convergent representations of computer programs in human and artificial neural networks
Convergent representations of computer programs in human and artificial neural networks
 
Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees
Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees
 
Domain Adaptation meets Individual Fairness. And they get along
Domain Adaptation meets Individual Fairness. And they get along
 
Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing
Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing
 
DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
 
Factored Adaptation for Non-stationary Reinforcement Learning
Factored Adaptation for Non-stationary Reinforcement Learning
 
Deep Differentiable Logic Gate Networks
Deep Differentiable Logic Gate Networks
 
Advancing Model Pruning via Bi-level Optimization
Advancing Model Pruning via Bi-level Optimization
 
S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
 
The Missing Invariance Principle found — the Reciprocal Twin of Invariant Risk Minimization
The Missing Invariance Principle found — the Reciprocal Twin of Invariant Risk Minimization
 
Fairness Reprogramming
Fairness Reprogramming
 
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing
 
Learning Active Camera for Multi-Object Navigation
Learning Active Camera for Multi-Object Navigation
 
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
 
Learning Neural Acoustic Fields
Learning Neural Acoustic Fields
 
3D Concept Grounding on Neural Fields
3D Concept Grounding on Neural Fields
 
How Transferable are Video Representations Based on Synthetic Data?
How Transferable are Video Representations Based on Synthetic Data?
 
3 Questions: Innovation in financial services through synthetic data
3 Questions: Innovation in financial services through synthetic data
 
Personalized Dialogue Generation with Persona-Adaptive Attention
Personalized Dialogue Generation with Persona-Adaptive Attention
 
Proximal Stochastic Recursive Momentum Methods for Nonconvex Composite Decentralized Optimization
Proximal Stochastic Recursive Momentum Methods for Nonconvex Composite Decentralized Optimization
 
Zero-shot linear combinations of grounded social interactions with Linear Social MDPs
Zero-shot linear combinations of grounded social interactions with Linear Social MDPs
 
Post-hoc Uncertainty Learning using a Dirichlet Meta-Model
Post-hoc Uncertainty Learning using a Dirichlet Meta-Model
 
DAG-GNN: DAG Structure Learning with Graph Neural Networks
DAG-GNN: DAG Structure Learning with Graph Neural Networks
 
A Family of Robust Stochastic Operators for Reinforcement Learning
A Family of Robust Stochastic Operators for Reinforcement Learning
 
Temperature Schedules for self-supervised contrastive methods on long-tail data
Temperature Schedules for self-supervised contrastive methods on long-tail data
 
Who Should Predict? Exact Algorithms For Learning to Defer to Humans
Who Should Predict? Exact Algorithms For Learning to Defer to Humans
 
Minimum-Entropy Coupling Approximation Guarantees Beyond the Majorization Barrier
Minimum-Entropy Coupling Approximation Guarantees Beyond the Majorization Barrier
 
Planning with Large Language Models for Code Generation
Planning with Large Language Models for Code Generation
 
Understanding new tasks through the lens of training data via exponential tilting
Understanding new tasks through the lens of training data via exponential tilting
 
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
 
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
 
TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization
TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization
 
Hyper-Decision Transformer for Efficient Online Policy Adaptation
Hyper-Decision Transformer for Efficient Online Policy Adaptation
 
What Is Missing in IRM Training and Evaluation? Challenges and Solutions
What Is Missing in IRM Training and Evaluation? Challenges and Solutions
 
Aligning Model and Macaque Inferior Temporal Cortex Representations Improves Model-to-Human Behavioral Alignment and Adversarial Robustness
Aligning Model and Macaque Inferior Temporal Cortex Representations Improves Model-to-Human Behavioral Alignment and Adversarial Robustness
 
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
 
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments
 
ISAAC Newton: Input-based Approximate Curvature for Newton’s Method
ISAAC Newton: Input-based Approximate Curvature for Newton’s Method
 
Contrastive Audio-Visual Masked Autoencoder
Contrastive Audio-Visual Masked Autoencoder
 
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation
 
Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning
Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning
 
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
 
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification
 
Learning to Grow Pretrained Models for Efficient Transformer Training
Learning to Grow Pretrained Models for Efficient Transformer Training
 
Label-free Concept Bottleneck Models
Label-free Concept Bottleneck Models
 
AnyDA: Anytime Domain Adaptation
AnyDA: Anytime Domain Adaptation
 
Sampling with Mollified Interaction Energy Descent
Sampling with Mollified Interaction Energy Descent
 
Learning Proximal Operators to Discover Multiple Optima
Learning Proximal Operators to Discover Multiple Optima
 
Is conditional generative modeling all you need for decision-making?
Is conditional generative modeling all you need for decision-making?
 
Creating space for the evolution of generative and trustworthy AI
Creating space for the evolution of generative and trustworthy AI
 
CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning
CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning
 
Teaching Structured Vision & Language Concepts to Vision & Language Models
Teaching Structured Vision & Language Concepts to Vision & Language Models
 
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
 
3D Concept Learning and Reasoning from Multi-View Images
3D Concept Learning and Reasoning from Multi-View Images
 
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
 
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos
 
Learning Situation Hyper-Graphs for Video Question Answering
Learning Situation Hyper-Graphs for Video Question Answering
 
EC^2 : Emergent Communication for Embodied Control
EC^2 : Emergent Communication for Embodied Control
 
Masked Motion Encoding for Self-Supervised Video Representation Learning
Masked Motion Encoding for Self-Supervised Video Representation Learning
 
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
 
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
 
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
 
Video Test-Time Adaptation for Action Recognition
Video Test-Time Adaptation for Action Recognition
 
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
 
ConStruct-VL: Data-Free Continual Structured VL Concepts Learning
ConStruct-VL: Data-Free Continual Structured VL Concepts Learning
 
Predicate Invention for Bilevel Planning
Predicate Invention for Bilevel Planning
 
Learning Rational Subgoals from Demonstrations and Instructions
Learning Rational Subgoals from Demonstrations and Instructions
 
MaskSketch: Unpaired Structure-guided Masked Image Generation
MaskSketch: Unpaired Structure-guided Masked Image Generation
 
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
 
Bias Mimicking: A Simple Sampling Approach for Bias Mitigation
Bias Mimicking: A Simple Sampling Approach for Bias Mitigation
 
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
 
TGRL: Teacher Guided Reinforcement Learning Algorithm for POMDPs
TGRL: Teacher Guided Reinforcement Learning Algorithm for POMDPs
 
PFGM++: Unlocking the Potential of Physics-Inspired Generative Models
PFGM++: Unlocking the Potential of Physics-Inspired Generative Models
 
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
 
Change is Hard: A Closer Look at Subpopulation Shift
Change is Hard: A Closer Look at Subpopulation Shift
 
Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models
Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models
 
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
 
A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
 
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data
 
GC-Flow: A Graph-Based Flow Network for Effective Clustering
GC-Flow: A Graph-Based Flow Network for Effective Clustering
 
Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries
Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries
 
ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction
ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction
 
Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction
Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction