Understanding and Improving Visual Prompting: A Label-Mapping Perspective
Understanding and Improving Visual Prompting: A Label-Mapping Perspective
 
signSGD via Zeroth-Order Oracle
signSGD via Zeroth-Order Oracle
 
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data
 
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning.
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning.
 
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
 
Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective
Topology Attack and Defense for Graph Neural Networks: An Optimization Perspective
 
Adversarial T-shirt! Evading Person Detectors in A Physical World
Adversarial T-shirt! Evading Person Detectors in A Physical World
 
AutoZOOM: Autoencoder-based Zeroth Order Optimization Method for Attacking Black-box Neural Networks
AutoZOOM: Autoencoder-based Zeroth Order Optimization Method for Attacking Black-box Neural Networks
 
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
 
Fast Training of Provably Robust Neural Networks by SingleProp
Fast Training of Provably Robust Neural Networks by SingleProp
 
Towards Certificated Model Robustness Against Weight Perturbations
Towards Certificated Model Robustness Against Weight Perturbations
 
S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
S3-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint
 
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
 
On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
 
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing
 
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications
Fast Incremental von Neumann Graph Entropy Computation: Theory, Algorithm, and Applications
 
Hidden Cost of Randomized Smoothing
Hidden Cost of Randomized Smoothing
 
Self-Progressing Robust Training
Self-Progressing Robust Training
 
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
 
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
 
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
 
CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks
CNN-Cert: An Efficient Framework for Certifying Robustness of Convolutional Neural Networks
 
Proper Network Interpretability Helps Adversarial Robustness in Classification
Proper Network Interpretability Helps Adversarial Robustness in Classification
 
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?
 
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
 
Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach
Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach
 
Higher-Order Certification For Randomized Smoothing
Higher-Order Certification For Randomized Smoothing
 
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
 
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
 
Efficient Neural Network Robustness Certification with General Activation Functions
Efficient Neural Network Robustness Certification with General Activation Functions
 
Adversarial Attack Generation Empowered by Min-Max Optimization
Adversarial Attack Generation Empowered by Min-Max Optimization
 
Neural Network Robustness Certification with General Activation Functions
Neural Network Robustness Certification with General Activation Functions
 
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
 
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach
PROVEN: Verifying Robustness of Neural Networks with a Probabilistic Approach
 
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
 
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
 
Masked Motion Encoding for Self-Supervised Video Representation Learning
Masked Motion Encoding for Self-Supervised Video Representation Learning
 
On-Device Training Under 256KB Memory
On-Device Training Under 256KB Memory
 
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
 
A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening
 
Topological Experience Replay
Topological Experience Replay
 
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
 
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
 
Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning
Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning
 
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
 
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
 
Location-aware Graph Convolutional Networks for Video Question Answering
Location-aware Graph Convolutional Networks for Video Question Answering
 
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence
 
A Sequential Set Generation Method for Predicting Set-Valued Outputs
A Sequential Set Generation Method for Predicting Set-Valued Outputs
 
Training Stronger Baselines for Learning to Optimize
Training Stronger Baselines for Learning to Optimize
 
MCUNet: Tiny Deep Learning on IoT Devices
MCUNet: Tiny Deep Learning on IoT Devices
 
Towards Verifying Robustness of Neural Networks against Semantic Perturbations
Towards Verifying Robustness of Neural Networks against Semantic Perturbations
 
Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization
Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization
 
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
 
Detector-Free Weakly Supervised Grounding by Separation
Detector-Free Weakly Supervised Grounding by Separation
 
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
Data-Efficient Double-Win Lottery Tickets from Robust Pre-training
 
Procedural Image Programs for Representation Learning
Procedural Image Programs for Representation Learning
 
SNAKE: Shape-aware Neural 3D Keypoint Field
SNAKE: Shape-aware Neural 3D Keypoint Field
 
A Broad Study on the Transferability of Visual Representations With Contrastive Learning
A Broad Study on the Transferability of Visual Representations With Contrastive Learning
 
Learning Active Camera for Multi-Object Navigation
Learning Active Camera for Multi-Object Navigation
 
Dense Regression Network for Video Grounding
Dense Regression Network for Video Grounding
 
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
 
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data
 
LitePose: Efficient Architecture Design for 2D Human Pose Estimation
LitePose: Efficient Architecture Design for 2D Human Pose Estimation
 
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
 
Optimizer Amalgamation
Optimizer Amalgamation
 
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos
 
Planning with Large Language Models for Code Generation
Planning with Large Language Models for Code Generation
 
Discrete Graph Structure Learning for Forecasting Multiple Time Series
Discrete Graph Structure Learning for Forecasting Multiple Time Series
 
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
 
DAG-GNN: DAG Structure Learning with Graph Neural Networks
DAG-GNN: DAG Structure Learning with Graph Neural Networks
 
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization
ZO-AdaMM: Zeroth-Order Adaptive Momentum Method for Black-Box Optimization
 
CAG: A Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator
CAG: A Real-Time Low-Cost Enhanced-Robustness High-Transferability Content-Aware Adversarial Attack Generator
 
Proximal Stochastic Recursive Momentum Methods for Nonconvex Composite Decentralized Optimization
Proximal Stochastic Recursive Momentum Methods for Nonconvex Composite Decentralized Optimization
 
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
 
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
 
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification
 
Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning
Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning
 
3D Concept Learning and Reasoning from Multi-View Images
3D Concept Learning and Reasoning from Multi-View Images
 
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval
 
GC-Flow: A Graph-Based Flow Network for Effective Clustering
GC-Flow: A Graph-Based Flow Network for Effective Clustering
 
Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction
Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction
 
Advancing Model Pruning via Bi-level Optimization
Advancing Model Pruning via Bi-level Optimization
 
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
 
Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks
Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks
 
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
 
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
 
Scalable Graph Learning for Anti-Money Laundering: A First Look
Scalable Graph Learning for Anti-Money Laundering: A First Look
 
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
 
Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics
Anti-Money Laundering in Bitcoin: Experimenting with Graph Convolutional Networks for Financial Forensics
 
Self-supervised Moving Vehicle Tracking with Stereo Sound
Self-supervised Moving Vehicle Tracking with Stereo Sound
 
Big-Little-Video-Net: Work smarter, not harder, for video understanding
Big-Little-Video-Net: Work smarter, not harder, for video understanding
 
ZO-AdaMM: Derivative-free optimization for black-box problems
ZO-AdaMM: Derivative-free optimization for black-box problems
 
Deep Symbolic Superoptimization Without Human Knowledge
Deep Symbolic Superoptimization Without Human Knowledge
 
The Lottery Ticket Hypothesis for the Pre-trained BERT Networks
The Lottery Ticket Hypothesis for the Pre-trained BERT Networks
 
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
 
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning.
 
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
 
Robust Overfitting may be mitigated by properly learned smoothening
Robust Overfitting may be mitigated by properly learned smoothening
 
Debiased Contrastive Learning
Debiased Contrastive Learning
 
Foley Music: Learning to Generate Music from Videos
Foley Music: Learning to Generate Music from Videos
 
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks
 
Dynamic Video Quantization for Efficient Inference
Dynamic Video Quantization for Efficient Inference
 
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search
 
STAR: A Benchmark for Situated Reasoning in Real-World Videos
STAR: A Benchmark for Situated Reasoning in Real-World Videos
 
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
 
Memory-efficient Patch-based Inference for Tiny Deep Learning
Memory-efficient Patch-based Inference for Tiny Deep Learning
 
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
Everything at Once – Multi-modal Fusion Transformer for Video Retrieval
 
On the Equivalence between Neural Network and Support Vector Machine
On the Equivalence between Neural Network and Support Vector Machine
 
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data
Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data
 
Graph Universal Adversarial Attacks: A Few Bad Actors Ruin Graph Learning Models
Graph Universal Adversarial Attacks: A Few Bad Actors Ruin Graph Learning Models
 
Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport
Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport
 
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
 
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning
 
Embedding Compression with Isotropic Iterative Quantization
Embedding Compression with Isotropic Iterative Quantization
 
Online AI planning with graph neural networks and adaptive scheduling
Online AI planning with graph neural networks and adaptive scheduling
 
Learning to learn with distributional signatures for text data
Learning to learn with distributional signatures for text data
 
Fast and efficient black-box testing for AI cybersecurity
Fast and efficient black-box testing for AI cybersecurity
 
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series
Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series
 
RegionViT: Regional-to-Local Attention for Vision Transformers
RegionViT: Regional-to-Local Attention for Vision Transformers
 
Can an Image Classifier Suffice For Action Recognition?
Can an Image Classifier Suffice For Action Recognition?
 
Data-Efficient Graph Grammar Learning for Molecular Generation
Data-Efficient Graph Grammar Learning for Molecular Generation
 
Directed Acyclic Graph Neural Networks
Directed Acyclic Graph Neural Networks