About

Shiyu Chang

Research Staff Member

Publications Research

Who they work with

Yang Zhang

Categories

Computer Vision Deep Rationalization Machine Learning Natural Language Processing

Shiyu Chang is a research scientist at the MIT-IBM Watson AI Lab and an associate professor of computer science at UC Santa Barbara. His research focuses on machine learning and its applications in natural language processing and computer vision. Most recently, he has been investigating how machine predictions can be made more interpretable to humans, and how human intuition and rationalization can improve AI transferability, data efficiency, and adversarial robustness. Prior to his current position, Chang was a research scientist at the IBM T.J. Watson Research Center. He got his BS and PhD from the University of Illinois at Urbana-Champaign; his PhD advisor was Thomas S. Huang.

Top Work

Class-wise rationalization: teaching AI to weigh pros and cons

Natural Language Processing

Publications with the MIT-IBM Watson AI Lab

PromptBoosting: Black-Box Text Classification with Ten Forward Passes

ICML

Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models

ICML

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

CVPR

TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization

ICLR

Fairness Reprogramming

NeurIPS AI Fairness

An Adversarial Framework for Generating Unseen Images by Activation Maximization

AAAI

CONTENTVEC: An Improved Self-Supervised Speech Representation by Disentangling Speakers

ICML

Data-Efficient Double-Win Lottery Tickets from Robust Pre-training

ICML

Adversarial Support Alignment

ICLR

Optimizer Amalgamation

ICLR

How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

ICLR

PARP: Prune Once, Adjust and Re-Prune for Self-Supervised Speech Recognition

NeurIPS

Understanding Interlocking Dynamics of Cooperative Rationalization

NeurIPS

TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up

NeurIPS

Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning

ICLR

Robust Overfitting may be mitigated by properly learned smoothening

ICLR

Generating Adversarial Computer Programs using Optimized Obfuscations

ICLR Machine Learning

The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models

CVPR Computer Vision

Self-Progressing Robust Training

AAAI Machine Learning

Complementary Evidence Identification in Open-Domain Question Answering

EACL Computation and Language

Global Prosody Style Transfer Without Text Transcriptions

ICML

Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning

EMNLP Computation and Language

Training Stronger Baselines for Learning to Optimize

Optimization

The Lottery Ticket Hypothesis for the Pre-trained BERT Networks

Efficient AI

Invariant Rationalization

ICML

Unsupervised Speech Decomposition via Triple Information Bottleneck

ICML

Proper Network Interpretability Helps Adversarial Robustness in Classification

ICML

Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning

CVPR

Learning to learn with distributional signatures for text data

ICLR Transfer Learning

Sentence Embedding Alignment for Lifelong Relation Extraction

Natural Language Processing Lifelong Learning

A Game Theoretic Approach to Class-wise Selective Rationalization

NeurIPS

Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers

NeurIPS

Class-wise rationalization: teaching AI to weigh pros and cons

Natural Language Processing Deep Rationalization

Improving Question Answering over Incomplete KBs with Knowledge-Aware Reader

ACL

Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers

ACL

Self-Supervised Learning for Contextualized Extractive Summarization

ACL

TWEETQA: A Social Media Focused Question Answering Dataset

ACL

Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets

ACL Natural Language Processing

Context-Aware Conversation Thread Detection in Multi-Party Chat

EMNLP

Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

EMNLP

Out-of-Domain Detection for Low-Resource Text Classification Tasks

EMNLP

AutoGAN: Neural Architecture Search for Generative Adversarial Networks

ICCV

Coupled Variational Recurrent Collaborative Filtering

KDD

Additive Adversarial Learning for Unbiased Authentication

CVPR

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

ICML

Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity Typing

Graph Deep Learning Natural Language Processing

Tight Certificates of Adversarial Robustness

Cybersecurity NeurIPS

Deriving Machine Attention from Human Rationales

EMNLP Natural Language Processing

Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization

Optimization Deep Learning