James Glass
Senior Research Scientist, Computer Science and Artificial Intelligence Laboratory

Who they work with
Categories
James Glass is a senior research scientist and heads the Spoken Language Systems Group in MIT’s Computer Science and Artificial Intelligence Laboratory. He is also a faculty member of the Harvard-MIT Division of Health Sciences and Technology program. His research focuses on automatic speech recognition, unsupervised speech processing, and spoken language understanding. His group is focused on finding answers to three questions: who is talking; what is said; and what is meant. The first area focuses on paralinguistic issues like speaker verification, language and dialect identification, and speaker diarization, or who spoke when. The group is also analyzing health markers embedded in speech, an area that addresses speech recognition capabilities and challenges related to noise robustness, limited linguistic resources, and unsupervised language acquisition. The third and final area focuses on the boundary between speech and natural language processing, and includes topics related to speech understanding such as sentiment analysis and dialogue. Some research also focuses on the user-generated text in social forums.
Glass is a fellow of the Institute of Electrical and Electronics Engineers (IEEE) and the International Speech Communication Association, and is currently an associate editor for the IEEE Transactions on Pattern Analysis and Machine Intelligence. He earned an MS and PhD in electrical engineering and computer science at MIT.
Publications
- Lai, C-I. J., Zhang, Y., Liu, A. H., Chang, S., Liao, Y-L., Chuang, Y-S., Qian, K., Khurana, S., Cox, D., Glass, J. (2021). PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. Conference on Neural Information Processing Systems (NeurIPS).
- Baly, R., Karadzhov, G., Saleh, A., Glass, J., and Nakov, P. (2019). Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media. Proc. NAACL-HLT, Minneapolis.
- Nadeem, M., Fang, W., Xu, B., Mohtarami, M., and Glass, J. (2019). FAKTA: An Automatic End-to-End Fact Checking System, Proc. NAACL-HLT, Minneapolis.
- Harwath, D. and Glass, J. (2019). Towards Visually Grounded Sub-Word Speech Unit Discovery. Proc. ICASSP, Brighton.
- Boggust, A., Audhkhasi, K., Joshi, D., Harwath, D., Thomas, S., Feris, R., Gutfreund, D., Zhang, Y., Torralba, A., Picheny, M., Glass, J. (2019). Grounding Spoken Words in Unlabeled Video. Conference on Computer Vision and Pattern Recognition.
Media
- November 4, 2021: MIT News, Toward speech recognition for uncommon spoken languages.
- February 21, 2019: MIT News, Exploring the nature of intelligence.
- October 4, 2018: MIT News, Detecting fake news at its source.
- September 18, 2018: MIT News, Machine-learning system tackles speech and object recognition, all at once.
- Augugust 29, 2018: MIT News, Model can more naturally detect depression in conversations.