Joshua Tenenbaum

Co-Scientific Director, MIT Quest for Intelligence; Professor, Brain and Cognitive Sciences; MacArthur Fellow

Joshua Tenenbaum is a professor of computational cognitive science in MIT’s Department of Brain and Cognitive Sciences and a co-scientific director with the MIT Quest for Intelligence. He is also an investigator at the Center for Brains, Minds and Machines and the Computer Science and Artificial Intelligence Laboratory. Tenenbaum’s research straddles cognitive science and artificial intelligence, where his goals are to reverse engineer human intelligence and to build machines that behave in human-like ways and have greater use to society. Through a combination of mathematical modeling, computer simulation, and behavioral experiments, Tenenbaum tries to uncover the logic behind our everyday inductive leaps: constructing perceptual representations, separating “style” and “content” in perception, learning concepts and words, judging similarity or representativeness, inferring causal connections, noticing coincidences and predicting the future. Tenenbaum is a MacArthur Fellow and has received the National Academy of Sciences’ Troland Research Award. He earned a BA from Yale University, and a PhD in brain and cognitive sciences from MIT.

Selected Publications

Media

Top Work

The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision

The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision

Neuro-Symbolic AI

Publications with the MIT-IBM Watson AI Lab

Learning Rational Subgoals from Demonstrations and Instructions
Learning Rational Subgoals from Demonstrations and Instructions
 
Predicate Invention for Bilevel Planning
Predicate Invention for Bilevel Planning
 
3D Concept Learning and Reasoning from Multi-View Images
3D Concept Learning and Reasoning from Multi-View Images
 
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
 
Is conditional generative modeling all you need for decision-making?
Is conditional generative modeling all you need for decision-making?
 
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments
 
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
 
Planning with Large Language Models for Code Generation
Planning with Large Language Models for Code Generation
 
Zero-shot linear combinations of grounded social interactions with Linear Social MDPs
Zero-shot linear combinations of grounded social interactions with Linear Social MDPs
 
3D Concept Grounding on Neural Fields
3D Concept Grounding on Neural Fields
 
Learning Neural Acoustic Fields
Learning Neural Acoustic Fields
 
Learning Physical Dynamics with Subequivariant Graph Neural Networks
Learning Physical Dynamics with Subequivariant Graph Neural Networks
 
Music Gesture for Visual Sound Separation
Music Gesture for Visual Sound Separation
 
Prompting Decision Transformer for Few-shot Policy Generalization
Prompting Decision Transformer for Few-shot Policy Generalization
 
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
 
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
 
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation
 
Linking Emergent and Natural Languages via Corpus Transfer
Linking Emergent and Natural Languages via Corpus Transfer
 
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics
 
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
 
Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks
Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks
 
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling
 
Learning Symbolic Operators for Task and Motion Planning
Learning Symbolic Operators for Task and Motion Planning
 
Few-Shot Bayesian Imitation Learning with Logical Program Policies
Few-Shot Bayesian Imitation Learning with Logical Program Policies
 
Temporal and Object Quantification Networks
Temporal and Object Quantification Networks
 
Discovering State and Action Abstractions for Generalized Task and Motion Planning
Discovering State and Action Abstractions for Generalized Task and Motion Planning
 
A large-scale benchmark for few-shot program induction and synthesis
A large-scale benchmark for few-shot program induction and synthesis
 
A Bayesian-Symbolic Approach to Reasoning and Learning in Intuitive Physics
A Bayesian-Symbolic Approach to Reasoning and Learning in Intuitive Physics
 
Grammar-Based Grounded Lexicon Learning
Grammar-Based Grounded Lexicon Learning
 
Noether networks: meta-learning useful conserved quantities
Noether networks: meta-learning useful conserved quantities
 
STAR: A Benchmark for Situated Reasoning in Real-World Videos
STAR: A Benchmark for Situated Reasoning in Real-World Videos
 
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
 
3DP3: 3D Scene Perception via Probabilistic Programming
3DP3: 3D Scene Perception via Probabilistic Programming
 
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
 
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
 
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations
 
Leveraging Language to Learn Program Abstractions and Search Heuristics
Leveraging Language to Learn Program Abstractions and Search Heuristics
 
Learning Physical Graph Representations from Visual Scenes
Learning Physical Graph Representations from Visual Scenes
 
Online Bayesian Goal Inference for Boundedly-Rational Planning Agents
Online Bayesian Goal Inference for Boundedly-Rational Planning Agents
 
Foley Music: Learning to Generate Music from Videos
Foley Music: Learning to Generate Music from Videos
 
CLEVRER: The first video dataset for neuro-symbolic reasoning
CLEVRER: The first video dataset for neuro-symbolic reasoning
 
Deep Audio Priors Emerge From Harmonic Convolutional Networks
Deep Audio Priors Emerge From Harmonic Convolutional Networks
 
ObjectNet: A bias-controlled dataset object recognition
ObjectNet: A bias-controlled dataset object recognition
 
ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models
ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models
 
Visual Concept-Metaconcept Learning
Visual Concept-Metaconcept Learning
 
Write, Execute, Assess: Program Synthesis with a REPL
Write, Execute, Assess: Program Synthesis with a REPL
 
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
 
Learning Libraries of Subroutines for Neurally–Guided Bayesian Program Induction
Learning Libraries of Subroutines for Neurally–Guided Bayesian Program Induction
 
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
 
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding