MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare
Authors
Authors
- Edward Choi
- Cao Xiao
- Walter F. Stewart
- Jimeng Sun
MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare
Authors
- Edward Choi
- Cao Xiao
- Walter F. Stewart
- Jimeng Sun
Published on
10/22/2018
Categories
Deep learning models exhibit state-of-the-art performance for many predictive healthcare tasks using electronic health records (EHR) data, but these models typically require training data volume that exceeds the capacity of most healthcare systems. External resources such as medical ontologies are used to bridge the data volume constraint, but this approach is often not directly applicable or useful because of inconsistencies with terminology. To solve the data insufficiency challenge, we leverage the inherent multilevel structure of EHR data and, in particular, the encoded relationships among medical codes. We propose Multilevel Medical Embedding (MiME) which learns the multilevel embedding of EHR data while jointly performing auxiliary prediction tasks that rely on this inherent EHR structure without the need for external labels. We conducted two prediction tasks, heart failure prediction and sequential disease prediction, where MiME outperformed baseline methods in diverse evaluation settings. In particular, MiME consistently outperformed all baselines when predicting heart failure on datasets of different volumes, especially demonstrating the greatest performance improvement (15% relative gain in PR-AUC over the best baseline) on the smallest dataset, demonstrating its ability to effectively model the multilevel structure of EHR data.
Please cite our work using the BibTeX below.
@article{DBLP:journals/corr/abs-1810-09593,
author = {Edward Choi and
Cao Xiao and
Walter F. Stewart and
Jimeng Sun},
title = {MiME: Multilevel Medical Embedding of Electronic Health Records for
Predictive Healthcare},
journal = {CoRR},
volume = {abs/1810.09593},
year = {2018},
url = {http://arxiv.org/abs/1810.09593},
archivePrefix = {arXiv},
eprint = {1810.09593},
timestamp = {Wed, 31 Oct 2018 14:24:29 +0100},
biburl = {https://dblp.org/rec/journals/corr/abs-1810-09593.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}