Data science has revolutionized industry and academic fields including marketing, 1 astronomy, 2 and computer vision. 3 It has not yet impacted medicine and biomedical research to the same degree. However, many observers 4-6 believe that data science will improve the ability of health care systems to deliver personalized medicine, 7 population health, 8 and public health. 9 The US National Institutes of Health (NIH) Strategic Plan for Data Science defines it as "the interdisciplinary field of inquiry in which quantitative and analytical approaches, processes, and systems are developed and used to extract knowledge and insights from increasingly large and/or complex sets of data." 10 One interpretation of this definition is that data science is possible because of relatively cheap data storage and computing, efficient algorithms and computational tools for complex tasks such as optimizing neural network models and analyzing whole exome sequence (WES) data, and the availability of enormous amounts of data. This includes biological (eg, genomic, proteomic, and metabolomic), clinical (eg, laboratory results, digital images, text notes, and monitor signals), lifestyle (mobile sensor output, interactions with applications), and other data with which to train and evaluate models.