Abstracts
2024 Annual Symposium on Risks and Opportunities of AI in Pharmaceutical Medicine
Day 2 Plenary Session 1
Title:
Empowering Insights in the All of Us Research Program: A Statistical Perspective on the Transformational Role of AI and ML, Qingxia ‘Cindy’ Chen, Biomedical Informatics, and Ophthalmology & Visual Sciences at Vanderbilt University Medical Center
Abstract:
The All of Us Research Program is building a diverse cohort of a million or more Americans engaged longitudinally in data sharing including biospecimens, electronic health records, surveys, and digital health technology to advance precision medicine research and fuel new insights into human health. This data is at the fingertips of researchers through the Researcher Workbench—a cloud-based analytical platform equipped with graphical interface tools and bolstered by Jupyter notebooks. Underpinned by the FAIR principles, these resources render data and discoveries Findable, Accessible, Interoperable, and Reproducible in the domain of biomedical research. The All of UsResearch Program leverages cutting-edge AI and ML methods. These endeavors include devising models to enhance missing data with external resources and various data types, forecasting participant engagement, and refining polygenetic risk scores for underrepresented populations within biomedicine. At its core, the All of Us Researcher Workbench stands as a dynamic hub, fostering methodological innovation and catalyzing outcome research.
Title:
“The Generalist Medical AI Will See You Now”, Pranav Rajpurkar, Ph.D., Harvard University
Abstract:
Accurate interpretation of medical images is crucial for disease diagnosis and treatment, and AI has the potential to minimize errors, reduce delays, and improve accessibility. The focal point of this presentation lies in a grand ambition: the development of ‘Generalist Medical AI’ systems that can closely resemble doctors in their ability to reason through a wide range of medical tasks, incorporate multiple data modalities, and communicate in natural language. Starting with pioneering algorithms that have already demonstrated their potential in diagnosing diseases from chest X-rays or electrocardiograms, matching the proficiency of expert radiologists and cardiologists, I will delve into the core challenges and advancements in the field. The discussion will navigate towards the topic of label-efficient AI models: with a scarcity of meticulously annotated data in healthcare, the development of AI systems capable of learning effectively from limited labels has become a key concern. In this vein, I’ll delve into how the innovative use of self-supervision and pre-training methods has led to algorithmic advancements that can perform high-level diagnostic tasks using significantly less annotated data. Additionally, I will talk about initiatives in data curation, human-AI collaboration, and the creation of open benchmarks to evaluate the generalizability of medical AI algorithms. In summary, this talk aims to deliver a comprehensive picture of the state of ‘Generalist Medical AI,’ the advancements made, the challenges faced, and the prospects lying ahead.
Title:
“Generative AI for Case Adjudication in OHDSI”, Marc Suchard, UCLA
Abstract:
Observational Health Data Science and Informatics (OHDSI) aims to improve health by empowering an open-science community to collaboratively generate the evidence that promotes better health decisions and better care. OHDSI is actively exploring ways in which large language models (LLMs) can help deliver this aim through the Assessment of Pre-trained Observational Large Longitudinal models in OHDSI (APOLLO) project. This talk highlights two APOLLO directions. First, medical case validation stands as a necessary element of regulatory-grade evidence, but currently requires human adjudication that is time and resource-intensive. APOLLO provides open-source development of LLMs to improve the reliability and scalability of case validation. Across 10 diseases in 2 different data sources, we find substantial heterogeneity in agreement between human reviewers and that LLMs agree with humans as much as humans agree with each other. Second, APOLLO explores the use of LLMs pre-trained to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) data sources to perform patient-level prediction, where a pre-trained model may prove more accurate with less training data than current non-pre-trained models, missing value imputation, almost identical to a bidirectional pre-training task, and counterfactual prediction. In this latter task, the LLM provides a prediction of what happens to a patient in the future given both treatment options. We suspect further applications will continue to emerge across the OHDSI community.
Day 2 Plenary Session 2
Title:
“Generative Mixed-Response State-Space Model for Analyzing Multi-Dimensional Digital Phenotypes”, Yuanjia Wang, Columbia University
Abstract:
Digital technologies (e.g., mobile phones) can be used to obtain objective, frequent, and real-world digital phenotypes from individuals. However, modeling these data poses substantial challenges since observational data are subject to confounding and various sources of variabilities. For example, signals on patients’ underlying health status and treatment effects are mixed with variation due to the living environment and measurement noises. The digital phenotype data thus shows extensive variabilities between and within patient as well as across different health domains (e.g., motor, cognitive, and speaking). Motivated by a mobile health study of Parkinson’s disease (PD), we develop a mixed-response state-space (MRSS) model to jointly capture multi-dimensional, multi-modal digital phenotypes and their measurement processes by a finite number of latent state time series. These latent states reflect the dynamic health status and personalized time-varying treatment effects and can be used to adjust for informative measurements. We conduct comprehensive simulation studies and demonstrate the advantage of MRSS in modeling a mobile health study that remotely collects real-time digital phenotypes from PD patients. We discuss extensions to deep latent state-space models for generating digital phenotype time-series data to learn optimal treatment strategies.
Title:
Abstract:
Title:
Abstract:
Day 2 Plenary Session 3
Title:
Abstract:
Title:
“The multitude of group affiliations: Algorithmic Fairness, Loss Minimization and Outcome Indistinguishability Omer Reingold, Stanford University
Abstract:
We will discuss a rather recent and very fruitful line of research in algorithmic fairness, coined multi-group fairness. We will focus on risk prediction, where a machine learning algorithm tries to learn a predictor to answer questions of the form “what is the probability that patient x will have a particular adverse reaction to a drug?” Training a risk predictor to minimize a loss function fixed in advance is the dominant paradigm in machine learning. However, global loss minimization may create predictions that are mis-calibrated on sub-populations, causing harm to individuals of these populations. Multi-group fairness tries to prevent forms of discrimination to a rich (possibly exponential) collection of arbitrarily intersecting groups. In a sense, it provides a computational perspective on the meaning of individual risks and the classical tension between clinical prediction, which uses individual-level traits, and actuarial prediction, which uses group-level traits. Surprisingly, it also provides unique robustness properties to the learned risk predictors, such as robustness to distributional shifts and to changing objectives. Multi-group fairness has found practical usages in health care, which we will discuss as well.
Based on a sequence of works joint with (subsets of) Cynthia Dwork, Shafi Goldwasser, Parikshit Gopalan, Úrsula Hébert-Johnson, Lunjia Hu, Adam Kalai, Christoph Kern, Michael P. Kim, Frauke Kreuter, Guy N. Rothblum, Vatsal Sharan, Udi Wieder, Gal Yona and others.
Modeling Covid-19 Immunological Reactions and Clinical Susceptibility in the Context of Long-Term Follow-up of a Prospective Cohort of Healthcare Workers, Noam Barda, Ben-Gurion University. (25 min)
Title:
Abstract:
Title: