Abstract
Alzheimers Disease and Related Dementias (ADRD) afflict almost 7 million people in the USA alone. The majority of research in ADRD is conducted using post-mortem samples of brain tissue or carefully recruited clinical trial patients. While these resources are excellent, they suffer from lack of sex/gender, and racial/ethnic inclusiveness. Electronic Health Records (EHR) data has the potential to bridge this gap by including real-world ADRD patients treated during routine clinical care. In this study, we utilize EHR data from a cohort of 70,420 ADRD patients diagnosed and treated at Penn Medicine. Our goal is to uncover important risk features leading to three types of Neuro-Degenerative Disorders (NDD), including Alzheimers Disease (AD), Parkinsons Disease (PD) and Other Dementias (OD). We employ a variety of Machine Learning (ML) Methods, including uni-variate and multivariate ML approaches and compare accuracies across the ML methods. We also investigate the types of features identified by each method, the overlapping features and the unique features to highlight important advantages and disadvantages of each approach specific for certain NDD types. Our study is important for those interested in studying ADRD and NDD in EHRs as it highlights the strengths and limitations of popular approaches employed in the ML community. We found that the uni-variate approach was able to uncover features that were important and rare for specific types of NDD (AD, PD, OD), which is important from a clinical perspective. Features that were found across all methods represent features that are the most robust.
| Original language | English |
|---|---|
| Title of host publication | Pacific Symposium on Biocomputing, PSB 2025 |
| Editors | Russ B. Altman, Lawrence Hunter, Marylyn D. Ritchie, Teri E. Klein |
| Publisher | World Scientific |
| Pages | 631-646 |
| Number of pages | 16 |
| ISBN (Electronic) | 9789819807017 |
| DOIs | |
| State | Published - 2025 |
| Event | 30th Pacific Symposium on Biocomputing, PSB 2025 - Kohala Cost, United States Duration: Jan 4 2025 → Jan 8 2025 |
Conference
| Conference | 30th Pacific Symposium on Biocomputing, PSB 2025 |
|---|---|
| Country/Territory | United States |
| City | Kohala Cost |
| Period | 01/4/25 → 01/8/25 |
Keywords
- Alzheimers Disease and Related Dementias
- Data Mining
- Electronic Health Records
- Machine Learning
Fingerprint
Dive into the research topics of 'Uncovering important diagnostic features for alzheimer s, parkinson s and other dementias using interpretable association mining methods'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver