Back to Journals » Vascular Health and Risk Management » Volume 22
Artificial Intelligence for Cardiovascular Risk Prediction: An Umbrella Review of Applications and Translational Challenges
Authors Parizad R, Hatwal J
, Brar A, Desai R, Batta A
, Mohan B
Received 20 December 2025
Accepted for publication 12 March 2026
Published 28 March 2026 Volume 2026:22 590502
DOI https://doi.org/10.2147/VHRM.S590502
Checked for plagiarism Yes
Review by Single anonymous peer review
Peer reviewer comments 4
Editor who approved publication: Professor Roland Asmar
Artificial Intelligence for Cardiovascular Risk Prediction – Video abstract [590502]
Views: 40
Razieh Parizad,1 Juniali Hatwal,2 Ajit Brar,3 Rupak Desai,4 Akash Batta,5 Bishav Mohan5
1Cardiovascular Research Center, Tabriz University of Medical Sciences, Tabriz, Iran; 2Department of Internal Medicine, Advanced Cardiac Centre, Post Graduate Institute of Medical Education & Research (PGIMER), Chandigarh, India; 3Department of Internal Medicine, Michigan State University at Hurley Medical Center, Flint, MI, USA; 4Independent Researcher, Outcomes Research, Atlanta, GA, USA; 5Department of Cardiology, Dayanand Medical College and Hospital (DMCH), Ludhiana, India
Correspondence: Akash Batta, Department of Cardiology, Dayanand Medical College and Hospital (DMCH), Ludhiana, India, Tel +91 9815496786, Email [email protected]
Background: Cardiovascular diseases (CVDs) remain the leading cause of mortality worldwide. Conventional risk prediction models often demonstrate suboptimal calibration and limited generalizability across populations. Artificial intelligence (AI) approaches, including machine learning (ML) and deep learning (DL), enable integration of multimodal clinical and imaging data for individualized cardiovascular risk estimation.
Objective: To evaluate the applications, predictive performance, and translational limitations of AI models for cardiovascular risk prediction within an umbrella review framework.
Methods: PubMed, Scopus, and Web of Science were systematically searched for studies published between January 2015 and October 2025 investigating AI-based prediction of cardiovascular outcomes. Eligible designs included randomized controlled trials (RCTs), cohort studies, systematic reviews, and meta-analyses. Predictive performance was the primary outcome, mainly assessed using the area under the receiver operating characteristic curve (AUC). Methodological quality was evaluated using established risk-of-bias tools. From 3500 identified records, 48 studies (8 RCTs, 28 cohort studies, and 12 systematic reviews or meta-analyses) were included in the final analysis.
Results: AI models achieved AUC values greater than 0.90 in more than 70% of imaging-based studies. Evidence synthesis showed predominant reliance on internal validation, inconsistent calibration reporting, and limited evaluation of algorithmic fairness. Multimodal data integration improved detection of coronary artery disease (CAD) and heart failure (HF). Wearable monitoring was associated with 18– 25% lower hospitalization rates compared with usual care.
Conclusion: AI improves predictive accuracy in cardiovascular risk assessment. Despite strong discrimination performance (AUC), methodological heterogeneity, insufficient calibration assessment, algorithmic bias, limited external validation, and regulatory uncertainty remain major barriers to implementation. Clinical translation requires multicenter RCTs, explainable AI frameworks, and standardized reporting guidelines such as Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis Artificial Intelligence (TRIPOD-AI).
Plain Language Summary: Cardiovascular diseases (CVDs) remain the leading cause of death worldwide, yet commonly used clinical risk prediction tools do not perform equally well across populations. This umbrella review shows that artificial intelligence (AI) has the potential to improve cardiovascular risk prediction.
By analyzing nearly fifty high-quality studies published over the past decade, we found that AI-based prediction models often outperform traditional risk scores in estimating future cardiovascular events. This umbrella review integrated evidence from original research studies and previously published systematic reviews while minimizing duplication of data. In many investigations, particularly those using cardiovascular imaging, AI models demonstrated substantially higher predictive accuracy. Studies combining multiple data sources, including electronic health records, imaging data, genetic information, and wearable device monitoring, demonstrated improved diagnostic performance coronary artery disease (CAD) and heart failure (HF). Continuous monitoring using wearable technologies was associated with a reduction in hospitalization rates in prospective comparisons with usual care.
Despite these promising findings, several challenges remain before AI can be routinely implemented in clinical practice. Variation in study design, potential algorithmic bias, and evolving regulatory requirements continue to limit widespread adoption. Overall, AI exhibits strong potential strong potential to support more personalized cardiovascular care; however, large prospective clinical trials and transparent reporting standards are necessary to confirm safety, fairness, and reliability before broad clinical integration.
Keywords: artificial intelligence, machine learning, deep learning, cardiovascular diseases, risk assessment, precision medicine
Introduction
Cardiovascular diseases (CVDs) remain the leading cause of mortality globally. Updated estimates from the Global Burden of Disease (GBD) collaboration reported approximately 20.5 million CVD-related deaths in 2023, with the majority occurring in low- and middle-income countries (LMICs) where diagnostic and therapeutic resources remain limited.1,2 In the United States (US), the American Heart Association (AHA) reports one CVD-related death every 33 seconds, corresponding to nearly 700,000 deaths annually and generating an economic burden exceeding USD 400 billion in direct and indirect healthcare costs.3 According to the European Society of Cardiology (ESC), CVD accounts for 47% of female and 39% of male deaths across Europe, with pronounced regional disparities favoring Western over Eastern regions due to differences in risk factor management and healthcare infrastructure.4 In Asia, projections indicate a 91.2% increase in CVD mortality between 2025 and 2050, largely driven by population aging and escalating metabolic risk factors.5 Moreover, a recent global report confirmed that cardiovascular deaths attributable to metabolic risk factors increased by more than 18% between 2010 and 2021, underscoring the urgent need for predictive and preventive strategies supported by advanced data-driven technologies.6
Conventional risk stratification tools, such as the Framingham Risk Score (FRS), estimate the 10-year probability of coronary events using a limited set of clinical variables, including age, systolic blood pressure (SBP), total cholesterol, and smoking status.7 In Europe, the Systematic COronary Risk Evaluation 2 (SCORE2) and its older-adult extension (SCORE2-OP) provide risk estimation for fatal and non-fatal cardiovascular events with improved calibration across European risk regions.8 However, LMICs account for over 80% of CVD-related deaths, primarily due to restricted access to diagnostic and therapeutic tools. Systematic evaluations indicate that conventional scores, including FRS and SCORE2, frequently exhibit miscalibration in non-Caucasian populations, overestimating risk in low-incidence groups and underestimating it in high-risk cohorts. These models also fail to incorporate emerging contributors such as physical inactivity, psychosocial stress, and novel biomarkers, limiting their generalizability and predictive precision.9 More recently, contemporary risk scores such as the Predicting Bleeding Complications in Patients Undergoing Stent Implantation–High Bleeding Risk (PRECISE-HBR) score, a well-validated seven-item bleeding risk prediction tool for patients undergoing percutaneous coronary intervention, have demonstrated improved discrimination compared with earlier bleeding risk models and are increasingly incorporated into data-driven algorithmic decision-support systems.10 Artificial intelligence (AI) approaches not only augment preventive strategies but also hold promise in acute cardiovascular care, overcoming limitations of conventional methods and providing complementary or integrative pathways to advance precision in cardiovascular risk assessment.11–13
AI techniques, including machine learning (ML) and deep learning (DL) algorithms, possess superior capacity to analyze high-dimensional, multimodal datasets derived from electronic health records (EHRs), cardiac imaging, genomic profiling, and wearable sensor data. By identifying nonlinear and latent relationships among diverse risk determinants, AI models have demonstrated predictive performance that exceeds conventional statistical frameworks.11,14 Meta-analytic evidence further indicates that AI-enhanced prediction systems achieve significantly higher discrimination for myocardial infarction (MI) and heart failure (HF) onset compared with guideline-endorsed risk scores.12
Despite these promising results, the clinical translation of AI-driven CVD prediction remains limited. Key challenges include heterogeneity of training datasets, algorithmic opacity, bias amplification in underrepresented populations, and uncertainties regarding regulatory frameworks governing data privacy, transparency, and model accountability.15 Current clinical guidelines, including the 2021 ESC Guidelines on cardiovascular disease prevention, acknowledge the emerging role of AI in risk prediction but emphasize the necessity of rigorous external validation, transparency, and regulatory oversight prior to routine clinical implementation.7 Accordingly, this umbrella review assesses the practical applications and limitations of AI in cardiovascular risk prediction, with a focus on clinical implementation barriers and translational challenges, such as under-evaluation of model calibration, external validity, and algorithmic fairness, rather than solely technical or theoretical performance. The review also proposes targeted strategies to overcome these barriers and to promote interdisciplinary collaboration for equitable, safe, and effective integration of AI into routine precision cardiology practice.
Methods
Search Strategy and Data Sources
A comprehensive systematic search was conducted to identify evidence on the application of AI in CVD prediction. Three major electronic databases (PubMed, Scopus, and Web of Science) were searched for studies published between January 2015 and October 2025 to capture recent advances in the field.
The search strategy combined Medical Subject Headings (MeSH) with free-text keywords, including artificial intelligence, machine learning, deep learning, cardiovascular risk prediction, predictive modeling, cardiac imaging analysis, wearable sensors, and precision cardiovascular care. Boolean operators (AND, OR, NOT) were applied to refine the search and improve retrieval accuracy.
Reference lists of included studies, relevant review articles, and American College of Cardiology (ACC) guideline documents were manually screened to identify additional sources. Non-peer-reviewed materials, including conference proceedings, preprints, and trial registry records, were excluded.
Eligibility Criteria
Studies were selected according to predefined inclusion and exclusion criteria aligned with the objectives of this systematic review.
Inclusion Criteria
Eligible studies were required to:
- Be published between January 2015 and October 2025.
- Apply AI techniques such as ML, DL, or natural language processing (NLP) to predict CVD risk, events, or outcomes.
- Report quantitative performance metrics, including the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, or comparisons with conventional risk scores.
- Use robust study designs such as randomized controlled trials (RCTs), prospective cohort studies, or meta-analyses with verifiable primary data.
- For secondary evidence (systematic reviews and meta-analyses), include high-quality syntheses reporting pooled performance metrics; these were used only for contextual interpretation and not for quantitative pooling.
Exclusion Criteria
Studies were excluded if they:
- Were published in languages other than English.
- Examined AI applications unrelated to CVDs.
- Were editorials, letters, or abstracts without original data.
- Lacked accessible full text.
Study Selection Process
Two independent reviewers screened titles and abstracts to minimize selection bias. Disagreements were resolved through discussion or adjudication by a third reviewer. The selection process followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines to ensure transparency and reproducibility.16 The review protocol was not prospectively registered in the International Prospective Register of Systematic Reviews (PROSPERO); however, the review followed a predefined methodological framework to maintain transparency and reproducibility (Figure 1).
All retrieved records were exported to EndNote version 21 for reference management. After duplicate removal, full texts of potentially eligible studies were assessed. Ultimately, 48 studies met the inclusion criteria and were included in the final synthesis.
Quality Assessment
Methodological quality was evaluated using validated tools appropriate to study design. RCTs were assessed using the Cochrane Risk of Bias 2 (RoB 2) tool.17 Cohort studies were evaluated using the Newcastle–Ottawa Scale (NOS).18
All included studies were retained regardless of quality rating. Quality assessment informed interpretation through:
- Stratified reporting by risk-of-bias level (where feasible),
- Narrative discussion of methodological limitations,
- Cautious interpretation of findings from lower-quality studies.
Detailed quality appraisal results are presented in Supplementary Table S1.
Risk-of-Bias and Methodological Quality Assessment Summary
The majority of included RCTs were rated as having low risk of bias or some concerns using the Cochrane RoB 2 tool, primarily due to limitations in blinding of outcome assessment and incomplete reporting of model performance metrics. Cohort studies mostly received 7–9 stars on the NOS, indicating moderate-to-good methodological quality.
The quality assessment revealed three recurring critical limitations: (1) predominant reliance on internal validation with minimal external validation, (2) inconsistent or absent reporting of calibration metrics (despite frequent use of AUC), and (3) near absence of fairness assessment or subgroup analyses to detect algorithmic bias. These issues are discussed further in the narrative synthesis.
Data Extraction and Analysis
Data were extracted using a standardized form capturing study design, sample size, population characteristics, AI methodology, predictive performance metrics, and clinical outcomes.
Because of heterogeneity in populations, AI architectures, and endpoints, a narrative synthesis was conducted instead of meta-analysis. Findings were categorized into thematic domains, including risk prediction, imaging interpretation, and real-time monitoring. Descriptive statistics were employed to summarize methodological characteristics and performance trends. Any discrepancies during data extraction were resolved through consensus among reviewers.
Synthesis and Interpretation of Quantitative Findings
Owing to substantial heterogeneity across studies, quantitative improvements, such as increases in AUC or reductions in hospitalization rates, are reported as ranges from individual primary studies rather than pooled estimates. These values should be interpreted as indicative trends rather than definitive effect sizes. Greater interpretive weight was assigned to studies reporting comprehensive performance metrics and those with lower risk of bias.
Outcomes
The primary outcome of this umbrella review was the predictive performance of AI-based models, assessed mainly through discrimination AUC, calibration, and reclassification metrics, including net reclassification improvement and integrated discrimination improvement when reported.
Secondary outcomes comprised clinical endpoints such as hospitalization, mortality, emergency department visits, and other major cardiovascular events. These outcomes were summarized narratively without drawing causal inferences due to the observational and heterogeneous nature of most included studies.
In the narrative synthesis, findings from primary studies were prioritized, whereas secondary evidence was used solely to provide contextual pooled estimates without integration into the primary analyses.
Heterogeneity
Substantial heterogeneity was evident across the included studies in populations, data sources (electronic health records, imaging modalities, and wearable devices), AI architectures, validation strategies, and outcome definitions. This variability precluded quantitative meta-analysis and necessitates cautious interpretation of findings. Key contributors included demographic and clinical setting differences, multimodal data inputs, and predominant internal validation. These factors should be considered when evaluating generalizability.
Results
A total of 48 studies, including RCTs, 28 cohort studies, and 12 systematic reviews or meta-analyses, met predefined inclusion and quality criteria. Investigations were conducted within large collaborative programs in North America, Europe, and the Asia-Pacific region, with participant mean ages ranging from 50 to 80 years and female representation between 40% and 60%.
Reporting Characteristics of Included Prediction Models
To facilitate comparison of predictive performance across studies, key reporting characteristics of AI-based prediction models were summarized narratively, including validation approach (internal versus external), training and testing procedures (random split, chronological split, or cross-validation), calibration reporting alongside discrimination metrics, and assessment of fairness or algorithmic bias. Most studies relied exclusively on internal validation, with external validation—temporal or geographic—performed in only a small minority of investigations.19–21 Calibration metrics were inconsistently reported, with most studies presenting discrimination via AUC without corresponding calibration assessment. Only a minority of studies reported comprehensive calibration evaluation, such as calibration plots, Hosmer–Lemeshow (HL) test, calibration slope or intercept, or expected calibration error (ECE).11,12,22–24
Systematic fairness evaluation and subgroup analyses, stratified by race/ethnicity, sex, age, socioeconomic status, or geographic region, were rarely conducted, despite evidence that unmitigated algorithmic bias can substantially reduce predictive accuracy and lead to inequitable outcomes in minority, female, older, or socioeconomically disadvantaged populations.25–27 Systematic fairness evaluation and subgroup validation therefore remain important priorities for future AI-based cardiovascular prediction research.25–27
Heterogeneity and Sensitivity Considerations
Substantial clinical and methodological heterogeneity across studies precluded quantitative meta-analysis and necessitates cautious interpretation of performance estimates. Major sources of heterogeneity included differences in population characteristics, AI model architectures and feature selection, validation strategies, outcome definitions, and follow-up duration.
Imaging-based AI models generally demonstrated higher discrimination, with pooled AUC values frequently exceeding 0.90 in systematic reviews,28 whereas electronic health record-based prediction models typically showed pooled AUC values between 0.82 and 0.88.13 Sensitivity analyses, when reported, indicated that model performance was generally stable across internal validation approaches but often declined in external or temporal validation cohorts.11,21 These findings underscore the importance of external validation for reliable clinical translation.
These methodological variations significantly limit direct comparability of AUC values across studies and reduce confidence in the generalizability and reliability of the reported models. This prevailing focus on discrimination at the expense of calibration, validation, and fairness assessment constitutes a key gap in the translational readiness of AI models for CVD prediction.
Predictive Performance and Clinical Outcomes
AI techniques evaluated across studies included convolutional neural networks (CNNs), random forests (RFs), support vector machines (SVMs), and gradient boosting machines (GBMs). Across cohort studies and meta-analyses, AI-based models demonstrated predictive accuracy ranging from 80% to 95% in identifying individuals at elevated cardiovascular risk, depending on dataset characteristics and model architecture.14,15 Imaging-based models generally achieved higher discrimination (pooled AUC ~0.91) compared with EHR-based models (pooled AUC ~0.86), with meta-analytic evidence reporting 12–25% improvements over conventional risk scores such as QResearch Risk Prediction Algorithm 3 (QRISK3).28,29
Analyses of large-scale databases, including the UK Biobank AI initiatives (UK Biobank) and the EchoNet-Dynamic echocardiography study (EchoNet-Dynamic), supported improved identification of CAD, atrial fibrillation (AF), and HF-related hospitalization risk.11,30 Cohort studies such as the Multi-Ethnic Study of Atherosclerosis (MESA) and the Cardiovascular Health Study (CHS) demonstrated that integrating AI into clinical workflows enhanced risk stratification, reduced diagnostic errors, and optimized cardiac imaging efficiency.22,31 Evidence from controlled trials, including AI-electrocardiography approaches for AF detection and deep learning-based computed tomography angiography analysis, further confirmed strong predictive performance while illustrating trade-offs between model complexity, interpretability, and computational demands.23,32
Studies in Asian populations, particularly in China and Japan, emphasized the integration of wearable technologies and reported greater performance gains in urban compared with rural healthcare settings.20,33 These results suggest a consistent contribution of AI to cardiovascular risk prediction and clinical decision-making across diverse populations and care environments. However, gaps remain in calibration reporting, external validation, and fairness assessment. Mitigating these limitations through multicenter validation, explainable AI frameworks, comprehensive calibration evaluation, and fairness-aware reporting standards is essential for safe and equitable clinical implementation.
AI Applications in Cardiovascular Prediction
AI has been implemented across multiple clinical domains in cardiovascular disease, including risk stratification, diagnostic imaging, and real-time monitoring. Models integrating multimodal data sources such as electronic health records, imaging modalities, and wearable sensor data consistently outperformed single-modality approaches in detecting early subclinical disease signals.14,15 Mitigation of potential bias in AI models, including race- or ethnicity-related bias, is essential for achieving equitable clinical outcomes. Evidence indicates that unaddressed bias can compromise predictive accuracy in underrepresented populations, underscoring the importance of robust external validation and fairness-aware algorithm development.25 Furthermore, emerging observational evidence associates AI-driven monitoring strategies, including wearable-based detection systems, with reduced hospitalization rates for AF through earlier identification of arrhythmic events.34 These models leverage large-scale datasets to improve risk classification while simultaneously revealing challenges in integration into established clinical workflows (Figure 2).
Secondary evidence from meta-analyses supports these findings, demonstrating consistent improvements in predictive performance, including AUC gains of approximately 15–25% across studies, without overlapping primary data extraction.12,29
Machine Learning in Risk Assessment
ML algorithms, including random forests and gradient boosting methods, have enhanced cardiovascular disease risk prediction by capturing nonlinear interactions among traditional risk factors. In cohort studies comparing AI-based models with conventional risk scores, ML models achieved AUC values ranging from 0.82 to 0.91, representing improvements of approximately 10–18% over traditional tools such as the FRS.11,35 Figure 3 illustrates the analytical workflow through which ML models process complex datasets to improve predictive performance relative to conventional statistical approaches.
Deep Learning for Cardiac Imaging
DL models, particularly CNNs, have markedly advanced cardiac imaging analysis by automating the interpretation of echocardiography, computed tomography (CT) angiography, and magnetic resonance imaging (MRI). Beyond anatomical evaluation, these models increasingly extract prognostic imaging biomarkers capable of forecasting future cardiovascular events. Reported diagnostic accuracy ranges from 85% to 95% for conditions such as left ventricular dysfunction (LVD) and CAD, while also facilitating prediction of incident HF, MI, and cardiovascular mortality through detection of subtle subclinical patterns not discernible via conventional interpretation.23,28,36,37 These systems offer advantages in processing speed, reproducibility, and long-term risk stratification. Figure 4 depicts the workflow for DL-based imaging analysis and prediction.
Natural Language Processing in EHR Analysis
Natural language processing (NLP) methods enable extraction of clinically relevant information from unstructured EHR components, including physician notes and discharge summaries, capturing risk signals not available in structured data fields. When combined with structured EHR variables, NLP-enhanced models demonstrate improved discrimination for predicting MI and HF onset. Meta-analytic evidence indicates approximately 12–20% improvement in predictive accuracy, primarily measured by AUC, compared with traditional risk scores.12 Specifically, predictive performance for MI improved in coronary artery disease populations,29 while HF onset prediction, including hospital readmission risk, showed approximately 12–18% improvement.12 These results underscore the broad applicability of NLP-based approaches across cardiovascular risk prediction tasks. Figure 5 depicts the integration of NLP-derived features with structured EHR data for enhanced prediction of MI and HF outcomes.
Personalized Medicine and AI in Cardiovascular Risk
AI has enabled the development of patient-specific risk models by integrating genetic, clinical, and lifestyle data to support individualized preventive strategies. Studies report an 18–25% improvement in calibration of individualized risk estimates compared with population-based risk scores, facilitating more precise therapeutic targeting in high-risk subgroups.15,38 Figure 6 illustrates the key components of this approach, including predictive analytics for HF, genomic data integration, and wearable-based monitoring systems that enhance clinical decision-making.
Predictive Modeling for Heart Failure
ML and DL algorithms have been applied to predict HF onset and progression using multimodal inputs such as echocardiographic parameters and circulating biomarkers. These models illustrated AUC values ranging from 0.88 to 0.94 for prediction of one-year readmission risk, outperforming conventional prognostic scores by around 15–22% and enabling earlier clinical intervention.30,39
Integration of Genomic Data
AI-based models that incorporate genomic variants alongside clinical risk profiles have further refined cardiovascular risk prediction. Polygenic risk scores enhanced through ML-based modeling improved prediction of coronary events by 20–30% beyond traditional clinical risk factors, with the greatest incremental value observed in younger populations.38,40
Wearable Technology and Real-Time Monitoring
Wearable devices integrated with AI algorithms enable continuous monitoring of heart rate (HR) variability, physical activity patterns, and early arrhythmia detection. Reported sensitivity for AF detection ranged from 90% to 96%, while RCTs and cohort studies documented reductions in emergency department utilization ranging from 18% to 25% among individuals receiving proactive algorithm-driven alerts compared with usual care.32,34 Table 1 summarizes the current evidence and practical applications of artificial intelligence in cardiovascular disease prediction.
|
Table 1 Key Applications of Artificial Intelligence in Cardiovascular Disease Prediction: a Stratified Overview of Landmark Studies |
Discussion
This umbrella review evaluated the clinical applications and translational limitations AI in CVD risk prediction, with particular emphasis on real-world implementation rather than purely theoretical performance metrics. In addition to synthesizing existing evidence, this review proposes targeted strategies to address current barriers and promote interdisciplinary collaboration, thereby extending prior reviews that primarily focused on algorithmic performance.
The findings indicate that AI approaches, including ML, DL, and multimodal data integration, are consistently associated with improved predictive discrimination in cardiovascular risk assessment. More than 70% of imaging-based studies reported AUC values exceeding 0.90, and several individual studies described improvements in risk stratification ranging from approximately 15% to 25% compared with traditional tools such as the FRS and QRISK.11,15 These improvements represent ranges reported across individual studies rather than pooled meta-analytic estimates and should therefore be interpreted as indicative rather than definitive.
Performance gains were particularly evident in real-time monitoring using wearable devices and in models integrating EHRs with genomic data, enabling earlier detection of AF, CAD, and HF across heterogeneous populations. One prospective study reported approximately 19% fewer unplanned cardiovascular hospitalizations with AI-assisted wearable monitoring compared with usual care;51 however, this observation derives from a single study and warrants cautious interpretation.
An important consideration for clinical translation is the uneven maturity of evidence across AI applications. Imaging-based AI models, including automated echocardiographic view classification, ejection fraction quantification, and coronary stenosis detection, have undergone extensive technical validation and, in some cases, prospective clinical evaluation.20,23,28,46 Several tools have received regulatory clearance (eg, FDA or CE marking) and are primarily designed to enhance diagnostic accuracy and workflow efficiency, placing them closer to routine clinical adoption.15,28 In contrast, multimodal and wearable-based AI models integrating EHR data, genomics, and continuous sensor inputs for long-term individualized risk prediction remain in a more exploratory phase.12,15,38 Although these approaches hold substantial preventive potential, they frequently lack robust external validation across diverse populations, prospective trials demonstrating clinical utility, and clearly defined regulatory pathways.19,25,33
Compared with prior high-impact publications, this umbrella review expands beyond analyses of deep learning (DL)-based cardiac imaging reported in Journal of the American Medical Association Cardiology (JAMA Cardiol)28 by incorporating wearable technologies and polygenic risk scores (PRSs). By explicitly stratifying primary and secondary evidence, the present synthesis provides a more transparent comparison across levels of evidence. Additionally, studies from Asian cohorts reported up to 20% higher predictive performance in urban settings, potentially reflecting greater adoption of wearable technologies; these findings should be interpreted as context-specific observations rather than generalizable effects.
Whereas European Heart Journal (EHJ) meta-analyses have emphasized the limitations of traditional risk algorithms,9 the present synthesis underscores the capacity of AI models to capture nonlinear relationships within EHR data and unstructured clinical narratives through NLP, outperforming guideline-endorsed risk models by roughly 18–22% in multiethnic datasets.30 Even more recent conventional scores, such as the PRECISE-HBR score for post-percutaneous coronary intervention (PCI) bleeding risk prediction, largely represent incremental improvements in discrimination within specific procedural contexts.10 By comparison, AI-based approaches offer broader multimodal integration, improved detection of nonlinear patterns, and adaptability across diverse cardiovascular conditions, representing a more fundamental methodological advancement.
Despite the promise of AI in precision medicine,15 clinical translation remains slower than anticipated.15 A central finding of this umbrella review is that progress has been uneven, with imaging-based tools approaching clinical readiness while multimodal and wearable-based models require further maturation in validation, interpretability, and regulatory pathways.
Limitations
The included studies showed substantial methodological heterogeneity in populations, AI algorithms, data sources, and outcome definitions, which precluded quantitative meta-analysis. The evidence is disproportionately focused on discrimination metrics (AUC), with comparatively less attention to other essential performance aspects.
Further limitations arise from restriction to English-language publications and datasets primarily from high-income countries, potentially restricting generalizability to low-resource settings and underrepresented populations. Additional barriers to clinical translation include data privacy considerations, the need for large high-quality datasets, variability in acute care delivery and provider behavior, and evolving regulatory requirements. These factors collectively underscore the need for rigorous multicenter validation, prospective implementation studies, and standardized reporting to facilitate safe and equitable adoption of AI-based cardiovascular risk prediction models.
Future Directions
Future research should prioritize large, multicenter randomized controlled trials using diverse longitudinal datasets to reduce selection and spectrum bias and improve global generalizability. Development of explainable AI (XAI) frameworks will be essential to enhance interpretability and clinician trust, notably in real-time clinical decision-support contexts. Health-economic analyses should also be incorporated to evaluate cost-effectiveness and equity in real-world implementation.
Federated learning architectures, together with adherence to reporting standards such as TRIPOD-AI, will be critical for accelerating regulatory approval, improving transparency, and facilitating clinical integration. Future studies synthesizing both primary and secondary evidence may further clarify the rapidly evolving landscape of AI applications in cardiovascular medicine. AI-based CVD prediction studies should routinely report both discrimination and comprehensive calibration metrics, including calibration plots, calibration-in-the-large statistics, and expected calibration error (ECE).
Conclusion
This umbrella review demonstrates that AI-based models show substantial promise in improving cardiovascular risk prediction, frequently achieving high discriminatory performance, particularly in imaging-based applications.
Bridging the gap between algorithmic performance and clinical implementation will require a fundamental shift in research priorities toward multicenter validation, explainable AI frameworks, comprehensive performance reporting, and equity-focused evaluation. A balanced evaluation framework that equally prioritizes accuracy, reliability, fairness, and clinical utility will be essential for the safe, effective, and equitable integration of AI into cardiovascular care.
AI Statement
AI-assisted tools, including Grammarly, were used exclusively for language editing and readability enhancement. All scientific content, analyses, and interpretations were performed by the authors.
Author Contributions
All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Disclosure
The authors report no conflicts of interest in this work.
References
1. George AM, Fuster V, Murray CJ, et al. Global burden of cardiovascular diseases and risks, 1990–2022; 2023.
2. Roth GA; Global Burden of Cardiovascular Diseases and Risks 2023 Collaborators. Global, regional, and national burden of cardiovascular diseases and risk factors in 204 countries and territories, 1990–2023. Available at SSRN 5392535; 2025.
3. Martin S, Aday A, Almarzooq Z. Heart disease and stroke statistics-2024 update: a report from the American Heart Association. Circulation. 2024;149(8):e347–19. doi:10.1161/CIR.0000000000001209
4. Timmis A, Vardas P, Townsend N, et al. European Society of Cardiology: cardiovascular disease statistics 2021. Eur Heart J. 2022;43(8):716–799. doi:10.1093/eurheartj/ehab892
5. Chen H, Liu L, Wang Y, et al. Burden of cardiovascular disease attributable to metabolic risks in 204 countries and territories from 1990 to 2021. Eur Heart J. 2025;11(4):467–476. doi:10.1093/ehjqcco/qcae090
6. Mensah GA, Roth GA, Fuster V. The Global Burden of Cardiovascular Diseases and Risk Factors: 2020 and Beyond. Washington, DC: American College of Cardiology Foundation; 2019:2529–2532.
7. Visseren FL, Mach F, Smulders YM, et al. 2021 ESC Guidelines on cardiovascular disease prevention in clinical practice: developed by the Task Force for cardiovascular disease prevention in clinical practice with representatives of the European Society of Cardiology and 12 medical societies With the special contribution of the European Association of Preventive Cardiology (EAPC). Eur Heart J. 2021;42(34):3227–3337. doi:10.1093/eurheartj/ehab484
8. Hageman S, Pennells L, Ojeda F. SCORE2 risk prediction algorithms: new models to estimate 10-year risk of cardiovascular disease in Europe. Eur Heart J. 2021;42(25):2439–2454. doi:10.1093/eurheartj/ehab309
9. Damen JA, Hooft L, Schuit E, et al. Prediction models for cardiovascular disease risk in the general population: systematic review. BMJ. 2016;353. doi:10.1136/bmj.i2416
10. Gragnano F, van Klaveren D, Heg D, et al. Derivation and validation of the PRECISE-HBR score to predict bleeding after percutaneous coronary intervention. Circulation. 2025;151(6):343–355. doi:10.1161/CIRCULATIONAHA.124.072009
11. Alaa AM, Bolton T, Di Angelantonio E, Rudd JH, Van der Schaar M, Aalto-Setala K. Cardiovascular disease risk prediction using automated machine learning: a prospective study of 423,604 UK Biobank participants. PLoS One. 2019;14(5):e0213653. doi:10.1371/journal.pone.0213653
12. Liu T, Krentz A, Lu L, Curcin V. Machine learning based prediction models for cardiovascular disease risk using electronic health records data: systematic review and meta-analysis. Eur Heart J Digit Health. 2025;6(1):7–22. doi:10.1093/ehjdh/ztae080
13. Al-Khero KN, Al-Kheroo MK, Hasan HB. Integrating artificial intelligence into cardiovascular risk prediction: a comprehensive review of models, predictors, and limitations: a review. Centr Asian J Med Natur Sci. 2025;6(4):1404–1412.
14. Krittanawong C, Virk HUH, Bangalore S, et al. Machine learning prediction in cardiovascular diseases: a meta-analysis. Sci Rep. 2020;10(1):16057. doi:10.1038/s41598-020-72685-1
15. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nature Med. 2019;25(1):44–56. doi:10.1038/s41591-018-0300-7
16. Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021;372: e112.
17. Sterne JA, Savović J, Page MJ, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. 2019;2019:366.
18. Wells GA, Shea B, O’Connell D, et al. The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses; 2000.
19. Liu T, Krentz AJ, Huo Z, Ćurčin V. Opportunities and challenges of cardiovascular disease risk prediction for primary prevention using machine learning and electronic health records: a systematic review. Rev cardiovasc med. 2025;26(4):37443. doi:10.31083/RCM37443
20. Zhang J, Gajjala S, Agrawal P, et al. Fully automated echocardiogram interpretation in clinical practice: feasibility and diagnostic accuracy. Circulation. 2018;138(16):1623–1635. doi:10.1161/CIRCULATIONAHA.118.034338
21. Kim M-N, Lee YS, Park Y, et al. Deep learning for predicting rehospitalization in acute heart failure: model foundation and external validation. ESC Heart Fail. 2024;11(6):3702–3712. doi:10.1002/ehf2.14918
22. Ambale-Venkatesh B, Yang X, Wu CO, et al. Cardiovascular event prediction by machine learning: the multi-ethnic study of atherosclerosis. Circ Res. 2017;121(9):1092–1101. doi:10.1161/CIRCRESAHA.117.311312
23. Madani A, Arnaout R, Mofrad M, Arnaout R. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digital Med. 2018;1(1):6. doi:10.1038/s41746-017-0013-1
24. Raghunath S, Pfeifer JM, Ulloa-Cerna AE, et al. Deep neural networks can predict new-onset atrial fibrillation from the 12-lead ECG and help identify those at risk of atrial fibrillation–related stroke. Circulation. 2021;143(13):1287–1298. doi:10.1161/CIRCULATIONAHA.120.047829
25. Noseworthy PA, Attia ZI, Brewer LC, et al. Assessing and mitigating bias in medical artificial intelligence: the effects of race and ethnicity on a deep learning model for ECG analysis. Circulation. 2020;13(3):e007988. doi:10.1161/CIRCEP.119.007988
26. Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science. 2019;366(6464):447–453. doi:10.1126/science.aax2342
27. Seyyed-Kalantari L, Zhang H, McDermott MB, Chen IY, Ghassemi M. Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations. Nature Med. 2021;27(12):2176–2182. doi:10.1038/s41591-021-01595-0
28. Wehbe RM, Katsaggelos AK, Hammond KJ, et al. Deep learning for cardiovascular imaging: a review. JAMA Cardiol. 2023;8(11):1089–1098. doi:10.1001/jamacardio.2023.3142
29. Cicek V, Cikirikci EHK, Babaoğlu M, et al. Machine learning for prognostic prediction in coronary artery disease with SPECT data: a systematic review and meta-analysis. EJNMMI Res. 2024;14(1):117. doi:10.1186/s13550-024-01179-2
30. Ouyang D, He B, Ghorbani A, et al. Video-based AI for beat-to-beat assessment of cardiac function. Nature. 2020;580(7802):252–256. doi:10.1038/s41586-020-2145-8
31. Bello GA, Dawes TJ, Duan J, et al. Deep-learning cardiac motion analysis for human survival prediction. Nature Mach Intell. 2019;1(2):95–104. doi:10.1038/s42256-019-0019-2
32. Attia ZI, Noseworthy PA, Lopez-Jimenez F, et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet. 2019;394(10201):861–867. doi:10.1016/S0140-6736(19)31721-0
33. Cai Y, Cai Y-Q, Tang L-Y, et al. Artificial intelligence in the risk prediction models of cardiovascular disease and development of an independent validation screening tool: a systematic review. BMC Med. 2024;22(1):56. doi:10.1186/s12916-024-03273-7
34. Steinhubl SR, Waalen J, Edwards AM, et al. Effect of a home-based wearable continuous ECG monitoring patch on detection of undiagnosed atrial fibrillation: the mSToPS randomized clinical trial. JAMA. 2018;320(2):146–155. doi:10.1001/jama.2018.8102
35. Weng SF, Reps J, Kai J, Garibaldi JM, Qureshi N, Liu B. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS One. 2017;12(4):e0174944. doi:10.1371/journal.pone.0174944
36. Juarez-Orozco LE, Martinez-Manzanera O, van der Zant FM, Knol RJ, Knuuti J. Deep learning in quantitative PET myocardial perfusion imaging: a study on cardiovascular event prediction. Cardiovasc Imaging. 2020;13(1_Part_1):180–182. doi:10.1016/j.jcmg.2019.08.009
37. Li Y-L, Leu H-B, Ting C-H, et al. Predicting long-term time to cardiovascular incidents using myocardial perfusion imaging and deep convolutional neural networks. Sci Rep. 2024;14(1):3802. doi:10.1038/s41598-024-54139-0
38. Krittanawong C, Johnson KW, Choi E, et al. Artificial intelligence and cardiovascular genetics. Life. 2022;12(2):279. doi:10.3390/life12020279
39. Farajidavar N, O’Gallagher K, Bean D, et al. Diagnostic signature for heart failure with preserved ejection fraction (HFpEF): a machine learning approach using multi-modality electronic health record data. BMC Cardiovasc Disord. 2022;22(1):567. doi:10.1186/s12872-022-03005-w
40. Aragam KG, Jiang T, Goel A, et al. Discovery and systematic characterization of risk variants and genes for coronary artery disease in over a million participants. Nature Genet. 2022;54(12):1803–1815. doi:10.1038/s41588-022-01233-6
41. Narula S, Shameer K, Salem Omar AM, Dudley JT, Sengupta PP. Machine-learning algorithms to automate morphological and functional assessments in 2D echocardiography. J Am Coll Cardiol. 2016;68(21):2287–2295. doi:10.1016/j.jacc.2016.08.062
42. Krittanawong C, Zhang H, Wang Z, Aydar M, Kitai T. Artificial intelligence in precision cardiovascular medicine. J Am Coll Cardiol. 2017;69(21):2657–2664. doi:10.1016/j.jacc.2017.03.571
43. Dimopoulos AC, Nikolaidou M, Caballero FF, et al. Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk. BMC Med Res Method. 2018;18(1):179. doi:10.1186/s12874-018-0644-1
44. Oikonomou EK, Williams MC, Kotanidis CP, et al. A novel machine learning-derived radiotranscriptomic signature of perivascular fat improves cardiac risk prediction using coronary CT angiography. Eur Heart J. 2019;40(43):3529–3543. doi:10.1093/eurheartj/ehz592
45. Sengupta PP, Huang Y-M, Bansal M, et al. Cognitive machine-learning algorithm for cardiac imaging: a pilot study for differentiating constrictive pericarditis from restrictive cardiomyopathy. Circulation. 2016;9(6):e004330. doi:10.1161/CIRCIMAGING.115.004330
46. Ghorbani A, Ouyang D, Abid A, et al. Deep learning interpretation of echocardiograms. NPJ Digital Med. 2020;3(1):10. doi:10.1038/s41746-019-0216-8
47. Cho S-Y, Kim S-H, Kang S-H, et al. Pre-existing and machine learning-based models for cardiovascular risk prediction. Sci Rep. 2021;11(1):8886. doi:10.1038/s41598-021-88257-w
48. Khurshid S, Friedman S, Reeder C, et al. ECG-based deep learning and clinical risk factors to predict atrial fibrillation. Circulation. 2022;145(2):122–133. doi:10.1161/CIRCULATIONAHA.121.057480
49. Mannhart D, Lischer M, Knecht S, et al. Clinical validation of 5 direct-to-consumer wearable smart devices to detect atrial fibrillation: BASEL wearable study. Clin Electrophysiol. 2023;9(2):232–242. doi:10.1016/j.jacep.2022.09.011
50. Subramani S, Varshney N, Anand MV, et al. Cardiovascular diseases prediction by machine learning incorporation with deep learning. Front Med. 2023;10:1150933. doi:10.3389/fmed.2023.1150933
51. Perez MV, Mahaffey KW, Hedlin H, et al. Large-scale assessment of a smartwatch to identify atrial fibrillation. N Engl J Med. 2019;381(20):1909–1917. doi:10.1056/NEJMoa1901183
52. Hu W, Yii FS, Chen R, et al. A systematic review and meta-analysis of applying deep learning in the prediction of the risk of cardiovascular diseases from retinal images. Trans Vision Sci Technol. 2023;12(7):14. doi:10.1167/tvst.12.7.14
53. Yuan N, Duffy G, Dhruva SS, et al. Deep learning of electrocardiograms in sinus rhythm from US veterans to predict atrial fibrillation. JAMA Cardiol. 2023;8(12):1131–1139. doi:10.1001/jamacardio.2023.3701
54. Ding C, Xiao R, Do DH, et al. Log-spectral matching GAN: PPG-based atrial fibrillation detection can be enhanced by GAN-based data augmentation with integration of spectral loss. IEEE J Biomed Health Inform. 2023;27(3):1331–1341. doi:10.1109/JBHI.2023.3234557
55. Oikonomou EK, Khera R. Machine learning in precision diabetes care and cardiovascular risk prediction. Cardiovasc Diabetol. 2023;22(1):259. doi:10.1186/s12933-023-01985-3
56. Weiss J, Raghu VK, Paruchuri K, et al. Deep learning to estimate cardiovascular risk from chest radiographs: a risk prediction study. Ann Internal Med. 2024;177(4):409–417. doi:10.7326/M23-1898
57. Al-Alshaikh HA, P P, Poonia RC, et al. Comprehensive evaluation and performance analysis of machine learning in heart disease prediction. Sci Rep. 2024;14(1):7819. doi:10.1038/s41598-024-58489-7
58. Teshale AB, Htun HL, Vered M, Owen AJ, Freak-Poli R. A systematic review of artificial intelligence models for Time-to-Event outcome applied in cardiovascular disease risk prediction. J Med Syst. 2024;48(1):68. doi:10.1007/s10916-024-02087-7
59. Singh M, Kumar A, Khanna NN, et al. Artificial intelligence for cardiovascular disease risk assessment in personalised framework: a scoping review. EClinicalMedicine. 2024;73:102660. doi:10.1016/j.eclinm.2024.102660
60. Dorraki M, Liao Z, Abbott D, et al. Improving cardiovascular disease prediction with machine learning using mental health data: a prospective UK Biobank study. JACC. 2024;3(9_Part_2):101180. doi:10.1016/j.jacadv.2024.101180
61. El-Sofany H, Bouallegue B, El-Latif YMA. A proposed technique for predicting heart disease using machine learning algorithms and an explainable AI method. Sci Rep. 2024;14(1):23277. doi:10.1038/s41598-024-74656-2
62. Jiang X, Wang B. Enhancing Clinical decision making by predicting readmission risk in patients with heart failure using machine learning: predictive model development study. JMIR Med Inform. 2024;12(1):e58812. doi:10.2196/58812
63. Rehman MU, Naseem S, Butt AUR, et al. Predicting coronary heart disease with advanced machine learning classifiers for improved cardiovascular risk assessment. Sci Rep. 2025;15(1):13361. doi:10.1038/s41598-025-96437-1
64. Karim SR, Helseth HC, Baker PO, et al. Artificial Intelligence detection of occlusive myocardial infarction from electrocardiograms interpreted as “Normal” by conventional algorithms. J Personal Med. 2025;15(4):130. doi:10.3390/jpm15040130
65. Elvas LB, Almeida A, Ferreira JC. The role of AI in cardiovascular event monitoring and early detection: scoping literature review. JMIR Med Inform. 2025;13(1):e64349. doi:10.2196/64349
66. Meder B, Asselbergs FW, Ashley E. Artificial intelligence to improve cardiovascular population health. Eur Heart J. 2025;46(20):1907–1916. doi:10.1093/eurheartj/ehaf125
67. Rohan D, Reddy GP, Kumar YP, Prakash KP, Reddy CP. An extensive experimental analysis for heart disease prediction using artificial intelligence techniques. Sci Rep. 2025;15(1):6132. doi:10.1038/s41598-025-90530-1
68. Shojaei S, Mousavi A, Kazemian S, et al. Artificial intelligence in risk stratification and outcome prediction for transcatheter aortic valve replacement: a systematic review and meta-analysis. J Personal Med. 2025;15(7):302. doi:10.3390/jpm15070302
69. Bdir S, Jaber M, Tanbouz O, et al. Artificial intelligence for myocardial infarction detection via electrocardiogram: a scoping review. J Clin Med. 2025;14(19):6792. doi:10.3390/jcm14196792
70. Majumder S, Sen K, Karanjai R. Artificial intelligence-based target for personalized interventions of atherosclerosis from gut microbiota signature. SynBio. 2025;3(1):2. doi:10.3390/synbio3010002
71. Kasartzian D-I, Tsiampalis T. Transforming cardiovascular risk prediction: a review of machine learning and artificial intelligence innovations. Life. 2025;15(1):94. doi:10.3390/life15010094
© 2026 The Author(s). This work is published and licensed by Dove Medical Press Limited. The
full terms of this license are available at https://www.dovepress.com/terms
and incorporate the Creative Commons Attribution
- Non Commercial (unported, 4.0) License.
By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted
without any further permission from Dove Medical Press Limited, provided the work is properly
attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.
Recommended articles
Awareness and Predictors of the Use of Bioinformatics in Genome Research in Saudi Arabia
Alomair L, Abolfotouh MA
International Journal of General Medicine 2023, 16:3413-3425
Published Date: 11 August 2023
A Systematic Review of Real-Time Deep Learning Methods for Image-Based Cancer Diagnostics
Sriraman H, Badarudeen S, Vats S, Balasubramanian P
Journal of Multidisciplinary Healthcare 2024, 17:4411-4425
Published Date: 9 September 2024
An Artificial Intelligence Pipeline for Hepatocellular Carcinoma: From Data to Treatment Recommendations
Zhang X, Yang L, Liu C, Yuan X, Zhang Y
International Journal of General Medicine 2025, 18:3581-3595
Published Date: 2 July 2025
Artificial Intelligence in Neuro-Ophthalmology for Optic Disc Pathologies and Neurodegenerative Disease
Ahuja AS, Paredes III AA, Eisel MLS, Miller C, Truong N, Falardeau J
Eye and Brain 2026, 18:555894
Published Date: 13 March 2026
Advancements in Image-Based Artificial Intelligence in the Diagnosis and Treatment of Head and Neck Squamous Cell Carcinoma: A Narrative Review
Wu B, Gu J, Chen T
International Journal of General Medicine 2026, 19:593911
Published Date: 9 May 2026
