AI boosts predictive healthcare yet struggles with data, ethics and workflow adoption

In oncology and cardiology, AI’s strength lies in deep learning techniques applied to imaging and genomic data. Convolutional neural networks and recurrent architectures have been deployed to interpret radiology scans, pathology slides, and genetic markers, supporting early detection and treatment personalization. These domains showcase AI’s ability to process complex, high-dimensional data that far exceeds human capacity.


CO-EDP, VisionRICO-EDP, VisionRI | Updated: 29-09-2025 09:28 IST | Created: 29-09-2025 09:28 IST
AI boosts predictive healthcare yet struggles with data, ethics and workflow adoption
Representative Image. Credit: ChatGPT

Artificial intelligence is rapidly reshaping predictive healthcare, offering powerful tools to forecast disease outcomes and guide clinical decisions. A new systematic review by researchers from Imam Mohammad Ibn Saud Islamic University in Riyadh provides a detailed assessment of where AI has made the biggest impact and where challenges continue to limit its widespread adoption.

The study, titled Artificial Intelligence in Predictive Healthcare: A Systematic Review and published in the Journal of Clinical Medicine, synthesizes findings from 22 peer-reviewed papers published between 2021 and 2025. It maps out how machine learning models have been deployed across intensive care, oncology, cardiology, diabetes, chronic disease management, and COVID-19, while also identifying the most common algorithms, evaluation metrics, limitations, and future directions for the field.

Where is predictive AI making the strongest impact?

The review highlights that the most significant applications of predictive AI have emerged in intensive care units (ICUs) and critical care settings. Rich structured datasets such as electronic health records and continuous monitoring of vital signs provide fertile ground for machine learning. Models such as Random Forest and XGBoost have been used effectively to predict life-threatening outcomes including sepsis, mortality, and readmissions.

In oncology and cardiology, AI’s strength lies in deep learning techniques applied to imaging and genomic data. Convolutional neural networks and recurrent architectures have been deployed to interpret radiology scans, pathology slides, and genetic markers, supporting early detection and treatment personalization. These domains showcase AI’s ability to process complex, high-dimensional data that far exceeds human capacity.

For chronic disease management, including diabetes, predictive models have increasingly integrated Internet of Things (IoT) devices such as wearables and remote sensors. These systems enable real-time monitoring of blood sugar, heart rate, and other biomarkers, enhancing patient engagement and long-term care planning.

The review also addresses the surge of research around COVID-19, where AI was leveraged to predict hospital admissions, severity progression, and patient outcomes. While promising, these models often lacked robust validation, as many were trained on limited datasets under the urgency of the pandemic.

Finally, in primary care and general preventive health, AI has been explored for risk scoring and early-warning systems, offering potential to shift healthcare from a reactive to a proactive model.

Which models deliver the most reliable predictions?

The analysis shows that no single model consistently outperforms across all healthcare domains. Instead, effectiveness depends heavily on the type of data and the clinical context.

Tree-based ensemble methods, including Random Forest, XGBoost, and LightGBM, dominated applications using structured tabular data such as patient records and laboratory results. Their interpretability and robustness make them well-suited to high-stakes clinical environments.

Deep learning architectures, particularly convolutional neural networks, excelled in handling imaging data, while recurrent and long short-term memory (LSTM) models were most effective for sequential signals like electrocardiograms. These models outperformed traditional approaches by capturing subtle patterns across time and space.

Hybrid strategies emerged as particularly promising. By combining ensemble methods with deep learning, researchers achieved superior performance compared to single-model approaches. This reflects the diversity of healthcare data, where structured records, imaging, and time-series inputs often need to be analyzed together to achieve accurate predictions.

The review also documents the evaluation metrics most commonly employed. Area under the receiver operating characteristic curve (AUROC), F1-score, accuracy, and sensitivity were frequently used in critical care to ensure high-risk patients were not missed. Oncology and survival analysis studies leaned more on C-index and Brier scores to assess long-term outcomes. The choice of metrics varied by context, reflecting the stakes of clinical decision-making in different specialties.

What barriers prevent wider clinical adoption?

Despite clear progress, the study underscores multiple barriers that limit AI’s real-world impact in predictive healthcare.

A key concern is data generalizability. Many studies relied on single-center datasets such as MIMIC and PhysioNet. While these repositories are invaluable, their demographic and geographic limitations mean that models trained on them may not perform reliably in diverse healthcare systems or populations. Without multi-center validation, scalability remains uncertain.

Interpretability also remains a major hurdle. Clinicians are hesitant to adopt black-box systems without clear explanations of how predictions are made. Tools such as SHAP and LIME, which offer transparency into model decisions, are still underutilized. Without interpretability, trust and accountability remain fragile.

Integration into clinical workflows is another weak point. Few models have been embedded into hospital systems or linked seamlessly with electronic health records. This gap means that even strong algorithms remain in research settings rather than reaching the bedside.

The review highlights growing concerns over privacy and ethics. With sensitive medical data at the core of predictive models, safeguarding patient information is critical. Federated learning, which allows algorithms to be trained across multiple institutions without data sharing, is identified as a promising solution but is not yet widely implemented. Regulatory compliance and standardized frameworks also lag behind technical advances.

  • FIRST PUBLISHED IN:
  • Devdiscourse
Give Feedback