Back to Journals » International Journal of Chronic Obstructive Pulmonary Disease » Volume 21

Development and External Validation of a Machine Learning Model for 90-Day Readmission in Hospitalized Older Patients with AECOPD: A Two-Center Study

Authors Zhang G, Wang D, Chen H, Dai W, Fan X, Liu Y, Jiang L

Received 21 February 2026

Accepted for publication 30 April 2026

Published 5 May 2026 Volume 2026:21 602126

DOI https://doi.org/10.2147/COPD.S602126

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Prof. Dr. Zijing Zhou



Guibin Zhang,1 Dan Wang,1 Hang Chen,1 Wenchao Dai,1 Xingfu Fan,2 Yulian Liu,3 Li Jiang1

1Department of Respiratory and Critical Care Medicine, Affiliated Hospital of North Sichuan Medical College, Nanchong, Sichuan, 637000, People’s Republic of China; 2Department of General Medicine, Affiliated Hospital of North Sichuan Medical College, Nanchong, Sichuan, 637000, People’s Republic of China; 3Department of Respiratory and Critical Care Medicine, Dazhou Hospital of Integrated Traditional and Western Medicine, Dazhou, Sichuan, 635000, People’s Republic of China

Correspondence: Li Jiang, Email [email protected]

Background: Short-term readmission after hospitalization for acute exacerbation of chronic obstructive pulmonary disease (AECOPD) is common in older adults, yet early risk stratification remains limited. We aimed to develop and validate a 90-day readmission model using early admission data.
Methods: This retrospective two-center study used a development cohort from the Affiliated Hospital of North Sichuan Medical College and an external validation cohort from Dazhou Integrated Traditional Chinese and Western Medicine Hospital. Predictors were limited to early admission variables harmonized across sites. The neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), and systemic immune-inflammation index (SII) were derived, with component blood counts removed to reduce collinearity. Feature selection used stability selection with Elastic Net regularization. Five models were trained and compared: multivariable logistic regression, naïve Bayes (NB), linear discriminant analysis (LDA), gradient boosting machine (GBM), and extreme gradient boosting (XGBoost). Discrimination was assessed by area under the receiver operating characteristic curve (AUC), with internal validation using bootstrap-derived optimism-corrected AUC and Brier score for overall error. Interpretability was examined with Shapley additive explanations (SHAP).
Results: A total of 692 patients were included (development, n=513; external validation, n=179). Six predictors were retained: NLR, SII, D-dimer, frequent exacerbations (FE), body mass index (BMI), and albumin (ALB). No strong multicollinearity was detected (|r|< 0.90; variance inflation factors (VIFs) < 5). XGBoost showed the best discrimination in the development cohort (AUC=0.892) and remained stable after internal validation (optimism corrected AUC=0.864). In the external cohort, XGBoost again achieved the highest AUC (0.847) with a lower Brier score than alternative models. SHAP analyses indicated D-dimer, NLR, and FE as major contributors and suggested non-linear effects.
Conclusion: Using early admission data, we developed and externally validated a 90-day readmission prediction model for older adults hospitalized with AECOPD. XGBoost showed stable performance and clinically interpretable risk patterns, supporting its potential for early risk stratification.

Keywords: acute exacerbation of chronic obstructive pulmonary disease, 90-day readmission, extreme gradient boosting, external validation, shapley additive explanations, multicohort study

Introduction

Chronic obstructive pulmonary disease (COPD) remains one of the leading chronic respiratory diseases worldwide and continues to exert a heavy, long-term burden on public health and healthcare resources. The Global Burden of Disease (GBD) study estimated that approximately 212 million people were living with COPD in 2019, with the overall burden still increasing.1 COPD demonstrates a strong age-related pattern: as individuals grow older, airway remodeling, immunosenescence, and accumulating comorbidities make the condition both more prevalent and more challenging to manage. A systematic review suggests that COPD prevalence among adults aged ≥65 years is roughly 15%, far exceeding that seen in younger populations.2 As populations age, the number of older adults with COPD is expected to rise further, and their higher comorbidity burden and reduced physiological reserve are likely to translate into greater hospitalization risk and healthcare utilization.1,2

Across the course of COPD, acute exacerbations (AECOPD) are key turning points. They can trigger sudden worsening of symptoms, drive emergency department visits and hospitalizations, and often make longer-term disease control more difficult.3 Short-term readmission after hospitalization is among the most frequent and challenging problems in AECOPD management and is commonly regarded as a clinical signal of inadequate disease control and heightened patient vulnerability. Prior systematic reviews suggest that 30–90-day readmission following hospitalization for COPD exacerbation is not uncommon; although substantial heterogeneity exists across regions and studies, the overall readmission level remains high.4 In older adults (≥65 years), diminished physiological reserve, greater comorbidity burden, and more pronounced inflammatory/coagulation stress responses may delay recovery and increase the likelihood of recurrent decompensation after discharge, thereby elevating short-term readmission risk.5 Therefore, developing a risk assessment tool that can be applied early in the hospitalization course—so as to identify patients at high risk of readmission within 90 days and to inform post-discharge management decisions—has clear clinical value.4

Inflammation is central to the onset and progression of COPD as well as to acute exacerbations, and it is closely associated with adverse outcomes.6 During AECOPD, airway and systemic inflammation may worsen simultaneously, impairing gas exchange and exercise tolerance and hindering overall recovery, which may in turn increase the risk of short-term adverse events after discharge.6 In practice, single inflammatory markers can be sensitive to sampling time and short-term fluctuations. Consequently, composite inflammatory indices derived from routine blood cell counts have attracted increasing attention in recent years, including the neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), and the systemic immune-inflammation index (SII).7–9 These measures are readily available, low-cost, and obtainable early after admission, offering practical biological signals for early risk stratification in older patients hospitalized with AECOPD.7,9

As electronic health records and other clinical datasets have expanded, machine learning (ML) has become a common approach for building readmission risk models. Compared with conventional regression, ML can more readily combine many predictors and accommodate non-linear patterns and interactions, which may enhance the accuracy of patient-level risk stratification.10–12 However, existing ML studies on COPD/AECOPD readmission still face barriers to clinical translation, including limited external validation, reliance on in-hospital course or post-discharge variables that hinder early implementation at admission, and insufficient interpretability of key risk drivers.12–14

These limitations reflect broader methodological concerns in the field. Recent systematic reviews have categorized artificial intelligence applications in COPD into three major modalities: imaging-based diagnosis, physiological signal monitoring where respiratory sound analysis has emerged as particularly effective, and multi-modal fusion frameworks that integrate clinical, imaging, and genomic data.15 However, the task of short-term prognostic stratification after acute exacerbation remains comparatively underexplored. A recent meta-analysis of 71 prediction models found that 57.9% exhibited high risk of bias and none had undergone formal clinical utility evaluation.16 Furthermore, while candidate predictors for AECOPD have expanded considerably, promising laboratory indicators such as the neutrophil-to-lymphocyte ratio and D-dimer have yet to be incorporated into a rigorously externally validated prediction framework.17 This gap between the expanding predictor landscape and the scarcity of valid, clinically tested tools highlights the need for models designed from the outset for early bedside deployment and rigorous external evaluation.

Against this background, the present study focuses on hospitalized older adults (≥65 years) with AECOPD. By integrating routinely available early-admission clinical information with inflammation-related indicators, we will compare multiple models, conduct external validation, and perform interpretability analyses. Our aim is to develop a generalizable, explainable, and clinically practical tool for predicting 90-day readmission risk.

Methods

Ethics Statement

This retrospective study used a development cohort from the Affiliated Hospital of North Sichuan Medical College and received approval from the hospital’s Ethics Committee (Approval No. 2025ER707-1). Given the retrospective nature of the study and the use of de-identified data, the requirement for informed consent was waived by the hospital in accordance with local ethical oversight requirements and institutional policies. In line with these requirements and policies, the Dazhou Hospital of Integrated Traditional Chinese and Western Medicine (external validation cohort) conducted only secondary analyses of previously collected, de-identified clinical data. As the study met the criteria for an ethics exemption, no additional ethical review was required. All data were anonymized before statistical analysis. The study was conducted in accordance with the Declaration of Helsinki and applicable local laws and regulations.

Study Population

This retrospective, multicenter study enrolled patients aged ≥65 years with COPD who were hospitalized in the Department of Respiratory and Critical Care Medicine at the Affiliated Hospital of North Sichuan Medical College between January 2023 and June 2024, as well as patients admitted to Dazhou Hospital of Integrated Traditional Chinese and Western Medicine between January and June 2024. COPD was defined as a post-bronchodilator FEV1/FVC ratio <0.70 confirmed by spirometry, or a previously documented diagnosis recorded in the medical chart.18 Acute exacerbation was characterized by worsening respiratory symptoms requiring additional pharmacological treatment beyond the patient’s regular maintenance therapy.

Exclusion criteria were: (1) coexisting asthma, active pulmonary tuberculosis, or any malignancy; (2) hematologic diseases (eg., acute myeloid leukemia); (3) autoimmune disorders; (4) other acute illnesses (eg., acute myocardial infarction); (5) severe psychiatric or psychological conditions that could affect data reliability or outcome ascertainment; (6) in-hospital death; and (7) missing key outcome data or essential covariates that could not be reasonably imputed.

Outcome Definition and Ascertainment

The primary outcome was readmission for AECOPD within 90 days of discharge from the index hospitalization. A readmission was counted when a new AECOPD-related inpatient stay was registered during the 90-day period after the index discharge date. Events were identified by querying and confirming inpatient records in each center’s hospital information system (HIS) and/or electronic medical record (EMR). Both centers followed the same extraction rules and used the same method to define the 90-day window to keep outcome ascertainment consistent. Because only in-house information systems were available, admissions to other hospitals could not be captured.

Data Collection and Processing

Candidate predictors were drawn from information routinely available at admission: (1) demographics (age, sex, smoking history, and body mass index (BMI); (2) comorbidities (hypertension, cardiovascular disease, and diabetes); (3) laboratory tests (complete blood count with differential, platelet count, hemoglobin (Hb), red cell distribution width (RDW), coagulation markers including D-dimer, and biochemistry such as albumin and creatinine); (4) respiratory failure (RF), defined as respiratory failure indicated by arterial blood gas analysis at admission; and (5) frequent exacerbations (FE), defined as ≥2 hospitalizations for acute exacerbations in the past year. To improve bedside usability and minimize information leakage, we restricted predictors to early admission data. Laboratory values were taken from the first results obtained within 24 hours of admission; if multiple measurements were available in that period, the earliest was used. Comorbidity data were derived from prior medical history and discharge diagnoses, and were cross-checked against supporting examinations when available.

Data from the development cohort and the external validation cohort were cleaned and preprocessed separately. Variable derivation was performed independently within each cohort using the same prespecified definitions to ensure consistency in variable meaning. For categorical variables, factor levels were harmonized across the two cohorts. To reduce multicollinearity introduced by component variables and to improve interpretability, inflammatory composite indices were calculated and included when the relevant blood cell counts were available: the neutrophil-to-lymphocyte ratio (NLR; absolute neutrophil count/absolute lymphocyte count), the platelet-to-lymphocyte ratio (PLR; platelet count/absolute lymphocyte count), and the systemic immune-inflammation index (SII; platelet count × absolute neutrophil count / absolute lymphocyte count).7–9 In subsequent modeling, the component variables—absolute neutrophil count, absolute lymphocyte count, and platelet count —were excluded.

Missing data were handled separately in the development cohort and the external validation cohort to preserve each center’s missingness patterns as much as possible and to avoid bias that could arise from cross-center imputation. To enhance the model’s usability in multicenter external validation and to limit uncertainty introduced by variables with substantial missingness, we prespecified an inclusion rule before modeling: any variable with a missing rate >30% in either cohort was excluded from the candidate predictor set. For the remaining variables, missing values were imputed within each cohort using multiple imputation by chained equations (MICE). Continuous variables were imputed using predictive mean matching, whereas binary variables were imputed using logistic regression.

Statistical Analysis

All statistical analyses were conducted using R (version 4.5.1). Data from the Affiliated Hospital of North Sichuan Medical College served as the development cohort, and data from Dazhou Hospital of Integrated Traditional Chinese and Western Medicine served as the external validation cohort. Baseline characteristics were summarized separately for the development and validation cohorts. Continuous variables are reported as mean ± standard deviation, and categorical variables as number (percentage). The baseline table was intended to describe differences in case mix between cohorts and was not used for hypothesis testing, feature selection, or model-development decisions. Candidate predictors entered into modeling were based on the results of the data-processing stage (cleaning, harmonized coding, and variable construction). Variable definitions and factor levels for categorical variables were kept consistent across cohorts to ensure comparability between model development and external validation. The overall study workflow and the modeling–calibration–validation framework are shown in Figure 1.

A flowchart of a study workflow from data collection to model evaluation.

Figure 1 Study workflow. After preprocessing and multiple imputation, patients were divided into a development cohort (n=513) and an external validation cohort (n=179). Predictors were selected using stability selection with Elastic Net. Five models were developed, internally validated, and externally tested. Performance was evaluated by ROC, calibration, decision curve analysis, and SHAP.

To mitigate multicollinearity and improve model stability, collinearity diagnostics were performed in the development cohort before model building. Variance inflation factors (VIFs) were calculated from a design matrix that included one-hot encoding for categorical variables; the maximum and mean VIF were summarized at the variable level. In addition, the condition number (κ) of the standardized design matrix was computed, and pairwise correlation coefficients were examined to identify highly correlated terms (|r| ≥ 0.9 indicating strong correlation).19 When substantial collinearity was suspected, variables were handled based on clinical relevance and data completeness to support robust downstream model fitting.

Next, predictor selection in the development cohort was performed using stability selection combined with Elastic Net. Under outcome stratification, 50% subsamples were randomly drawn, repeated 500 times. Within each subsample, an Elastic Net logistic regression model (α = 0.5) was fitted, and the penalty parameter was chosen by 10-fold cross-validation using lambda.1se Predictor stability was quantified as the proportion of resamples in which a variable was selected.20 A prespecified selection-frequency threshold of ≥0.90 was applied, yielding a fixed set of predictors for subsequent model development.

The five candidate models were selected to represent both conventional statistical classifiers and machine-learning algorithms with different assumptions. Logistic regression was included as a clinically interpretable benchmark model. Naïve Bayes and linear discriminant analysis were included as simple probabilistic and linear classification methods, respectively, whereas GBM and XGBoost were selected to capture potential non-linear associations and feature interactions. Elastic Net was used for feature selection because it combines L1 and L2 penalties and is suitable for clinical datasets with potentially correlated predictors. The Elastic Net mixing parameter was set at α = 0.5 to balance variable sparsity and coefficient shrinkage, and the penalty parameter λ was determined by 10-fold cross-validation using the lambda.1se criterion. Stability selection was performed with 500 repeated stratified 50% subsamples, and predictors with a selection frequency ≥0.90 were retained.

For tree-based models, hyperparameters were tuned in the development cohort using stratified cross-validation. For XGBoost, the tuned parameters included the number of boosting rounds, maximum tree depth, learning rate, minimum child weight, subsampling ratio, and column subsampling ratio. For GBM, the tuned parameters included the number of trees, interaction depth, shrinkage, and minimum observations per terminal node. Early stopping or optimal tree-number selection was applied to reduce overfitting. The final hyperparameters were fixed after model development and were not re-estimated in the external validation cohort.

After the final predictor set was established, five types of predictive models were developed and compared in the development cohort: multivariable logistic regression, naïve Bayes, extreme gradient boosting (XGBoost), linear discriminant analysis (LDA), and gradient boosting machine (GBM). For XGBoost and GBM, models were trained on the one-hot–encoded design matrix, with complexity controlled through cross-validation and early stopping or optimal tree-number selection to reduce overfitting.21 To obtain prediction probabilities that better reflect expected generalization performance and to support consistent calibration, we used 5-fold stratified cross-validation to generate out-of-fold (OOF) predicted probabilities for each model. Platt scaling (estimating an intercept and slope) was then applied to produce calibrated probabilities. Calibration parameters were estimated in the development cohort, fixed thereafter, and applied unchanged in the external validation cohort.22

Discrimination was assessed using the area under the receiver operating characteristic curve (AUC), alongside accuracy, sensitivity, specificity, and the Brier score. The classification threshold was determined in the development cohort based on calibrated OOF probabilities and was then carried forward to the external validation cohort. Internal validation was performed using stratified bootstrap resampling (500 iterations) to estimate optimism and compute an optimism-corrected AUC. We validated the model’s generalizability in the validation cohort using the same metrics, visualizing results with ROC and calibration curves, and assessing clinical net benefit via decision curve analysis. To enhance interpretability, we applied SHAP to the final model, calculating SHAP values for each feature. We summarized feature importance by mean absolute SHAP values and used SHAP summary plots, heatmaps, and dependence plots to analyze feature contributions, individual patterns, and interactions, clarifying model behavior and identifying key risk drivers.

Results

Baseline Characteristics

Table 1 summarizes the baseline characteristics of the two cohorts. A total of 692 patients were included in this study, comprising 513 in the development cohort and 179 in the external validation cohort. The two cohorts were broadly comparable in age and sex distribution (age: 76.1 ± 6.31 vs. 76.3 ± 4.98 years; female: 34.7% vs. 34.1%). Patients in the external validation cohort had a slightly higher BMI (21.9 ± 2.41 vs. 21.3 ± 2.86 kg/m2), a higher proportion of smokers (60.3% vs. 47.6%), and a higher prevalence of frequent exacerbations (FE) (45.3% vs. 29.4%). The 90-day readmission rate was also modestly higher in the external validation cohort (24.6% vs. 20.7%). The distribution of comorbidities was largely similar between cohorts (hypertension: 32.4% vs. 31.0%; cardiovascular disease: 45.8% vs. 50.5%; diabetes: 16.8% vs. 16.8%). Likewise, the proportions of non-invasive ventilation (NIV) use and respiratory failure (RF) were comparable (NIV: 63.1% vs. 60.8%; RF: 55.9% vs. 50.7%). Laboratory and arterial blood gas measures were generally consistent across cohorts, although the external validation cohort showed slightly lower PaO2 and higher PaCO2, along with modestly higher D-dimer and creatinine levels. The composite inflammatory indices (NLR, PLR, and SII) were overall similar between the two cohorts.

Table 1 Demographic and Clinical Characteristics of the Cohort

Variable Screening

In the development cohort, predictor selection was performed using stability selection (stratified subsampling) combined with Elastic Net. This process resulted in a fixed set of six predictors: NLR, SII, D, FE, BMI, and ALB. After the predictor set was finalized, we further examined multicollinearity. The absolute values of pairwise correlation coefficients among all continuous variables were below 0.90, and variance inflation factors (VIFs) for all predictors in the multivariable logistic regression framework were <5. These findings indicate no meaningful multicollinearity, and all six predictors were therefore retained for subsequent model development and validation.

Model Performance Comparison

After fixing the final predictor set and completing probability calibration, we compared the discriminative performance, internal robustness, and external transportability of five models in both the development and external validation cohorts (Figures 2 and 3; Table 2). In the development cohort, the ROC curves of all five models lay clearly above the diagonal reference line (Figure 2A), indicating good overall discrimination. Among them, XGBoost achieved the best performance (AUC = 0.892). GBM, logistic regression, and LDA showed similar discrimination (AUC approximately 0.86–0.87), whereas naïve Bayes yielded a comparatively lower AUC (0.836). In addition to discrimination, we assessed overall probabilistic error using the Brier score. The results suggested meaningful differences across models, with XGBoost showing a lower overall prediction error and a more balanced performance profile (Table 2).

Table 2 Performance of Five Models in Development and External Cohorts

Two line graphs showing receiver operating characteristic curves for five prediction models in two cohorts.

Figure 2 Receiver operating characteristic (ROC) curves of the five prediction models. (A) Development cohort. (B) External validation cohort. The diagonal dashed line indicates no discrimination. XGBoost achieved the highest area under the curve (AUC) in both cohorts.

Two line graphs showing model calibration curves and decision curve analysis for a prediction model.

Figure 3 Calibration and clinical utility in the external validation cohort. (A) Calibration curves for logistic regression, naive Bayes, XGBoost, linear discriminant analysis, and gradient boosting machine; the dashed line indicates ideal calibration. (B) Decision curve analysis of the XGBoost model compared with the treat-all and treat-none strategies.

To evaluate overfitting and internal reproducibility, we used stratified bootstrap resampling for internal validation and calculated optimism-corrected AUCs. The corrected AUCs closely matched the apparent AUCs, indicating minimal overfitting. XGB was the best performer (corrected AUC = 0.864), followed by logistic regression and LDA, with NB and GBM having lower values.

In the external validation cohort, all models showed acceptable AUC values, demonstrating good generalizability (Figure 2B). XGB had the highest AUC at 0.847, followed by LDA and logistic regression with AUCs of 0.843 and 0.836, respectively. NB and GBM had slightly lower AUCs at 0.833 and 0.829. While AUCs were slightly lower than in the development cohort, the models’ relative rankings remained consistent, indicating stable performance.

In summary, the calibration plots indicated that all models demonstrated satisfactory concordance between predicted and observed risks within the external cohort (Figure 3A). Regarding clinical utility, decision curve analysis revealed that the XGB model offered a greater net benefit compared to the “treat-all” and “treat-none” strategies across various threshold probabilities in the external cohort (Figure 3B), underscoring its potential clinical applicability. Consequently, we selected the XGB model as the primary model for subsequent interpretability analyses based on SHAP values.

Model Interpretation

After identifying XGB as the primary model, we used SHAP to clarify how each predictor contributed to the model output and to examine possible non-linear patterns (Figures 4 and 5). In the global importance ranking, all six predictors contributed consistently, with D-dimer, NLR, and frequent exacerbations (FE) showing the highest mean absolute SHAP values, indicating that they were the main drivers of prediction. BMI and ALB also had notable influence, whereas SII contributed less overall (Figure 4A).

Three SHAP plots: importance bars, beeswarm and heatmap with columns SII, ALB, BMI, FE, NLR, D; no cell numbers.

Figure 4 SHAP-based interpretation of the XGBoost model. (A) Global feature importance ranked by mean absolute SHAP value. (B) SHAP beeswarm plot showing the direction and magnitude of feature effects across patients. (C) SHAP heatmap showing patient-level contribution patterns. D-dimer, neutrophil-to-lymphocyte ratio, and frequent exacerbations were the main contributors.

Three SHAP dependence plots for D-dimer, neutrophil-to-lymphocyte ratio and albumin in the XGBoost model.

Figure 5 SHAP dependence plots for key predictors in the XGBoost model. (A) D-dimer, colored by albumin. (B) Neutrophil-to-lymphocyte ratio, colored by D-dimer. (C) Albumin, colored by D-dimer. Positive SHAP values indicate higher predicted risk of 90-day readmission, and negative values indicate lower predicted risk.

The SHAP beeswarm plot further showed clear between-patient variation in how features affected risk. In general, higher D-dimer and NLR values were more likely to push SHAP values upward and increase predicted risk, while lower values tended to pull risk downward. FE (yes) was associated with a positive contribution in most patients, consistent with a higher readmission risk among those with frequent prior exacerbations. For ALB and BMI, the direction of contribution changed across value ranges, suggesting non-linear effects (Figure 4B). Patient-level SHAP heatmaps also indicated that contribution patterns were not uniform across individuals, supporting patient-specific explanation pathways for risk stratification (Figure 4C).

To explore non-linear effects and interactions among key variables, we created SHAP dependence plots (Figure 5). D-dimer showed a threshold effect: its risk contribution was negative or near zero at low levels but increased rapidly before plateauing as levels rose, with variations potentially influenced by ALB levels (Figure 5A). NLR also had a non-linear association, with minimal or negative impact at low values, a sharp increase beyond a certain point, and stabilization at higher levels, possibly interacting with D-dimer (Figure 5B). ALB generally had an inverse relationship with SHAP values, where lower ALB increased risk contribution and higher ALB reduced it, suggesting higher albumin might lower readmission risk, with this pattern possibly affected by D-dimer levels (Figure 5C).

In summary, the SHAP analysis reveals that the predictions generated by the XGB model are collectively influenced by markers related to inflammation, history of prior exacerbations, and indicators of coagulation and nutrition. Furthermore, the findings underscore the presence of non-linear effects and potential interactions among principal predictors, thereby offering a comprehensible foundation for clinical interpretation and risk stratification.

Discussion

We developed and externally validated a 90-day readmission risk prediction model using two hospital cohorts of patients aged ≥65 years who were admitted for AECOPD. Predictor selection was carried out with stability selection (stratified subsampling) combined with Elastic Net, yielding a fixed set of six key variables (NLR, SII, D-dimer, FE, BMI, and ALB). We then compared the performance of five modeling approaches: logistic regression, naïve Bayes, LDA, GBM, and XGBoost. Overall, XGBoost delivered the best performance in the development cohort, remained robust after bootstrap internal validation, and showed acceptable discrimination and calibration in the external cohort. It also demonstrated lower overall prediction error (Brier score) and potential clinical utility. SHAP analyses further indicated that model predictions were mainly driven by D-dimer, NLR, and FE, with evidence of non-linear effects and possible interactions. Together, these findings support the use of the model for early, admission-stage identification of patients at high risk of 90-day readmission.

Building on the interpretability findings, we propose that the three leading predictors map onto three key pathophysiological dimensions underlying 90-day readmission risk in older patients with AECOPD: inflammatory burden (NLR), an exacerbation-prone phenotype (FE), and coagulation/endothelial stress related to injury (D-dimer).

First, NLR, derived from routine complete blood counts, reflects both an intensified neutrophil-driven pro-inflammatory response and the immunoregulatory suppression suggested by lymphopenia. As such, it is often regarded as a simple and practical biomarker capturing the balance between inflammatory burden and immune status.23,24 In the context of AECOPD, an elevated NLR typically corresponds to more pronounced systemic inflammatory activation and a less favorable acute clinical course, and has been linked to a higher risk of short-term adverse outcomes. This provides a physiologically plausible explanation for delayed post-discharge recovery and a greater likelihood of subsequent decompensation.24,25 Moreover, in cohorts of older adults (≥65 years) hospitalized with AECOPD, NLR has shown a positive association with 90-day readmission risk (multivariable OR ≈ 1.57; reaching ≈ 1.75 in higher-risk subgroups). Systematic reviews have likewise reported a markedly increased readmission risk when NLR exceeds 7 (pooled OR ≈ 1.93).26,27 These findings align with the dominant risk-stratifying direction of NLR observed in our model.

Second, FE reflects a relatively stable frequent-exacerbator phenotype, indicating poorer long-term disease control and greater baseline vulnerability. This phenotype likely arises from multiple, overlapping factors, including persistent airway inflammation and structural remodeling, bacterial colonization and a propensity for recurrent infections, a heavier burden of comorbidities with reduced functional reserve, and differences in adherence and self-management capacity after discharge. Accordingly, the consistently positive risk contribution of FE observed in our model is clinically plausible.28–30 Prior work also supports a clear “dose–response” pattern in effect size: patients with ≥2 hospitalizations for AECOPD in the previous year have a markedly higher risk of subsequent readmission (crude OR ≈ 4.10), and the risk increases further among those with ≥3 hospitalizations (crude OR ≈ 7.51).31 These findings suggest that a history of exacerbation-related admissions is among the most informative clinical markers for predicting short-term readmission.

Finally, D-dimer, a marker of increased fibrin formation and breakdown, may reflect joint activation of the inflammation–endothelial activation–coagulation axis during AECOPD.32,33 Prior studies have reported higher coagulation/fibrinolysis markers during exacerbations, supporting D-dimer as a signal of systemic stress and possible microcirculatory impairment.32,33 Most evidence links D-dimer to acute severity or longer-term outcomes; for example, Hu et al found that higher D-dimer was associated with increased in-hospital events and 1-year adverse outcomes.34 In our study, we interpret D-dimer as a measurable indicator of acute systemic stress and disease-severity burden: elevated levels may imply slower recovery and greater susceptibility to post-discharge decompensation, aligning with its SHAP-derived risk contribution to 90-day readmission.

It should be emphasized that the incremental AUC improvement of XGBoost over logistic regression was modest, especially in the external validation cohort. Therefore, the value of XGBoost in this study should not be interpreted solely as a large gain in discrimination. Rather, its selection as the primary model was based on a more comprehensive performance profile, including discrimination, calibration, Brier score, decision-curve-derived net benefit, and interpretability. The relatively similar AUCs across models also suggest that part of the predictive information was captured by a small set of clinically meaningful variables, rather than by algorithmic complexity alone. This finding supports the robustness of the selected predictor panel and indicates that the model may be practically usable in clinical settings. The potential clinical application of this model lies in early risk stratification during hospitalization. Because all predictors were derived from routinely available admission data, the model could be embedded into the hospital information system or electronic medical record to automatically estimate 90-day readmission risk after the first laboratory results are available. Patients classified as high risk could be considered for intensified discharge planning, medication and inhaler-technique optimization, nutritional and rehabilitation assessment, earlier outpatient follow-up, and closer post-discharge monitoring. Nevertheless, the model should be used as a clinical decision-support tool rather than a substitute for physician judgment. Future prospective implementation studies are needed to determine whether model-guided management can reduce readmission rates and improve patient outcomes.

Compared with prior work, the practical value of this study lies in its closer alignment with the real-world need for short-term post-discharge risk control in older patients hospitalized for AECOPD. Many existing scoring systems and prediction studies in COPD/AECOPD focus on in-hospital severe outcomes (eg., in-hospital mortality, need for invasive ventilation, or ICU care) or on longer-term endpoints. For example, Zhang et al developed an explainable machine-learning model to predict hospitalization death in hospitalized patients with AECOPD.35 In contrast, our study targets 90-day readmission after discharge in older (≥65 years) patients and emphasizes routinely available early-admission variables with two-center development and external validation. These targets are useful for assessing acute severity, but they do not fully match the clinical meaning of 90-day readmission, a short-term outcome that more directly reflects inadequate disease control and patient vulnerability after discharge. For example, DECAF is primarily designed for stratifying the risk of in-hospital mortality, whereas BAP-65 places greater emphasis on acute severity and adverse in-hospital outcomes.36–38 In addition, a systematic review of COPD readmission prediction models and AI-based studies has highlighted recurring limitations—such as reliance on single-center data, insufficient external validation, and substantial methodological heterogeneity—which can undermine model transportability and bedside usability.16 Against this background, we prespecified 90-day readmission as the prediction target and deliberately restricted predictors to routine information available early after admission, aiming to minimize information leakage while improving clinical deployability.

Several limitations should be acknowledged. First, as a retrospective observational study, selection bias and residual confounding cannot be fully eliminated despite both internal and external validation. Moreover, readmissions were identified only through rehospitalization records within each hospital’s own information system; readmissions occurring at other institutions were not captured, which may underestimate the true readmission rate and limit generalizability. Accordingly, our findings should be interpreted as predictive associations rather than causal effects. Second, because participants were drawn from two hospitals within the same region, broader validation in larger, multicenter prospective cohorts across different areas is still needed. Third, key laboratory measurements were taken from the first test within 24 hours of admission, but differences in laboratory platforms and pre-analytical procedures across centers may introduce measurement variability. Fourth, some post-discharge management factors (eg., adherence, rehabilitation, and access to follow-up) were incompletely documented in retrospective records, potentially limiting the model’s coverage of modifiable risks. Finally, SHAP provides contribution-based explanations under the current data distribution and should not be interpreted as effect sizes or evidence of causal mechanisms; prospective impact studies are required before routine clinical implementation.

In summary, we developed and externally validated a 90-day readmission prediction model in two cohorts of hospitalized patients aged ≥65 years with AECOPD, using routinely available clinical indicators obtained early after admission. The model maintained acceptable discrimination and calibration in the external cohort and showed a low overall prediction error. Interpretability analyses suggested that readmission risk was primarily driven by coagulation- and inflammation-related burden, together with a frequent-exacerbator phenotype. This tool may support early in-hospital risk stratification and guide post-discharge management decisions, although larger, multicenter prospective studies across diverse regions are still needed to confirm its clinical impact.

Data Sharing Statement

The datasets used and/or analyzed in this study were derived from multiple cohorts and are currently being used for ongoing analyses. Due to institutional policies and ethics-related restrictions, the datasets are not publicly available. De-identified data may be provided by the corresponding author upon reasonable request, subject to appropriate institutional/ethics approvals and applicable regulations.

Author Contributions

All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.

Funding

This study was jointly supported by the Sichuan Medical Technology Innovation Research Association (YCH-KY-YCZD2024-287), the National Key R&D Program of China (2023ZD0506106), Nanchong Municipal Science and Technology Bureau (22ZXKTYJ0001), Guang’an District Bureau of Education, Science and Technology, and Sports (2024SYF15),the Industry–Academia–Research Innovation Fund for Chinese Universities—Smart Healthcare Innovation Program (Industry Association Special Fund) (2025XH022) and the Affiliated Hospital of North Sichuan Medical College Intra-Hospital Research Project (2024PTZK002).

Disclosure

The authors declare no conflicts of interest in this study.

References

1. Safiri S, Carson-Chahhoud K, Noori M, et al. Burden of chronic obstructive pulmonary disease and its attributable risk factors in 204 countries and territories, 1990-2019: results from the Global Burden of Disease Study 2019. BMJ. 2022;378:e069679. doi:10.1136/bmj-2021-069679

2. Witt LJ, Wroblewski KE, Pinto JM, et al. Beyond the lung: geriatric conditions afflict community-dwelling older adults with self-reported chronic obstructive pulmonary disease. Front Med Lausanne. 2022;9:814606. doi:10.3389/fmed.2022.814606

3. Global Initiative for Chronic Obstructive Lung Disease (GOLD). Global strategy for the diagnosis, management, and prevention of COPD: 2024 report. GOLD; 2024. Available from: https://goldcopd.org/2024-gold-report/. Accessed February 7, 2026.

4. Alqahtani JS, Njoku CM, Bereznicki B, et al. Risk factors for all-cause hospital readmission following exacerbation of COPD: a systematic review and meta-analysis. Eur Respir Rev. 2020;29(156):190166. doi:10.1183/16000617.0166-2019

5. Genao L, Durheim MT, Mi X, Todd JL, Whitson HE, Curtis LH. Early and long-term outcomes of older adults after acute care encounters for chronic obstructive pulmonary disease exacerbation. Ann Am Thorac Soc. 2015;12(12):1805–16. doi:10.1513/AnnalsATS.201504-250OC

6. Hurst JR, Perera WR, Wilkinson TM, Donaldson GC, Wedzicha JA. Systemic and upper and lower airway inflammation at exacerbation of chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2006;173(1):71–78. doi:10.1164/rccm.200505-704OC

7. Fang L, Zhu J, Fu D. Predictive value of neutrophil-lymphocyte ratio for all-cause mortality in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. BMC Pulm Med. 2025;25(1):206. doi:10.1186/s12890-025-03677-y

8. Huang Y, Feng T, Song X, et al. Explainable machine learning model for 1-year readmission risk prediction in AECOPD patients: integrating relief feature selection with sample augmentation. BMC Infect Dis. 2025;25(1):1748. doi:10.1186/s12879-025-12010-1

9. Yi Q, Chen X, Liu S, et al. Systemic immune-inflammation index is associated with adverse outcomes in patients hospitalized for AECOPD: a multicenter cohort study. Respiration. 2025;104(9):648–666. doi:10.1159/000545267

10. Min X, Yu B, Wang F. Predictive modeling of the hospital readmission risk from patients’ claims data using machine learning: a case study on COPD. Sci Rep. 2019;9(1):2362. doi:10.1038/s41598-019-39071-y

11. Alvarez-Romero C, Martinez-Garcia A, Ternero Vega J, et al. Predicting 30-day readmission risk for patients with chronic obstructive pulmonary disease through a federated machine learning architecture on Findable, Accessible, Interoperable, and Reusable (FAIR) data: development and validation study. JMIR Med Inform. 2022;10(6):e35307. doi:10.2196/35307

12. Li M, Cheng K, Ku K, Li J, Hu H, Ung COL. Modelling 30-day hospital readmission after discharge for COPD patients based on electronic health records. NPJ Prim Care Respir Med. 2023;33(1):16. doi:10.1038/s41533-023-00339-6

13. Niraula P, Upreti M, Kadariya S, Poudel B, Kadariya S, Kunwar S. AI/ML driven prediction of COPD exacerbations and readmissions: a systematic review and meta-analysis. Front Digit Health. 2025;7:1641356. doi:10.3389/fdgth.2025.1641356

14. Collins GS, Moons KGM, Dhiman P, et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ. 2024;385:e078378. doi:10.1136/bmj-2023-078378

15. Sait ARW, Shaikh MA. Artificial intelligence-powered chronic obstructive pulmonary disease detection techniques–a review. Diagnostics. 2025;15(20):2562. doi:10.3390/diagnostics15202562

16. Joseph T, Raveendra K, Haris M, et al. Community-based prediction models of cardiovascular events, acute exacerbations and all-cause mortality in individuals with chronic obstructive pulmonary disease: a systematic review and meta-analysis on behalf of the International Cardiovascular and Respiratory Alliance. BMJ Open Respir Res. 2026;13(1). doi:10.1136/bmjresp-2025-003752

17. Zhu Y, Wang M, Gu XN, Wang C, Deng SM. Development and validation of the machine learning model for acute exacerbation of chronic obstructive pulmonary disease prediction based on inflammatory biomarkers. Front Med Lausanne. 2025;12:1616712. doi:10.3389/fmed.2025.1616712

18. Bonomo M, Hermsen MG, Kaskovich S, et al. Using machine learning to predict likelihood and cause of readmission after hospitalization for chronic obstructive pulmonary disease exacerbation. Int J Chron Obstruct Pulmon Dis. 2022;17:2701–2709. doi:10.2147/COPD.S379700

19. Agustí A, Celli BR, Criner GJ, et al. Global initiative for chronic obstructive lung disease 2023 report: GOLD executive summary. Am J Respir Crit Care Med. 2023;207(7):819–837. doi:10.1164/rccm.202301-0106PP

20. Gregorich M, Strohmaier S, Dunkler D, Heinze G. Regression with highly correlated predictors: variable omission is not the solution. Int J Environ Res Public Health. 2021;18(8):4259. doi:10.3390/ijerph18084259

21. Khan MHR, Bhadra A, Howlader T. Stability selection for lasso, ridge and elastic net implemented with AFT models. Stat Appl Genet Mol Biol. 2019;18(5). doi:10.1515/sagmb-2017-0001

22. Walsh CG, Sharman K, Hripcsak G. Beyond discrimination: a comparison of calibration methods and clinical usefulness of predictive models of readmission risk. J Biomed Inform. 2017;76:9–18. doi:10.1016/j.jbi.2017.10.008

23. Paliogiannis P, Fois AG, Sotgia S, et al. Neutrophil to lymphocyte ratio and clinical outcomes in COPD: recent evidence and future perspectives. Eur Respir Rev. 2018;27(147):170113. doi:10.1183/16000617.0113-2017

24. Zinellu A, Zinellu E, Mangoni AA, et al. Clinical significance of the neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio in acute exacerbations of COPD: present and future. Eur Respir Rev. 2022;31(166):220095. doi:10.1183/16000617.0095-2022

25. Çoban Ağca M, Aksoy E, Duman D, et al. Does eosinophilia and neutrophil to lymphocyte ratio affect hospital re-admission in cases of COPD exacerbation? KOAH alevlenmesinde eozinofil ve nötrofil/lenfosit oranı hastaneye tekrar başvuruyu etkiler mi? Tuberk Toraks. 2017;65(4):282–290. doi:10.5578/tt.57278

26. Li L, Feng Q, Yang C. The D-dimer to albumin ratio could predict hospital readmission within one year in patients with acute exacerbation of chronic obstructive pulmonary disease. Int J Chron Obstruct Pulmon Dis. 2024;19:2587–2597. doi:10.2147/COPD.S481483

27. Lin P, Shen C, Li Q, et al. A systematic review and meta-analysis of chronic obstructive pulmonary disease in Asia: risk factors for readmission and readmission rate. BMC Pulm Med. 2024;24(1):388. doi:10.1186/s12890-024-03203-6

28. Hurst JR, Vestbo J, Anzueto A, et al. Susceptibility to exacerbation in chronic obstructive pulmonary disease. N Engl J Med. 2010;363(12):1128–1138. doi:10.1056/NEJMoa0909883

29. Wedzicha JA, Brill SE, Allinson JP, Donaldson GC. Mechanisms and impact of the frequent exacerbator phenotype in chronic obstructive pulmonary disease. BMC Med. 2013;11:181. doi:10.1186/1741-7015-11-181

30. Le Rouzic O, Roche N, Cortot AB, et al. Defining the “Frequent exacerbator” phenotype in COPD: a hypothesis-free approach. Chest. 2018;153(5):1106–1115. doi:10.1016/j.chest.2017.10.009

31. Lu L, Chen M, Xing X, et al. Is the incident of once chronic obstructive pulmonary disease related admission a high risk for readmission in the future? J Thorac Dis. 2023;15(6):3133–3142. doi:10.21037/jtd-22-1504

32. Polosa R, Cacciola RR, Prosperini G, Spicuzza L, Morjaria JB, Di Maria GU. Endothelial-coagulative activation during chronic obstructive pulmonary disease exacerbations. Haematologica. 2008;93(8):1275–1276. doi:10.3324/haematol.12473

33. Polosa R, Malerba M, Cacciola RR, et al. Effect of acute exacerbations on circulating endothelial, clotting and fibrinolytic markers in COPD patients. Intern Emerg Med. 2013;8(7):567–574. doi:10.1007/s11739-011-0636-1

34. Hu G, Wu Y, Zhou Y, et al. Prognostic role of D-dimer for in-hospital and 1-year mortality in exacerbations of COPD. Int J Chron Obstruct Pulmon Dis. 2016;11:2729–2736. doi:10.2147/COPD.S112882

35. Zhang Y, Zheng S, Wang D, et al. Development and validation of a machine learning-based model to predict the risk of hospitalization death in hospitalized patients with AECOPD. Sci Rep. 2025;15(1):35918. doi:10.1038/s41598-025-19810-0

36. Steer J, Gibson J, Bourke SC. The DECAF Score: predicting hospital mortality in exacerbations of chronic obstructive pulmonary disease. Thorax. 2012;67(11):970–976. doi:10.1136/thoraxjnl-2012-202103

37. Echevarria C, Steer J, Heslop-Marshall K, et al. Validation of the DECAF score to predict hospital mortality in acute exacerbations of COPD. Thorax. 2016;71(2):133–140. doi:10.1136/thoraxjnl-2015-207775

38. Shorr AF, Sun X, Johannes RS, Derby KG, Tabak YP. Predicting the need for mechanical ventilation in acute exacerbations of chronic obstructive pulmonary disease: comparing the CURB-65 and BAP-65 scores. J Crit Care. 2012;27(6):564–570. doi:10.1016/j.jcrc.2012.02.015

Creative Commons License © 2026 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms and incorporate the Creative Commons Attribution - Non Commercial (unported, 4.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.