Back to Journals » Clinical Interventions in Aging » Volume 21

Association Between Preoperative Activities of Daily Living and Postoperative Delirium in Elderly Patients Undergoing Non-Cardiac Surgery: A Prospective Cohort Study

Authors Liu Y ORCID logo, Zhu Z, Dang X, Qu T, Zhang W, Su X ORCID logo

Received 31 October 2025

Accepted for publication 3 March 2026

Published 7 March 2026 Volume 2026:21 578062

DOI https://doi.org/10.2147/CIA.S578062

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Maddalena Illario



Yuting Liu,1 Zhuan Zhu,1 Xuzhou Dang,1 Tingwei Qu,2 Wenjie Zhang,2 Xuesen Su3,4

1The College of Anesthesia, Shanxi Medical University, Taiyuan, Shanxi, People’s Republic of China; 2Department of Anesthesiology, First Hospital of Shanxi Medical University, Taiyuan, Shanxi, People’s Republic of China; 3The Department of Anesthesiology, The Fifth Clinical Medical College of Shanxi Medical University, Taiyuan, Shanxi, People’s Republic of China; 4Shanxi Key Laboratory of Geriatric Precision Anesthesia and Complication Prevention, the Fifth Clinical Medical College of Shanxi Medical University, Taiyuan, Shanxi, People’s Republic of China

Correspondence: Xuesen Su, The Department of Anesthesiology, The Fifth Clinical Medical College of Shanxi Medical University, No. 29 Shuangtasi Street, Taiyuan, Shanxi, 030012, People’s Republic of China, Email [email protected] Wenjie Zhang, Department of Anesthesiology, First Hospital of Shanxi Medical University, No. 85 Jiefang South Road, Taiyuan, Shanxi, 030001, People’s Republic of China, Email [email protected]

Background: Postoperative delirium (POD) is a common complication in elderly surgical patients associated with adverse outcomes. However, the relationship between low preoperative Barthel Index (BI) and POD remains unclear.
Purpose: This study investigated the correlation between preoperative BI and POD incidence in elderly patients undergoing non-cardiac surgery.
Patients and Methods: This prospective cohort study enrolled 298 patients (≥ 65 years) undergoing elective non-cardiac surgery under general anesthesia. Preoperative BI assessed activities of daily living (ADL) within 24 hours before surgery. POD was screened twice daily until postoperative day 7 or discharge using the 3-Minute Diagnostic Interview for Confusion Assessment Method–defined Delirium (3D-CAM). Univariate and multivariate logistic regression analyzed associations between BI and POD. Restricted cubic spline (RCS) and two-piecewise linear regression explored non-linearity and identified inflection points. Findings were validated using inverse probability of treatment weighting (IPTW), and BI’s incremental predictive value was evaluated by receiver operating characteristic (ROC) curve analysis.
Results: POD incidence was 25.8% (77/298). Multivariate analysis showed lower preoperative BI (OR=0.956, 95% CI: 0.940– 0.973, p< 0.001), lower BMI (OR=0.860, 95% CI: 0.757– 0.977, p=0.020), and advanced age (OR=1.174, 95% CI: 1.102– 1.250, p< 0.001) independently associated with POD. RCS revealed a nonlinear relationship with inflection at BI=75. Patients with BI< 75 had significantly higher POD risk than those with BI≥ 75 (OR=4.615, 95% CI: 2.377– 8.963, p< 0.001), robust after IPTW adjustment (42.9% vs 14.5%; adjusted OR=4.445, 95% CI: 2.564– 7.707, p< 0.001). This association was pronounced in patients with fewer comorbidities, shorter anesthesia duration, and shorter preoperative stays. Incorporating the BI score significantly improved the discrimination of the predictive model for POD.
Conclusion: Preoperative ADL impairment (BI< 75) independently predicts POD in elderly non-cardiac surgery patients. Preoperative BI assessment is recommended for perioperative risk stratification, identifying low-BI patients as targets for preventive interventions.

Keywords: activities of daily living, barthel index, postoperative delirium, elderly, non-cardiac surgery, 3d-cam, neuropsychological tests

Introduction

Postoperative delirium (POD) is an acute-onset, temporary brain dysfunction occurring primarily within the first week after surgery. Its clinical features include attention deficits, fluctuations in consciousness, and acute changes in cognitive function.1–3 This condition significantly increases the risk of postoperative complications, prolongs hospital stay, exacerbates healthcare burdens, and elevates 30-day readmission rates, thereby adversely affecting patient outcomes.4–6

With the accelerating aging of the population, the number of elderly patients is increasing year by year. Concurrently, alongside advancements in medical technology and the broadening of surgical indications, the rate of non-cardiac surgery in elderly patients is also continuously rising.7 Each year, over 310 million surgeries are performed globally, and elderly patients, who constitute about a quarter of these cases, face a heightened risk of POD.8 Existing research indicates that preoperative cognitive impairment, reduced physiological reserve, multimorbidity, and frailty are important risk factors for postoperative delirium in elderly patients undergoing non-cardiac surgery.9 Simple screening tools such as the GFI, FRAIL scale, or Clinical Frailty Scale can effectively identify high-risk populations,10–12 while intraoperative EEG suppression partially mediates the association between frailty and postoperative delirium.13 Furthermore, research has confirmed that preoperative administration of corticosteroids can mitigate surgery-induced inflammatory responses, thereby reducing the incidence and severity of postoperative cognitive dysfunction (POCD). This suggests that perioperative inflammatory responses may also constitute a significant shared pathological basis for postoperative neurocognitive complications.14

Functional status represents a core dimension of geriatric vulnerability. Activities of Daily Living (ADL) reflect an individual’s ability to perform basic self-care tasks and serve as an important indicator of fundamental functional reserve.15 Impaired ADL is often associated with physical frailty, reduced physiological resilience, and underlying cognitive vulnerability, which may increase a patient’s susceptibility to the impact of surgical trauma and stress during the perioperative period, thereby elevating the risk of neurocognitive disturbances.16

Although existing research has explored the relationship between functional status and POD from various perspectives, indicating that improving functional status through multicomponent interventions (such as a tailored program validated in a Chinese population17) can help reduce the risk of POD, current evidence still has certain limitations. First, in most studies, ADL indicators are typically incorporated into comprehensive geriatric assessments or composite frailty indices,18 making it difficult to independently clarify the specific association between basic daily living function itself and POD. Second, significant heterogeneity exists across studies in terms of assessment tools, surgical types, and study populations, limiting the comparability and generalizability of findings. The Barthel Index (BI) is currently one of the most widely used and thoroughly validated tools internationally for assessing basic ADL.19 This scale offers advantages such as a clear structure, ease of administration, objective and quantifiable results, and provides a relatively direct reflection of the basic functional status of elderly patients, thus demonstrating good feasibility and potential for widespread adoption in clinical practice.20,21

Therefore, this study uses the BI score to quantitatively evaluate ADL, aiming to explore the correlation between preoperative BI score and POD in elderly patients undergoing non-cardiac surgery, based on the hypothesis that impaired preoperative basic ADL, as reflected by lower BI scores, is associated with an increased risk of POD. The findings are expected to provide a scientific basis for the early identification of high-risk patients and the development of individualized intervention strategies.

Patients and Methods

Study Design and Participants

This prospective cohort study was approved by the Scientific Research Ethics Review Committee of the First Hospital of Shanxi Medical University (Approval No. KYLL-2025-101) and was conducted in accordance with the ethical principles of the Declaration of Helsinki. The study was registered at http://www.chictr.org.cn/ (Clinical Trial Registration No. ChiCTR2500101542). A total of 329 elderly patients with complete clinical data who underwent elective non-cardiac surgery under general anesthesia at the First Hospital of Shanxi Medical University between March 2025 and May 2025 were selected.

Patients meeting all the following inclusion criteria were considered for enrollment: 1. age ≥ 65 years; 2. scheduled for elective non-cardiac surgery; 3. planned for general anesthesia; 4. American Society of Anesthesiologists (ASA) physical status classification II–III; 5. preoperative Mini-Mental State Examination (MMSE) score ≥ 24, Hospital Anxiety and Depression Scale (HADS) anxiety score ≤ 7 and depression score ≤ 7; 6. approved by the hospital ethics committee, and informed consent obtained from the patients. Patients were excluded if they met any of the following criteria: 1. history of traumatic brain injury or severe central nervous system disorders; 2. significant auditory/visual impairment, communication barriers, physical disabilities, or inability to complete assessments; 3. preoperative cognitive impairment, dementia, mental disorders or predisposition, or recent use of antipsychotic medications; 4. undergoing emergency surgery; 5. postoperative Intensive Care Unit (ICU) admission; 6. postoperative hospital stay < 3 days; 7. inability to comprehend informed consent documents or declined study participation.

Data Collection

Demographic and Clinical Characteristics

The baseline characteristics of the enrolled subjects were obtained through a comprehensive review of electronic medical records, including demographic characteristics (eg, sex, age, body mass index [BMI], and smoking status) and clinical laboratory test results.

Preoperative ADL

Within 24 hours before surgery, two assessors trained for consistency used the Chinese version of the BI scale to assess the patients’ ADL. The assessment included 10 functions such as feeding, dressing, and toileting, with a total score range of 0–100 (higher scores indicate greater independence).22

Other Potential Risk Factors of POD

Based on current state-of-the-art evidence and clinical guidelines,2 the following data were collected: ASA physical status classification, Charlson Comorbidity Index (CCI), and preoperative length of stay. In addition, ten intraoperative and postoperative variables were recorded: type of surgery (gastrointestinal, urologic, orthopedic, gynecologic, neurosurgical, otorhinolaryngologic [ear, nose, and throat; ENT], thoracic, hernia, oral, and hepatobiliary surgery), type of anesthesia (total intravenous anesthesia or combined intravenous–inhalational anesthesia), duration of anesthesia (from induction to extubation), duration of surgery (from skin incision to skin closure), mean arterial pressure difference (highest minus lowest intraoperative value), oxygen saturation difference (highest minus lowest intraoperative value), intraoperative blood loss, total fluid input, use of patient-controlled intravenous analgesia (PCIA), and administration of postoperative nerve block. All variables were obtained through a review of surgical records, anesthesia records, and intraoperative monitoring data.

Observation Indicators

The primary outcome of this study was the occurrence of POD within 7 days after surgery or before hospital discharge, whichever occurred first. Trained researchers assessed patients twice daily (08:00–10:00 and 18:00–20:00)23 using the validated Chinese version24 of the 3-Minute Diagnostic Interview for Confusion Assessment Method–defined Delirium (3D-CAM) from postoperative day 1 until day 7 or discharge.25,26 The 3D-CAM, whose diagnostic algorithm operationalizes the core domains of the DSM-5 criteria,27 was used as the primary instrument for delirium diagnosis. In cases of complex or ambiguous clinical presentations where scale item interpretation was uncertain, final diagnostic judgment was made with direct reference to the original DSM-5 diagnostic domains. Delirium occurrence was recorded as a binary variable (yes/no).

The secondary outcome was postoperative agitation, assessed using the Riker Sedation–Agitation Scale (SAS)28 immediately after emergence from anesthesia and at 1, 2, 4, and 6 hours thereafter. An SAS score ≥5 was defined as agitation and recorded as a binary outcome. All data were collected using standardized case report forms and independently verified by two researchers to ensure data accuracy.29

Perioperative Management

All patients received standardized monitoring upon entry into the operating room, including electrocardiography, invasive or noninvasive blood pressure monitoring, and pulse oximetry. All surgeries were performed under general anesthesia, either alone or in combination with regional anesthesia, at the discretion of the attending anesthesiologist based on the type of surgery. General anesthesia was induced using sufentanil and etomidate. After tracheal intubation, mechanical ventilation was initiated to maintain an end-tidal carbon dioxide partial pressure of 35–45 mmHg. Anesthesia was maintained with continuous intravenous infusions of propofol and remifentanil, with or without sevoflurane inhalation, and neuromuscular blockade was routinely achieved using muscle relaxants such as cisatracurium. The depth of anesthesia was adjusted to maintain the bispectral index (BIS) between 40 and 60 throughout the procedure. Hemodynamic stability was maintained using vasoactive agents (vasopressors and/or inotropes), fluid administration, and red blood cell transfusion when clinically indicated.

Sample Size Calculation

According to the Chinese Expert Consensus on Prevention and Treatment of Postoperative Delirium in Elderly Patients (2022 Edition), data from China show that the incidence of POD in patients over 65 years undergoing non-cardiac surgery ranges from 6.1% to 57.1%, with an overall incidence of 11.1%.30 Assuming an incidence rate of 11.1% in our study, an AUC of no less than 80%, and 4 candidate predictor variables, calculation using the R software pmsampsize package yielded a sample size of 279, requiring at least 31 outcome events. Considering a 15% loss to follow-up rate, the final required sample size was 329 patients.

Statistical Analysis

The distribution of continuous variables was visually assessed using histograms to determine normality. Normally distributed continuous variables are expressed as mean ± standard deviation (Mean ± SD), and compared between groups using the independent samples T-test. Non-normally distributed continuous variables are expressed as median and interquartile range [M (P25, P75)], and compared between groups using the Mann–Whitney U-test. Categorical variables are described using counts (percentages), and compared between groups using the Chi-square (χ2) test or Fisher’s exact test (when the number of cells with expected count <5 exceeds 20% or the total number of cases is <40).

First, univariate logistic regression analysis was performed. Then, preoperative BI score and variables with P<0.05 in the univariate analysis were simultaneously entered into a multivariate logistic regression model using the enter method to analyze the association between preoperative BI score and the risk of POD, calculating the odds ratio (OR) and its 95% confidence interval (CI).

Furthermore, restricted cubic spline (RCS) nested within a multivariate logistic regression model was employed to visualize and analyze the dose-response relationship between preoperative BI scores and the risk of POD. To achieve a balance between model fit and smoothness, four knots were pre-specified for the BI score distribution, as recommended by Harrell.31 The model was adjusted for potential confounders. A likelihood ratio test was performed to test the null hypothesis of linearity (P for nonlinear). If a significant non-linear association was identified, a two-piecewise linear regression model was subsequently applied to determine the threshold effect (inflection point) using the maximum likelihood method. Based on the identified inflection point, patients were stratified into High BI and Low BI groups. Subsequently, univariate and multivariate logistic regression analyses were repeated to further validate the association between BI categories and POD risk, with ORs and 95% CIs calculated.

To enhance the reliability of the results, this study further employed Inverse Probability of Treatment Weighting (IPTW) analysis. To reduce confounding effects due to the non-randomized nature of this study, the preoperative BI score group (High BI group/Low BI group) was used as the outcome variable, and the demographic and clinical characteristics listed in Table 1 (excluding preoperative BI score) were used as predictor variables to construct a Logistic regression model for calculating the propensity score. The stabilized weight for each patient was calculated based on their marginal probability and the propensity score. Furthermore, we trimmed the top and bottom 1% of the stabilized weights to reduce the influence of outliers.32,33 The Absolute Standardized Difference (ASD) was used to assess the balance between the two groups before and after weighting. An ASD less than 0.10 was considered indicative of good balance between the two groups. Variables with ASD still ≥0.10 after IPTW adjustment were considered insufficiently balanced and were included as covariates in the outcome analysis to control for residual confounding. Specifically, relevant variables were included as additional covariates in the weighted Logistic regression models for POD and postoperative agitation, respectively. Results are presented as ORs and their 95% CIs.

Table 1 Comparison of Variables Between High and Low BI Groups Before and After IPTW Adjustment

Additionally, to explore potential effect modifiers, subgroup analyses were performed stratified by age (≥75 years vs <75 years), gender (male vs female), CCI (≥3 points vs <3 points), anesthesia duration (≥120 min vs <120 min), and preoperative hospital stay (≥5 days vs <5 days). The presence of interaction between each stratification factor and preoperative BI score in influencing the risk of POD was tested by constructing a single Logistic regression model containing the treatment-covariate interaction term. The interaction was tested using the likelihood ratio test, with P<0.05 indicating a statistically significant interaction. We estimated stratum-specific ORs.

To assess the incremental predictive value of the BI score for POD, four logistic regression models were developed: Model 1 included variables expect BI score that were significant in univariate analysis; Model 2 incorporated the BI score along with those univariate-significant variables; Model 3 included variables expect BI score that were significant in multivariate analysis; and Model 4 incorporated the BI score alongside those multivariate-significant variables. The predictive performance of these models was evaluated using receiver operating characteristic (ROC) curve analysis, and the improvement attributed to the BI score was quantified by comparing the area under the curve (AUC) with its corresponding 95% confidence interval (CI).

This study used professional statistical analysis software SPSS 25.0 and Zstats for data processing and analysis. A two-sided P-value < 0.05 was considered statistically significant. The calculation of inverse probability weights was achieved by obtaining propensity scores and performing calculations using SPSS 25.0 software.

Results

Patient Characteristics

A total of 329 patients voluntarily participated in this study and provided written informed consent. After excluding 31 individuals based on the exclusion criteria, 298 patients were ultimately included for analysis (Figure 1). The baseline characteristics and perioperative data of the patients are detailed in Table 2. Among the preoperative indicators, compared to the non-delirium group, the delirium group had a significantly lower preoperative BI score (Z=−5.673, P<0.001), lower BMI (t=3.149, P=0.002), older age (Z=−6.757, P<0.001), and longer preoperative hospital stay (Z=−3.086, P=0.002). Among the intraoperative variables, delirium patients had a greater mean arterial pressure difference (t=−2.000, P=0.046) and greater fluctuation in oxygen saturation difference (Z=−2.253, P=0.024) compared to non-delirium patients. The differences in the remaining variables between the two groups did not reach statistical significance.

Table 2 Comparison of Variables Between the Two Patient Groups

Figure 1 Flow chart of patient selection.

Analysis Results of Influencing Factors for POD in Elderly Non-Cardiac Surgery Patients

The results showed that several variables were significantly associated with POD (P<0.05): Preoperative BI score (OR=0.959, 95% CI: 0.946–0.973) and BMI (OR=0.852, 95% CI: 0.769–0.944) were significantly negatively correlated with the outcome. Age (OR=1.196, 95% CI: 1.131–1.264), preoperative hospital stays (OR=1.179, 95% CI: 1.072–1.297), and mean arterial pressure difference (OR=1.027, 95% CI: 1.001–1.055) were significantly positively correlated with the outcome. The associations of variables such as hemoglobin and albumin with the outcome did not reach significant statistical significance. For categorical variables, taking smoking as an example, the association with the outcome varied by smoking status; “occasional smoking” (OR=3.649, 95% CI: 1.314–10.131) was significantly associated with the outcome, while other categories were not, as shown in Table 3. These results provided a preliminary basis for subsequent multivariate analysis.

Table 3 Univariate and Multivariate Logistic Regression Analysis of Factors Influencing POD

Using POD as the dependent variable (assignment: 1=occurred, 0=not occurred) and the 6 variables with P<0.05 from the univariate logistic regression analysis as independent variables, multivariate logistic regression analysis was performed. No significant multicollinearity was detected among the variables (Supplementary Table 1), and the model showed good fit (Supplementary Table 2). The final results showed that 3 variables were significantly associated with POD (P<0.05). Among them, a higher preoperative BI score (OR=0.956, 95% CI: 0.940–0.973) and a higher BMI (OR=0.860, 95% CI: 0.757–0.977) were protective factors for POD in elderly patients undergoing non-cardiac surgery under general anesthesia, while older age (OR=1.174, 95% CI: 1.102–1.250) was an independent risk factor, as shown in Table 3.

Dose-Response Relationship Between Preoperative BI Score and POD Risk and Threshold Identification

We performed RCS analysis to examine the nonlinear dose–response relationship between preoperative BI score and the risk of POD in elderly non-cardiac surgery patients. The RCS curve revealed a significant negative nonlinear association: as the BI score increased, POD risk gradually decreased. Notably, in the range below a BI score of 70, the risk declined more steeply, indicating greater sensitivity of POD risk to BI score changes among patients with lower functional status. Beyond a specific threshold, the decline in risk slowed and the curve flattened, suggesting a diminished protective effect of further increasing the BI score (Figure 2).

Figure 2 RCS curve depicting the relationship between BI and the risk of POD. The solid red line represents the trend of the OR for POD as the BI increases, the red shaded area indicates the 95% confidence interval, and the black dashed line represents the line of null effect (OR = 1). The plot was derived from a multivariable logistic regression model incorporating restricted cubic splines with four knots, adjusted for covariates. The reference point was set at the median BI value.

Abbreviations: RCS, restricted cubic spline; BI, Barthel Index; POD, postoperative delirium; OR, odds ratio.

To quantitatively identify the threshold, a two-piecewise linear regression model was applied. The analysis indicated an inflection point at 74.795 for the BI score (Supplementary Table 3). A likelihood ratio test comparing the two-piecewise model with a simple linear model yielded P = 0.034, supporting the presence of a threshold effect. On either side of this value, the association between BI score and POD risk differed substantially: at BI scores < 74.795, each 1-point increase was associated with a 7.5% reduction in POD risk (OR = 0.925, 95% CI: 0.885–0.966, P < 0.001), whereas above this point, the association was not statistically significant (OR = 1.003, 95% CI: 0.956–1.052, P = 0.903).

POD Risk Analysis Based on Preoperative BI Score Grouping

To enhance clinical applicability and validate the observed threshold effect, this study categorized patients into two groups using a cutoff value of 75 points based on the calculated threshold and BI score characteristics: the High BI group (BI ≥ 75 points, n = 179, 60.1%) and the Low BI group (BI < 75 points, n = 119, 39.9%). Detailed baseline characteristics and perioperative data are presented in Supplementary Table 4. Overall, the median preoperative BI scores were 95 in the High BI group and 55 in the Low BI group. Compared with the High BI group, patients in the Low BI group were significantly older (P = 0.033), while no statistically significant differences were observed in the remaining variables between the two groups.

The overall incidence of POD within 7 days after surgery was 25.8% (77/298), with distinct rates between the High BI and Low BI groups (14.5% vs 42.9%, respectively). Both univariate and multivariate logistic regression analyses were conducted on the grouped data, with results shown in Table 4. The multivariate model confirmed that the low BI group was independently associated with an increased risk of POD (OR = 4.615, 95% CI: 2.377–8.963, P < 0.001). To enhance the reliability of the findings, IPTW was further employed as a supplementary analysis. After IPTW adjustment, all variables achieved good balance between the two groups (ASD < 0.10; Table 1). The IPTW-adjusted analysis corroborated the primary results, demonstrating a persistently elevated risk of POD in the Low BI group (adjusted OR = 4.445, 95% CI: 2.564–7.707, P < 0.001). However, no significant differences were observed between the two groups in secondary outcomes such as agitation (Table 5).

Table 4 Univariate and Multivariate Logistic Regression Analysis of the Impact of BI-Based Grouping on POD

Table 5 Comparison of Postoperative Outcomes Based on BI Before and After IPTW Adjustment

Subgroup Analysis

Subgroup analysis showed that the association between preoperative BI score and POD had significant interaction effects with three variables: CCI (interaction P < 0.001), anesthesia duration (interaction P = 0.004), and preoperative hospital stay (interaction P < 0.001). The significant association between preoperative BI score and POD was more prominent in patients with shorter anesthesia duration (OR=0.938; 95% CI: 0.917–0.960; P < 0.001), and was only applicable to patient groups with a low CCI (OR=0.931; 95% CI: 0.911–0.952; P < 0.001) and shorter preoperative hospital stay (OR=0.919; 95% CI: 0.894–0.946; P < 0.001) (Figure 3).

Figure 3 Forest plot of ORs by patient subgroups.

Abbreviations: OR, odds ratio; CCI, Charlson Comorbidity Index; Preop LOS, preoperative length of stay.

The Incremental Predictive Value of BI Score on POD

ROC curve analysis was performed to evaluate the incremental predictive value of the preoperative BI score for POD in elderly patients undergoing non-cardiac surgery. To assess the specific contribution of the BI score, models were developed and evaluated separately, with and without it. Incorporating the BI score into the model with univariable-significant predictors increased the AUC from 0.779 (95% CI: 0.723–0.835) to 0.840 (95% CI: 0.790–0.890) (Figure 4A and B). Similarly, when the score was added to the multivariate-significant predictors, the AUC improved from 0.766 (95% CI: 0.710–0.823) to 0.833 (95% CI: 0.782–0.884) (Figures 4C and D). Furthermore, integrating the BI score markedly improved model specificity, positive predictive value, and overall diagnostic accuracy, as detailed in Table 6. These findings underscore the BI score’s role as a robust, continuous measure of functional status that provides critical incremental information, thereby optimizing the identification of high-risk patients.

Table 6 Comparison of Predictive Performance for POD Between Models with and without the BI Score

Figure 4 (A) ROC curve analysis of Model 1 for predicting POD; (B) ROC curve analysis of Model 2 for predicting POD; (C) ROC curve analysis of Model 3 for predicting POD; (D) ROC curve analysis of Model 4 for predicting POD. Model 1 included variables other than the BI score that were significant in univariate analysis; Model 2 added the BI score to Model 1; Model 3 included variables other than the BI score that were significant in multivariate analysis; Model 4 added the BI score to Model 3. The incremental predictive value of the BI score was assessed by comparing the area under the curve (AUC) among these models.

Abbreviations: ROC, receiver operating characteristic; POD, postoperative delirium; BI, Barthel Index; AUC, area under the curve.

Discussion

This study focused on the intrinsic relationship between the preoperative BI score and POD in elderly patients undergoing non-cardiac surgery. Through systematic analysis of prospectively collected single-center data, we found that a lower preoperative BI score was associated with a higher risk of POD, and there was a significant nonlinear relationship between the BI score and POD risk, with an optimal critical value of 75 points. Using this value as a cutoff, a low BI score status was a strong independent risk factor for POD. Furthermore, ROC curve analysis demonstrated that the inclusion of the BI score significantly improved the model’s discrimination and diagnostic performance. This confirms that the BI captures a dimension of patient vulnerability distinct from traditional risk factors, providing quantitative evidence to support its integration into routine perioperative risk stratification.

In recent years, exercise interventions have emerged as a promising non-pharmacological approach to enhance functional and cognitive outcomes in elderly patients. Accumulating evidence demonstrates that tailored exercise regimens not only effectively counteract hospitalization-associated functional decline34 but also confer multidomain benefits in patients with Alzheimer’s disease.35 Notably, exercise has shown potential in delirium management, with studies reporting improved delirium trajectory and preserved functional capacity after hospital discharge.36 These consistent observations underscore the importance of maintained physical function in preventing neurocognitive complications. Nevertheless, existing research has primarily addressed post-delirium interventions or focused on specific clinical populations, whereas the relationship between preoperative functional status and POD risk during the elective surgical period remains inadequately explored.

Against this background, the current study systematically examines the association between preoperative ADL, as quantified by the BI, and POD risk in elderly non-cardiac surgical patients. Our analysis identifies a preoperative BI score <75 as a strong independent predictor for POD development. This observation extends the established connection between physical function and cognitive preservation into the preoperative context, while simultaneously supporting the integration of BI assessment into routine preoperative evaluation. Such implementation could facilitate early identification of high-risk patients and inform targeted prehabilitation strategies aimed at POD prevention.

Patients with low preoperative BI scores often exhibit insufficient compensatory reserve of organ function, impaired integrity of neuroregulatory networks, and are frequently accompanied by nutritional and metabolic imbalances and immune dysfunction.37 This frail physical state leaves them lacking the effective capacity to resist surgical trauma stress, anesthetic drug effects, and postoperative environmental disturbances. The brain, as an organ with extremely high demands for energy metabolism and internal homeostasis, is more prone to functional disorders during this process, manifesting as delirium.38 From a clinical practice perspective, traditional POD prediction indicators mostly focus on physiological parameters (such as age, BMI, hemoglobin) or disease states (such as comorbidity index, ASA classification).39,40 These indicators can only reflect the patient’s current pathophysiological state but cannot quantify their potential compensatory capacity to cope with surgical trauma and anesthetic stress. In contrast, the BI score, which uses ADL (such as eating, dressing, toileting, walking, transferring) as its core assessment dimension, essentially quantifies the patient’s physical functional reserve. This scoring system integrates multiple dimensions reflecting physical functional status, including muscle strength, balance ability, and neural coordination function, providing a relatively comprehensive assessment of the patient’s overall functional reserve.41 Moreover, the BI score can be obtained through a simple assessment, offering the advantages of easy operation, strong repeatability, and freedom from medical facility limitations, making it particularly suitable for primary hospitals or rapid perioperative risk screening scenarios.

Through RCS curve analysis, we revealed a significant nonlinear correlation between the preoperative BI score and POD risk and identified BI=75 as the risk threshold. For elderly non-cardiac surgery patients with a BI score <75 points, close attention should be paid to the high POD risk associated with their deficits in ADL. Interventions such as preoperative rehabilitation training can be implemented to improve the BI score and reduce the incidence of POD. For patients with a BI score ≥75 points, the focus of POD risk prevention and control should shift to other risk factors (such as age, underlying diseases, et).9,42–44 This conclusion provides an important reference for clinical individualized risk stratification and precise intervention.

Simultaneously, the subgroup analysis in this study showed that the association between preoperative BI score and POD had significant interaction effects in patients with different CCI scores, anesthesia durations, and preoperative hospital stays, indicating that the impact of patient characteristics on POD varies. Furthermore, this association was only significant in patients with a lower CCI, shorter anesthesia duration, and shorter preoperative hospital stay. This suggests that in patients with fewer comorbidities, the predictive value of physical functional reserve (BI score) for POD is more prominent, whereas in patients with more comorbidities, POD risk is more dominated by organ function damage,45 diminishing the relative predictive performance of the BI score. Shorter anesthesia duration causes less interference with neurological function, making the patient’s functional reserve (BI score) a key indicator for identifying POD risk; whereas prolonged anesthesia itself increases neuronal stress,46,47 thereby weakening the predictive ability of the BI score for POD risk. Patients with short preoperative stays experience less negative impact of the hospital environment (eg, noise, disrupted routines) on cognitive function, making their POD risk more dependent on assessment of their preoperative functional status; patients with long preoperative stays may already have hospital-acquired frailty or cognitive decline,48 reducing the predictive value of the BI score. These results suggest that when assessing POD risk and formulating perioperative strategies to improve activity capacity, individual patient characteristics should be fully considered.

Despite the rigorous design and analytical methods employed in this study, several limitations remain: First, the study was single-center with a relatively limited sample size, potentially introducing selection bias, and the generalizability of the results requires validation through multi-center, large-sample prospective studies. Second, the specific molecular mechanisms through which the BI score is associated with POD (eg, whether by modulating neuroinflammatory factors or cholinergic neurotransmitter levels) were not explored, preventing a complete pathophysiological explanation for their association. Third, the effects of targeted preoperative interventions (such as preoperative functional rehabilitation training, individualized nutritional support, preoperative cognitive enhancement training) on improving the BI score were not assessed. This precludes determining specific intervention protocols for improving the BI score and their effectiveness, meaning the study conclusions can only reflect the predictive value of the preoperative BI score but cannot provide evidence for BI score-based preoperative targeted interventions. Future research should further investigate the preventive effects of these interventions on POD in elderly non-cardiac surgery patients.

Conclusion

This study demonstrates that a low preoperative BI score is an independent risk factor for POD in elderly patients undergoing non-cardiac surgery. RCS analysis revealed a significant nonlinear association between BI scores and POD risk, identifying a critical inflection point around 75 points, below which the risk of POD increases substantially. Furthermore, incorporating the BI score into clinical prediction models significantly enhances their discriminative performance for high-risk individuals. These findings suggest that preoperative BI assessment serves as a valuable tool for risk stratification. Future multicenter prospective studies are warranted to evaluate whether targeted interventions to optimize preoperative functional status can effectively reduce POD incidence, thereby improving long-term cognitive outcomes and quality of life in the elderly surgical population.

Compliance Statement on MMSE Usage

An unauthorized version of the Chinese MMSE was used by the study team without permission, however this has now been rectified with PAR. The MMSE is a copyrighted instrument and may not be used or reproduced in whole or in part, in any form or language, or by any means without written permission of PAR (www.parinc.com).

Data Sharing Statement

All data relevant to the study are included in the article or uploaded as Supplementary Information.

Disclosure

The authors have no conflict of interest to declare.

References

1. Hughes CG, Boncyk CS, Culley DJ. et al. American Society for Enhanced Recovery and Perioperative Quality Initiative Joint Consensus Statement on Postoperative Delirium Prevention. Anesth Analg. 2020;130(6):1572–17. doi:10.1213/ANE.0000000000004641

2. Aldecoa C, Bettelli G, Bilotta F, et al. European Society of Anaesthesiology evidence-based and consensus-based guideline on postoperative delirium. Eur J Anaesthesiol. 2017;34(4):192–214. doi:10.1097/EJA.0000000000000594

3. Evered L, Silbert B, Knopman DS, et al. Recommendations for the nomenclature of cognitive change associated with anaesthesia and surgery-2018. Br J Anaesth. 2018;121(5):1005–1012. doi:10.1016/j.bja.2017.11.087

4. Steinmetz J, Christensen KB, Lund T, Lohse N, Rasmussen LS, ISPOCD Group. Long-term consequences of postoperative cognitive dysfunction. Anesthesiology. 2009;110(3):548–555. doi:10.1097/ALN.0b013e318195b569

5. Newman MF, Kirchner JL, Phillips-Bute B, et al. Longitudinal assessment of neurocognitive function after coronary-artery bypass surgery. N Engl J Med. 2001;344(6):395–402. doi:10.1056/NEJM200102083440601

6. Gou RY, Hshieh TT, Marcantonio ER, et al. One-Year Medicare Costs Associated With Delirium in Older Patients Undergoing Major Elective Surgery. JAMA Surg. 2021;156(5):430–442. doi:10.1001/jamasurg.2020.7260

7. Thedim M, Vacas S. Postoperative Delirium and the Older Adult: untangling the Confusion. J Neurosurg Anesthesiol. 2024;36(3):184–189. doi:10.1097/ANA.0000000000000971

8. Li H, Liu C, Yang Y, et al. Effect of Intraoperative Midazolam on Postoperative Delirium in Older Surgical Patients: a Prospective, Multicenter Cohort Study. Anesthesiology. 2025;142(2):268–277. doi:10.1097/ALN.0000000000005276

9. Ahmadzadeh S, Duplechin DP, Haynes AT, et al. Evolving Clinical Management of Postoperative Delirium: a Narrative Review. Cureus. 2025;17(9):e92927. doi:10.7759/cureus.92927

10. Susano MJ, Grasfield RH, Friese M, et al. Brief Preoperative Screening for Frailty and Cognitive Impairment Predicts Delirium after Spine Surgery. Anesthesiology. 2020;133(6):1184–1191. doi:10.1097/ALN.0000000000003523

11. SNAP-3 collaborators, Swarbrick CJ, Williams K, Evans B, et al. Postoperative outcomes in older patients living with frailty and multimorbidity in the UK: SNAP-3, a snapshot observational study. Br J Anaesth. 2025;135(1):166–176. doi:10.1016/j.bja.2025.04.026.

12. Pol RA, van Leeuwen BL, Visser L, et al. Standardised frailty indicator as predictor for postoperative delirium after vascular surgery: a prospective cohort study. Eur J Vasc Endovasc Surg. 2011;42(6):824–830. doi:10.1016/j.ejvs.2011.07.006

13. Fang PP, Shang ZX, Xu J, et al. Contribution of intraoperative electroencephalogram suppression to frailty-associated postoperative delirium: mediation analysis of a prospective surgical cohort. Br J Anaesth. 2023;130(2):e263–e271. doi:10.1016/j.bja.2022.11.002

14. Glumac S, Kardum G, Sodic L, Supe-Domic D, Karanovic N. Effects of dexamethasone on early cognitive decline after cardiac surgery: a randomized controlled trial. Eur J Anaesthesiol. 2017;34(11):776–784. doi:10.1097/EJA.0000000000000647

15. Jitapunkul S, Kamolratanakul P, Ebrahim S. The meaning of activities of daily living in a Thai elderly population: development of a new index. Age Ageing. 1994;23(2):97–101. doi:10.1093/ageing/23.2.97

16. Falandry C, Bitker L, Abraham P, et al. Senior-COVID-Rea Cohort Study: a Geriatric Prediction Model of 30-day Mortality in Patients Aged over 60 Years in ICU for Severe COVID-19. Aging Dis. 2022;13(2):614–623. doi:10.14336/AD.2021.1004

17. Wang YY, Yue JR, Xie DM, et al. Effect of the Tailored, Family-Involved Hospital Elder Life Program on Postoperative Delirium and Function in Older Adults: a Randomized Clinical Trial. JAMA Intern Med. 2020;180(1):17–25. doi:10.1001/jamainternmed.2019.4446

18. Stortecky S, Schoenenberger AW, Moser A, et al. Evaluation of multidimensional geriatric assessment as a predictor of mortality and cardiovascular events after transcatheter aortic valve implantation. JACC: Cardiovasc Interv. 2012;5(5):489–496. doi:10.1016/j.jcin.2012.02.012

19. Collin C, Wade DT, Davies S, Horne V. The Barthel ADL Index: a reliability study. Int Disabil Stud. 1988;10(2):61–63. doi:10.3109/09638288809164103

20. Shah S, Cooper B, Maas F. The Barthel Index and ADL Evaluation in Stroke Rehabilitation in Australia, Japan, the UK and the USA. Aust Occup Ther J. 1992;39(1):5–13. doi:10.1111/j.1440-1630.1992.tb01729.x

21. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373–383. doi:10.1016/0021-9681(87)90171-8

22. Oberhaus J, Wang W, Mickle AM, et al. Evaluation of the 3-Minute Diagnostic Confusion Assessment Method for Identification of Postoperative Delirium in Older Patients. JAMA Network Open. 2021;4(12):e2137267. doi:10.1001/jamanetworkopen.2021.37267

23. Wang Y, Zhao L, Zhang C, et al. Identification of risk factors for postoperative delirium in elderly patients with Hip fractures by a risk stratification index model: a retrospective study. Brain Behav. 2021;11(12):e32420. doi:10.1002/brb3.2420

24. Mu DL, Ding PP, Zhou SZ, et al. Cross-cultural adaptation and validation of the 3D-CAM Chinese version in surgical ICU patients. BMC Psychiatry. 2020;20(1):133. doi:10.1186/s12888-020-02544-w

25. Evered LA, Chan MTV, Han R, et al. Anaesthetic depth and delirium after major surgery: a randomised clinical trial. Br J Anaesth. 2021;127(5):704–712. doi:10.1016/j.bja.2021.07.021

26. Ma R, Zhao J, Li C, et al. Diagnostic accuracy of the 3-minute diagnostic interview for confusion assessment method-defined delirium in delirium detection: a systematic review and meta-analysis. Age Ageing. 2023;52(5):afad074. doi:10.1093/ageing/afad074

27. Marcantonio ER, Ngo LH, O’Connor M, et al. 3D-CAM: derivation and validation of a 3-minute diagnostic interview for CAM-defined delirium: a cross-sectional diagnostic test study. Ann Intern Med. 2014;161(8):554–561. doi:10.7326/M14-0865

28. Al-Taie SF, Fedwi MM, Merza M, et al. Evaluation of different sedation scales in the ICU management of COVID-19 patients. Sci Rep. 2025;15(1):28782. doi:10.1038/s41598-025-14421-1

29. Idänpään-Heikkilä JE. WHO guidelines for good clinical practice (GCP) for trials on pharmaceutical products: responsibilities of the investigator. Ann Med. 1994;26(2):89–94. doi:10.3109/07853899409147334

30. China Geriatrics Society Anesthesiology Branch. Chinese expert consensus on the prevention and treatment of postoperative delirium in elderly patients (in Chinese). Int J Anesthesiol Resuscit. 2023;44(1):1–27.

31. Harrell Jr FE. Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis. 2nd edn ed. Springer; 2015.

32. Shiba K, Kawahara T. Using Propensity Scores for Causal Inference: pitfalls and Tips. J Epidemiol. 2021;31(8):457–463. doi:10.2188/jea.JE20210145

33. Austin PC, Stuart EA. Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Stat Med. 2015;34(28):3661–3679. doi:10.1002/sim.6607

34. Martínez-Velilla N, Casas-Herrero A, Zambom-Ferraresi F, et al. Effect of Exercise Intervention on Functional Decline in Very Elderly Patients During Acute Hospitalization: a Randomized Clinical Trial. JAMA Intern Med. 2019;179(1):28–36. doi:10.1001/jamainternmed.2018.4869

35. López-Ortiz S, Valenzuela PL, Seisdedos MM, et al. Exercise interventions in Alzheimer’s disease: a systematic review and meta-analysis of randomized controlled trials. Ageing Res Rev. 2021;72:101479. doi:10.1016/j.arr.2021.101479

36. Lozano-Vicario L, Zambom-Ferraresi F, Zambom-Ferraresi F, et al. Effects of Exercise Intervention for the Management of Delirium in Hospitalized Older Adults: a Randomized Clinical Trial. J Am Med Dir Assoc. 2024;25(8):104980. doi:10.1016/j.jamda.2024.02.018

37. Pilotto A, Cella A, Pilotto A, et al. Three Decades of Comprehensive Geriatric Assessment: evidence Coming From Different Healthcare Settings and Specific Clinical Conditions. J Am Med Dir Assoc. 2017;18(2):192.e1–192.e11. doi:10.1016/j.jamda.2016.11.004

38. Stone SP, Ali B, Auberleek I, Thompsell A, Young A. The Barthel index in clinical practice: use on a rehabilitation ward for elderly people. J R Coll Physicians Lond. 1994;28(5):419–423.

39. Oh AR, Park J, Kim CS, Lee SM, Yoo SY. Association between preoperative anemia and postoperative delirium in elderly patients undergoing non-cardiac surgery: a retrospective observational study. Korean J Anesthesiol. 2025;78(5):462–470. doi:10.4097/kja.24701

40. Sadeghirad B, Dodsworth BT, Schmutz Gelsomino N, et al. Perioperative Factors Associated With Postoperative Delirium in Patients Undergoing Noncardiac Surgery: an Individual Patient Data Meta-Analysis. JAMA Network Open. 2023;6(10):e2337239. doi:10.1001/jamanetworkopen.2023.37239

41. Van den Beld AW, Blum WF, Pols HA, Grobbee DE, Lamberts SW. Serum insulin-like growth factor binding protein-2 levels as an indicator of functional ability in elderly men. Eur J Endocrinol. 2003;148(6):627–634. doi:10.1530/eje.0.1480627

42. Vacas S, Degos V, Feng X, Maze M. The neuroinflammatory response of postoperative cognitive decline. Br Med Bull. 2013;106(1):161–178. doi:10.1093/bmb/ldt006

43. Patel A, Zhang M, Liao G, et al. A Systematic Review and Meta-analysis Examining the Impact of Age on Perioperative Inflammatory Biomarkers. Anesth Analg. 2022;134(4):751–764. doi:10.1213/ANE.0000000000005832

44. Vacas S, Canales C, Deiner SG, Cole DJ. Perioperative Brain Health in the Older Adult: a Patient Safety Imperative. Anesth Analg. 2022;135(2):316–328. doi:10.1213/ANE.0000000000006090

45. Kalyan P, Parulekar M. Impact of the Charlson Comorbidity Index on Delirium Outcomes. Cureus. 2024;16(9):e70006. doi:10.7759/cureus.70006

46. Moller JT, Cluitmans P, Rasmussen LS, et al. Long-term postoperative cognitive dysfunction in the elderly ISPOCD1 study. ISPOCD investigators. International Study of Post-Operative Cognitive Dysfunction. Lancet. 1998;351(9106):857–861. doi:10.1016/S0140-6736(97)07382-0

47. Ravi B, Pincus D, Choi S, Jenkinson R, Wasserstein DN, Redelmeier DA. Association of Duration of Surgery With Postoperative Delirium Among Patients Receiving Hip Fracture Repair. JAMA Network Open. 2019;2(2):e190111. doi:10.1001/jamanetworkopen.2019.0111

48. Jain S, Gan S, Nguyen OK, et al. Survival, Function, and Cognition After Hospitalization in Long-Term Acute Care Hospitals. JAMA Network Open. 2024;7(5):e2413309. doi:10.1001/jamanetworkopen.2024.13309

Creative Commons License © 2026 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms and incorporate the Creative Commons Attribution - Non Commercial (unported, 4.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.