Skip to main content

Longitudinal phenotypes in patients with acute respiratory distress syndrome: a multi-database study

Abstract

Background

Previously identified phenotypes of acute respiratory distress syndrome (ARDS) have been limited by a disregard for temporal dynamics. We aimed to identify longitudinal phenotypes in ARDS to test the prognostic and predictive enrichment of longitudinal phenotypes, and to develop simplified models for phenotype identification.

Methods

We conducted a multi-database study based on the Chinese Database in Intensive Care (CDIC) and four ARDS randomized clinical trials (RCTs). We employed latent class analysis (LCA) to identify longitudinal phenotypes using 24-hourly data from the first four days of invasive ventilation. We used the Cox regression model to explore the association between time-varying respiratory parameters and 28-day mortality across phenotypes. Phenotypes were validated in four RCTs, and the heterogeneity of treatment effect (HTE) was investigated. We also constructed two multinomial logistical regression analyses to develop the probabilistic models.

Findings

A total of 605 ARDS patients in CDIC were enrolled. The three-class LCA model was identified and had the optimal fit, as follows: Class 1 (n = 400, 66.1% of the cohort) was the largest phenotype over all study days, and had fewer abnormal values, less organ dysfunction and the lowest 28-day mortality rate (30.5%). Class 2 (n = 102, 16.9% of the cohort) was characterized by pulmonary mechanical dysfunction and had the highest proportion of poorly aerated lung volume, the 28-day mortality rate was 47.1%. Class 3 (n = 103, 17% of the cohort) was correlated with extra-pulmonary dysfunction and had the highest 28-day mortality rate (56.3%). Time-varying mechanical power was more significantly associated with 28-day mortality in Class 2 patients compared to other phenotypes. Similar phenotypes were identified in four RCTs. A significant HTE between phenotypes and treatment strategies was observed in the ALVEOLI (high PEEP vs. low PEEP) and the FACTT trials (conservative vs. liberal fluid management). Two parsimonious probabilistic models were constructed to identify longitudinal phenotypes.

Interpretation

We identified and validated three novel longitudinal phenotypes for ARDS patients, with both prognostic and predictive enrichment. The phenotypes of ARDS can be accurately identified with simple classifier models, except for Class 3.

Introduction

Acute respiratory distress syndrome (ARDS), clinically defined by the Berlin definition [1], is a heterogenous syndrome characterized by acute hypoxic respiratory failure that can be caused by a wide variety of insults [2]. ARDS is a clinically heterogenous syndrome with diverse populations, multiple etiologies, and a broad definition which might explain the absence of benefit in most randomized controlled trials (RCTs) assessing various treatment strategies [3]. Identifying specific ARDS phenotypes could lead to more favorable clinical trials and personalized ARDS management [4,5,6].

Several ARDS phenotypes have been documented. In the past few years, Calfee and colleagues used latent class analysis (LCA) using cross-sectional data at baseline and identified two phenotypes: a hypo-inflammatory, and a hyper-inflammatory phenotype [7]. The latter had higher levels of pro-inflammatory biomarkers and poorer outcomes. Subsequent analyses demonstrated that patients with the hyper-inflammatory phenotype might benefit more from higher positive end-expiratory pressure (PEEP) and a conservative fluid strategy [8, 9]. According to 54 respiratory and CT-derived variables, a two-class model was identified as best fitting: non-recruitable phenotype and recruitable phenotype, and the recruitable phenotype presented with an increased PaO2/FiO2 ratio, compliance, and decreased alveolar dead space in response to a standardized recruitment maneuver [10].

However, these phenotypes do not capture the complexity and diversity of ARDS, and were derived only based on cross-sectional data collected within one day. Using more diverse data with a longitudinal approach might be more informative in identifying phenotypes. The longitudinal phenotypes of traditional ARDS have never been addressed. Phenotypes may be dynamic and change throughout the course of a patient’s illness. A retrospective study divided septic patients into four illness categories based on the severity of laboratory and vital sign abnormalities, and demonstrated that almost 60% of them changed their illness category at least once during hospitalization [11]. To our best knowledge, only one prior study has evaluated the dynamic change of ARDS phenotypes [12].

Therefore, we designed this study based on a multi-database to identify longitudinal phenotypes of ARDS. We hypothesized that using diverse data with a longitudinal approach could identify novel longitudinal phenotypes, with different clinical characteristics, mortality rates, and most importantly, responding differently to treatments. We also aimed to explore the dynamic change of ARDS phenotypes across days. Finally, we aimed to derive and validate simplified probabilistic models for phenotype assignment.

Method

Study design and participants

We conducted a multi-database study based on the Chinese Database in Intensive Care (CDIC) and RCTs from the National Heart, Lung, and Blood Institute (NHLBI) ARDS Network. The CDIC collected data from 11,560 patients admitted to the Department of Critical Care Medicine, Zhongda Hospital, Southeast University, China, from January 2014 to March 2021, and was classified as a derivation cohort in the present study. Patients in CDIC who fulfilled the berlin definition of ARDS (detail in Additional file 1: Additional methods) [1] and received mechanical ventilation for at least 24 h were eligible for inclusion, we excluded patients younger than 18 years. We only included the first Intensive Care Unit (ICU) admission of each patient. ARDSNet trials including ALVEOLI [13], FACTT [14], EDEN [15] and SAILS [16] were classified as validation cohorts. Involved patients in RCTs were all intubated and received mechanical ventilation. The details of CDIC and ARDSNet trials were presented in Additional file 1: Table S1.

The present study was approved by the Research Ethics Commission of Zhongda Hospital Southeast University (2022ZDSYLL082-P01). For ARDSNet trials, all data were approved by Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC, https://biolincc.nhlbi.nih.gov). STROBE recommendations were followed.

Data collection and outcomes

The detail of data collection was presented in Additional file 1: Additional methods. The primary outcome in the CDIC derivation cohort was 28-day mortality. The primary outcome was 60-day mortality in ALVEOLI, FACTT and SAILS trials, and was ventilator-free days (VFDs) to 28 days in the EDEN trial. Other outcomes included the use of neuromuscular blocking agents (NMBAs), ICU mortality, hospital mortality and 90-day mortality were also investigated.

Quantitative CT analysis

For each patient in CDIC, the whole lung CTs were collected within three days before the initiation of mechanical ventilation, and were treated as lung CTs on Day 0. The cross-sectional lung images were processed and analyzed by a custom-designed software package [17]. After excluding hilar structures, the total lung parenchyma was divided into four regions with different inflated status based on CT attenuation, as follows: non-aerated (voxel density + 100 to − 100 Hounsfield Units, HU), poorly-aerated (− 101 to − 500 HU), well-aerated (− 501 to − 900 HU), and over-inflated lung tissue (− 901 to − 1000 HU) [18]. Fours masks in the total lung were gained as the output, the volume and volumetric percentage of each mask were calculated. The image analysis process was displayed in Additional file 1: Fig. S1. All analyses were performed in Python version 3.7.

Phenotype derivation

In the CDIC derivation cohort, phenotypes were first studied longitudinally using time-dependent analysis with 24-hourly data from the first four days of invasive ventilation, which were identified by longitudinal LCA [19]. Clinical variables were selected based on their association with the severity or outcome of ARDS, and were used as inputs for the identification of latent classes (Additional file 1: Table S2), including age, minute ventilation, PEEP, driving pressure, mechanical power, ventilatory ratio, PaO2/FiO2 ratio, heart rate, MAP, pH, creatinine, bicarbonate, lactate and fluid balance. We recorded the most abnormal value if a variable was recorded more than once.

Before longitudinal LCA, we first assessed the distributions and missingness (Additional file 1: Table S3) of candidate variables. Multiple imputations with chained equations (MICE) were used to account for missing data (detail in Additional file 1: Additional methods). Standardized transformation was used for the dataset, and non–normally distributed variables were log-transformed prior to standardized transformation. Longitudinal LCA was fit to the combined datasets of candidate variables from all patients across Days 0, 1, 2 and 3, while allowing phenotype transition across ICU days. We estimated models ranging from two to six classes, and the optimal number of latent classes were selected using the lowest AIC, SABIC and highest values of entropy. The minimum number of patients should be over 5% of the entire study population. The probability of class membership was used to evaluate the robustness of class membership, and the minimum probability should be over 80%. Sensitivity analysis included only the patients who remained on mechanical ventilation for more than 96 h. Longitudinal phenotypes were also validated in ARDSNet trials. LCA was performed using the tidyLPA package in R, and codes are available in Additional file 2.

We then employed group-based trajectory modeling (GBTM) [20] to assess if the trajectory of a single variable could be used to identify trajectory phenotypes with similar dynamics to those identified by longitudinal LCA. Since longitudinal phenotypes differed most on mechanical power and ventilatory ratio in present study, we applied GBTM on 24-hourly data from the first four days of invasive ventilation to identify trajectories for mechanical power and ventilatory ratio in CDIC (detail in Additional file 1: Additional methods). GBTM was performed using the traj package in Stata.

Statistical analyses

Values are presented as proportions for categorical variables and means (standard deviations) or medians [interquartile ranges (IQRs)] for continuous variables. For comparisons, we used analysis of variance and the Kruskal–Wallis test for continuous data and the X2 test for categorical data.

After the derivation of phenotypes of ARDS, key variables between phenotypes were compared and visualized with rank plots, class membership transition over days 0, 1, 2, and 3 were visualized with alluvial plots. We then assessed the correlation of the longitudinal phenotypes with pre-selected respiratory variables in CDIC, which included mechanical power, ventilatory ratio and driving pressure. We employed the Cox proportional hazards model to estimate the effect of a time-varying parameter on a time-to-event outcome in longitudinal phenotypes [21]. Based on the prior knowledge, baseline variables were selected into the Cox model and included pH, PaO2/FiO2 ratio, PaCO2 and dynamic compliance. Since patients with different phenotypes might have different inflated status of lung parenchyma, we also compared the volumetric percentage of each mask across phenotypes on Day 0 in CDIC.

We then compared 28-day mortality or 60-day mortality of patients in different phenotypes (Day 0) in CDIC cohort and ARDSNet trials using Kaplan–Meier curves and log-rank tests; we also performed a multivariate Cox regression model to explore the association and adjusted for age, gender and BMI. In each of the four ARDSNet trials, heterogeneity of treatment effect (HTE) was also evaluated by the interaction test to determine if treatment effects were differential across phenotypes (Day 0) in existing trials. HTE was assessed by the interaction term (Class × Treatment strategy) of the Cox regression for mortality and Poisson regression for VFDs.

Finally, we attempted to construct two parsimonious models to predict phenotypes on Day 0 and Day 2 using baseline variables, respectively. Machine learning algorithms included extreme gradient boosting (XGBoost) and gradient boosted model (GBM) were used to identify the most critical classifier variables. To select the most important variables, variable importance was used for the XGBoost, relative influence factor of the variable was used for GBM (detail in Additional file 1: Additional methods). The common variables in the top five variables of the two machine learning models were selected, and therefore were used to develop final multinomial logistical regression models to identify phenotypes on Day 0 and Day 2, respectively. The ability of the final model to predict the phenotypes was determined by calculating the area under the receiver operating characteristic curves (AUROC) for the phenotypes, both in the CDIC and ARDSNet trials.

The p-value was calculated to evaluate the differences between phenotypes, and P < 0.05 was considered statistically significant. The level of significance for the test of interaction was adjusted to 0.0167 according to Bonferroni correction. All statistical analyses were performed using R (version 4.0.3), Stata (16.0) and Python (3.7).

Results

Patients in study

In the CDIC derivation cohort, a total of 605 patients met inclusion and exclusion criteria and were enrolled in the final analyses (Additional file 1: Fig. S2). The mean sofa score was 9 (IQR: 6–12) and the PaO2/FiO2 ratio was 159 mmHg (IQR: 117–210). Pneumonia was the leading cause of ARDS (66.1%). The 28-day all-cause mortality was 37.7%. In the four ARDSNet trials, a total of 3294 patients were included as validation cohorts (549 in the ALVEOLI trial, 1000 in the FACTT trial, 1000 in the EDEN trial and 745 in the SAILS trial), the 60-day mortality across the four trials ranged from 22.7 to 26.9%.

Derivation of phenotypes for ARDS in CDIC

Using the longitudinal data from the first four days of mechanical ventilation, a three-class model was identified and had the optimal fit in the CDIC derivation cohort (Additional file 1: Table S4). Entropy was 90.6% and the probability of class membership ranged from 87.4 to 97.6%. The longitudinal phenotypes differed broadly in clinical characteristics and organ dysfunction patterns (Table 1 and Additional file 1: Table S5). The standardized mean differences of main clinical characteristics between phenotypes over time were shown in Fig. 1. Class 1 (n = 400, 66.1% of the cohort) was the largest phenotype, and had fewer abnormal values and less organ dysfunction. Class 2 (n = 102, 16.9% of the cohort) was characterized by the highest minute ventilation, driving pressure, mechanical power, ventilatory ratio and the lowest PaO2/FiO2 during the first four days of mechanical ventilation, which can be called pulmonary mechanical dysfunction phenotype. Class 3 (n = 103, 17% of the cohort) was characterized by the highest creatinine, lactate and the lowest bicarbonate, MAP, and a higher proportion of patients received vasopressors compared to other phenotypes, which can be called extra-pulmonary dysfunction phenotype (Additional file 1: Figs. S3–S5). Interleukin-6 (IL-6) was highest in Class 3 compared to other phenotypes (Additional file 1: Fig. S6). As for the causes of ARDS, Class 3 had the lowest proportion of pneumonia and the highest proportion of sepsis compared to other phenotypes.

Table 1 Clinical characteristics and outcomes of the longitudinal phenotypes on Day 0 in the CDIC
Fig. 1
figure 1

Standardized mean differences between three longitudinal phenotypes in CDIC cohort. A Class 1 vs. Class 2. B Class 1 vs. Class 3, C Class 2 vs Class 3. Fourteen variables were used of phenotyping, but seventeen variables are displayed to give a comprehensive clinical characteristic of the phenotypes. MAP = mean arterial blood pressure; PaCO2 = partial pressure of Carbon Dioxide; PaO2 = partial pressure of oxygen; FiO2 = fraction of inspired oxygen; PEEP = positive end-expiratory pressure

Most patients (56.9%) changed their phenotypes at least once during the first four days of mechanical ventilation (Fig. 2). Class 1 had the largest number over all study days, while the number of Class 3 gradually decreased. Sensitivity analysis including only the patients who remained on mechanical ventilation for more than 96 h (n = 459, 75.8% of the cohort) and yielded a similar result, with only 4.4% of patients on Day 0, 3.3% on Day 1, 4.3% on Day 2 and 4.8% on Day 3 changing class membership (Additional file 1: Fig. S7).

Fig. 2
figure 2

ARDS state transition over days 0, 1, 2 and 3 in CDIC cohort. A ARDS state transition over days in whole patients. B ARDS state transition over days based on the survival state on Day 28. C ARDS state transition over days for patients in Class 1; D ARDS state transition over days for patients in Class 2; E ARDS state transition over days for patients in Class 3. ARDS = acute respiratory distress syndrome; ICU = Intensive care unit

After GBTM, three distinct trajectories of mechanical power and ventilatory ratio were observed and had the optimal fit, as follows: a sustained low value, or a sustained moderate value, or a sustained high value for mechanical power or ventilatory ratio over the first four days of invasive ventilation (Additional file 1: Fig. S8). While the trajectories and longitudinal phenotypes did not overlap much in mechanical power and ventilatory ratio (Additional file 1: Fig. S9). The clinical characteristics and outcomes between trajectories were shown in Additional file 1: Table S6.

Interaction between phenotypes and respiratory parameters on mortality

In the Cox proportional hazards model, after adjusting for pH, PaCO2, PaO2/FiO2 ratio and respiratory system compliance, there was a significant interaction between mechanical power and phenotypes (Fig. 3 and Additional file 1: Tables S7–S9). While time-varying mechanical power was more significantly associated with 28-day mortality in Class 2 patients compared to other phenotypes in the CDIC cohort (HR 1.04, 95% CI 1.01–1.07; p for interaction = 0.0051). No significant interaction was detected between time-varying ventilatory ratio or driving pressure and phenotypes.

Fig. 3
figure 3

Interaction between longitudinal phenotypes of ARDS with time-varying mechanical power (A), ventilatory ratio (B) and driving pressure (C) on 28-day mortality in CDIC cohort. P values represent p values for interaction

Correlation of phenotypes with lung CT features in CDIC

A total of 427 patients performed lung CT within three days before the initiation of mechanical ventilation: 328 (82%) patients in Class 1, 72 (70.6%) patients in Class 2 and 27 (26.2%) patients in Class 3. Compared to other phenotypes, patients in Class 2 had the lowest proportion of normally aerated lung volume, and the highest proportion of poorly-aerated lung volume on Day 0 (Additional file 1: Table S10).

Validation of longitudinal phenotypes for ARDS in ARDSNet trials

Longitudinal phenotypes were also validated in the four ARDSNet trials, and showed the same optimal phenotype numbers as observed in the derivation cohort (Additional file 1: Fig. S10). The phenotype sizes varied across the trials: Class 1 ranged from 41.8 to 62.0%, Class 2 ranged from 14.4 to 34.9%, and Class 3 ranged from 9.9 to 37.2%. The clinical characteristics of the phenotypes were largely similar to those of the derivation cohort (Additional file 1: Tables S11–S14 and Additional file 1: Figs. S11–S14). Specifically, Class 1 was characterized by less organ dysfunction, Class 2 was predominantly characterized by pulmonary mechanical dysfunction phenotype and Class 3 was mainly characterized by extra-pulmonary organ dysfunction. IL-6 and Soluble intercellular adhesion molecule-1 (sICAM-1) were assessed in the ALVEOLI trial, compared to Class 1 and Class 2, both IL-6 and sICAM-1 were highest in Class 3 (Additional file 1: Fig. S6).

Relationship between phenotypes and clinical outcomes

In the CDIC derivation cohort, the 28-day mortality rates were highest in Class 3 (56.3%), followed by Class 2 and Class 1 (47.1% and 30.5%, respectively). Kaplan–Meier survival curves showed the 28-day mortality was highest in Class 3 (P < 0.0001) compared with other phenotypes. Across all ARDSNet trials, the highest 60-day mortality occurred in Class 3 compared with other phenotypes (P < 0.001). In the ALVEOLI trial, 60-day mortality was 18.9% for Class 1, 32.9% for Class 2 and 35.6% for Class 3. In the FACTT trial, 60-day mortality was 17.7% for Class 1, 35.4% for Class 2 and 47.9% for Class 3. In the EDEN trial, the 60-day mortality was 18.2% for Class 1, 24.2% for Class 2 and 27% for Class 3. In the SAILS trial, the 60-day mortality was 24.2% for Class 1, 27.5% for Class 2 and 36.6% for Class 3 (Fig. 4). The derived phenotypes demonstrated significant differences in VFDs across the cohort and trials. Specifically, patients assigned to Class 1 had the most VFDs and patients assigned to Class 3 had the least VFDs (In the EDEN trial, patients in Class 2 had the least VFDs).

Fig. 4
figure 4

Mortality by longitudinal phenotypes on Day 0. A 28-day mortality stratified by phenotype assignment on Day 0 in CDIC. After adjusting for age, gender and BMI, compared with Class 1, the adjusted hazard ratio of 28-day mortality for Class 2 and Class 3 were 1.86 (95% CI 1.33–2.62), 2.48 (95% CI 1.81–3.41), respectively. BE 60-day mortality stratified by phenotype assignment on Day 0 in ARDSNet Trials. In ALVEOLI trial, compared with Class 1, the adjusted hazard ratio of 60-day mortality for Class 2 and Class 3 were 2.58 (95% CI 1.61–5.12), 2.56 (95% CI 1.76–3.72), respectively. In FACTT trial, compared with Class 1, the adjusted hazard ratio of 60-day mortality for Class 2 and Class 3 were 2.72 (95% CI 2.05–3.61), 3.02 (95% CI 2.07–4.40), respectively. In EDEN trial, compared with Class 1, the adjusted hazard ratio of 60-day mortality for Class 2 and Class 3 were 1.83 (95% CI 1.26–2.64), 1.53 (95% CI 1.13–2.07), respectively. In SAILS trial, compared with Class 1, the adjusted hazard ratio of 60-day mortality for Class 2 and Class 3 were 1.29 (95% CI 0.92–1.81), 2.09 (95% CI 1.43–3.04), respectively

Heterogeneity of treatment effect within phenotypes

We assessed HTE in four ARDSNet trials based on the phenotypes on Day 0. In the ALVEOLI trial, a significant interaction between phenotypes and PEEP strategy on 60-day mortality was detected, patients classified to Class 2 had a 60-day mortality of 23.6% when received lower PEEP strategy, versus 54.2% when received higher PEEP strategy. In contrast, patients classified to Class 3 had a 60-day mortality of 41.6% when received lower PEEP strategy, versus 30.9% when received higher PEEP strategy (P for interaction = 0.0016) (Additional file 1: Fig. S15 and Table 2). In the FACTT trial, we also identified a significant effect of the interaction between phenotypes and fluid management strategy on 60-day mortality. Specifically, mortality among Class 2 patients was 30.8% with the fluid conservative strategy compared to 39.3% with the fluid liberal strategy. While mortality among Class 3 patients was 58.5% with the fluid conservative strategy compared to 35.6% with the fluid liberal strategy (P for interaction = 0.0068) (Additional file 1: Fig. S16 and Table 3). No significant HTE was observed in the EDEN and SAILS trials (Additional file 1: Figs. S17–S18).

Table 2 Heterogeneity of treatment effect to PEEP strategy within phenotypes in ALVEOLI Trial
Table 3 Heterogeneity of treatment effect to Fluid management strategy within phenotypes in FACTT Trial

Parsimonious probabilistic models to identify phenotypes

The most important classifier variables for predicting phenotypes were shown in Additional file 1: Fig. S19–S20. Ultimately, model A included mechanical power, ventilatory ratio, respiratory rate and pH was constructed to predict phenotypes on Day 0, and model B included ventilatory ratio, mechanical power, and creatinine was constructed to predict phenotypes on Day 2 (Additional file 1: Table S15). The AUROC for model A to predict phenotypes on Day 0 was 0.86 (95% CI 0.82–0.90) for Class 1, 0.97 (95% CI 0.95–0.98) for Class 2, and 0.67 (95% CI 0.62–0.73) for Class 3. While for the prediction of phenotypes on Day 2, the AUROC of model B was 0.78 (95% CI 0.72–0.83) for Class 1, 0.80 (95% CI 0.74–0.86) for Class 2, and 0.70 (95% CI 0.60–0.79) for Class 3 (Fig. 5). For the external validation cohorts, the predictive ability of the parsimonious models was shown in Additional file 1: Table S16.

Fig. 5
figure 5

Receiver operating curves for the multinomial regression models for predicting phenotypes on Day 0 (A) and Day 2 (B) in CDIC cohort

Discussion

The novel findings of our analyses can be summarized as follows. First, based on a multi-database, we derived and validated three novel longitudinal phenotypes of ARDS, with different severity of pulmonary mechanics, organ dysfunction, chest CT features, and outcomes. Second, we detected a significant HTE between phenotypes and treatment strategies in the ALVEOLI and FACTT trials. Third, we developed two simplified probabilistic models to predict ARDS phenotypes, potentially applicable to other cohorts.

Longitudinal phenotypes in present study differed from the prior ARDS phenotypes in several critical aspects. First, different variables were applied to identify the phenotypes. Calfee and colleagues focused mainly on the degree of inflammatory conditions of ARDS, and first derived two inflammatory phenotypes using numerous inflammatory factors in the ALVEOLI trial [7]. The two inflammatory phenotypes were subsequently identified in other ARDSNet trials and cohorts [22, 23]. Recently, two studies declared that inflammatory phenotypes could also be accurately predicted without biomarker data using supervised learning approaches [24, 25]. Another study identified three phenotypes of ARDS based on routine medical data in the eICU database [26]. From another perspective, we concentrated primarily on pulmonary mechanics and organ function, including mechanical power and ventilatory ratio, which were both significantly associated with mortality in ARDS [27, 28], and had never been employed to identify phenotypes in traditional ARDS. Second, the previous ARDS phenotypes described above were largely used cross-sectional data, unlike in our study using longitudinal data. While longitudinal data might be more informative in identifying phenotypes. In a prospective cohort study including COVID-19 related ARDS patients,  Bos LDJ et al. [29] did not yield any latent classes using cross-sectional data, but found that a two-class model best fit the cohort using the longitudinal data.

Our phenotypes were partially consistent with the two longitudinal respiratory subphenotypes in COVID-19 patients [29]. Specifically, subphenotype 1 was characterized by the least abnormalities of pulmonary mechanics, while subphenotype 2 was characterized by increasing minute ventilation, mechanical power and ventilatory ratio. The characteristics of Class 1 and Class 2 in our study were similar to subphenotype 1 and subphenotype 2, respectively. Additionally, we discovered a novel longitudinal phenotype (Class 3) which was predominantly characterized by extra-pulmonary dysfunction, since we enrolled patients with other causes-related ARDS and the model included more markers of organ dysfunction.

Phenotypes of disease could change dynamically accompanied by treatment response and disease progression. Limited studies have addressed the dynamic change of ARDS phenotypes. Delucchi and colleagues [12] performed a secondary analysis of the ARMA and ALVEOLI trials to determine the stability of ARDS subphenotypes over time. LCA was conducted separately at day 0 and day 3 using inflammatory factors, and a two-class model was identified in both day 0 and day 3. Latent transition analysis demonstrated that most patients (> 94%) stayed in the same class from day 0 to day 3 in both trials. However, whether the levels of inflammatory factors in Class 2 on day 3 were similar to Class 2 on day 0 was unclear, since they performed LCA independently on days 0 and 3. That said, Class 2 on day 3 might differ from Class 2 on day 0. In comparison, we found that 56.9% of the patients changed their phenotypes at least once during the study days. Except for the different variables included in the model, the different approaches might explain the difference in proportions. In present study, LCA was employed using the longitudinal data, which was fit to the combined datasets from all patients across all study days. The values of pulmonary mechanics or organ dysfunction markers were similar in same class across all study days.

The radiographic severity differed in phenotypes in our study. Previous research provided conflicting evidence concerning the association between radiographic severity and disease severity of ARDS patients. A prospective cohort study employed a radiographic assessment of lung edema (RALE) score to reflect the radiographic severity, and found that the RALE score was neither associated with ARDS severity grouped by PaO2/FiO2 ratio nor pulmonary mechanics [30]. Whereas a secondary analysis of the FACTT trial showed that a lower baseline RALE score was independently associated with a higher PaO2/FiO2 ratio [31]. RALE score was calculated based on the chest X-ray, which might limit the diagnostic accuracy of radiographic severity. We employed quantitative CT analysis to assess the radiographic severity directly, and found that patients with pulmonary mechanical dysfunction phenotype had the lowest proportion of normally aerated lung volume and the highest proportion of poorly-aerated lung volume compared to other phenotypes.

Accurate and precise phenotypes will more effectively guide individualized treatment strategies. The original FACTT trial found no difference in 60-day mortality between conservative and liberal fluid management [14]. After that, Famous et al. discovered that the conservative strategy was associated with improved mortality in patients with hyperinflammatory phenotype but had the opposite effect in patients with hypoinflammatory phenotype [9]. Inconsistent with the previous study, we found that fluid management strategies had no effect in Class 1 patients, while Class 2 patients can particularly benefit from conservative fluid strategy and Class 3 patients (similar to hyperinflammatory phenotype) can more strongly benefit from the liberal fluid strategy. Several explanations exist. Compared to other phenotypes in the FACTT trial, Class 3 was more likely to use vasopressor, had the lowest pH, serum bicarbonate and mean arterial blood pressure. Together these observations strongly implied inadequate effective circulating blood volume in Class 3 patients, whose are required early aggressive fluid resuscitation. A retrospective study included septic patients with ARDS and declared that patients who received adequate early fluid administration followed by later conservative fluid management had the lowest mortality [32]. More research regarding the effect of fluid management strategy on mortality in various phenotypes of ARDS is needed.

We also detected a significant interaction between phenotypes and PEEP strategy on 60-day mortality in the ALVEOLI trial [13]. The effect of PEEP is primarily related to the balance between the number of alveoli that are recruited to participate in ventilation and the amount of lung that is overdistended when PEEP is applied. Unlike inflammatory response-guided PEEP strategy in the previous study [8], we chose PEEP strategy from a physiological perspective. Class 2 in the ALVEOLI trial was characterized by the highest driving pressure, ventilatory ratio and PaCO2, which suggested a higher proportion of dead space in Class 2 patients. This can interpret the beneficial effect of lower PEEP in Class 2. No significant interaction between phenotypes and treatment strategies was observed in the EDEN and SAILS trials. Possible reasons have been discussed elsewhere [33].

Our study is the first to identify longitudinal phenotypes for various etiologies ARDS based on comprehensive metrics and multi-database. Unlike previous inflammatory phenotypes, we also detected novel HTE between longitudinal phenotypes and treatment strategies (PEEP and Fluid management strategy). This study also has several limitations. First, although we identified and validated the longitudinal phenotypes in five separate cohorts, several factors might affect the robustness of phenotypes, such as the missing data, the treatments and ventilator settings. In four ARDSNet trials, the phenotypes can be affected by the randomization arms, although we excluded PEEP in the ALVEOLI trial and fluid balance in the FACTT trial when performing LCA, we cannot exclude the impact of the randomization arms since the respiratory variables are both mathematically and physiologically coupled. Meanwhile, such variations can create inconsistencies in the construct of phenotypes between studies. Second, most patients changed their states at least once during the study days, which may limit the translational premise of our approach for therapeutic targeting in future ARDS clinical trials. Third, the inflammatory factors are limited in CDIC and ARDSNet trials. Therefore, our phenotypes cannot compare to the inflammatory phenotypes directly and precisely. Meanwhile, the performance of parsimonious models was poor for predicting Class 3, since Class 3 was characterized as the pro-inflammatory ARDS, and we did not include any inflammatory factors in the model. More studies are required further to investigate the predictive performance of the model after including inflammatory factors. Fourth, we found that the longitudinal phenotypes responded differently to treatment, which was derived from a secondary analysis of ARDSNet trials. Whether the results reflect the true biology was unclear, and the treatment benefits need prospective validation. Additionally, the significant HTE was only against Day 0 phenotypes and may not against longitudinal assignments. Finally, the longitudinal phenotypes are dynamic and patients can switch classes during the first four days of invasive ventilation, although we developed models to predict phenotypes on Day 0 and Day 2, we can neither predict the phenotypes at any time nor the change of the phenotypes.

Conclusion

In this retrospective analysis of a multi-database from patients with ARDS, three novel longitudinal phenotypes were identified, with various sites and severity of organ dysfunction, and different clinical outcomes. Most patients changed their phenotypes at least once during the first four days of invasive ventilation. Besides, the analysis suggested heterogeneity of treatment effects within phenotypes on Day 0 in the ALVEOLI and FACTT trials.

Availability of data and material

For data in CDIC cohort, data are available upon reasonable request and with the approval from the Department of Critical Care Medicine, Zhongda Hospital, School of Medicine, Southeast University. The datasets of ARDSNets are available in the BioLINCC website (https://biolincc.nhlbi.nih.gov).

Abbreviations

ARDS:

Acute respiratory distress syndrome

RCTs:

Randomized controlled trials

CDIC:

Chinese Database in Intensive Care

NHLBI:

National Heart, Lung, and Blood Institute

ICU:

Intensive Care Unit

VFDs:

Ventilator-free days

NMBAs:

Neuromuscular blocking agents

LCA:

Latent class analysis

PEEP:

Positive end-expiratory pressure

MICE:

Multiple imputations with chained equations

GBTM:

Group-based trajectory modeling

HTE:

Heterogeneity of treatment effect

XGBoost:

Extreme gradient boosting

GBM:

Gradient boosted model

AUROC:

Area under the receiver operating characteristic curves

IL-6:

Interleukin-6

SICAM-1:

Soluble intercellular adhesion molecule-1

References

  1. Ranieri VM, Rubenfeld GD, Thompson BT, Ferguson ND, Caldwell E, Fan E, Camporota L, Slutsky AS. Acute respiratory distress syndrome: the Berlin Definition. JAMA. 2012;307(23):2526–33.

    PubMed  Google Scholar 

  2. Meyer NJ, Gattinoni L, Calfee CS. Acute respiratory distress syndrome. Lancet. 2021;398(10300):622–37.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. McNicholas B, Madden MG, Laffey JG. Machine learning classifier models: the future for acute respiratory distress syndrome phenotyping? Am J Respir Crit Care Med. 2020;202(7):919–20.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Matthay MA, Arabi YM, Siegel ER, Ware LB, Bos LDJ, Sinha P, Beitler JR, Wick KD, Curley MAQ, Constantin JM, et al. Phenotypes and personalized medicine in the acute respiratory distress syndrome. Intensive Care Med. 2020;46(12):2136–52.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Sinha P, Calfee CS. Phenotypes in acute respiratory distress syndrome: moving towards precision medicine. Curr Opin Crit Care. 2019;25(1):12–20.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Reddy K, Sinha P, O’Kane CM, Gordon AC, Calfee CS, McAuley DF. Subphenotypes in critical care: translation into clinical practice. Lancet Respir Med. 2020;8(6):631–43.

    Article  PubMed  Google Scholar 

  7. Calfee CS, Delucchi K, Parsons PE, Thompson BT, Ware LB, Matthay MA, Network NA. Subphenotypes in acute respiratory distress syndrome: latent class analysis of data from two randomised controlled trials. Lancet Respir Med. 2014;2(8):611–20.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Calfee CS, Delucchi K, Parsons PE, Thompson BT, Ware LB, Matthay MA. Subphenotypes in acute respiratory distress syndrome: latent class analysis of data from two randomised controlled trials. Lancet Respir Med. 2014;2(8):611–20.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Famous KR, Delucchi K, Ware LB, Kangelaris KN, Liu KD, Thompson BT, Calfee CS. Acute respiratory distress syndrome subphenotypes respond differently to randomized fluid management strategy. Am J Respir Crit Care Med. 2017;195(3):331–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Wendel Garcia PD, Caccioppola A, Coppola S, Pozzi T, Ciabattoni A, Cenci S, Chiumello D. Latent class analysis to predict intensive care outcomes in acute respiratory distress syndrome: a proposal of two pulmonary phenotypes. Crit Care. 2021;25(1):154.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Mayr F, Tang L, Ou Y, Chang CH, Wang S, Kennedy JN, Seymour CW, Angus DC, Yende S. Sepsis phenotypes are dynamic and associated with long-term outcomes. Am J Respir Crit Care Med. 2020;201:7755.

    Google Scholar 

  12. Delucchi K, Famous KR, Ware LB, Parsons PE, Thompson BT, Calfee CS, Network A. Stability of ARDS subphenotypes over time in two randomised controlled trials. Thorax. 2018;73(5):439–45.

    Article  PubMed  Google Scholar 

  13. Brower RG, Lanken PN, MacIntyre N, Matthay MA, Morris A, Ancukiewicz M, Schoenfeld D, Thompson BT, National Heart L, Blood Institute ACTN. Higher versus lower positive end-expiratory pressures in patients with the acute respiratory distress syndrome. N Engl J Med. 2004;351(4):327–36.

    Article  PubMed  Google Scholar 

  14. Wiedemann HP, Wheeler AP, Bernard GR, Thompson BT, Hayden D, deBoisblanc B, Connors AF Jr, Hite RD, Harabin AL. Comparison of two fluid-management strategies in acute lung injury. N Engl J Med. 2006;354(24):2564–75.

    Article  CAS  PubMed  Google Scholar 

  15. Rice TW, Wheeler AP, Thompson BT, Steingrub J, Hite RD, Moss M, Morris A, Dong N, Rock P. Initial trophic vs full enteral feeding in patients with acute lung injury: the EDEN randomized trial. JAMA. 2012;307(8):795–803.

    Article  PubMed  Google Scholar 

  16. McAuley DF, Laffey JG, O’Kane CM, Perkins GD, Mullan B, Trinder TJ, Johnston P, Hopkins PA, Johnston AJ, McDowell C, et al. Simvastatin in the acute respiratory distress syndrome. N Engl J Med. 2014;371(18):1695–703.

    Article  PubMed  Google Scholar 

  17. Hofmanninger J, Prayer F, Pan J, Rohrich S, Prosch H, Langs G. Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem. Eur Radiol Exp. 2020;4(1):50.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Gattinoni L, Caironi P, Pelosi P, Goodman LR. What has computed tomography taught us about the acute respiratory distress syndrome? Am J Respir Crit Care Med. 2001;164(9):1701–11.

    Article  CAS  PubMed  Google Scholar 

  19. Ma P, Liu J, Shen F, Liao X, Xiu M, Zhao H, Zhao M, Xie J, Wang P, Huang M, et al. Individualized resuscitation strategy for septic shock formalized by finite mixture modeling and dynamic treatment regimen. Crit Care. 2021;25(1):243.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Nagin DS, Jones BL, Passos VL, Tremblay RE. Group-based multi-trajectory modeling. Stat Methods Med Res. 2018;27(7):2015–23.

    Article  PubMed  Google Scholar 

  21. Zhang Z, Reinikainen J, Adeleke KA, Pieterse ME, Groothuis-Oudshoorn CGM. Time-varying covariates and coefficients in Cox regression models. Ann Transl Med. 2018;6(7):121.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Bos LD, Schouten LR, van Vught LA, Wiewel MA, Ong DSY, Cremer O, Artigas A, Martin-Loeches I, Hoogendijk AJ, van der Poll T, et al. Identification and validation of distinct biological phenotypes in patients with acute respiratory distress syndrome by cluster analysis. Thorax. 2017;72(10):876–83.

    Article  CAS  PubMed  Google Scholar 

  23. Sinha P, Delucchi KL, McAuley DF, O’Kane CM, Matthay MA, Calfee CS. Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials. Lancet Respir Med. 2020;8(3):247–57.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Maddali MV, Churpek M, Pham T, Rezoagli E, Zhuo H, Zhao W, He J, Delucchi KL, Wang C, Wickersham N, et al. Validation and utility of ARDS subphenotypes identified by machine-learning models using clinical data: an observational, multicohort, retrospective analysis. Lancet Respir Med. 2022;10(4):367–77.

    Article  PubMed  Google Scholar 

  25. Sinha P, Churpek MM, Calfee CS. Machine learning classifier models can identify acute respiratory distress syndrome phenotypes using readily available clinical data. Am J Respir Crit Care Med. 2020;202(7):996–1004.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Liu X, Jiang Y, Jia X, Ma X, Han C, Guo N, Peng Y, Liu H, Ju Y, Luo X, et al. Identification of distinct clinical phenotypes of acute respiratory distress syndrome with differential responses to treatment. Crit Care. 2021;25(1):320.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Serpa Neto A, Deliberato RO, Johnson AEW, Bos LD, Amorim P, Pereira SM, Cazati DC, Cordioli RL, Correa TD, Pollard TJ, et al. Mechanical power of ventilation is associated with mortality in critically ill patients: an analysis of patients in two observational cohorts. Intensive Care Med. 2018;44(11):1914–22.

    Article  CAS  PubMed  Google Scholar 

  28. Sinha P, Calfee CS, Beitler JR, Soni N, Ho K, Matthay MA, Kallet RH. Physiologic analysis and clinical performance of the ventilatory ratio in acute respiratory distress syndrome. Am J Respir Crit Care Med. 2019;199(3):333–41.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Bos LDJ, Sjoding M, Sinha P, Bhavani SV, Lyons PG, Bewley AF, Botta M, Tsonas AM, Serpa Neto A, Schultz MJ, et al. Longitudinal respiratory subphenotypes in patients with COVID-19-related acute respiratory distress syndrome: results from three observational cohorts. Lancet Respir Med. 2021;9(12):1377–86.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Kotok D, Yang L, Evankovich JW, Bain W, Dunlap DG, Shah F, Zhang Y, Manatakis DV, Benos PV, Barbash IJ, et al. The evolution of radiographic edema in ARDS and its association with clinical outcomes: a prospective cohort study in adult patients. J Crit Care. 2020;56:222–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Warren MA, Zhao Z, Koyama T, Bastarache JA, Shaver CM, Semler MW, Rice TW, Matthay MA, Calfee CS, Ware LB. Severity scoring of lung oedema on the chest radiograph is associated with clinical outcomes in ARDS. Thorax. 2018;73(9):840–6.

    Article  PubMed  Google Scholar 

  32. Murphy CV, Schramm GE, Doherty JA, Reichley RM, Gajic O, Afessa B, Micek ST, Kollef MH. The importance of fluid management in acute lung injury secondary to septic shock. Chest. 2009;136(1):102–9.

    Article  PubMed  Google Scholar 

  33. Sinha P, Delucchi KL, Thompson BT, McAuley DF, Matthay MA, Calfee CS, Network NA. Latent class analysis of ARDS subphenotypes: a secondary analysis of the statins for acutely injured lungs from sepsis (SAILS) study. Intensive Care Med. 2018;44(11):1859–69.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

None.

Funding

The present study is supported by the Key Research and Development Plan of Jiangsu Province (BE2021734) and Yilu “Gexin”—Fluid Therapy Research Fund Project (YLGX-ZZ-2020002).

Author information

Authors and Affiliations

Authors

Contributions

YY, LL, YH, FG and QH had the idea of the study, conceptualized the research aims, YY and HC design the study, and take responsibility for the integrity of the data and the accuracy of the data analysis. HC, CP, JX and SL contributed to the acquisition of data, HC and JX doing the statistical analysis, QY performed the CT analyses. HC wrote the first draft of the paper and other authors provided comments and approved the final manuscript.

Corresponding author

Correspondence to Yi Yang.

Ethics declarations

Ethic approval and consent to participate

The present study was approved by the Research Ethics Commission of Zhongda Hospital Southeast University (2022ZDSYLL082-P01). For ARDSNet trials, all data were approved by Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC, https://biolincc.nhlbi.nih.gov).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Supplemental methods, Figures and Tables.

Additional file 2

. R code for data analysis.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, H., Yu, Q., Xie, J. et al. Longitudinal phenotypes in patients with acute respiratory distress syndrome: a multi-database study. Crit Care 26, 340 (2022). https://doi.org/10.1186/s13054-022-04211-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13054-022-04211-w

Keywords