Skip to main content

Predictors for extubation failure in COVID-19 patients using a machine learning approach



Determining the optimal timing for extubation can be challenging in the intensive care. In this study, we aim to identify predictors for extubation failure in critically ill patients with COVID-19.


We used highly granular data from 3464 adult critically ill COVID patients in the multicenter Dutch Data Warehouse, including demographics, clinical observations, medications, fluid balance, laboratory values, vital signs, and data from life support devices. All intubated patients with at least one extubation attempt were eligible for analysis. Transferred patients, patients admitted for less than 24 h, and patients still admitted at the time of data extraction were excluded. Potential predictors were selected by a team of intensive care physicians. The primary and secondary outcomes were extubation without reintubation or death within the next 7 days and within 48 h, respectively. We trained and validated multiple machine learning algorithms using fivefold nested cross-validation. Predictor importance was estimated using Shapley additive explanations, while cutoff values for the relative probability of failed extubation were estimated through partial dependence plots.


A total of 883 patients were included in the model derivation. The reintubation rate was 13.4% within 48 h and 18.9% at day 7, with a mortality rate of 0.6% and 1.0% respectively. The grandient-boost model performed best (area under the curve of 0.70) and was used to calculate predictor importance. Ventilatory characteristics and settings were the most important predictors. More specifically, a controlled mode duration longer than 4 days, a last fraction of inspired oxygen higher than 35%, a mean tidal volume per kg ideal body weight above 8 ml/kg in the day before extubation, and a shorter duration in assisted mode (< 2 days) compared to their median values. Additionally, a higher C-reactive protein and leukocyte count, a lower thrombocyte count, a lower Glasgow coma scale and a lower body mass index compared to their medians were associated with extubation failure.


The most important predictors for extubation failure in critically ill COVID-19 patients include ventilatory settings, inflammatory parameters, neurological status, and body mass index. These predictors should therefore be routinely captured in electronic health records.


The decision to extubate a COVID-19 patient can be challenging and a delicate trade-off between early and postponed extubation. In non-COVID patients, extubation failure occurs in 10–20% of intensive care cases and is associated with increased mortality [1]. While postponing extubation and waiting for further clinical improvement appears sensible, unnecessary extubation delays may lead to more ventilator-associated complications and inefficient use of scarce intensive care resources [2, 3].

An understanding of the risk factors for extubation failure will aid the clinician in determining the optimal time point for extubation. Previous studies in non-COVID-19 patients have investigated numerous factors related to extubation outcome, including age, maximum inspiratory pressure, and the rapid shallow breathing index [4]. However, given the complex interplay of many patient and treatment related characteristics in extubation success, a single parameter rarely provides sufficient accuracy to guide decision making [5]. Moreover, it remains largely unclear whether these parameters are similar for COVID-19 patients [6].

The collection of large intensive care datasets that span the entire intensive care admission paves the way for machine learning models to capture this complex interplay of predictors by using machine learning models. Previous non-COVID-19 machine learning work has aimed to predict simple and difficult weaning [7] and extubation failure [8,9,10,11,12,13,14,15]. However, data was frequently from over a decade ago, mechanical ventilator data was usually lacking, and no data was included from COVID-19 patients. Taken together, we identify an opportunity for machine learning models to predict unsuccessful extubation in critically ill COVID-19 patients.

We created the Dutch Data Warehouse (DDW), a multicenter database with critically ill COVID-19 patients [16]. All structured electronic health record (EHR) data for these patients have been combined and cleaned for research purposes. These data therefore represent the structured EHR data readily available to the intensivist at the bedside. In this study, we aim to identify and validate the most important predictors for extubation failure in COVID-19 patients.


This study follows the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) guidelines [17].

Data source

All data came from the DDW, a large, multicenter, full-admission, electronic health record data warehouse with data from critically ill COVID-19 patients in the Netherlands [16]. The data warehouse currently contains 3464 patients admitted between the beginning of the crisis in March 2020 and March 2021. Data spans both the first and second wave of ICU admissions from 25 hospitals in the Netherlands. The institutional review board of Amsterdam University Medical Center location VUmc waived the need for informed consent from individual patients and approved of an opt out procedure.


All critically ill patients extubated after more than 24 h of invasive mechanical ventilation were eligible for inclusion. Transferred patients were included if the transfer destination data were available. We excluded patients transferred before extubation or within 1 day after extubation in case the transfer destination data were not available. Patients transferred more than 24 h after extubation were assumed to be fit for transport and classified as successful extubations. Patients still admitted at the time of data collection were excluded.


The primary outcome was unsuccessful separation from invasive mechanical ventilation defined according to the WIND criteria [18], which mandate an extubation without reintubation or death within the next 7 days, or discharge from the ICU without invasive mechanical ventilation within 7 days [18]. The use of non-invasive ventilation is disregarded in this definition. As secondary outcomes, we applied the same criteria to a 48 h’ time window after extubation. The definition of extubation in EHR data has been published previously and reasonably excludes palliative care patients [16]. We did not distinguish between accidental and elective extubations as the reason for extubation is not routinely recorded.

Predictors and scoping literature search

Potential predictors for modeling were selected by a team of intensivists. Notably, the list included medication and fluid balance. To facilitate the selection process, machine learning studies that predict extubation failure were identified in the literature. Each of the identified articles was scanned full-text and included predictors were extracted. The total list of studies can be found in Additional file 1: Table S1. In addition, to account for the wide variety of ventilator settings in the DDW, the parameters from the landmark paper by Amato et al. on the association between ventilator parameters and outcome were included in the selection [19]. The mean or last value from the last 24 h before extubation as specified by the team of intensivists were included to facilitate interpretation of the model. The total dose in the last 24 h was included for the medications. For any predictor pair with an interpredictor correlation higher than 0.5, the most clinically insightful predictor was selected. The full list of predictors can be found in Table 1.

Table 1 Included parameters


Across all 25 hospitals, a nested cross validation was performed to assess model performance. First, the data was split into five equally large sets called outer folds. These outer folds were then each split into a train and test set. Each of the train sets was again divided into five subsets called the inner folds. A model was trained on these 5 inner folds with a randomized hyperparameter search. Model performance after training on these inner folds was then tested on the corresponding outer fold test set. Importantly, observations belonging to the same patient were always kept in the same split to prevent leakage of information. The overall model performance was the average of all outer fold test set performances.

We trained a logistic regression model, decision trees, and an XGBoost algorithm. These models were selected for their ease of determining predictor importance. Model performance was gauged with the area under the receiver operating characteristic (AUROC), Brier score, average precision, and calibration loss. Data imputation, standardization and automated feature selection were carried out on each outer fold separately. Missing values were imputed with the median and predictors were standardized to have a mean of 0 and a standard deviation of 1. Lasso regression was performed for automatic feature selection, and the L1 regularization term was optimized together with the other hyperparameters [20].

Predictor importance was estimated with the Shapley additive explanation (SHAP) framework. SHAP values represent a predictor’s marginal contribution to the overall prediction [21] and are state of the art in machine learning explainability. Moreover, Partial Dependence Plots (PDPs) were created to visualize the average change in probability of successful extubation for all values of a predictor while keeping all other predictors constant [22]. Standard deviations represent the distribution of the data. All analyses were carried out in Python 3.8 (Python software foundation).


Population and outcome

A total of 2.421 patients were mechanically ventilated during their ICU stay. In case of a patient transfer, data from the transferring and receiving hospital were merged when available. We excluded 517 transfers for which outcome or admission data were lacking, 123 patients that were still intubated when data were extracted, and 139 patients that were intubated less than 24 h. 568 patients died on the mechanical ventilator before their first extubation attempt and 191 patients received a tracheostomy. As a result, a total of 883 patients were included in the modeling. The reintubation rate in this COVID-19 population was 18.9% within 7 days and 13.4% within 48 h. The mortality rate was 1.0% within 7 days and 0.6% in the first 48 h after extubation. Patient characteristics are outlined in Table 2.

Table 2 Patient characteristics


Model performance for the primary outcome is shown in Additional file 1: Table S2 for each of the models. The XGBoost algorithm yielded the highest performance with an AUROC of 0.70, outperforming logistic regression (AUROC 0.67) and a decision tree (AUROC 0.59). Model performance for the prediction of unsuccessful extubation 48 h after extubation is presented in Additional file 1: Table S2. All algorithms, XGBoost (AUROC 0.67), logistic regression (0.66), and a decision tree (AUROC 0.54), performed worse compared to the primary outcome.

Predictor importance

Predictor importance was calculated with the XGBoost model since it yielded the highest performance. The SHAP values for the highest predictors are shown in Fig. 1. The most important predictive feature of extubation failure was the last FiO2 value before extubation. The majority of important predictors can be grouped into ventilatory characteristics, inflammation markers, neurological status and body mass index.

Fig. 1
figure 1

SHAP values for most important predictors of extubation failure. Overview of SHAP values for the top 20 predictors of successful extubation (negative SHAP values) or unsuccessful extubation (positive SHAP values). Features are ordered according to importance. FiO2: fraction of inspired oxygen, IBW: ideal body weight, PEEP: positive end expiratory pressure, P/F ratio: PaO2/FiO2 ratio

Ventilatory characteristics

Ventilatory characteristics are shown in Table 2. A short time-period between the last controlled mode and extubation, and a longer duration in controlled mode throughout the course of mechanical ventilation were associated with unsuccessful extubation. The PD-plots depict the difference in predicted probability of extubation failure compared to the median value for all of the observed values. The PD-plot shows a time since the last controlled mode shorter than 2 days and a controlled mode duration longer than 4 days are associated with increased chances of unsuccessful extubation compared to the median value.

For the ventilator settings, a higher fraction of inspired oxygen and a higher average tidal volume in the last 24 h are predictive of extubation failure. The PD-plot in Fig. 2 shows that an FiO2 above 35% or a tidal volume per kg ideal body weight above 8 ml/kg compared to their median values increases the probability of unsuccessful extubation. The median PEEP was 8 cmH2O (IQR 5–8 cmH2O) before extubation, with a median pressure support of 6 cmH2O (IQR 5–9 cmH2O). No patients received PEEP levels below 5 cmH2O, while pressure above PEEP was below 5 cmH2O in 7.3% of patients.

Fig. 2
figure 2

Partial dependence plots. PD-plot for the last FiO2 recording, mean glasgow coma score and tidal volume per kg ideal body weight in the last 24 h, and duration of the controlled mode

Inflammation markers, neurological scores and body mass index

Both a higher CRP, an elevated leukocyte count and higher thrombocyte count in the 24 h preceding extubation are predictors of an unsuccessful extubation attempt, while temperature was not in the top predicting features. For neurological scores, on the other hand, low EMV scores predict unsuccessful extubation. Lastly, BMI showed an inverse relationship with extubation failure; patients with a higher BMI had a lower probability of extubation failure. An increase in the chances of unsuccessful extubation is observed below 28 kg/m2 compared to the median in the PD-plot (shown in Additional file 1: Fig. S1).


To the best of our knowledge, this is the first study that identifies predictors for extubation failure in critically ill COVID-19 patients from a large and multicenter cohort that contains a wide variety of routinely collected clinical predictors. The most important predictors of extubation failure are ventilatory characteristics, inflammatory parameters, GCS score, and body mass index. These risk factors may aid intensive care professionals in selecting the optimal time point for extubation.

This study is unique as it provides predictive modeling of extubation failure across twenty-five hospitals. All previous machine learning studies in non-COVID patients for predicting extubation failure have been single center [7,8,9,10,11,12,13,14,15]. Model performance was higher in these studies, presumably due to overfitting resulting from the sole use of local data. Algorithms may be biased towards local extubation practices and extubation readiness assessments, making these models less generalizable to other clinical settings.

In our study, ventilatory characteristics, including ventilator settings, are the most important risk factors for extubation failure. These factors are systematically and frequently recorded by the ventilators, and are potentially modifiable. Two of the most important predictors associated with higher chances of failed extubation are the duration of the controlled and assisted ventilation modes prior to extubation. A longer time in a controlled mode was a stronger predictor than the total duration of mechanical ventilation. Moreover, a longer time in assisted mode was associated with improved chances of successful extubation. A possible explanation may be the reduced activity and consequent atrophy of the diaphragm or other skeletal muscles in controlled modes [23, 24]. Of note, none of the previous machine learning studies included the duration of controlled ventilation as a predictor. Our results show that the duration of ventilation modes should be recorded and taken into account when assessing extubation readiness.

For the ventilator settings, a higher FiO2 before extubation was associated with an increased risk of extubation failure. A higher FiO2 may indicate incomplete resolution of pulmonary pathology. Higher PEEP levels, on the other hand, were associated with better extubation success. The interquartile ranges of PEEP are low, however, indicating low PEEP is common practice before an extubation attempt. In addition, we observed that higher mean tidal volumes corrected for the ideal body weight in the last day before extubation were an important predictor of extubation failure. Patients with high average tidal volumes may suffer from more lung injury that may increase the risk of unsuccessful extubation [25]. While most of the ventilator settings are readily available, relevant respiratory system maneuvers such as spontaneous breathing trials, tracheobronchial suctioning and maximum inspiratory pressure that would ideally be included, were inconsistently recorded in the EHR systems and therefore not included in modelling. To evaluate their predicting importance in extubation failure, data of these maneuvers need to be incorporated systematically in the EHR.

Other important predictors included signals of ongoing or developing inflammation, poorer neurological status, and body mass index. Inflammation parameters are routinely determined in most intensive care units when extubation decisions are made. Conversely, neurological scores can be ambivalently scored in the intensive care unit. The Glasgow Coma Scale was originally designed for brain damage patients [26], but is used for the general intensive care patient. Unequivocal interpretation of sedated states, however, may hamper the use of this scale in the context of extubation readiness. Based on these results, we would recommend systematically recording and evaluating the predictive value of other scores like the Richmond agitation sedation scales.

Lastly, body mass index upon admission had an inverse relationship with extubation failure. Apart from one small study that found an association between BMI and post extubation stridor [27], no other studies were identified that found BMI to be an important predictor. As in any predictive study, the effect of BMI may be explained by an unmeasured predictor or a selection bias. That means, a low-BMI patient would have to be sicker to be admitted to the ICU. A negligible correlation was found between BMI and SOFA score, however, as an indicator of illness severity. Previous studies have also shown that BMI is uncorrelated with immunological responses or adverse outcomes [28]. Overall, once in the ICU, BMI is not related to higher chances of unsuccessful extubation and may not be a valid reason to postpone extubation.

Our study has several limitations. We aim to apply a holistic set of predictors across centers to assess extubation readiness. In routine practice, however, individualized treatment and diagnostic decisions result in variation of available parameters [29], and predictors may be unavailable in the 24 h prior to extubation. For example, it is not possible to conclude that cardiac markers like NT-pro-BNP or troponin do not aid in the prediction of extubation failure, because these markers were not routinely determined. Along the same line, we had to merge groups of medications, because individual drugs may not be administered frequently enough to be useful in the modeling. To truly exploit the predictive power of machine learning models, we should strive to systematically record the predictors of interest and determine which algorithms work in what clinical circumstances [30].

A further limitation is the missing outcome data because of patient transfers to centers not included in this project. The potential bias is considered small, as we connected all patients’ stays whenever available and transferred patients had similar baseline characteristics as the study population as a whole [31]. Lastly, the relationships identified in this study are associations and do not equal causation. As with any clinical observational dataset, we cannot observe counterfactual states; once a patient is extubated we irretrievably lose the outcome in case the patient would have been kept on mechanical ventilation. While many of the ventilatory settings are predictive of extubation failure, we would ultimately be interested in the effects of continuing mechanical ventilation for another day on extubation success. We believe that these results will provide a crucial step for other study designs to investigate the causal relation between modifiable predictors and successful extubation.


This is the first study to identify risk factors of extubation failure in a large multi-center cohort of critically ill COVID-19 patients. The large number of hospitals included limits the risk of overfitting due to specific local practices. From a large set of clinically important predictors, ventilatory characteristics, inflammatory markers, neurological status and BMI were most important predictors for failed extubation. These predictors should be taken into account to determine extubation readiness.

Availability of data and materials

The Dutch Data Warehouse is available for global collaboration, within the restrictions imposed by privacy laws and ethics. Information on how to access the data warehouse may be found at



Acute physiology and chronic health evaluation iI


Area under the receiver operating characteristics curve


Body mass index


C-reactive protein


Dutch data warehouse


Electronic health record

EMV score:

Eye motor verbal score

FiO2 :

Fraction of inspired oxygen


Glasgow coma scale


Interquartile range


Partial dependence plot


Positive end expiratory pressure


Richmond agitation and sedation scales


Shapley additive explanation


Sequential organ failure assessment


  1. 1.

    Thille AW, Richard J-CM, Brochard L. The decision to extubate in the intensive care unit. Am J Respir Crit Care Med. 2013;187:1294–302.

    Article  Google Scholar 

  2. 2.

    Slutsky AS, Ranieri VM. Ventilator-induced lung injury. N Engl J Med. 2013;369:2126–36.

    CAS  Article  Google Scholar 

  3. 3.

    Ventilator-Associated Events: Prevalence, Outcome, and Relat ionship With Ventilator-Associated Pneumonia. Critical Care Medicine [Internet]. [cited 2021 Jun 17]. Available from:,_Outcome,.3.aspx

  4. 4.

    Baptistella AR, Sarmento FJ, da Silva KR, Baptistella SF, Taglietti M, Zuquello RÁ, et al. Predictive factors of weaning from mechanical ventilation and extubation outcome: a systematic review. J Crit Care. 2018;48:56–62.

    Article  Google Scholar 

  5. 5.

    Heunks LM, van der Hoeven JG. Clinical review: The ABC of weaning failure—a structured approach. Crit Care. 2010;14:245.

    Article  Google Scholar 

  6. 6.

    Ionescu F, Zimmer MS, Petrescu I, Castillo E, Bozyk P, Abbas A, et al. Extubation failure in critically ill COVID-19 patients: risk factors and impact on in-hospital mortality. J Intensive Care Med. 2021;36:1018–24.

    Article  Google Scholar 

  7. 7.

    Hsieh MH, Hsieh MJ, Cheng A-C, Chen C-M, Hsieh C-C, Chao C-M, et al. Predicting weaning difficulty for planned extubation patients with an artificial neural network. Medicine (Baltimore). 2019;98:e17392.

    Article  Google Scholar 

  8. 8.

    Fabregat A, Magret M, Ferré JA, Vernet A, Guasch N, Rodríguez A, et al. A machine learning decision-making tool for extubation in Intensive care unit patients. Comput Methods Programs Biomed. 2021;200:105869.

    Article  Google Scholar 

  9. 9.

    Hsieh M-H, Hsieh M-J, Chen C-M, Hsieh C-C, Chao C-M, Lai C-C. An artificial neural network model for predicting successful extubation in intensive care units. J Clin Med. 2018;7:240.

    Article  Google Scholar 

  10. 10.

    Tsai T-L, Huang M-H, Lee C-Y, Lai W-W. Data science for extubation prediction and value of information in surgical intensive care unit. J Clin Med. 2019;8:1709.

    Article  Google Scholar 

  11. 11.

    Lin M-Y, Li C-C, Lin P-H, Wang J-L, Chan M-C, Wu C-L, et al. Explainable machine learning to predict successful weaning among patients requiring prolonged mechanical ventilation: a retrospective cohort study in central Taiwan. Front Med. 2021;8:663739.

    Article  Google Scholar 

  12. 12.

    Jia Y, Kaul C, Lawton T, Murray-Smith R, Habli I. Prediction of weaning from mechanical ventilation using convolutional neural networks. Artif Intell Med. 2021;117:102087.

    Article  Google Scholar 

  13. 13.

    Zhao Q-Y, Wang H, Luo J-C, Luo M-H, Liu L-P, Yu S-J, et al. Development and validation of a machine-learning model for prediction of extubation failure in intensive care units. Front Med. 2021;8:676343.

    Article  Google Scholar 

  14. 14.

    Kuo H-J, Chiu H-W, Lee C-N, Chen T-T, Chang C-C, Bien M-Y. Improvement in the prediction of ventilator weaning outcomes by an artificial neural network in a medical ICU. Respir Care. 2015;60:1560–9.

    Article  Google Scholar 

  15. 15.

    Otaguro T, Tanaka H, Igarashi Y, Tagami T, Masuno T, Yokobori S, et al. Machine learning for the prediction of successful extubation among patients with mechanical ventilation in the intensive care unit: a retrospective observational study. J Nippon Med Sch Nippon Ika Daigaku Zasshi. 2021.

    Article  PubMed  Google Scholar 

  16. 16.

    Fleuren LM, Dam TA, Tonutti M, de Bruin DP, Lalisang RCA, Gommers D, et al. The Dutch data warehouse, a multicenter and full-admission electronic health records database for critically ill COVID-19 patients. Crit Care. 2021;25:304.

    Article  Google Scholar 

  17. 17.

    Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594.

    Article  Google Scholar 

  18. 18.

    Béduneau G, Pham T, Schortgen F, Piquilloud L, Zogheib E, Jonas M, et al. Epidemiology of weaning outcome according to a new definition. The WIND study. Am J Respir Crit Care Med. 2016;195:772–83.

    Article  Google Scholar 

  19. 19.

    Amato MBP, Meade MO, Slutsky AS, Brochard L, Costa ELV, Schoenfeld DA, et al. Driving pressure and survival in the acute respiratory distress syndrome. N Engl J Med. 2015;372:747–55.

    CAS  Article  Google Scholar 

  20. 20.

    Tibshirani R. Regression Shrinkage and Selection via the Lasso. J R Stat Soc Ser B Methodol. 1996;58:267–88.

    Google Scholar 

  21. 21.

    Chen H, Janizek JD, Lundberg S, Lee S-I. True to the Model or True to the Data? ArXiv200616234 Cs Stat [Internet]. 2020 [cited 2021 Jan 28]; Available from:

  22. 22.

    Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29:1189–232.

    Article  Google Scholar 

  23. 23.

    Rapid Disuse Atrophy of Diaphragm Fibers in Mechanically Ventilated Humans | NEJM [Internet]. [cited 2021 Sep 21]. Available from:

  24. 24.

    Mechanical Ventilation–induced Diaphragm Atrophy Strongly Impacts Clinical Outcomes | American Journal of Respiratory and Critical Care Medicine [Internet]. [cited 2021 Sep 21]. Available from:

  25. 25.

    Yoshida T, Amato MBP, Kavanagh BP, Fujino Y. Impact of spontaneous breathing during mechanical ventilation in acute respiratory distress syndrome. Curr Opin Crit Care. 2019;25:192–8.

    Article  Google Scholar 

  26. 26.

    Teasdale G, Jennett B. Assessment of coma and impaired consciousness: a practical scale. The Lancet. 1974;304:81–4.

    Article  Google Scholar 

  27. 27.

    Erginel S, Ucgun I, Yildirim H, Metintas M, Parspour S. High body mass index and long duration of intubation increase post-extubation stridor in patients with mechanical ventilation. Tohoku J Exp Med. 2005;207:125–32.

    Article  Google Scholar 

  28. 28.

    Kooistra EJ, de Nooijer AH, Claassen WJ, Grondman I, Janssen NAF, Netea MG, et al. A higher BMI is not associated with a different immune response and disease course in critically ill COVID-19 patients. Int J Obes. 2021;45:687–94.

    CAS  Article  Google Scholar 

  29. 29.

    Qian Z, Alaa AM, van der Schaar M, Ercole A. Between-centre differences for COVID-19 ICU mortality from early data in England. Intensive Care Med. 2020.

    Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Eaneff S, Obermeyer Z, Butte AJ. The case for algorithmic stewardship for artificial intelligence and machine learning technologies. JAMA. 2020.

    Article  PubMed  Google Scholar 

  31. 31.

    Fleuren LM, de Bruin DP, Tonutti M, Lalisang RCA, Elbers PWG, Gommers D, et al. Large-scale ICU data sharing for global collaboration: the first 1633 critically ill COVID-19 patients in the Dutch Data Warehouse. Intensive Care Med. 2021.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


The Dutch ICU Data Sharing Against COVID-19 Collaborators: From collaborating hospitals having shared data: Julia Koeter, MD, Intensive Care, Canisius Wilhelmina Ziekenhuis, Nijmegen, The Netherlands. Roger van Rietschote, Business Intelligence, Haaglanden MC, Den Haag, The Netherlands. M.C. Reuland, MD, Department of Intensive Care Medicine, Amsterdam UMC, Universiteit van Amsterdam, Amsterdam, The Netherlands. Laura van Manen, MD, Department of Intensive Care, BovenIJ Ziekenhuis, Amsterdam, The Netherlands. Leon Montenij, MD, PhD, Department of Anesthesiology, Pain Management and Intensive Care, Catharina Ziekenhuis Eindhoven, Eindhoven, The Netherlands. Jasper van Bommel, MD, PhD, Department of Intensive Care, Erasmus Medical Center, Rotterdam, The Netherlands. Roy van den Berg, Department of Intensive Care, ETZ Tilburg, Tilburg, The Netherlands. Ellen van Geest, Department of ICMT, Haga Ziekenhuis, Den Haag, The Netherlands. Anisa Hana, MD, PhD, Intensive Care, Laurentius Ziekenhuis, Roermond, The Netherlands. B. van den Bogaard, MD, PhD, ICU, OLVG, Amsterdam, The Netherlands. Prof. Peter Pickkers, Department of Intensive Care Medicine, Radboud University Medical Centre, Nijmegen, The Netherlands. Pim van der Heiden, MD, PhD, Intensive Care, Reinier de Graaf Gasthuis, Delft, The Netherlands. Claudia (C.W.) van Gemeren, MD, Intensive Care, Spaarne Gasthuis, Haarlem en Hoofddorp, The Netherlands. Arend Jan Meinders, MD, Department of Internal Medicine and Intensive Care, St Antonius Hospital, Nieuwegein, The Netherlands. Martha de Bruin, MD, Department of Intensive Care, Franciscus Gasthuis & Vlietland, Rotterdam, The Netherlands. Emma Rademaker, MD, MSc, Department of Intensive Care, UMC Utrecht, Utrecht, The Netherlands. Frits H.M. van Osch, PhD, Department of Clinical Epidemiology, VieCuri Medisch Centrum, Venlo, The Netherlands. Martijn de Kruif, MD, PhD, Department of Pulmonology, Zuyderland MC, Heerlen, The Netherlands. Nicolas Schroten, MD, Intensive Care, Albert Schweitzerziekenhuis, Dordrecht, The Netherlands. Klaas Sierk Arnold, MD, Anesthesiology, Antonius Ziekenhuis Sneek, Sneek, The Netherlands. J. W. Fijen, MD, PhD, Department of Intensive Care, Diakonessenhuis Hospital, Utrecht, The Netherland. Jacomar J. M. van Koesveld, MD, ICU, IJsselland Ziekenhuis, Capelle aan den IJssel, The Netherlands. Koen S. Simons, MD, PhD, Department of Intensive Care, Jeroen Bosch Ziekenhuis, Den Bosch, The Netherlands. Joost Labout, MD, PhD, ICU, Maasstad Ziekenhuis Rotterdam, The Netherlands. Bart van de Gaauw, MD, Martiniziekenhuis, Groningen, The Netherlands. Michael Kuiper, Intensive Care, Medisch Centrum Leeuwarden, Leeuwarden, The Netherlands. Albertus Beishuizen, MD, PhD, Department of Intensive Care, Medisch Spectrum Twente, Enschede, The Netherlands. Dennis Geutjes, Department of Information Technology, Slingeland Ziekenhuis, Doetinchem, The Netherlands. Johan Lutisan, MD, ICU, WZA, Assen, The Netherlands. Bart P. Grady, MD, PhD, Department of Intensive Care, Ziekenhuisgroep Twente, Almelo, The Netherlands. Remko van den Akker, Intensive Care, Adrz, Goes, The Netherlands. Tom A. Rijpstra, MD, Department of Anesthesiology, Intensive Care and Pain Medicine, Amphia Ziekenhuis, Breda, The Netherlands. W.G. Boersma, MD, PhD, Department of Pulmonology, Northwest Clinics, Alkmaar, the Netherlands. From collaborating hospitals having signed the data sharing agreement: Daniel Pretorius, MD, Department of Intensive Care Medicine, Hospital St Jansdal, Harderwijk, The Netherlands. Menno Beukema, MD, Department of Intensive Care, Streekziekenhuis Koningin Beatrix, Winterswijk, The Netherlands. Bram Simons, MD, Intensive Care, Bravis Ziekenhuis, Bergen op Zoom en Roosendaal, The Netherlands. A.A. Rijkeboer, MD, ICU, Flevoziekenhuis, Almere, The Netherlands. Marcel Aries, MD, PhD, MUMC+, University Maastricht, Maastricht, The Netherlands. Niels C. Gritters van den Oever, MD, Intensive Care, Treant Zorggroep, Emmen, The Netherlands. Martijn van Tellingen, MD, EDIC, Department of Intensive Care Medicine, afdeling Intensive Care, ziekenhuis Tjongerschans, Heerenveen, The Netherlands. Annemieke Dijkstra, MD, Department of Intensive Care Medicine, Het Van Weel-Bethesda Ziekenhuis, Dirksland, The Netherlands. Rutger van Raalte, Department of Intensive Care, Tergooi hospital, Hilversum, The Netherlands.


Partially funded by grants from ZonMw (Project 10430012010003, file 50-55700-98-908), Zorgverzekeraars Nederland and the Corona Research Fund. The sponsors had no role in any part of the study.

Author information





LF drafted the manuscript. TD, DB, RL, MF, TM, MS, SV, AB, DQ, RN, TH, PT, WH and PE were involved in data processing and analytics. All authors contributed to data collection and critically reviewed the manuscript. All authors have full access to the data. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lucas M. Fleuren.

Ethics declarations

Ethics approval and consent to participate

The Medical Ethics Committee at Amsterdam UMC, location VUmc waived the need for patient informed consent and approved of an opt-out procedure for the collection of COVID-19 patient data during the COVID-19 crisis.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Table S1 Overview of machine learning studies that investigate extubation readiness. Table S2 Model performance. Figure S1 PD-plot for BMI

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Fleuren, L.M., Dam, T.A., Tonutti, M. et al. Predictors for extubation failure in COVID-19 patients using a machine learning approach. Crit Care 25, 448 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Extubation
  • Prediction
  • Risk factors
  • Extubation failure