Skip to main content

Natural language processing diagnosed behavioural disturbance phenotypes in the intensive care unit: characteristics, prevalence, trajectory, treatment, and outcomes

Abstract

Background

Natural language processing (NLP) may help evaluate the characteristics, prevalence, trajectory, treatment, and outcomes of behavioural disturbance phenotypes in critically ill patients.

Methods

We obtained electronic clinical notes, demographic information, outcomes, and treatment data from three medical-surgical ICUs. Using NLP, we screened for behavioural disturbance phenotypes based on words suggestive of an agitated state, a non-agitated state, or a combination of both.

Results

We studied 2931 patients. Of these, 225 (7.7%) were NLP-Dx-BD positive for the agitated phenotype, 544 (18.6%) for the non-agitated phenotype and 667 (22.7%) for the combined phenotype. Patients with these phenotypes carried multiple clinical baseline differences. On time-dependent multivariable analysis to compensate for immortal time bias and after adjustment for key outcome predictors, agitated phenotype patients were more likely to receive antipsychotic medications (odds ratio [OR] 1.84, 1.35–2.51, p < 0.001) compared to non-agitated phenotype patients but not compared to combined phenotype patients (OR 1.27, 0.86–1.89, p = 0.229). Moreover, agitated phenotype patients were more likely to die than other phenotypes patients (OR 1.57, 1.10–2.25, p = 0.012 vs non-agitated phenotype; OR 4.61, 2.14–9.90, p < 0.001 vs. combined phenotype). This association was strongest in patients receiving mechanical ventilation when compared with the combined phenotype (OR 7.03, 2.07–23.79, p = 0.002). A similar increased risk was also seen for patients with the non-agitated phenotype compared with the combined phenotype (OR 6.10, 1.80–20.64, p = 0.004).

Conclusions

NLP-Dx-BD screening enabled identification of three behavioural disturbance phenotypes with different characteristics, prevalence, trajectory, treatment, and outcome. Such phenotype identification appears relevant to prognostication and trial design.

Graphical abstract

Background

The study of the epidemiology of disturbed behavioural phenotypes in critically ill patients is both novel and challenging [1]. In part, this is because such phenotypes are multifaceted manifestations of a poorly understood underlying neurocognitive state. Thus, they cannot be measured and can only be described by words. Moreover, the frequently cited reference for their description [2] is itself an aggregation of words, provides limited guidance for the systematic identification of behavioural phenotypes, and was not intended for use in the intensive care unit (ICU). Furthermore, the widely used screening tools for cognitive and behavioural dysfunction in critically ill patients, the Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) [3] and the Intensive Care Delirium Screening Checklist (ICDSC) [4] only include limited guidance for the identification of behavioural phenotypes.

Notwithstanding the above difficulties, using CAM-ICU and ICDSC, several studies have reported that phenotypes of delirium do exist and may be associated with different morbidity and mortality rates [5,6,7].

Such studies have also suggested that there might be clinical value in further investigating the characteristics, prevalence, trajectory, treatment, and outcomes of disturbed behavioural phenotypes, a similar but not identical concept to delirium [8,9,10,11,12,13,14]. In this regard, natural language processing (NLP) of caregivers’ notes has recently emerged as a screening tool for behaviour in critically ill patients [8,9,10,11,12,13,14]. NLP has also been recently used to describe the syndrome of NLP-diagnosed behaviour disturbance (NLP-Dx-BD) [15]. Different from delirium screening tools, the identification of this syndrome requires the application of NLP to screen caregiver notes for the purpose of detecting words or phrases that describe disturbed behaviour (see “Methods” section). Such NLP-Dx-BD is a condition observed in critically ill patients and has high sensitivity for the identification of patients who later go on to be treated with antipsychotic medications in the ICU [8].

Moreover, although NLP can be used to read caregiver’s clinical notes for the purpose of identifying words and phrases such as “combative” or “confused”, these words may also be indicative of different phenotypes of NLP-Dx-BD. Thus, “combative” implies an agitated phenotype, while, in the absence of words to indicate psychomotor agitation, “confused” implies a non-agitated phenotype, and, finally, the presence of both phenotypes within a defined reporting period (e.g., 24 h) implies a combined phenotype.

We hypothesised that NLP could be used to detect words and phrases associated with distinct behavioural phenotypes and that such NLP-defined phenotypes would be associated with clinically important differences in other patient characteristics. Accordingly, we used NLP to study the characteristics, prevalence, trajectory, treatment, and outcomes of phenotypes of NLP-Dx-BD in a cohort of critically ill patients.

Methods

Study design

We performed a non-interventional, retrospective study of a cohort of critically ill adult patients (≥ 18 years old) admitted to the three ICUs of a university affiliated hospital in Melbourne, Australia between 1 May 2019 and 31 December 2020. The study was approved by the Austin Hospital Human Research Ethics Committee (LNR/19/Austin/38), which waived the requirement for informed consent. For patients with multiple admissions, only the first was included. No other exclusion criteria were applied. During the study period, all patients received care designed to reduce the risk of developing delirium including family visits, dimmed lights at night, minimal interaction to facilitate night-time sleep, and the use of visual and auditory aids as required.

Data collection and manipulation

We obtained the patient clinical progress notes entered into the electronic health record (EHR) by doctors, nurses, physiotherapists, and other allied health professionals. We analysed these notes using NLP techniques. As previously described [8, 15], each progress note was converted into sentence vectors, tokenised and searched for the presence of words indicative of agitated or non-agitated behaviour (Natural Language Toolkit; NLTK 3.5) [16], a process equivalent to the first step of Large Language Model Generative Pre-trained Transformer strategies recently popularised by OpenAI.

The words used in our NLP model were derived from the results of a previously published survey of clinical staff who were asked to identify words, terms or expressions that they would associate with disturbed behaviour and possible delirium [17].

In this study, we further categorised each of these words as being suggestive of agitated or a non-agitated behavioural disturbance state, while the presence of negation or resolution words was also determined (Table 1). Accordingly, all notes for the same patient were aggregated and categorised into four groups of NLP diagnosed behavioural disturbance (NLP-Ex-BD): (1) agitated, when only agitated words were present in available notes; (2) non-agitated, when only non-agitated words were present in available notes; (3) combined, when both agitated and non-agitated words were present in available notes; and (4) no disturbance, when no agitated or non-agitated words were present in any note available.

Table 1 Words suggestive of behavioral disturbance

We use the term NLP-Dx-BD instead of delirium for the sake of accuracy. Importantly, we wish to emphasise that the words in Table 1 are intuitively associated with cognitive deterioration, which is recognised with delirium as well. However, we are also aware that we do not have sufficient data, for example, to exclude the fact that agitated behaviour was not, in fact, due (in a very few individuals) to underlying severe pain and/or dementia.

Baseline and outcome data were obtained from the Australian and New Zealand Intensive Care Society Adult ICU Patient Database run by the Centre for Outcome and Resource Evaluation [18]. Data detailing the use of antipsychotic medications were obtained from the hospital’s electronic medication management system.

Exposure

The primary exposure of this study for a given patient was the occurrence at any time of having a word/s indicating agitated or non-agitated behaviour, or combined behavioural disturbance, within the notes recorded during their ICU stay.

Outcomes

The primary outcome of this study was the use of antipsychotic medications.

Although such treatment is controversial [19,20,21,22], antipsychotic medication use was chosen as the primary outcome measure because we were studying behaviour and considered it likely that such medications would be used differently according to the presence or absence of an agitated behavioural phenotype [23, 24].

Secondary outcomes included ICU and hospital mortality and 28-day mortality censored at hospital discharge as well as ICU and hospital length of stay and duration of mechanical ventilation.

Statistical analysis

All continuous data are reported as median (quartile 25%–quartile 75%) and categorical data as numbers and percentage. Baseline, clinical characteristics and outcomes of the patients were compared among the groups using the Fisher exact test and Kruskal–Wallis test.

Multivariable logistic regression models were used to assess the impact of the exposure on the use of antipsychotic medications and hospital clinical outcomes, according to each phenotype. The model was adjusted by age, type of admission, and by the Australian and New Zealand (ANZ) Risk of Death (ANZROD) after log transformation [25]. ANZROD is a local recalibration of the APACHE III score which adjusts for the persistent lower than expected mortality in ANZ and contains variables such as admission diagnosis and pre-existing comorbidities [25]. As previously demonstrated, ANZROD is an accurate outcome predictor and explains most of the mortality found in ICUs in Australia and New Zealand [26]. Effect estimates were reported as odds ratio (OR) with its 95% confidence interval (CI). To account for immortal time bias, we conducted a time-dependent Cox proportional hazard model for the primary outcome and hospital mortality that considered all measurements available in each note. For the primary outcome, only exposure variables happening before the first outcome (first time the patient received an antipsychotic) were included in the model to avoid exposures measured after the medication had been given. Due to pairwise comparisons, the significance level was adjusted using a Bonferroni method and a p < 0.01 was considered statistically significant. All analyses were case complete analyses and were conducted in R v.4.0.2 (R Foundation, Vienna, Austria) [27]

Results

Characteristics and prevalence of NLP-Dx-BD phenotypes

We studied 2931 patients, 79,807 progress notes, and 17,110,747 words. Among the study patients, 1436 (49.0%) were Natural Language Processing Diagnosed Behavioural Disturbance (NLP-Dx-BD) positive. Of the positive patients, 225 (15.7% of positive and 7.7% of all patients) were categorised into the agitated phenotype; 544 (38.0% of positive and 18.6% of all patients) into the non-agitated phenotype and 667 (46.4% of positive and 22.7% of all patients) into the combined phenotype (Additional file 1: Fig. S1). Patients categorised into the combined phenotype displayed the characteristics of the agitated and non-agitated phenotypes at the same or different times during their admission.

The words ‘agitated’, ‘agitation’ and ‘restraint’ were the most common words detected from the NLP-Dx-BD agitated word list (Additional file 1: Fig. S2a) and ‘delirium’, ‘confused’ and ‘fluctuating’ the most common words from the non-agitated word list (Additional file 1: Fig. S2b).

The baseline (first 24 h of ICU admission) characteristics of the study patients are summarised in Table 2, which shows multiple significant differences according to the presence of NLP-Dx-BD and of its three different phenotypes. Compared to patients with the agitated phenotype, those with the non-agitated phenotype were older, had higher APACHE III scores, different admission diagnoses, and higher serum creatinine levels. However, they were also less likely to receive invasive ventilation and vasopressor drugs.

Table 2 Baseline characteristics of the included patients

Compared with agitated phenotype patients, those with a combined phenotype were older, had higher APACHE III scores, a different source of admission, and were more likely to receive vasopressor drugs and renal replacement therapy. Finally, compared with the non-agitated phenotype those with the combined phenotype were younger, had higher APACHE III scores, different types of admission, were more likely to have acute kidney injury, had different admission diagnoses and source of admission, and were more likely to receive mechanical ventilation, vasopressor drugs and renal replacement therapy. A pairwise comparison of these three different phenotypes is presented in Additional file 1: Table S1, which confirmed the findings of Table 2 and identified the relationship between phenotypes.

Early trajectory of NLP-Dx-BD phenotypes

Most episodes of NLP-Dx-BD occurred in the first 2 days (Additional file 1: Fig. S3). Thus, we studied the trajectory of NLP-Dx-BD phenotypes within these first 48 h among patients who, on day one or day two, had developed NLP-Dx-BD (Fig. 1, Additional file 1: Table S6). We found that behavioural phenotypes within this population changed dynamically.

Fig. 1
figure 1

Alluvial plot illustrating the changes in NLP-Dx-BD phenotype over time in patients who were positive for the condition at some stage in the first 48 h. Most patients transitioned from one phenotype to another or achieved resolution indicating that not only is NLP-Dx-BD a potentially fluctuating condition but that phenotypes are similarly fluctuating

More than half (59.55%) of the patients diagnosed with the combined phenotype of NLP-Dx-BD on the day of admission were negative (resolution) for NLP-Dx-BD by the second day. Approximately a third (31.46%) continued to display the combined phenotype and a small number transitioned to either the agitated (4.49%) or non-agitated (4.49%) phenotype. Patients with the non-agitated phenotype on the day of admission were similarly likely to achieve resolution of their NLP-Dx-BD (70.21%). However, approximately one third (17.02%) remained in a non-agitated phenotype state with a smaller number (12.77%) transitioning to the combined phenotype and no patients in the non-agitated phenotype transitioned to an agitated phenotype on day two. Finally, almost a half (45.72%) of the patients with the agitated phenotype on admission achieved resolution of NLP-Dx-BD, approximately one in two (51.42%) transitioned to the combined phenotype or remained agitated and only one (2.86%) became non-agitated.

Primary outcome: antipsychotic medication use

The raw numbers for each antipsychotic drug used in each phenotype are presented in Table 3, which shows significant differences in their use according to phenotype. In the unadjusted pairwise comparison (Additional file 1: Table S2), antipsychotic drug use was significantly different only when the combined phenotype was the comparator. Thus, their use was significantly greater in the combined vs both other phenotypes (p < 0.001). This was also true for each individual agent (p < 0.001).

Table 3 Antipsychotic drug use for study patients according to phenotype

When analysing the use of antipsychotic drugs as outcome and considering all patients and groups as time-dependent variables (Additional file 1: Table S3), the univariable model showed that the odds ratio for receiving such drugs was greatest for the agitated phenotype. This was confirmed on multivariable modelling considering each phenotype as time dependent variable (Table 4) and was true for invasively ventilated and non-ventilated patients. Moreover, compared with patients without disturbance, the impact of NLP-Dx-BD with an agitated component (agitated or combined) on the prescription of anti-psychotic medications was strong (Additional file 1: Table S4).

Table 4 Multivariable models with use of anti-psychotic drugs as outcome and according to the use of invasive ventilation and considering the phenotypes as time-dependent variable

Secondary outcomes

The raw data for the secondary outcomes of the study patients are shown in Additional file 1: Table S4 and the unadjusted pairwise comparison is presented in Additional file 1: Table S5. The duration of ventilation, ICU length of stay and hospital length of stay were significantly greater for the combined phenotype vs. the agitated and non-agitated phenotype groups (p < 0.001). The unadjusted ICU mortality, hospital mortality and 28-day mortality rates were not significantly different across the phenotypes. However, once multivariable models were applied with hospital mortality as outcome and groups as time-dependent variables (Table 5), patients with the agitated phenotype were more likely to die than patients with other phenotypes (OR 1.57, 1.10–2.25, p = 0.012 vs. non-agitated phenotype; OR 4.61, 2.14–9.90, p < 0.001 vs. combined phenotype). This association was strongest in those patients receiving mechanical ventilation when compared with the combined phenotype (OR 7.03, 2.07–23.79, p = 0.002). A similar increased risk was also seen for patients with the non-agitated phenotype compared with the combined phenotype (OR 6.10, 1.80–20.64, p = 0.004).

Table 5 Multivariable models with hospital mortality as outcome and considering the groups as time-dependent variable

Discussion

Key findings

We applied Natural Language Processing (NLP) to evaluate the characteristics, prevalence, trajectory, treatment, and outcomes of behavioural disturbance phenotypes in critically ill patients. We identified three major phenotypes: agitated, non-agitated, and combined. We found that each of these three phenotypes was associated with different patient characteristics. Of these, the combined phenotype was the most common, followed by the agitated phenotype. However, the trajectory of patients was such that movement from one phenotype to another in the first 48 h was common (65.66%). Moreover, during this time, it was uncommon for non-agitated patients to develop an agitated phenotype, many agitated patients achieved resolution of their NLP-Dx-BD, and both non-agitated and agitated phenotype patients typically transitioned to a combined phenotype. We also found significant differences in the use of antipsychotic medication. Thus, in time-variant multivariable models and after adjustment, patients with an agitated component to their NLP-Dx-BD were significantly more likely to receive antipsychotic medications overall compared with those patients with non-agitated NLP-Dx-BD. Finally, we found that patients with the combined phenotype had longer unadjusted duration of invasive ventilation, ICU stay, and hospital stay and greater mortality. However, once multivariable models were applied with hospital mortality as the outcome and groups as time-dependent variables, it was patients with the agitated phenotype who were more likely to die than patients with other phenotypes, especially among those patients receiving mechanical ventilation.

Relationship to previous studies

Characterisation

Behavioural disturbance phenotypes based on NLP analysis of caregiver notes have not been previously defined. However in a parallel fashion, three phenotypes of delirium have been previously described: hyperactive, hypoactive and mixed [28,29,30,31]. Although there is no gold standard for the identification of these phenotypes, the linguistic constructs and psychomotor descriptors used in previous studies appear broadly consistent with the categorisation of the NLP-Dx-BD search terms used in our study [32]. Nonetheless, NLP-Dx-BD phenotypes are focussed on unstructured continuous observation of behaviour by care givers and their relationship with structured intermittent screening tool-based assessment is unknown [3, 33].

Prevalence

The prevalence of the phenotypes of NLP-Dx-BD phenotypes in critically ill patients is unknown and ours is the first study to explore this concept. However, studies using structured intermittent screening tools reported that between 0.3 and 45.9% of patients displayed agitation, 0.5–91% displayed a non-agitated state and 1–69.5% displayed both [7, 20, 22]. The prevalence of the NLP-Dx-BD phenotypes identified in our study falls within these ranges.

Trajectory

To our knowledge, the trajectory of behavioural phenotypes of critically ill patients has not been reported. Moreover, studies of the epidemiology and treatment of delirium using intermittent structured screening tool-based assessments have not reported on their dynamic nature, thus implying stability [34, 35]. In contrast, and for the first time, we have demonstrated in detail that behavioural phenotypes, as identified by NLP, are unstable within the first 48 h of critical care admission.

Treatment and outcomes

Antipsychotic medications remain widely used for the treatment of disturbed behaviour in critically ill patients [36, 37] with 70% of antipsychotic medication used in acute care prescribed for its treatment [38]. Further, several studies have reported higher rates of administration for agitated patients [23, 39]. In our study, we also found antipsychotic medication use was significantly more likely in the combined NLP-Dx-BD group followed by the agitated groups (both groups having an agitated component). This implies that it is the presence of agitation that drives antipsychotic drugs prescription.

Using intermittently applied structured-assessment tools, a previous study of non-ventilated patients suggested that mortality did not differ between the agitated, non-agitated and combined states [40]. However, in our study, after appropriate adjustments and assessing the group as a time-dependent variable (an adjustment not applied to previous studies), ICU mortality was highest for the NLP-Dx-BD agitated group. This suggests that NLP assessment of continuous caregiver observation may identify a cohort of patients that differs from that identified by intermittently applied structured-assessment tools.

Implications of study findings

Our findings imply that NLP-Dx-BD may be used to identify three clinically relevant phenotypes. Further, our findings suggest that the combined phenotype may be dominant. However, they also suggest that the early trajectory of these phenotypes is complex with dynamic changes from one phenotype to the other, indicating that such phenotypes are unstable. Moreover, on multivariable analysis and overall, patients with an agitated component to their NLP-Dx-BD (agitated phenotype or combined phenotype) appear significantly more likely to receive antipsychotic medications. These observations imply that future studies of pharmacologic intervention should primarily focus on patients who present an agitated component to their phenotype, either in isolation or in a combination (combined phenotype).

Strengths and limitations

Our study has several strengths. It is the first to use NLP to study the prevalence of behavioural phenotypes in critically ill patients. It is also the first to describe the early trajectory of these phenotypes in critically ill patients and to demonstrate their dynamic and unstable nature. Moreover, to the best of our knowledge, our study presents the first detailed information of the use of antipsychotic medications according to phenotype, thus providing further evidence of the validity of the NLP-Dx-BD construct. Finally, with current software, as soon as a key word describing an agitated state is entered into the electronic notes, such entry can be used to trigger alerts to clinicians, thus facilitating randomisation into interventional trials.

We acknowledge several limitations. First, our study was undertaken in a large intensive care unit system involving three ICUs within a university affiliated tertiary hospital in a resource-rich country. Therefore, our findings may not apply to other intensive care units in low or middle-income countries or to other ICUs with a different approach to the management of disturbed behaviour. Moreover, patients were not assessed for the presence or absence of delirium by independent adjudication personnel. However, we investigated behavioural disturbances. This is different in focus, concept, and technique from the assessment of delirium [8]. Thus, we cannot comment of the relationship between our findings and their relevance to intermittently applied delirium screening tools. Future investigations of such relationship may be of interest. The use of medications in the treatment of behavioural disturbance in critically ill patients is controversial. However, antipsychotic medications remain an important tool for moderating behaviour, are a reasonable proxy for disturbed behaviour (the target of our investigation), and are widely used in critically ill patients [22, 41]. Finally, clinicians may be unfamiliar with NLP techniques, which might generate scepticisms about our findings. However, clinicians are the very generators of the words we used in our study; NLP-based technology is fast becoming accepted in response to the arrival of Large Language Model Generative Pre-trained Transformer strategies, and our study is the first step toward machine learning approaches to the behavioural management of critically ill patients.

Conclusions

For the first time, we demonstrated that Natural Language Processing of electronic caregiver notes enables the identification and characterisation of NLP-Dx-BD phenotypes in critically ill patients. Moreover, we found that the combined phenotype was dominant and that, in the first 48 h, it was common for critically ill patients within the study cohort to transition between behavioural phenotypes of NLP-Dx-BD. Importantly, after adjustment, patients with the agitated phenotype either in isolation or within a combined phenotype were significantly more likely to receive antipsychotic medications. These findings have important implications for our understanding of the epidemiology of phenotypes of disturbed behaviour in critically ill patients and for trial design in this field.

Availability of data and materials

Will individual participant data be available (including data dictionaries)? Yes. What data in particular will be shared? Individual deidentified records. What Other documents will be available? Ethics approval. When will the data be available (start and end dates)? From 1 year after publication for a period of 1 year. With whom? Researchers with a study protocol approved by an institutional ethics review board. For what types of analysis? Additional observational statistical analysis. By what mechanism will data be made available? By submission of a research protocol to the corresponding author.

References

  1. Bowman EML, Cunningham EL, Page VJ, McAuley DF. Phenotypes and subphenotypes of delirium: a review of current categorisations and suggestions for progression. Crit Care. 2021;25(1):334.

    Article  PubMed  PubMed Central  Google Scholar 

  2. American Psychiatric Association. Diagnostic and statistical manual of mental disorders: DSM-5. 5th ed. Washington: American Psychiatric Association; 2013. p. 947.

    Book  Google Scholar 

  3. Ely EW, Inouye SK, Bernard GR, Gordon S, Francis J, May L, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM-ICU). JAMA. 2001;286(21):2703.

    Article  CAS  PubMed  Google Scholar 

  4. Inouye SK. Clarifying confusion: the confusion assessment method: a new method for detection of delirium. Ann Intern Med. 1990;113(12):941.

    Article  CAS  PubMed  Google Scholar 

  5. Krewulak KD, Stelfox HT, Ely EW, Fiest KM. Risk factors and outcomes among delirium subtypes in adult ICUs: a systematic review. J Crit Care. 2020;56:257–64.

    Article  PubMed  Google Scholar 

  6. Meagher DJ, Leonard M, Donnelly S, Conroy M, Adamis D, Trzepacz PT. A longitudinal study of motor subtypes in delirium: relationship with other phenomenology, etiology, medication exposure and prognosis. J Psychosom Res. 2011;71(6):395–403.

    Article  PubMed  Google Scholar 

  7. Krewulak KD, Stelfox HT, Leigh JP, Ely EW, Fiest KM. Incidence and prevalence of delirium subtypes in an adult ICU: a systematic review and meta-analysis*. Crit Care Med. 2018;46(12):2029–35.

    Article  PubMed  Google Scholar 

  8. Young M, Holmes N, Kishore K, Marhoon N, Amjad S, Serpa-Neto A, et al. Natural language processing diagnosed behavioral disturbance vs confusion assessment method for the intensive care unit: prevalence, patient characteristics, overlap, and association with treatment and outcome. Intensive Care Med. 2022;23(48):559–69.

    Article  Google Scholar 

  9. Young M, Holmes N, Kishore K, Serpa-Neto A, Bellomo R. Confusion regarding the use of natural language processing in ICU delirium assessment. Author’s reply. Intensive Care Med. 2022;48(7):983–4.

    Article  PubMed  Google Scholar 

  10. Devlin JW, Skrobik Y. What language conveys distress and reassurance? Intensive Care Med. 2022;48(5):599–601.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Burki T. Natural language processing and detecting delirium. Lancet Respir Med. 2022;10(7):639.

    Article  PubMed  Google Scholar 

  12. Ge W, Alabsi H, Jain A, Ye E, Sun H, Fernandes M, et al. Identifying patients with delirium based on unstructured clinical notes: observational study. JMIR Form Res. 2022;6(6):e33834.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Hart KL, Pellegrini AM, Forester BP, Berretta S, Murphy SN, Perlis RH, et al. Distribution of agitation and related symptoms among hospitalized patients using a scalable natural language processing method. Gen Hosp Psychiatry. 2021;68:46–51.

    Article  PubMed  Google Scholar 

  14. Fu S, Lopes GS, Pagali SR, Thorsteinsdottir B, LeBrasseur NK, Wen A, et al. Ascertainment of delirium status using natural language processing from electronic health records. J Gerontol Ser A. 2022;77(3):524–30.

    Article  Google Scholar 

  15. Young M, Holmes N, Robbins R, Marhoon N, Amjad S, Neto AS, et al. Natural language processing to assess the epidemiology of delirium-suggestive behavioural disturbances in critically ill patients. Crit Care Resusc. 2021;23(2):145–54.

    Google Scholar 

  16. Bird S, Klein E, Loper E. Natural language processing with python. Sebastopol: O’Reilly; 2009.

    Google Scholar 

  17. Holmes NE, Amjad S, Young M, Berlowitz DJ, Bellomo R. Using language descriptors to recognise delirium: a survey of clinicians and medical coders to identify delirium-suggestive words. Crit Care Resusc. 2019;21(4):4.

    Google Scholar 

  18. Stow PJ, Hart GK, Higlett T, George C, Herkes R, McWilliam D, et al. Development and implementation of a high-quality clinical database: the Australian and New Zealand Intensive Care Society adult patient database. J Crit Care. 2006;21(2):133–41.

    Article  PubMed  Google Scholar 

  19. Nikooie R, Neufeld KJ, Oh ES, Wilson LM, Zhang A, Robinson KA, et al. Antipsychotics for treating delirium in hospitalized adults: a systematic review. Ann Intern Med. 2019;171(7):485.

    Article  PubMed  Google Scholar 

  20. Girard TD, Exline MC, Carson SS, Hough CL, Rock P, Gong MN, et al. Haloperidol and ziprasidone for treatment of delirium in critical illness. N Engl J Med. 2018;379(26):2506–16.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Carrasco G, Baeza N, Cabré L, Portillo E, Gimeno G, Manzanedo D, et al. Dexmedetomidine for the treatment of hyperactive delirium refractory to haloperidol in nonintubated ICU patients: a nonrandomized controlled trial*. Crit Care Med. 2016;44(7):1295–306.

    Article  CAS  PubMed  Google Scholar 

  22. Andersen-Ranberg NC, Poulsen LM, Perner A, Wetterslev J, Estrup S, Hästbacka J, et al. Haloperidol for the treatment of delirium in ICU patients. N Engl J Med. 2022;867:2425–35.

    Article  Google Scholar 

  23. Mac Sweeney R, Barber V, Page V, Ely EW, Perkins GD, Young JD, et al. A national survey of the management of delirium in UK intensive care units. QJM. 2010;103(4):243–51.

    Article  CAS  PubMed  Google Scholar 

  24. Collet MO, Caballero J, Sonneville R, Bozza FA, Nydahl P, et al. Prevalence and risk factors related to haloperidol use for delirium in adult intensive care patients: the multinational AID-ICU inception cohort study. Intensive Care Med. 2018;44(7):1081–9.

    Article  CAS  PubMed  Google Scholar 

  25. Paul E, Bailey M, Pilcher D. Risk prediction of hospital mortality for adult patients admitted to Australian and New Zealand intensive care units: development and validation of the Australian and New Zealand Risk of Death model. J Crit Care. 2013;28(6):935–41.

    Article  PubMed  Google Scholar 

  26. Eldho P, Bailey M, Kasza J, Pilcher D. The ANZROD model: better benchmarking of ICU outcomes and detection of outliers. Crit Care Resusc. 2016;18(1):12.

    Google Scholar 

  27. R Core Team. R: a language and environment for statistical computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2019. Available from: https://www.R-project.org.

  28. Liptzin B, Levkoff SE. An empirical study of delirium subtypes. Br J Psychiatry. 1992;161(6):843–5.

    Article  CAS  PubMed  Google Scholar 

  29. O’Keeffe S. Clinical significance of delirium subtypes in older people. Age Ageing. 1999;28(2):115–9.

    Article  PubMed  Google Scholar 

  30. Boettger S, Breitbart W. Phenomenology of the subtypes of delirium: phenomenological differences between hyperactive and hypoactive delirium. Palliat Support Care. 2011;9(2):129–35.

    Article  PubMed  Google Scholar 

  31. Camus V, Gonthier R, Dubos G, Schwed P, Simeone I. Etiologic and outcome profiles in hypoactive and hyperactive subtypes of delirium. J Geriatr Psychiatry Neurol. 2000;13(1):38–42.

    Article  CAS  PubMed  Google Scholar 

  32. Meagher DJ, Moran M, Raju B, Gibbons D, Donnelly S, Saunders J, et al. Motor symptoms in 100 patients with delirium versus control subjects: comparison of subtyping methods. Psychosomatics. 2008;49(4):300–8.

    Article  PubMed  Google Scholar 

  33. Bergeron N, Dubois MJ, Dumont M, Dial S, Skrobik Y. Intensive care delirium screening checklist: evaluation of a new screening tool. Intensive Care Med. 2001;27(5):859–64.

    Article  CAS  PubMed  Google Scholar 

  34. Hayhurst CJ, Marra A, Han JH, Patel MB, Brummel NE, Thompson JL, et al. Association of hypoactive and hyperactive delirium with cognitive function after critical illness. Crit Care Med. 2020;48(6):e480–8.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Rengel KF, Hayhurst CJ, Jackson JC, Boncyk CS, Patel MB, Brummel NE, et al. Motoric subtypes of delirium and long-term functional and mental health outcomes in adults after critical illness. Crit Care Med. 2021;49(5):e521–32.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Meagher D, Agar MR, Teodorczuk A. Debate article: antipsychotic medications are clinically useful for the treatment of delirium: antipsychotic prescribing in delirium. Int J Geriatr Psychiatry. 2018;33(11):1420–7.

    Article  PubMed  Google Scholar 

  37. Burry LD, Williamson DR, Mehta S, Perreault MM, Mantas I, Mallick R, et al. Delirium and exposure to psychoactive medications in critically ill adults: a multi-centre observational study. J Crit Care. 2017;42:268–74.

    Article  PubMed  Google Scholar 

  38. Jaworska N, Moss SJ, Krewulak KD, Stelfox Z, Niven DJ, Ismail Z, et al. A scoping review of perceptions from healthcare professionals on antipsychotic prescribing practices in acute care settings. BMC Health Serv Res. 2022;22(1):1272.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Hui D, Reddy A, Palla S, Bruera E. Neuroleptic prescription pattern for delirium in patients with advanced cancer. J Palliat Care. 2011;27(2):141–7.

    Article  PubMed  Google Scholar 

  40. Zhang R, Bai L, Han X, Huang S, Zhou L, Duan J. Incidence, characteristics, and outcomes of delirium in patients with noninvasive ventilation: a prospective observational study. BMC Pulm Med. 2021;21(1):157.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Hughes CG, Mailloux PT, Devlin JW, Swan JT, Sanders RD, Anzueto A, et al. Dexmedetomidine or propofol for sedation in mechanically ventilated adults with sepsis. N Engl J Med. 2021;384(15):1424–36.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors wish to acknowledge contribution of the Austin Hospital medical, nursing and allied health staff for their detailed and timely entry of shift information regarding the neurocognitive state of their patients. The authors also acknowledge Dr Nikola Kolundzic for contributing the visual abstract.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

MY contributed to conceptualisation, data curation, methodology, investigation, software, visualisation, and writing—original draft. NH contributed to conceptualisation, project administration, and writing—review and editing. KK contributed to data curation. SA contributed to methodology and writing—review and editing. MG contributed to writing—review and editing. ASN contributed to data curation, formal analysis, software, and writing—review and editing. MR contributed to writing—review and editing. RB contributed to conceptualisation, investigation, methodology, writing original draft, and supervision.

Corresponding author

Correspondence to Rinaldo Bellomo.

Ethics declarations

Ethics approval and consent to participate.

This study was approved by the Austin Hospital Human Research Ethics Committee (LNR/19/Austin/38). The committee waived the requirement for informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests..

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Online Supplement.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Young, M., Holmes, N.E., Kishore, K. et al. Natural language processing diagnosed behavioural disturbance phenotypes in the intensive care unit: characteristics, prevalence, trajectory, treatment, and outcomes. Crit Care 27, 425 (2023). https://doi.org/10.1186/s13054-023-04695-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13054-023-04695-0

Keywords