Skip to main content

Predictive models for kidney disease: improving global outcomes (KDIGO) defined acute kidney injury in UK cardiac surgery



Acute kidney injury (AKI) risk prediction scores are an objective and transparent means to enable cohort enrichment in clinical trials or to risk stratify patients preoperatively. Existing scores are limited in that they have been designed to predict only severe, or non-consensus AKI definitions and not less severe stages of AKI, which also have prognostic significance. The aim of this study was to develop and validate novel risk scores that could identify all patients at risk of AKI.


Prospective routinely collected clinical data (n = 30,854) were obtained from 3 UK cardiac surgical centres (Bristol, Birmingham and Wolverhampton). AKI was defined as per the Kidney Disease: Improving Global Outcomes (KDIGO) Guidelines. The model was developed using the Bristol and Birmingham datasets, and externally validated using the Wolverhampton data. Model discrimination was estimated using the area under the ROC curve (AUC). Model calibration was assessed using the Hosmer–Lemeshow test and calibration plots. Diagnostic utility was also compared to existing scores.


The risk prediction score for any stage AKI (AUC = 0.74 (95% confidence intervals (CI) 0.72, 0.76)) demonstrated better discrimination compared to the Euroscore and the Cleveland Clinic Score, and equivalent discrimination to the Mehta and Ng scores. The any stage AKI score demonstrated better calibration than the four comparison scores. A stage 3 AKI risk prediction score also demonstrated good discrimination (AUC = 0.78 (95% CI 0.75, 0.80)) as did the four comparison risk scores, but stage 3 AKI scores were less well calibrated.


This is the first risk score that accurately identifies patients at risk of any stage AKI. This score will be useful in the perioperative management of high risk patients as well as in clinical trial design.


Acute kidney injury (AKI) is a common and severe complication of cardiac surgery affecting up to 30% of all patients and increasing mortality up to fourfold [1,2]. No effective treatment has been identified despite over 70 randomised controlled trials (RCTs) of proposed renoprotective interventions [3,4]. Systematic reviews have documented important limitations in these trials such as the enrolment of low or mixed AKI-risk patient cohorts, small sample sizes, and low statistical power [3,4]. These are important sources of bias that will have increased the likelihood of negative trial results. Recent recommendations on the design of trials in AKI suggest that these limitations may be countered by cohort enrichment [5], whereby the enrolment of patients with higher event rates will permit targeting of interventions to those patients populations most likely to benefit, smaller sample sizes, and increased study power. AKI risk prediction scores are an objective, transparent means of cohort enrichment but are not widely used. This is because existing scores have been developed principally to identify patients at risk of renal replacement therapy (RRT) which is rare and not less severe AKI, which is also associated with poor prognosis but occurs with greater frequency [6]. There is also uncertainty as to the generalizability of these scores. Only two published scores, The Cleveland clinic score [7], and the Mehta score [8] have been independently validated, and neither has demonstrated adequate discrimination and calibration in non-North American patient populations [9,10].

The aim of this study was to develop and externally validate two novel risk scores that could be used for cohort enrichment in clinical trials of renoprotective interventions in cardiac surgery. We used the Kidney Disease: Improving Global Outcomes (KDIGO) guidelines AKI definition [11] as this reconciles important differences between the two earlier consensus definitions; Acute Kidney Injury Network (AKIN) [12] and risk, injury, failure, loss, end stage (RIFLE) [13]. The prognostic utility of this score has only been demonstrated thus far in a single centre [14]. To support the validity and utility of the risk scores our first objective was therefore to demonstrate the utility of the KDIGO definition in a large multicentre cohort. Next, we developed two new scores based on preoperative variables; one to identify all patients at risk of AKI (KDIGO stages 1 to 3), and the second to identify only those patients at risk of severe AKI (KDIGO stage 3). Finally we compared the diagnostic utility of these new scores with four published risk scores.


This study was approved by the South West research ethics committee under reference 11/SW/0075. The requirement for written informed consent was waived. Our research objectives and methods were specified prior to execution.

Study population

This retrospective cohort study used routinely collected data from the patient analysis and tracking system (PATS), from three UK hospitals. We obtained prospectively collected data on all adult cardiac patients for the following periods: 1996 to 2010 at University Hospitals of Bristol, 2002 to 2010 at University Hospitals of Birmingham National Health Service (NHS) Foundation Trust, and 2004 to 2010 at Wolverhampton Heart and Lung Centre. Our analyses included all patients aged ≥16 years, who underwent cardiac surgery, with or without cardiopulmonary bypass (CBP), and including those who underwent surgery to the thoracic aorta; we excluded patients already on renal replacement therapy (RRT) or who had received a kidney transplants, and patients who died in theatre.

Definitions of perioperative variables used were consistent across the three sites, and are as specified by the UK National Adult Cardiac Surgery Audit (NACSA) [15]. NACSA is a component of the UK cardiac surgery national quality assurance programme, whereby a defined set of perioperative data are collected prospectively by the anaesthetist, surgeon, and ICU, high-dependency unit (HDU), and ward nurses on the PATS databases. These data are submitted to the National Institute Clinical Outcome Research (NICOR) for analysis and the generation of risk-adjusted outcome data [16]. The data undergo routine internal quality assurance prior to submission as well as external data quality assessment by NICOR. The NACSA Annual Report for 2010 indicated that these three centres are among the best units nationally for data quality [17]. PATS data were linked using name, hospital number, and date of birth, to institutional biochemistry databases that record serial serum creatinine results, to identify those patients who developed AKI. During the study serum creatinine was measured using the Jaffe method at all three centres. Wolverhampton and Birmingham used the Roche Modular system (Roche Diagnostics, Ltd, Lewes, UK). Bristol used the Olympus Diagnostics System AU640 or AU2700 (Olympus Diagnostic Systems, Southall, UK). The assays were calibrated using the manufacturers’ controls with inter-institutional diagnostic accuracy monitored by a national (UK) quality assessment scheme.


Measurement of acute kidney injury

AKI was classified according to the KDIGO guidelines [11]. Stage-1 AKI was defined as an increase from baseline of ≥26 μmol/L of postoperative creatinine or an increase of 1.5 to 1.9 times the preoperative creatinine within 7 days; stage 2 was an increase of 2.0 to 2.9 times the preoperative creatinine; stage-3 AKI was an increase ≥3 times the preoperative creatinine or an increase to ≥354 μmol/L or when the patient commenced RRT. The RRT was administered for uraemia, volume overload, or biochemical abnormalities, according to institutional protocols. Urine output data were not available and therefore not used in our AKI definition. The baseline creatinine value was defined as the preoperative value obtained closest to the date of the operation. Cases where the baseline data were missing, most commonly emergency and salvage patients, were not included in the complete case analyses.

Demographic factors

Information was extracted from PATS on age (grouped into three categories: <60; 60 to 74; or ≥75 years), sex, body mass index (BMI) (grouped into five categories: <20.0; 20.0 to 24.9; 25.0 to 29.9; 30.0 to 34.9; or ≥35.0 kg/m2) and smoking status (never smoked; ex-smoker; or current smoker).

Preoperative factors

Information on prespecified factors was obtained from PATS for the three centres. The presence of angina was grouped according to the Canadian Cardiovascular Society (CCS) categories: no angina, CCS 1 (ordinary activity does not cause angina); CCS 2 (slight limitation); CCS 3 (moderate limitation); or CCS 4 (inability to carry out any physical activity without discomfort). Dyspnoea was grouped according to the New York Heart Association (NYHA) functional classification: NYHA 1 (no symptoms and no limitation in ordinary physical activity); NHYA 2 (mild symptoms and slight limitation during ordinary activity); NYHA 3 (marked limitation in activity due to symptoms, comfortable only at rest); or NYHA 4 (severe limitations, experiences symptoms even while at rest). Previous myocardial infarction (MI) was categorised as; none; 1; or ≥2. Information was obtained on previous cardiac, vascular or thoracic surgery (0 and ≥1), the presence of diabetes, peripheral vascular disease, pulmonary disease, neurological disease, hypertension (treated or blood pressure (BP) >140/90) and preoperative haemoglobin (<10.0; 10.0 to 11.9; or ≥12.0 g/dL). Glomerular filtration rate (GFR) was estimated from the Cockcroft-Gault equation using the preoperative creatinine value obtained closest to the day of surgery, and grouped as: <30.0; 30.0 to 59.9; 60.0 to 89.9; or ≥90 μmol/L. Heparin or nitrates usage was grouped according to: none; within a week; or at operation. Any critical preoperative event was considered to be cardiogenic shock; preoperative intravenouse (IV) inotropes; or preoperative ventilation or intra-aortic balloon pump (IABP). The time between catheterisation and surgery was grouped as: within 24 hours; >24 hours for this admission; or >24 hours for a previous admission. Information was obtained on triple vessel disease, left main stem disease, ejection fraction (grouped as: good ≥50%; fair 30 to 49%; or poor <30%), operative priority (elective, urgent and emergency/salvage). Finally the cardiac procedures being undertaken were classified as coronary artery bypass graft surgery (CAGB) only; valve only; or CABG and valve and other/multiple. Procedures classed as other included major aortic surgery; left ventricular aneurysmectomy; atrial myxoma surgery; pulmonary embolectomy; epicardial pacemaker placement; pericardectomy; atrial septal defect closure; procedure for congenital conditions; acquired ventricular septal defect closure; pulmonary endarterectomy; atrial fibrillation ablation; myomectomy; cardiac surgery plus carotid endarterectomy or peripheral vascular procedures; or any other cardiothoracic procedure not listed above.

Outcomes following surgery

Information on pulmonary complications (reintubation and ventilation; full tracheostomy; need for CPAP or BIPAP; or prolonged ventilation >48 hours), infectious complications (sternal wound infection; leg wound infection; chest infection; or septicaemia or other infection), length of stay in hospital (in days) and death in hospital was obtained from PATS.

Statistical analyses

The association between AKI and outcomes following surgery were modelled using logistic regression models. The data from Bristol and Birmingham were used to develop the prognostic models (development sample) and the models were validated on the data from Wolverhampton (validation sample). Any-stage AKI and stage-3 AKI were modelled using separate logistic regression models. Univariable associations were examined for all demographic and preoperative factors. Mutivariable associations were examined by entering all demographic and preoperative factors into a single model, which controlled for all these main effects, in a full model. Factors selected for the initial prognostic model were those from the multivariable full model with P <0.001. Factors selected for the more inclusive prognostic model were those from the full model with P <0.05. The area under the receiver operating characteristic (ROC) curve (AUC) was calculated for each model in the development and validation samples to quantify diagnostic utility. Model calibration was assessed using the Hosmer-Lemeshow test and calibration plots. Calibration plots of observed versus predicted values for AKI were analysed using linear regression to provide the slope and intercept where the closer the intercept to 0, and the closer the slope to 1, the better the calibration. The discrimination and calibration of the models were compared to two published AKI risk scores from US data [7,8], a recently published risk score from Australia [10] as well as a mortality risk score - the logistic Euroscore [18]. We were able to match all the variables for the Cleveland clinic score [7] except congestive heart failure and left ventricular ejection fraction <35% (data cut at <30%). For the Ng score [10] we were able to match all the variables with the exception of infective endocarditis. For the Mehta score [8] we were unable to match ethnicity in the development sample, and were unable to precisely match the field MI within 3 weeks, or the type of valve procedure (mitral versus aortic).

We conducted sensitivity analyses, first by considering the inclusion of two-way interactions in our score, and second, by using multiple imputations by chained equations [19,20] to account for missing data. The analysis assumes any systematic difference between the missing values and the observed values can be explained by differences in observed data. We used the ice command [21] in Stata to impute confounder and outcome missing data. Variables that may help explain the missing data (for example, demographic information, preoperative factors, AKI status and outcomes following surgery) were included in the imputation model. The missing values were sampled from their predictive distribution, based on the observed data. Standard regression analyses were used to fit the model of interest to each of the imputed datasets. Ten cycles of regression were carried out and 20 datasets imputed. All 20 estimated associations were combined to give one overall estimated association of interest. Standard error was calculated using the Rubin rules [22,23]. These rules took into account the variability in results between the imputed datasets, indicating the uncertainty of the missing values. All analyses were carried out in StataTM version 13.

Online calculator

We constructed a web-based calculator for the any-stage AKI risk score [24].


Characteristics of patients

Data were available from 18,686 patients from Bristol and 7,306 patients from Birmingham (the developmental sample) and 4,862 patients from Wolverhampton (the validation sample). Missing data for each centre are reported in Table 1. The most common field that was missing was the baseline Hb. This was missing in 39% of the Birmingham data, and predominantly in the years prior to 2004. Emergency/salvage surgery were more likely to have missing preoperative creatinine (Bristol 10%, Birmingham 14%, Wolverhampton 3% missing) compared to elective or urgent surgery (Bristol 1%, Birmingham 1%, Wolverhampton 0% missing). The complete case analysis included 20,995 patients, of whom 12,435 were from Bristol, 4,092 were from Birmingham and 4,468 were from Wolverhampton (Table 2).

Table 1 Number and percentage missing for each variable by centre
Table 2 Characteristics of patients by centre in the complete case data

In the complete case data median age by centre was: Bristol 68 years (IQR 60, 74), Birmingham 67 years (59, 73) and Wolverhampton 67 years (60, 74). The majority of operations were performed on men (76.2% in Bristol, 73.1% in Birmingham and 76.5% in Wolverhampton). The most common procedure was CABG only (69.1% in Bristol, 54.8% in Birmingham and 66.6% in Wolverhampton). Mean (SD) baseline creatinine values by centre were Bristol 108.9 μmol/L (28.5), Birmingham 107.1 μmol/L (34.4) and Wolverhampton 94.8μmol/L (28.4) Any stage of AKI affected 23.7% patients in the Bristol dataset, 20.9% patients in Birmingham and 24.6% patients in Wolverhampton. Stage-3 AKI was slightly more prevalent in the validation dataset (6.0%) compared to the development sample (3.6% in Bristol and 5.0% in Birmingham). The criteria by which patients were defined as having AKI are described in Table 3.

Table 3 Total patients who met different acute kidney injury (AKI) criteria in the complete case data

KDIGO-defined AKI severity and outcome

AKI was associated with increased hospital stay (Table 4). Patients without AKI had a median postoperative hospital stay of 6 days in Bristol (IQR 5, 8) and Wolverhampton (5, 7), and 9 days (6, 13) in Birmingham (Table 4). For patients with stage-1 AKI, median postoperative stay increased to 9 days (IQR 7, 13) in Bristol, 13 (9, 19) in Birmingham and 8 (6, 13) in Wolverhampton. Patients with the most severe AKI (stages 2 and 3) had the longest postoperative stay. In the complete case data (n = 20,995) stage-1 AKI was associated with higher odds of infective and pulmonary complications and an almost five-fold increase in the odds of death in hospital (Table 5). Odds ratios for stage-3 AKI compared to no AKI were 7.6 (95% CI 6.6, 8.8) for infective complications, 9.4 (8.2, 10.8) for pulmonary complications and 81.2 (62.3, 106.0) for death in hospital. Over 80% of all deaths in hospital were preceded by AKI.

Table 4 Length of stay following surgery by acute kidney injury (AKI) status
Table 5 Outcomes following surgery by acute kidney injury (AKI) status, data from all centres

Model building for the clinical prediction score

A total of 23 factors were examined for their associations with AKI. Odds ratios for unadjusted and fully adjusted models are presented in Table 6 (for models of any-stage AKI) and Table 7 (for models of stage-3 AKI). In fully adjusted models, 15 factors were strongly associated with any-stage AKI (P <0.001) and were included in the initial prognostic model. A further five factors were associated with any-stage AKI at the level of P <0.05, and were added to the more inclusive model. For stage-3 AKI, eight factors were included in the initial prognostic model (P <0.001), a further three factors were included in the more inclusive prognostic model (P <0.05).

Table 6 Associations of preoperative factors with any-stage acute kidney injury (AKI) in the development sample (Bristol and Birmingham)
Table 7 Associations of preoperative factors with stage-3 acute kidney injury (AKI) in the Bristol and Birmingham development sample (complete case data)

Diagnostic utility

Any-stage AKI score

The ROC AUC for the initial model for any-stage AKI in the development sample was 0.73 (95% CI 0.72, 0.74; Figure 1 and Table 8) and was very similar in the validation sample (AUC 0.74; 95% CI 0.72, 0.76). The Hosmer Lemshow test (P = 0.490, Table 9) and plot of observed versus predicted any-stage AKI (Figure 2A and Table 9) demonstrated good calibration in the development sample. The initial model was less well-calibrated in the validation sample (Hosmer Lemshow P = 0.192, and Figure 2B). The more inclusive model did not demonstrate better discrimination when compared to the initial model (Table 8), however, it did demonstrate better calibration as evidenced by the Hosmer-Lemeshow test (P = 0.406 in the validation sample) and plots of observed versus predicted any-stage AKI (Figure 2B and Table 9).

Figure 1
figure 1

Receiver operating characteristic curves for the any-stage acute kidney injury (AKI) and stage-3 AKI risk models in the external validation dataset (Wolverhampton data).

Table 8 Discrimination: area under ROC curves for the different scores and for different procedures (complete case data)
Table 9 Calibration: Hosmer Lemshow tests and results of linear regression analysis of observed versus expected calibration plots
Figure 2
figure 2

Calibration plots of observed versus expected values for the any-stage acute kidney injury (AKI), stage-3 AKI, Cleveland, Mehta, Ng score and EuroScore in the development (Bristol, Birmingham) and validation (Wolverhampton) datasets. (A) Any-stage AKI, Bristol and Birmingham. (B) Any-stage AKI, Wolverhampton. (C) Stage-3 AKI, Bristol and Birmingham. (D) Stage-3 AKI, Wolverhampton.

For the validation sample discrimination by the any-stage AKI score was better than the Euroscore (AUC 0.68; 95% CI 0.67, 0.70) and Cleveland clinic (AUC 0.70; 95% CI 0.69, 0.72), and equivalent to the Mehta Score (AUC 0.74; 0.72, 0.76) and Ng score (AUC 0.73; 95% CI 0.71, 0.75) (Table 8). Hosmer-Lemshow tests and calibration plots (Table 9 and Figure 2B) demonstrated that the more inclusive model demonstrated better calibration than each of the four comparison scores.

Stage-AKI score

In the validation sample both the initial prognostic stage-3 AKI model (0.78, 95% CI 0.75, 0.80) and the more inclusive stage-3 AKI model (0.79, 95% CI 0.76, 0.81) demonstrated good discrimination (Figure 1 and Table 8). The stage-3 AKI score had better discrimination than the Euroscore (0.73; 95% CI 0.70, 0.76) and similar discrimination to the Cleveland Clinic (0.78; 95% CI 0.75, 0.81), Metha score (0.79; 95% CI 0.77, 0.82) and Ng Score (0.79, 95% CI 0.76, 0.82) (Table 8). Plots of observed versus expected stage-3 AKI indicated that the initial stage-3 AKI model had better calibration that each of the four comparison scores, however Hosmer-Lemshow tests suggested that calibration of the stage-3 score, as well as the four comparison scores was poor (P <0.001) in both development and validation samples (Figure 2C, D).

Final model coefficients

The model coefficients for the initial model for any-stage AKI and the initial model for stage-3 AKI are shown in Table 10. Factors associated with any-stage AKI in the final model were older age, male sex, BMI >35 kg/m2, current smokers, higher dyspnoea categories, diabetes, peripheral vascular disease, hypertension, lower haemoglobin, lower estimated GFR, catheter to surgery within 24 hours, triple vessel disease, poor ejection fraction, emergency/salvage operations and more complex surgery. The only factor included in the stage-3 model that was not in the any-stage model was a critical preoperative event. Sensitivity analyses showed that adding two-way interactions at P <0.05 did not improve the diagnostic utility in the validation sample (data not shown). Further sensitivity analysis using only more recent data (after 2002) did not improve discrimination. Coefficients from imputed data for the final model were very similar to the complete case analysis (data not shown).

Table 10 Model coefficients for final models (complete case data)


This study has developed two new risk scores for the preoperative identification of cardiac surgery patients who are at increased risk of developing AKI. These scores were developed using a large cohort of patients from two UK cardiac centres and externally validated using data from a third UK centre. We compared the diagnostic utility of these scores to four previously published scores; Euroscore, Mehta Score, Cleveland Clinic score and the Ng score. Our risk prediction score for any-stage AKI has demonstrated better discrimination compared to the Euroscore and the Cleveland Clinic Score, and equivalent discrimination to the Mehta and Ng scores. The any-stage AKI score demonstrated better calibration than the four comparison scores. The stage-3 AKI risk prediction score demonstrated good discrimination, as did the four comparison risk scores, but these scores were less well-calibrated.

The study has important strengths. It has demonstrated the prognostic utility of the KDIGO AKI definition in a large multicentre cohort, and has confirmed the prognostic importance of milder forms of AKI, as has been identified by earlier consensus definitions [12,13]. The two risk scores we have developed are the first to our knowledge that have been designed to predict a consensus definition of AKI. The use of consensus AKI definitions as endpoints is a key element of study design that is important for standardisation of reporting and comparative analyses of trials. Furthermore, the any-stage AKI risk score is the first externally validated AKI risk prediction score that has been designed to include patients at risk of KDIGO stage-1 AKI, which we have shown to have prognostic utility in the current study. The any-stage AKI score is available as a web-based calculator [24] that is freely available to any researcher or clinician. It can be accessed by any smart phone, tablet or tabletop computer, and can be completed in less than 1 minute. Using a cutoff for the any-stage AKI score of 30% will select patients for interventional studies with a positive predictive value of 44% and a negative predictive value of 85% for AKI. We suggest that this score may be used to identify an enriched patient cohort for inclusion in clinical trials. We did not detect any advantage for our stage-3 AKI score in relation to existing scores. The stage-3 AKI score may have greater utility as a risk adjustment tool for quality assurance, or clinically to assist with informed consent. However it did not demonstrate clear advantages beyond existing score and we have not developed a web-based calculator for this score.

The study has several limitations. First, retrospective analyses of routinely collected data have limitations with respect to data quality, specifically missing data, misclassification and inconsistent data definitions between individuals and sites. To minimise these we used prospectively collected data from three clinical databases that use common, standardised definitions for clinical risk factors. The data had undergone both internal and external quality checking, had low levels of missing data, and the three sites contributing to the study are listed as among the top for data quality within the UK NACSA programme. Importantly, baseline creatinine values, defined in this study as the preoperative value obtained closest to the date of operation, were present in over 98% of patients, as would be expected in a cardiac surgery cohort where preoperative bloods are routinely taken in all but the very sickest patients. This is important; alternate definitions of baseline change the reported frequency of stage-1 AKI [25] - a key consideration in this study. The only variable for which there was a significant proportion of missing data was baseline haemoglobin, and this was restricted to a single centre. To address the limitations posed by missing data we performed our primary analysis in the complete case data. We confirmed the robustness of the models developed in the complete case analysis in imputed data. Model coefficients do not differ substantially in the imputed data, which suggests that the missing data are unlikely to have introduced bias into our model.

Second, unmeasured confounders are an important consideration in any retrospective analysis. For example, the Bristol and Birmingham databases do not routinely record patient race, and this has been found to be an important variable in some risk scores but not others [6]. This may also have affected the estimated GFR (eGFR) calculation, a key component of the score. Equations to calculate eGFR that include race, such as, for example, the modified diet in renal disease equation [26] have greater accuracy.

Third, intra and postoperative events that affect the incidence of AKI also represent confounders [10,27]. However, it was our intention to design a score that will identify patients at risk of AKI preoperatively on the basis that the most effective prevention strategies are likely to be those applied before or at the commencement of surgery rather than after injury (surgery) has occurred. Fourth, the AKI definition used did not incorporate urine output data, as defined in the KDIGO definition, as these data were not recorded. Urine output data are known to significantly alter the estimates in patients with AKI, although whether this improves the prognostic utility of the scores is unclear particularly in cardiac surgery, where perioperative urine output is closely monitored, and oliguria aggressively treated [28]. The use of creatinine-based definitions of AKI has other limitations in cardiac surgery. In the validation sample in this study 48% of patients had undergone coronary angiography within the same hospital admission and 39% were undergoing urgent inpatient surgery. Many of these patients would have sustained significant renal insults prior to surgery and this may have increased the baseline serum creatinine value.

A final limitation is the applicability of our findings to non-UK populations. We used well-defined variables that are routinely collected in UK databases, however, differences in variable definitions that occur between databases and countries may limit the wider utility of the model. This was a limitation of the two North American comparison scores. The Mehta score, in particular was not composed of variables that were routinely collected in the UK data, including MI <3 weeks, and ethnicity. The Mehta score was also developed in a selected population and both American scores excluded patients undergoing surgery without CPB, but were subsequently tested in a UK population that included a significant proportion of off-pump procedures. Conversely variable definitions were largely comparable between the UK and Australian cohorts, and the Ng score demonstrated good discrimination but poor calibration. This may reflect the non-consensus definition of AKI used in the Ng score or differences in patient populations and clinical practice between the two countries. These findings highlight the problems of risks scores developed in distinct geographic populations. The UK AKI risk scores described here may suffer from similar limitations, and we conclude that their wider utility requires independent external validation.


This study has used a large multicentre cohort to develop and validate a risk prediction score for AKI stages 1 to 3. This is the only published score that predicts less severe AKI, as currently defined by the consensus KDIGO definition. We suggest that this new score core will have clinical utility for risk stratification and facilitate cohort enrichment for clinical trials of novel renoprotective interventions.

Key messages

AKI is a common and severe complication of cardiac surgery that contributes to morbidity, mortality and increased healthcare costs

Previous trials of renoprotective interventions have been limited by the enrolment of low- or mixed-risk AKI cohorts, and no effective treatment has been identified thus far

AKI risk scores are an objective and transparent way of identifying cohorts of patients at increased risk of AKI for clinical trials; however, existing scores identify only those patients who develop severe AKI requiring renal replacement therapy

This study used data from two large cardiac surgery centres to develop a risk score that identifies all patients at risk of AKI (stages 1 to 3) with high discrimination and good calibration in an external validation dataset from a third centre

The utility of this score is currently being prospectively validated as a cohort enrichment toll in several ongoing clinical trials

Authors' contributions

GJM, JAS and VV conceived the study and established the funding for the study. GJM, MB and DP were principal investigators for the three institutions and helped design the study. KB, KT and JAS undertook the analyses. KB, GJM and JAS wrote the manuscript. All the authors have reviewed and approved the manuscript.



acute kidney injury


Acute Kidney Injury Network


area under the curve


British Heart Foundation


bi-level positive airway pressure


body mass index


coronary artery bypass graft surgery


Canadian Cardiovascular Society


continuous positive airway pressure


glomerular filtration rate




High Dependency Unit


intra-aortic balloon pump


Interquartile range


Kidney Disease Improving Global Outcomes


myocardial Infarction


National Adult Cardiac Surgery Audit


National Health Service


National Institute Clinical Outcome Research


New York Heart Association


patient analysis and tracking system


randomised controlled trial


risk injury, failure, loss, end stage


receiver operating characteristic


renal replacement therapy


  1. Lassnigg A, Schmidlin D, Mouhieddine M, Bachmann LM, Druml W, Bauer P, Hiesmayr M: Minimal changes of serum creatinine predict prognosis in patients after cardiothoracic surgery: a prospective cohort study. J Am Soc Nephrol. 2004, 15: 1597-1605. 10.1097/01.ASN.0000130340.93930.DD.

    Article  CAS  Google Scholar 

  2. Howell NJ, Freemantle N, Bonser RS, Graham TR, Mascaro J, Rooney SJ, Wilson IC, Pagano D: Subtle changes in renal function are associated with differences in late survival following adult cardiac surgery. Eur J Cardiothorac Surg. 2012, 41: e38-e42. 10.1093/ejcts/ezr329.

    Article  Google Scholar 

  3. Park M, Coca SG, Nigwekar SU, Garg AX, Garwood S, Parikh CR: Prevention and treatment of acute kidney injury in patients undergoing cardiac surgery: a systematic review. Am J Nephrol. 2010, 31: 408-418. 10.1159/000296277.

    Article  Google Scholar 

  4. Zacharias M, Mugawar M, Herbison GP, Walker RJ, Hovhannisyan K, Sivalingam P, Conlon NP: Interventions for protecting renal function in the perioperative period. Cochrane Database Syst Rev. 2013, 9: CD003590-

    Google Scholar 

  5. Palevsky PM, Molitoris BA, Okusa MD, Levin A, Waikar SS, Wald R, Chertow GM, Murray PT, Parikh CR, Shaw AD, Go AS, Faubel SG, Kellum JA, Chinchilli VM, Liu KD, Cheung AK, Weisbord SD, Chawla LS, Kaufman JS, Devarajan P, Toto RM, Hsu CY, Greene T, Mehta RL, Stokes JB, Thompson AM, Thompson BT, Westenfelder CS, Tumlin JA, Warnock DG, et al: Design of clinical trials in acute kidney injury: report from an NIDDK workshop on trial methodology. Clin J Am Soc Nephrol. 2012, 7: 844-850. 10.2215/CJN.12791211.

    Article  Google Scholar 

  6. Huen SC, Parikh CR: Predicting acute kidney injury after cardiac surgery: a systematic review. Ann Thorac Surg. 2012, 93: 337-347.

    Google Scholar 

  7. Thakar CV, Arrigain S, Worley S, Yared J-P, Paganini EP: A clinical score to predict acute renal failure after cardiac surgery. J Am Soc Nephrol. 2005, 16: 162-168. 10.1681/ASN.2004040331.

    Article  Google Scholar 

  8. Mehta RH, Grab JD, O'Brien SM, Glower DD, Haan CK, Gammie JS, Peterson ED: Bedside tool for predicting the risk of postoperative dialysis in patients undergoing cardiac surgery. Circulation. 2006, 114: 2208-2216. 10.1161/CIRCULATIONAHA.106.635573.

    Article  Google Scholar 

  9. Englberger L, Suri RM, Li Z, Casey ET, Daly RC, Dearani JA, Schaff HV: Validation of clinical scores predicting severe acute kidney injury after cardiac surgery. Am J Kidney Dis. 2010, 56: 623-631. 10.1053/j.ajkd.2010.04.017.

    Article  Google Scholar 

  10. Ng SY, Sanagou M, Wolfe R, Cochrane A, Smith JA, Reid CM: Prediction of acute kidney injury within 30 days of cardiac surgery. J Thorac Cardiovasc Surg. 2014, 147: 1875-1883. 10.1016/j.jtcvs.2013.06.049.

    Article  Google Scholar 

  11. KDIGO clinical practice guideline for acute kidney injury. Kidney Int. 2012, Suppl2: 1-138.

  12. Mehta RL, Kellum JA, Shah SV, Molitoris BA, Ronco C, Warnock DG, Levin A: Acute Kidney Injury Network: report of an initiative to improve outcomes in acute kidney injury. Crit Care. 2007, 11: R31-10.1186/cc5713.

    Article  Google Scholar 

  13. Bellomo R, Ronco C, Kellum JA, Mehta RL, Palevsky P: Acute renal failure–definition, outcome measures, animal models, fluid therapy and information technology needs: the Second International Consensus Conference of the Acute Dialysis Quality Initiative (ADQI) Group. Crit Care. 2004, 8: R204-R212. 10.1186/cc2872.

    Article  Google Scholar 

  14. Bastin AJ, Ostermann M, Slack AJ, Diller GP, Finney SJ, Evans TW: Acute kidney injury after cardiac surgery according to Risk/ Injury/ Failure/ Loss/End-stage, Acute Kidney Injury Network, and Kidney Disease: Improving Global Outcomes classifications. J Crit Care. 2013, 28: 389-396. 10.1016/j.jcrc.2012.12.008.

    Article  Google Scholar 

  15. Bridgewater B, Kinsman R, Walton P, Keogh B: Demonstrating quality: The Sixth National Adult Cardiac Surgery database report. 2008, Henley-on-Thames, Oxfordshire, United Kingdom, Dendrite Clinical Systems Ltd

    Google Scholar 

  16. Blue Book Online: The Society for Cardiothoracic Surgery in Great Britain & Ireland. [], []

  17. National Adult Cardiac Surgery Audit Report 2011-12. [], []

  18. Nashef SA, Roques F, Hammill BG, Peterson ED, Michel P, Grover FL, Wyse RK, Ferguson TB: Validation of European system for cardiac operative risk evaluation (EuroSCORE) in North American cardiac surgery. Eur J Cardiothorac Surg. 2002, 22: 101-105. 10.1016/S1010-7940(02)00208-7.

    Article  Google Scholar 

  19. Raghunathan TE, Lepkowski JM, Van Hoewyk J, Solenberger P: A multivariate technique for multiply imputing missing values using a sequence of regression models. Survey Methodology. 2009, 27: 85-95.

    Google Scholar 

  20. Royston P: Multiple imputation of missing value. Stata Journal. 2004, 4: 227-241.

    Google Scholar 

  21. Royston P: Multiple imputation of missing values: update of ice. Stata Journal. 2005, 5: 527-536.

    Google Scholar 

  22. Vergouwe Y, Royston P, Moons KGM, Altman DG: Development and validation of a prediction model with missing predictor data: a practical approach. J Clin Epidemiol. 2010, 63: 205-214. 10.1016/j.jclinepi.2009.03.017.

    Article  Google Scholar 

  23. Marshall A, Altman DG, Holder RL, Royston P: Combining estimates of interest in prognostic modelling studies after multiple imputation: current practice and guidelines. BMC Med Res Methodol. 2009, 9: 57-10.1186/1471-2288-9-57.

    Article  Google Scholar 

  24. Leicester cardiac Surgery Research Acute Kidney Injury Risk Score Calculator. [], []

  25. Siew ED, Matheny ME, Ikizler TA, Lewis JB, Miller RA, Waitman LR, Go AS, Parikh CR, Peterson JF: Commonly used surrogates for baseline renal function affect the classification and prognosis of acute kidney injury. Kidney Int. 2010, 77: 536-542. 10.1038/ki.2009.479.

    Article  Google Scholar 

  26. Lamb EJ, Stevens PE: Estimating and measuring glomerular filtration rate: methods of measurement and markers for estimation. Curr Opin Nephrol Hypertens. 2014, 23: 258-266. 10.1097/01.mnh.0000444813.72626.88.

    Article  CAS  Google Scholar 

  27. Demirjian S, Schold JD, Navia J, Mastracci TM, Paganini EP, Yared JP, Bashour CA: Predictive models for acute kidney injury following cardiac surgery. Am J Kidney Dis. 2012, 59: 382-389. 10.1053/j.ajkd.2011.10.046.

    Article  Google Scholar 

  28. Md Ralib A, Pickering JW, Shaw GM, Endre ZH: The urine output definition of acute kidney injury is too liberal. Crit Care. 2013, 17: R112-10.1186/cc12784.

    Article  Google Scholar 

Download references


Thanks to Jessica Harris and Chris Rogers for matching clinical and laboratory data in the Bristol dataset (University of Bristol). Collaborators who contributed data to the study: Birmingham: Mr TR Graham, Mr SJ Rooney, Mr IC Wilson, Mr RS Bonser, Mr J Mascaro, Database manager Ms Vivian Barnett; Bristol: Mr M Yeatman, Mr G Asimakopoulos, Mr M Caputo, Mr S Stoica, Mr A Parry, Mr GD Angelini, Mr F Ciulli, Mr R Ascione, Mr J A Hutter, Database manager Mr Alan Davies; Wolverhampton: Mr M Matuszewski, Mr P Yiu, Mr J S Billing, Mr I S Morgan, Mr H Luckraz, Database manager Ms J Sear.


This study was funded by a Flexibility and Sustainability Grant from University Hospitals Bristol NHS Foundation Trust, and BHF Grants CH/12/1/29419 and RG/13/6/29947, and the Leicester NIHR Cardiovascular Biomedical Research Unit. KB is funded Medical Research Council UK (grant RD1826).

Author information

Authors and Affiliations


Corresponding author

Correspondence to Gavin J Murphy.

Additional information

Competing interests

GJ Murphy has served as a consultant to the AKI divisions of AbbVie and Thrasos Innovation Inc. The remaining authors have no disclosures.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Birnie, K., Verheyden, V., Pagano, D. et al. Predictive models for kidney disease: improving global outcomes (KDIGO) defined acute kidney injury in UK cardiac surgery. Crit Care 18, 606 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: