- Open Access
Furosemide stress test as a predictive marker of acute kidney injury progression or renal replacement therapy: a systemic review and meta-analysis
Critical Care volume 24, Article number: 202 (2020)
The use of the furosemide stress test (FST) as an acute kidney injury (AKI) severity marker has been described in several trials. However, the diagnostic performance of the FST in predicting AKI progression has not yet been fully discussed.
In accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, we searched the PubMed, Embase, and Cochrane databases up to March 2020. The diagnostic performance of the FST (in terms of sensitivity, specificity, number of events, true positive, false positive) was extracted and evaluated.
We identified eleven trials that enrolled a total of 1366 patients, including 517 patients and 1017 patients for whom the outcomes in terms of AKI stage progression and renal replacement therapy (RRT), respectively, were reported. The pooled sensitivity and specificity results of the FST for AKI progression prediction were 0.81 (95% CI 0.74–0.87) and 0.88 (95% CI 0.82–0.92), respectively. The pooled positive likelihood ratio (LR) was 5.45 (95% CI 3.96–7.50), the pooled negative LR was 0.26 (95% CI 0.19–0.36), and the pooled diagnostic odds ratio (DOR) was 29.69 (95% CI 17.00–51.85). The summary receiver operating characteristics (SROC) with pooled diagnostic accuracy was 0.88. The diagnostic performance of the FST in predicting AKI progression was not affected by different AKI criteria or underlying chronic kidney disease. The pooled sensitivity and specificity results of the FST for RRT prediction were 0.84 (95% CI 0.72–0.91) and 0.77 (95% CI 0.64–0.87), respectively. The pooled positive LR and pooled negative LR were 3.16 (95% CI 2.06–4.86) and 0.25 (95% CI 0.14–0.44), respectively. The pooled diagnostic odds ratio (DOR) was 13.59 (95% CI 5.74–32.17), and SROC with pooled diagnostic accuracy was 0.86. The diagnostic performance of FST for RRT prediction is better in stage 1–2 AKI compared to stage 3 AKI (relative DOR 5.75, 95% CI 2.51–13.33).
The FST is a simple tool for the identification of AKI populations at high risk of AKI progression and the need for RRT, and the diagnostic performance of FST in RRT prediction is better in early AKI population.
The incidence of in-hospital acute kidney injury (AKI), depending on the different AKI criteria used, ranges from 7.0–18.3%  among hospitalized patients in general and up to 20–50% in critically ill populations . The progression of AKI with multiple organ failure can result in poor prognosis. Because of the high morbidities and mortalities associated with AKI, many investigators have focused on several novel biomarkers for earlier detection of AKI, discrimination of etiologies, and prediction of outcomes [3,4,5,6,7]. However, the availability of these novel biomarkers may be limited by its expense or reimbursement issues in different countries. In addition to the therapeutic role of furosemide on fluid balance, blood pressure control, and the management of hypercalcemia, Chawla et al. proposed furosemide stress test (FST) as a tool for predicting AKI progression . Several following studies also utilized FST to predict AKI progression or RRT prediction, but with heterogeneity in AKI criteria, cutoff value of urine output, duration of monitor, or study designs. A few recent studies used FST to predict delayed graft function after kidney transplant [9, 10], and others focused on child populations [11, 12]. As such, in order to more effectively explore the diagnostic accuracy of the FST to predict AKI progression and renal replacement therapy (RRT) initiation, we conducted this meta-analysis according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) diagnostic test accuracy guidelines .
In accordance with the PRISMA guidelines, two investigators (JJ-C, G-K) systematically and independently conducted a review of the relevant published data. A computerized search of the Pubmed, Embase, and Cochrane electronic databases was performed using the keywords “furosemide,” “acute kidney injury,” “acute kidney failure,” and “renal insufficiency,” and medical subject heading (MeSH) terms “Furosemide” [Mesh], “renal insufficiency”[Mesh] AND “Acute Kidney Injury” [Mesh] in order to identify all the relevant studies up to March 2020. Review articles or meta-analyses were not included for analysis, but their citations and references were searched for additional relevant studies. The detail results of literature search were provided in Additional profile 1: Supplementary Table 1A and 1B. We also performed search of gray literature, and the detail is provided in Additional profile 2: Supplementary document.
After the initial screening, the two investigators Jia Jin Chen (JJ-C) and George Kuo (G-K) independently determined the eligibility of the identified studies based on evaluations of their titles, abstracts, and, subsequently, full texts. Any differences in opinion regarding eligibility were resolved by consensus through discussion with Chih-Hsiang Chang. The full text of any article that was deemed potentially relevant was retrieved online. A study was included if it met the criteria of adult humans as its population, and reported the protocol and cutoff point of the FST. We enrolled studies with primary or secondary outcomes reporting the diagnostic value of the FST for AKI progression, RRT, or mortality. Studies were excluded if they met one or more of the following criteria: (1) focused on a population with solid organ or hematopoietic stem cell transplantation, (2) used duplicate cohorts, (3) contained insufficient information for analysis, (4) included pediatric patients, or (5) did not report outcome of interest. Detailed results regarding excluded studies and the reasons for their exclusion are available in Additional profile 1: Supplementary Table 2. We have registered our work in PROSPERO. However, till we finished our work, the registration was still under assessed by the editorial team of PROSPERO; therefore, we provided our initial registered protocol as Additional profile 3.
The two investigators independently extracted relevant information from each study. The extracted data elements included the first author, year of publication, study location, study design, diagnostic criteria of AKI, total sample size, protocol of the FST (that is, furosemide dose, duration of monitor, cutoff value of urine output), patients’ AKI stages, outcomes of interest, whether or not the enrolled population had high plasma neutrophil gelatinase-associated lipocalin (NGAL) levels, and whether patients with chronic kidney disease were excluded or not (Table 1). As for diagnostic test performance, the extracted data included the cutoff value of urine output based on the Youden index or pre-defined criteria, sensitivity, specificity, number of true positive, number of false positive, and the event number of AKI progression, RRT, or mortality (Table 1 and Table 2).
The diagnostic criteria for AKI were different in the eleven enrolled studies. Five of the studies (Elsaegh, Lumlertgul, Martínez, Matsuura, Vairakkani) [14,15,16,17, 22] used the Kidney Disease: Improving Global Outcomes (KDIGO) criteria . Other studies used the Acute Kidney Injury Network (AKIN) criteria . The reference test used in each study was based on the different AKI criteria in each trial or on whether the patients received RRT or mortality during the follow-up period. Four studies (Chawla, Pérez-Cruz, Rewa, Venugopal) [8, 18, 19, 23] used the AKIN stage 3 AKI as primary outcome. Three studies (Martínez, Matsuura, Vairakkani) [16, 17, 22] used the KDIGO stage 3 AKI as primary outcome. Two studies (Elsaegh, Saber) reported primary composite outcome consist of AKI progression and RRT [14, 20]. Six studies (Martínez, Lumlertgul, Matsuura, Pérez-Cruz, Sakhuja, Venugopal) reported outcome of RRT, and two studies (Martínez, Venugopal) reported outcome of mortality [15,16,17,18, 21, 23] (Table 2). Most studies reporting outcome of RRT did not mention the indications of renal replacement therapy except one (Lumlertgul) . In this study, the patient received RRT within 6 h after randomization in early group or received RRT based on conventional indications in standard group.
Risk of bias assessment
The risk of bias for each of the included studies was assessed using the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool and Review Manager version 5.3 to identify the quality of the included studies . The QUADAS-2 tool is based on four domains (patient selection, index test, reference standard, and flow and timing), which are used to judge the risk of bias. Each study was reviewed independently by JJ-C and G-K, with each investigator assigning a rating of high, low, or unclear risk for all four domains. The judgment principle of “applicability” was the same as the bias section, but there were no signaling questions. Disagreements between the reviewers were resolved by discussion with another author, Chih-Hsiang Chang. If the answer to all the signaling questions for a given domain was “yes,” then the domain was considered to entail a low risk of bias. If the answer to any of the signaling questions for a domain was “no,” then the domain was considered to entail a high risk of bias. The quality of evidence for the diagnostic performance of the FST in this meta-analysis was assessed based on the guidelines of the GRADE Working Group methodology . We summarized the results in a table, which was constructed using the online GRADE Profiler (see Additional profile 4).
We extracted the event number, total sample size, and true positive (TP), true negative (TN), false positive (FP), and false negative (FN) rates for each study or calculated these values according to the reported sensitivity and specificity. Based on these data, the positive likelihood ratio (+LR), negative likelihood ratio (−LR), and diagnostic odds ratio (DOR) could be obtained for each study. The summary measures were calculated using a bivariate model for the pooled sensitivity and specificity. We used a random-effects model with maximum likelihood estimation to calculate the pooled DOR and LR. The above two tests were conducted by the “metabin” function in the “meta” package . To assess the diagnostic performance of the FST regarding AKI progression for FST non-responders, a summary receiver operating characteristics (SROC) curve was constructed by the “restima” function with restricted maximum likelihood estimation in the “mada” package . The threshold effect was examined by using the Spearman correlation coefficient between the logit of sensitivity and logit of “1 – specificity,” and P < 0.05 indicated the existence of a threshold effect. If there is no significant threshold effect, subgroup analysis or meta-regression analysis is warranted to clarify the sources of heterogeneity . Heterogeneity from covariates other than the threshold effect among studies was evaluated using the I2 index, with I2 < 25%, 25–50%, and > 50% indicating mild, moderate, and high heterogeneity, respectively. The LRs indicate whether the accuracy of a particular test would be more accurate for patients with a disease than for subjects without the disease. Several relevant variables were identified, and these variables are summarized in Table 1, Table 2, and Additional profile 1: Supplementary Table 3 (with the specific variables including the diagnostic criteria of AKI, whether the enrolled patients had high plasma NGAL, whether or not the enrolled patients had a clinical diagnosis of AKI, the use of a pre-specified cutoff value of urine output, the used FST protocol, prospective or retrospective study design, and whether the patients with chronic kidney disease were excluded). To explore possible sources of heterogeneity, these variables were applied as moderators in meta-regression weighted by the inverse of the study variance. We performed the meta-regression by using “metareg” function in the “meta” package. A sensitivity analysis was performed after excluding studies using a composite outcome consist of AKI progression and RRT. All analyses were conducted using R version 3.6.2 (2019-12-12) . A two-sided P value of < 0.05 was considered statistically significant.
The initial search retrieved 1902 records. After excluding duplicate articles, the remaining 1679 articles were screened based on their titles and abstracts in order to identify the potentially relevant articles, the full texts of which then were downloaded and reviewed to further determine their eligibility for inclusion in the final analysis. Of the 29 articles, two [32, 33] were suspected of using a duplicate cohort from another study , five were focused on child populations [11, 12, 34,35,36], and three were based on kidney transplant outcomes [9, 10, 37]. Meanwhile, five studies reported different outcomes of interest and the remaining three did not report sufficient information for analysis [38,39,40,41,42,43,44,45] (Additional profile 1: Supplementary Table 2). As such, eleven studies were ultimately included in this meta-analytic study (Fig. 1).
The eleven included trials enrolled a total of 1366 patients with clinical AKI or a risk of AKI. Among those patients, 517 patients and 1017 patients, respectively, had reported outcomes of AKI progression (including the need for RRT) or RRT. Most of the enrolled studies used prospective cohorts, and the remaining four studies used non-prospective study designs or insufficient information about study designs (Chawla, Matsuura, Sakhuja, Vairakkani) [8, 17, 21, 22]. All of the studies, except the two by Matsuura et al. and Sakhuja et al., used a standard furosemide dose, which is 1 mg/kg for the furosemide naive patients and 1.5 mg/kg for those patients exposed to furosemide within 7 days prior to FST [17, 21]. Matsuura et al. used a complex cutoff value, which presented as urine volume divided by the administered furosemide dose (specifically, 3.9 ml of urine output 2 h after per milligram of furosemide administration) . In the study by Sakhuja et al., the used dose of furosemide was at least 1 mg/kg . Most of the studies used a 2-h time interval to determine the FST responsiveness; only one study (Saber) used a 6-h time interval . Most studies used 200 ml urine output within 2 h after furosemide stress test as cutoff value except four studies (Matsuura; Saber; Sakhuja; Vairakkani) [17, 20,21,22] (Table 1). Three studies enrolled populations with high plasma NGAL levels (Chawla, Lumlertgul, Matsuura) [8, 15, 17]. Most studies did not report serum albumin level, which might be an important factor for diuresis response after furosemide administration. Only two studies reported serum albumin level (Matsuura average serum albumin 2.8 g/dl and Sakhuja average serum albumin level 2.9 g/dl) [17, 21]. Besides, the study by Lumlertgul et al. excluded patients with serum albumin level less than 2 g/dl  (Additional profile 1: Supplementary Table 3).
Risk of bias
With the QUADAS-2 tool, study characteristics or designs that might increase the risk of bias were identified. Domain 1 of the QUADAS-2 tool focuses on patient selection. One study (Elsaegh)  enrolled septic ICU patients with normal renal function, and we considered this to entail a high risk of applicability concern. Another study (Matsuura)  enrolled patients with clinical AKI or subclinical AKI (that is, those with high biomarker levels that still did not meet the clinical AKI criteria). Two trials (Vairakkani, Venugopal) [22, 23] provided insufficient information about their study designs; therefore, the domain 1 aspects of the study populations for these two studies were considered to entail unclear risks. Domain 2 of the QUADAS-2 tool addresses the aspect of index tests. Six trials (Chawla; Matsuura; Rewa; Saber; Sakhuja; Vairakkani) [8, 17, 19,20,21,22] selected the urine output threshold to optimize sensitivity and/or specificity; therefore, these six studies were considered to have a high risk of bias regarding domain 2. All of the studies that used the AKIN or KDIGO AKI criteria or RRT as reference standard were considered to have low risk of bias. Four studies (Elsaegh; Pérez-Cruz; Saber; Venugopal) [14, 18, 20, 23] did not report a follow-up period for the primary or secondary outcomes. Therefore, these four studies were considered to have unclear risk of bias regarding domain 4. Because of the reasons mentioned above, we considered one study (Elsaegh)  to have high applicability concern regarding patient selection and another one (Matsuura)  to have unclear concern. The other two domains of applicability concern in the included studies were all rated as low risk. We conducted the risk of bias analysis for all the included studies using Review Manager (RevMan) version 5.3 , and the results are summarized in Fig. 2.
Furosemide stress test for acute kidney injury stage progression prediction
The diagnostic values, cutoffs, and key results are summarized in Table 2. The pooled sensitivity and specificity values were 0.81 (95% CI 0.74–0.87) and 0.88 (95% CI 0.82–0.92), respectively. The pooled positive LR was 5.45 (95% CI 3.96–7.50), and the negative LR was 0.26 (95% CI 0.19–0.36) (Fig. 3). The heterogeneity of the aforementioned four pooled indices ranged from low to moderate (I2 ranged from 0.0 to 42%) (Fig. 3). The pooled DOR was 29.69 (95% CI 17.00–51.85), with low heterogeneity (I2 = 0) (Supplementary Fig. 1). The area under the curve (AUC) for SROC to summarize diagnostic accuracy was 0.88 (Supplementary Fig. 2).
Furosemide stress test for renal replacement therapy prediction
Six studies reported the diagnostic value of FST in predicting RRT in AKI populations. Four studies (Lumlertgul, Martínez, Pérez-Cruz, Venugopal) used FST protocol identical to that used by Chawla et al. (1 mg/kg for the furosemide-naive patients or 1.5 mg/kg for patients who have exposure to furosemide and 200 ml urine output after furosemide administration as cutoff value) [8, 15, 16, 18, 23]. One study (Matsuura) used complex cutoff value as abovementioned . In one retrospective study (Sakhuja) , the patient received at least 1 mg/kg furosemide and the cutoff value of urine output was 600 ml at 6 h after FST (Table 1). The pooled sensitivity and specificity values were 0.84 (95% CI 0.72–0.91) and 0.77 (95% CI 0.64–0.87), respectively. The pooled positive LR was 3.16 (95% CI 2.06–4.86), and the negative LR was 0.25 (95% CI 0.14–0.44). The heterogeneity of the aforementioned four pooled indices was high (I2 ranged from 55 to 83%) (Fig. 4). The pooled DOR was 13.59 (95% CI 5.74–32.17), with high heterogeneity (I2 = 76%) (Supplementary Fig. 3). The area under the curve (AUC) for SROC to summarize diagnostic accuracy was 0.86 (Supplementary Fig. 4).
Furosemide stress test for mortality prediction
Two studies (Martínez, Venugopal) reported the diagnostic value of FST for predicting mortality [16, 23]. Martínez et al. reported the prediction ability of FST for 30-day mortality. The follow-up period was unclear in the study by Venugopal et al. The pooled sensitivity and specificity values were 0.48 (95% CI 0.18–0.79) and 0.78 (95% CI 0.67–0.86), respectively. The pooled positive LR was 2.64 (95% CI 1.39–5.03), and the negative LR was 0.83 (95% CI 0.53–1.29) (Supplementary Figure 5). The heterogeneity of the aforementioned four pooled indices was low to high (I2 ranged from 0 to 58%). The pooled DOR was 4.09 (95% CI 1.11–15.12), with moderate heterogeneity (I2 = 38%) (Supplementary Figure 6). The area under the curve (AUC) for SROC to summarize diagnostic accuracy was 0.78 (Supplementary Figure 7).
Subgroup analysis and sensitivity analysis
To explore the source of heterogeneity, we perform subgroup analysis in regard to the diagnostic criteria of AKI, prospective or non-prospective design, use of a pre-specified cutoff value of urine output, enrolled high NGAL population, different FST protocols, exclusion or inclusion of patients with baseline CKD, and whether the primary outcome was a pure outcome. The analysis of threshold effect was performed with Spearman rank correlations (ρ = 0.197; P = 0.62). The results implied that there was no significant threshold effect and subgroup analysis was required. The diagnostic performance of FST for AKI progression was not affected by different diagnostic criteria of AKI, exclusion or inclusion of CKD, different duration to monitor urine output, different FST protocol, or the purity of primary outcome. The results of the subgroup analysis and sensitivity analysis are summarized and presented in Table 3.
There were 2 studies that provided a composite outcome consisting of diagnostic performance of FST for AKI progression and RRT prediction (Elsaegh; Saber) [14, 20]. A sensitivity analysis was conducted after excluding these two trials. The pooled sensitivity and specificity values of the remaining 7 studies were 0.79 (95% CI 0.71–0.85) and 0.88 (95% CI 0.83–0.91), respectively. The pooled positive LR was 6.07 (95% CI 4.45–8.29), and the negative LR was 0.27 (95% CI 0.20–0.38) (Supplementary Figure 8). The pooled DOR was 30.26 (95% CI 16.67–54.94) (Supplementary Figure 9). The SROC with pooled diagnostic accuracy was 0.90 (Supplementary Figure 10).
We also performed Spearman rank correlations (ρ = 0.579; P = 0.23) and then subgroup analysis for FST as an RRT prediction tool. The RRT incidence is different in enrolled studies (from 15.6% in the study by Matsuura et al. to 66.6% in the study by Lumlertgul et al.). These six studies are also with variable follow-up period (from 1 day to 30 days) and enrolled patients of different AKI severity (stage 1–2 or stage 3). Subgroup analysis showed that the diagnostic performance was not affected by study population with different RRT incidences (RRT incidence < 20% vs. ≥ 20%; the relative diagnostic odds ratio 1.19 with 95% CI 0.37–3.78) or different follow-up durations (follow-up duration not reported or < 7 days vs. ≥ 7 days; the relative diagnostic odds ratio 3.71 with 95% CI 0.80–71.11). However, the diagnostic performance was better in early AKI stage population (stage 1–2) than in stage 3 (relative diagnostic odds ratio 5.75 with 95% CI 2.51–13.33) (Table 4).
Furosemide has been used for decades. Its pharmacodynamics, pharmacokinetics, and adverse effects are well described in patients with chronic kidney disease or nephrotic syndrome, but less data is available regarding its effects in AKI populations. Because of its low cost and availability, using diuretic response as a preserved renal functional marker has been proposed. In 1973, Baek et al. reported that the urinary free water excretion following intravenous furosemide administration could serve as a diagnostic tool for acute tubular necrosis (ATN) . Pandit and colleagues found that, while on furosemide therapy, patients who had urine output less than 1200 ml 1 day after coronary artery bypass graft surgery were more likely to experience AKI, with a specificity of 97.93% . It has been no one until 2013, Chawla et al. proposed a standard FST protocol, in which diuretic-naive patients receive 1 mg/kg of furosemide and patients who were exposed to furosemide within 7 days received 1.5 mg/kg of furosemide . They use 200 ml urine output at 2 h after furosemide administration to serve as a cutoff value. In subjects with normal renal function or mild AKI, the infusion dose and creatinine clearance are major determinants of diuretic response [47, 48]. After AKI, several tubular function alterations could affect diuretic response, including a decrease of Na-K-Cl cotransporter 2 expression, Na-K-ATPase redistribution , and organic acid transporter mistargeting . Therefore, the FST seems to provide a quick and easy method for the assessment of glomerular filtration and tubular damage. Despite this aforementioned role in diagnostics, furosemide is unlikely to reduce mortality or decrease the risk of RRT in AKI populations . We thus performed this systematic review and meta-analysis to clarify the predictive value of the FST on AKI progression, the need for RRT, and in-hospital mortality. First, the analysis of the diagnostic accuracy of the FST for AKI progression yielded an AUROC of 0.88, with pooled sensitivity and specificity values of 0.81 and 0.88, respectively. Although there are no studies directly comparing the diagnostic accuracy of FST with other biomarkers, the AUROC of FST for AKI progression is not inferior to that of biomarkers, which ranged from 0.70 to 0.85 in previous reports [3, 52, 53]. The diagnostic performance of FST was not affected by whether the enrolled patients have high plasma NGAL or not. Koyner et al., by using the same cohort with Chawla et al., reported the AUROC of FST was higher than that of each biomarker alone. Compared to the overall cohort, the diagnostic accuracy of FST improved in patients with elevated biomarkers . The aforementioned studies and our work imply that FST could serve as a simple risk triage tool combined with or without novel biomarker in early AKI patients.
Second, our work demonstrated that use of the FST as a tool for RRT prediction had an AUROC of 0.86, with high heterogeneity in regarding pooled diagnostic indices. The pooled specificity and positive LR values of the FST for RRT prediction were relatively low. The subgroup analysis showed that diagnostic performance is better in early AKI population. According to the study by Lumlertgul et al., 25% of the FST non-responder eventually did not undergo RRT because these patients did not meet the conventional criteria to start RRT. Lumlertgul et al. also demonstrated that in FST non-responders, whether early or late RRT initiation did not affect short-term mortality or renal recovery . On the other hand, the FST responders are less likely to receive RRT. Matsuura et al. showed that only 5.6% (2/36) FST responders underwent RRT, whereas up to 40% (6/15) of FST non-responders requires RRT . The major problems of RRT prediction lie in the optimal time for RRT initiation. Recently, several randomized controlled trials regarding the optimal timing of RRT initiation were published. The ELAIN trial enrolled KDIGO stage 2 AKI and demonstrated survival benefit from early initiation of RRT. This trial was criticized for its single center designs, the enrollment of post-surgery population, and some patients with significant fluid overload . The AKIKI trial enrolled ICU patients with KDIGO stage 3 AKI and demonstrated no benefit with earlier RRT initiation in regard to 60-day mortality . The IDEAL-ICU trial enrolled patients with septic shock who achieved a “failure” stage of AKI by RIFLE criteria but without life-threatening conditions, and found that there was no survival benefit with “early” RRT . Despite these large trials, we still have no conclusive answers about the optimal timing to start RRT. A recent published meta-analysis demonstrated that early RRT may be beneficial for a shorter duration on mechanical ventilation. However, a watchful waiting strategy based on conventional indications for RRT initiation was generally safe in regard to all-cause mortality . FST non-responsiveness alone might not be a good indicator for RRT initiation. We should also take clinical condition, patient’s demand, and residual renal capacity into consideration as suggested by Acute Disease Quality Initiative XVII conference (ADQI) . Overall, because of the inconsistency of timing of RRT initiation, FST non-responsiveness is not a good predictor for RRT; nevertheless, FST responsiveness might serve as a negative predictor for RRT, especially in early AKI stage.
Our study had several limitations. First, the risks of bias in the investigated studies were not low because of the existence of non-prospective study design, inconsistent diagnostic cutoff values, and mixed patient populations. Second, the serum albumin level has been considered as a factor of diuretic resistance based on early experimental data , and recent studies have shown that the co-administration of albumin and loop diuretics might transiently increase urine water and sodium excretion [60, 61]. However, we did not have information about the serum albumin level in most studies and whether loop diuretics were co-administered with albumin in the enrolled studies. Third, the indications for RRT initiation were not precisely reported in most studies. Further prospective studies with standard RRT initiation protocol are needed for further evaluation the ability of FST for RRT prediction. Due to the lack of large prospective studies meeting our criteria for inclusion, the total number of enrolled patients was relatively small. Two completed but not published trial (NCT02730117, NCT04215419) and another ongoing trial (NCT 01275729) were identified in the process of systematic research. Further results from these larger clinical studies are required in the future for validation the diagnostic role of FST in AKI severity.
In conclusion, FST non-responsiveness has a good predictive ability for AKI progression. The diagnostic performance of FST for RRT prediction is suboptimal and is better in early AKI population. Further trials with larger sample sizes with a high-quality study design are warranted to clarify the benefit of FST in the clinical setting.
Availability of data and materials
Acute kidney injury
Acute Kidney Injury Network
Furosemide stress test
Kidney Disease: Improving Global Outcomes
Neutrophil gelatinase-associated lipocalin
Zeng X, McMahon GM, Brunelli SM, Bates DW, Waikar SS. Incidence, outcomes, and comparisons across definitions of AKI in hospitalized individuals. Clin J Am Soc Nephrol. 2014;9(1):12–20.
Case J, Khan S, Khalid R, Khan A. Epidemiology of acute kidney injury in the intensive care unit. Crit Care Res Pract. 2013;2013:479730.
Klein SJ, Brandtner AK, Lehner GF, Ulmer H, Bagshaw SM, Wiedermann CJ, et al. Biomarkers for prediction of renal replacement therapy in acute kidney injury: a systematic review and meta-analysis. Intensive Care Med. 2018;44(3):323–36.
Zhang A, Cai Y, Wang PF, Qu JN, Luo ZC, Chen XD, et al. Diagnosis and prognosis of neutrophil gelatinase-associated lipocalin for acute kidney injury with sepsis: a systematic review and meta-analysis. Crit Care. 2016;20:41.
Chang CH, Yang CH, Yang HY, Chen TH, Lin CY, Chang SW, et al. Urinary biomarkers improve the diagnosis of intrinsic acute kidney injury in coronary care units. Medicine (Baltimore). 2015;94(40):e1703.
Chen JJ, Fan PC, Kou G, Chang SW, Chen YT, Lee CC, et al. Meta-analysis: urinary calprotectin for discrimination of intrinsic and prerenal acute kidney injury. J Clin Med. 2019;8(1).
Jia HM, Huang LF, Zheng Y, Li WX. Diagnostic value of urinary tissue inhibitor of metalloproteinase-2 and insulin-like growth factor binding protein 7 for acute kidney injury: a meta-analysis. Crit Care. 2017;21(1):77.
Chawla LS, Davison DL, Brasha-Mitchell E, Koyner JL, Arthur JM, Shaw AD, et al. Development and standardization of a furosemide stress test to predict the severity of acute kidney injury. Crit Care. 2013;17(5):R207.
McMahon BA, Koyner JL, Novick T, Menez S, Moran RA, Lonze BE, et al. The prognostic value of the furosemide stress test in predicting delayed graft function following deceased donor kidney transplantation. Biomarkers. 2018;23(1):61–9.
Udomkarnjananun S, Townamchai N, Iampenkhae K, Petchlorlian A, Srisawat N, Katavetin P, et al. Furosemide stress test as a predicting biomarker for delayed graft function in kidney transplantation. Nephron. 2019;141(4):236–48.
Borasino S, Wall KM, Crawford JH, Hock KM, Cleveland DC, Rahman F, et al. Furosemide response predicts acute kidney injury after cardiac surgery in infants and neonates. Pediatr Crit Care Med. 2018;19(4):310–7.
Kakajiwala A, Kim JY, Hughes JZ, Costarino A, Ferguson J, Gaynor JW, et al. Lack of furosemide responsiveness predicts acute kidney injury in infants after cardiac surgery. Ann Thorac Surg. 2017;104(4):1388–94.
McInnes MDF, Moher D, Thombs BD, McGrath TA, Bossuyt PM. And the P-DTAG, et al. preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies: the PRISMA-DTA statement. JAMA. 2018;319(4):388–96.
Elsaegh HKNY, Elsayed HE, Elbasha AM. The role of furosemide stress test in the prediction of severity and outcome of sepsis-induced acute kidney injury. J Egypt Soc Nephrol Transplant. 2018;18:86–95.
Lumlertgul N, Peerapornratana S, Trakarnvanich T, Pongsittisak W, Surasit K, Chuasuwan A, et al. Early versus standard initiation of renal replacement therapy in furosemide stress test non-responsive acute kidney injury patients (the FST trial). Crit Care. 2018;22(1):101.
Diana Vega Martínez GAÁ, Iñiguez JSC, Lozano LEC, Gil FCE. Precisión diagnóstica de prueba de estrés con furosemida para predicción de daño renal agudo severo. Rev Asoc Mex Med Crit Ter Int. 2016;30(4):230–4.
Matsuura R, Komaru Y, Miyamoto Y, Yoshida T, Yoshimoto K, Isshiki R, et al. Response to different furosemide doses predicts AKI progression in ICU patients with elevated plasma NGAL levels. Ann Intensive Care. 2018;8(1):8.
Elizabeth Pérez-Cruz AM-C, José Manuel Conde-Mercado,Eugenia Méndez-Calderillo. Comparación de la prueba de estrés con furosemida y biomarcadores séricos como predictores de la lesión renal aguda. Rev Hosp Jua Mex 2017;84(4):196–202.
Rewa OG, Bagshaw SM, Wang X, Wald R, Smith O, Shapiro J, et al. The furosemide stress test for prediction of worsening acute kidney injury in critically ill patients: a multicenter, prospective, observational study. J Crit Care. 2019;52:109–14.
Saber HMMW, Khaled H, Awad MA. Furosemide stress test, a novel assessment tool for tubular function in critically ill patients with acute kidney injury: potential therapeutic and prognostic values. Res Opin Anesth Intensive Care. 2019;6:273–81.
Sakhuja A, Bandak G, Barreto EF, Vallabhajosyula S, Jentzer J, Albright R, et al. Role of loop diuretic challenge in stage 3 acute kidney injury. Mayo Clin Proc. 2019;94(8):1509–15.
R Vairakkani PAG, M Edwin Fernando, N D Srinivasa Prasad, S Sujit, K Thirumal Valavan, C Hariharan. Furosemide stress test to predict the severity of acute kidney injury. Indian J Nephrol. 2019;29(Suppl S1):S24-S5.
L Venugopal RP, S Sreedhar, S Krishna Kumar, Arun Kumar Donakonda. Frusemide stress test to predict acute kidney injury progression and dialysis requirement: a prospective study. Indian J Nephrol. 2019;29(Suppl S1):S46-S7.
mada: meta-analysis of diagnostic accuracy. R package version 0.5.9. Philipp Doebler hCR-popm.
Section 2: AKI Definition. Kidney Int Suppl (2011). 2012;2(1):19–36.
Mehta RL, Kellum JA, Shah SV, Molitoris BA, Ronco C, Warnock DG, et al. Acute Kidney Injury Network: report of an initiative to improve outcomes in acute kidney injury. Crit Care. 2007;11(2):R31.
Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36.
GRADEpro GDT: GRADEpro Guideline Development Tool [Software]. McMaster University, 2015 (developed by Evidence Prime, Inc.). http://gradepro.org.
Balduzzi S, Rucker G, Schwarzer G. How to perform a meta-analysis with R: a practical tutorial. Evid Based Ment Health. 2019;22(4):153–60.
Arends LR, Hamza TH, van Houwelingen JC, Heijenbrok-Kal MH, Hunink MG, Stijnen T. Bivariate random effects meta-analysis of ROC curves. Med Decis Mak. 2008;28(5):621–38.
R: a language and environment for statistical computing. R Foundation for Statistical Computing V, Austria. R Core Team (2019). https://www.R-project.org/.
Koyner JL, Davison DL, Brasha-Mitchell E, Chalikonda DM, Arthur JM, Shaw AD, et al. Furosemide stress test and biomarkers for the prediction of AKI severity. J Am Soc Nephrol. 2015;26(8):2023–31.
Danielle L. Davison, Ermira Brasha-Mitchell, Jay L. Koyner, Katrina Hawkins, Divya Chalikond , Michael G. Seneff, Lakhmir S. Chawla. The furosemide stress test in combination with urinary biomarkers to predict the progression and severity of acute kidney injury. Am J Kidney Dis 2014;63(5):B42.
Penk J, Gist KM, Wald EL, Kitzmiller L, Webb TN, Li Y, et al. Furosemide response predicts acute kidney injury in children after cardiac surgery. J Thorac Cardiovasc Surg. 2019;157(6):2444–51.
Vargas R, Cuevas J, Lopez E. Furosemide in the early diagnosis of acute renal insufficiency in the newborn infant. Bol Med Hosp Infant Mex. 1977;34(6):1317–30.
Kalra SSA, Narayan K, Gupta R. Use of furosemide stress test for edema control and predicting acute kidney injury in children with nephrotic syndrome. Indian Journal of Child Health. 2017;4(4):488–91.
Palma I SY, Kabagambe S, Perry A, Palma I, Sageshima J, Perez R. The use of a furosemide stress test (FST) for assessment of discarded deceased donor kidneys in an ex-vivo normothermic perfusion model. [abstract]. Am J Transplant. 2017;17(suppl 3):A158.
Baek SM, Brown RS, Shoemaker WC. Early prediction of acute renal failure and recovery. II. Renal function response to furosemide. Ann Surg. 1973;178(5):605–8.
Amrita S. Pandit CP, FACS, Eugene Fernandes. Response to furosemide as marker of acute kidney injury in post-operative CABG patients. Journal of the American College of Surgeons. 2011;213(3):S46.
Rivero J, Rodriguez F, Soto V, Macedo E, Chawla LS, Mehta RL, et al. Furosemide stress test and interstitial fibrosis in kidney biopsies in chronic kidney disease. BMC Nephrol. 2020;21(1):87.
J. Kataoka YF, Y. Norisue, S. Fujitani. Does the response in urine output to a small dose of furosemide predict organ failure after achievement of negative fluid balance in acute respiratory failure? The interim analysis Intensive Care Medicine Experimental 2017;5(Suppl 2):44.
van der Voort PH, Boerma EC, Pickkers P. The furosemide stress test to predict renal function after continuous renal replacement therapy. Crit Care. 2014;18(3):429.
Rivera SGSD, Beltrán MM, Peniche MKG, Gutiérrez JÁA, Calyeca SMV. Furosemide stress test to predict success or failure to remove continuos slow renal replacement therapy in acute renal injury. Rev Asoc Mex Med Crit y Ter Int. 2018;32(2):85–92.
Arkhipov VV, Papaian AV, Rivkin AM, Levicheva OV. [Functional furosemide loading test. Practical use in children with kidney diseases]. Klin Lab Diagn. 2001(3):20–4, 33.
Arifianto H, Wasyanto T, Purwanto B. Acute kidney injury diagnosis in acute heart failure, does furosemide stress test make sense? Eur Heart J, Supplement 2017;19(Supplement E):E20-E1.
Review Manager (RevMan) [Computer program]. Version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014.
Mariano F, Mella A, Vincenti M, Biancone L. Furosemide as a functional marker of acute kidney injury in ICU patients: a new role for an old drug. J Nephrol. 2019;32(6):883–93.
Chawla LS, Ronco C. Renal stress testing in the assessment of kidney disease. Kidney Int Rep. 2016;1(1):57–63.
Schmidt C, Hocherl K, Schweda F, Kurtz A, Bucher M. Regulation of renal sodium transporters during severe inflammation. J Am Soc Nephrol. 2007;18(4):1072–83.
Kunin M, Holtzman EJ, Melnikov S, Dinour D. Urinary organic anion transporter protein profiles in AKI. Nephrol Dial Transplant. 2012;27(4):1387–95.
Krzych LJ, Czempik PF. Impact of furosemide on mortality and the requirement for renal replacement therapy in acute kidney injury: a systematic review and meta-analysis of randomised trials. Ann Intensive Care. 2019;9(1):85.
Haase M, Bellomo R, Devarajan P, Schlattmann P, Haase-Fielitz A. Group NM-aI. Accuracy of neutrophil gelatinase-associated lipocalin (NGAL) in diagnosis and prognosis in acute kidney injury: a systematic review and meta-analysis. Am J Kidney Dis. 2009;54(6):1012–24.
Greenberg JH, Zappitelli M, Jia Y, Thiessen-Philbrook HR, de Fontnouvelle CA, Wilson FP, et al. Biomarkers of AKI progression after pediatric cardiac surgery. J Am Soc Nephrol. 2018;29(5):1549–56.
Zarbock A, Kellum JA, Schmidt C, Van Aken H, Wempe C, Pavenstadt H, et al. Effect of early vs delayed initiation of renal replacement therapy on mortality in critically ill patients with acute kidney injury: the ELAIN randomized clinical trial. JAMA. 2016;315(20):2190–9.
Gaudry S, Hajage D, Schortgen F, Martin-Lefevre L, Pons B, Boulet E, et al. Initiation strategies for renal-replacement therapy in the intensive care unit. N Engl J Med. 2016;375(2):122–33.
Barbar SD, Clere-Jehl R, Bourredjem A, Hernu R, Montini F, Bruyere R, et al. Timing of renal-replacement therapy in patients with acute kidney injury and sepsis. N Engl J Med. 2018;379(15):1431–42.
Chen JJ, Lee CC, Kuo G, Fan PC, Lin CY, Chang SW, et al. Comparison between watchful waiting strategy and early initiation of renal replacement therapy in the critically ill acute kidney injury population: an updated systematic review and meta-analysis. Ann Intensive Care. 2020;10(1):30.
Ostermann M, Joannidis M, Pani A, Floris M, De Rosa S, Kellum JA, et al. Patient selection and timing of continuous renal replacement therapy. Blood Purif. 2016;42(3):224–37.
Inoue M, Okajima K, Itoh K, Ando Y, Watanabe N, Yasaka T, et al. Mechanism of furosemide resistance in analbuminemic rats and hypoalbuminemic patients. Kidney Int. 1987;32(2):198–203.
Phakdeekitcharoen B, Boonyawat K. The added-up albumin enhances the diuretic effect of furosemide in patients with hypoalbuminemic chronic kidney disease: a randomized controlled study. BMC Nephrol. 2012;13:92.
Kitsios GD, Mascari P, Ettunsi R, Gray AW. Co-administration of furosemide with albumin for overcoming diuretic resistance in patients with hypoalbuminemia: a meta-analysis. J Crit Care. 2014;29(2):253–9.
This research received no external funding.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Diagnostic odd ratio of FST for prediction of AKI progression.
SROC curves of FST for prediction of AKI progression, SROC (summary receiver operating characteristic).
Diagnostic odd ratio of FST for prediction of RRT.
SROC curves of FST for prediction of RRT.
Forest plot of FST diagnostic accuracy for mortality prediction.
Diagnostic odd ratio of FST for prediction of mortality.
SROC curves of FST for prediction of mortality.
Forest plot of FST diagnostic accuracy for AKI stage progression (exclusion of RRT).
Diagnostic odd ratio of FST for prediction of AKI stage progression (exclusion of RRT).
SROC curves of FST for prediction of AKI stage progression (exclusion of RRT).
Registered PROSPERO review protocol.
GRADE Evidence and Summary of Findings Table.
About this article
Cite this article
Chen, J., Chang, C., Huang, Y. et al. Furosemide stress test as a predictive marker of acute kidney injury progression or renal replacement therapy: a systemic review and meta-analysis. Crit Care 24, 202 (2020). https://doi.org/10.1186/s13054-020-02912-8
- Furosemide stress test
- Acute kidney injury
- Severity prediction