BIS 1 indicates Berlin Initiative Study 1 equation; CKD-EPI, chronic kidney disease epidemiology collaboration equation; FAS, full age spectrum equation; GFR, glomerular filtration rate; and LMR, Lund-Malmö revised equation.
The solid line represents the regression line. The dotted lines represent the 95% limits of prediction. The concordance correlation coefficient (CCC) for panel A is 0.832 (95% CI, 0.827-0.838); B, 0.838 (95% CI, 0.832-0.843); C, 0.817 (95% CI, 0.812-0.823); D, 0.826 (95% CI, 0.820-0.832).
eTable 1. Median Bias and P30 Comparisons Between GFR-Estimating Equations
eTable 2. Bias, Precision, and Accuracy of the Four GFR-Estimating Equations According to Measured GFR
eTable 3. Performance Criteria of the Four GFR-Estimating Equations in 466 Obese Patients (BMI≥30 kg/m2)
eTable 4. Performance Criteria of the Four GFR-Estimating Equations in 311 Kidney Transplanted Patients
eTable 5. Performance Criteria of the Four GFR-Estimating Equations According to Categories of Albuminuria
eTable 6. Performance Criteria of the Four GFR-Estimating Equations According to Renal Function
eFigure. ROC Curve Analysis of Diagnostic Accuracy of Calculated Clearance From the CKD-EPI, LMR, FAS and BIS 1 Equations
Customize your JAMA Network experience by selecting one or more topics from the list below.
Identify all potential conflicts of interest that might be relevant to your comment.
Conflicts of interest comprise financial interests, activities, and relationships within the past 3 years including but not limited to employment, affiliation, grants or funding, consultancies, honoraria or payment, speaker's bureaus, stock ownership or options, expert testimony, royalties, donation of medical equipment, or patents planned, pending, or issued.
Err on the side of full disclosure.
If you have no conflicts of interest, check "No potential conflicts of interest" in the box below. The information will be posted with your response.
Not all submitted comments are published. Please see our commenting policy for details.
da Silva Selistre L, Rech DL, de Souza V, Iwaz J, Lemoine S, Dubourg L. Diagnostic Performance of Creatinine-Based Equations for Estimating Glomerular Filtration Rate in Adults 65 Years and Older. JAMA Intern Med. 2019;179(6):796–804. doi:10.1001/jamainternmed.2019.0223
Are there differences in performance between equations that estimate glomerular filtration rate in adults 65 years and older?
This single-center, cross-sectional study included 2247 French adults aged 65 to 90 years and older at a single referral center. When comparing 4 plasma creatinine–based glomerular filtration rate–estimating equations (Chronic Kidney Disease–Epidemiology Collaboration, Lund-Malmö Revised, full age spectrum, and Berlin Initiative Study) with the reference inulin-measuring method, there were no clinically significant differences in terms of bias, precision, or accuracy.
There do not appear to be performance advantages for the use of any of these equations in persons 65 years and older.
Estimating glomerular filtration rate (GFR) is useful in many clinical conditions. However, very few studies have evaluated the performance of GFR-estimating equations in older adults at various degrees of kidney impairment.
To determine the performance of plasma-creatinine-based equations Chronic Kidney Disease–Epidemiology Collaboration (CKD-EPI), Lund-Malmö Revised, (LMR), full age spectrum (FAS), and Berlin Initiative Study (BIS) 1 in older adults across a broad spectrum of GFRs.
Design, Setting, and Participants
Single-center cross-sectional study performed in France including 2247 participants aged 65 to 90 years who underwent inulin GFR measurements from July 1, 2003, to July 30, 2017, for suspected or established renal dysfunction, for renal risk, before kidney donation, or after kidney transplant.
Main Outcomes and Measures
The main outcome measure was GRF measured by inulin clearance. Equation performance criteria considered bias (difference between estimated and measured GFR), precision (interquartile range of the median difference), and accuracy P30 (percentage of estimated GFRs lying between [measured GFR – 30% of measured GFR] and [measured GFR + 30% of measured GFR]).
The mean (SD) age of the 2247 participants was 71.5 (5) years and 1192 (53.0%) were male. The difference in median (95% CI) bias was significant between CKD-EPI vs LMR (−4.0 [–4.0 to –3.5 mL/min/1.73 m2; P < .001]) and CKD-EPI vs FAS (–2.0 [–3.5 to –2.5] mL/min/1.73 m2, P < .001) but not significant between CKD-EPI vs BIS 1 (0.0 [–1.5 to 0.5], P = .07, Mood test). In patients aged 65 to 74 years with measured GFR<45 mL/min/1.73 m2, the difference in median P30 (95% CI) was not significant between CKD-EPI vs LMR (P = .08) and CKD-EPI vs FAS (P = .48) but significant vs BIS 1 (P = .004, McNemar test). In subjects 75 years and older, with measured GFR less than 45 mL/min/1.73 m2, LMR and BIS 1 were more accurate than CKD-EPI and FAS (P30 = 74.5 [70.0-79.5] and 73.0 [68.0-78.0] vs 69.0 [64.5-74.0] and 69.0 [65.5-72.0]). In all patients, despite small statistical differences, the performance of CKD-EPI equation was not clinically different from that of LMR, FAS, or BIS 1.
Conclusions and Relevance
In a referral group of patients 65 years and older who had GFR estimated using CDK-EPI, LMR, BIS 1, and FAS equations, a comparison with renal inulin clearance found that none of the equations had a superior diagnostic performance. Each had limitations regarding accuracy.
In persons older than 65 years with chronic kidney disease (CKD), accurate estimation of glomerular filtration rate (GFR) is important for correct CKD classification, management, and drug dosage.1-4 Determining the GFR by reference methods (eg, clearance of inulin, iohexol, or 51Cr-EDTA) is not possible in everyday practice; the CKD clinical guidelines from the National Kidney Foundation and Kidney Disease: Improving Global Outcomes1,2 have recommended using GFR-estimating equations as noninvasive alternatives.
In 2013, it was estimated that 47% of US adults 70 years and older met criteria for CKD.4 However, in this age group, an estimated GFR less than 60 mL/min/1.73 m2 without albuminuria was not associated with lower life expectancy.5-7 This has led researchers to propose a lower GFR value (<45 mL/min/1.73 m2) to define CKD in older adults.7 In addition, aging is associated with structural and physiological changes in the kidney and the muscle mass that may affect GFR estimation from plasma creatinine.5-9 Concerns about the accuracy of the Chronic Kidney Disease–Epidemiology Collaboration (CKD-EPI) equation in older adults has led to proposals for new equations10-12 and recent studies have reported that the CKD-EPI equation13 might be less reliable than newer equations in estimating GFR in this population.11,12,14
To clarify this issue, we evaluated the performance of 4 plasma creatinine–based equations: the classic CKD-EPI vs Full Age Spectrum (FAS), Lund-Malmö Revised (LMR), and Berlin Initiative Study (BIS 1). The study compared equation-estimated GFRs with measured GFRs (by inulin clearance) in older adults of different age groups and different CKD levels. We also assess the ability of these GFR estimates to detect CKD in older adults as defined by inulin GFR measurement less than 45 mL/min/1.73 m2.
This retrospective cross-sectional study was planned to include all consecutive patients 65 to 90 years old who underwent a GFR measurement by either of 2 reference methods (urinary inulin or plasma iohexol clearance) between July 1, 2003, and July 30, 2017, at a single university hospital, Hôpital Edouard Herriot in Lyon, France, for suspected or established renal dysfunction, for renal risk, before kidney donation, or after kidney transplant. However, because these reference methods are not strictly comparable,15,16 the study had to exclude iohexol clearance measurements (n = 741) and keep only the first measurement of inulin clearance in 2247 patients (Figure 1).
All procedures were carried out in accordance with the ethical standards of the institutional and/or national research committee and with the 2013 Helsinki Declaration17 and its later amendments or with comparable ethical standards. Appropriate informed consent was obtained from each participant or his or her legal representative. The consent form included information on the procedure itself as well as on the possibility of later use of the data for research purposes.
According to the French law applicable at the time of the study, an observational study that did not change routine treatment of patients did not need to be declared or submitted to a research ethics board (Loi Huriet-Sérusclat 88-1138, December 20, 1988, and its subsequent amendments; text available at http://www.chu-toulouse.fr/IMG/pdf/loihuriet.pdf). Furthermore, according to the French Haute Autorité de Santé,18 no correction factor for race and ethnicity in the CKD-EPI equation should be applied in the European population; therefore, data concerning race and ethnicity were not collected and are not available.
Renal inulin clearance measurement was carried out only in patients who could empty their bladder easily and completely; this criterion excluded those who needed bladder catheterization.
The renal clearance of inulin used a polyfructosan-based method (Inutest, Fresenius Kabi). A standard technique was used by trained staff with a continuous infusion after a 30 mg/kg priming dose of polyfructosan. Water diuresis was induced by a first oral administration of 5 mL/kg of water followed by 3 mL/kg every 30 minutes combined with an intravenous infusion of 0.9% sodium chloride. Three to 4 urine samples were collected and a blood sample was drawn midway through each collection period. The retained clearance value was the mean of 3 or 4 values obtained by the usual UV/P formula (U, urinary concentration of polyfructosan; V, urine flow rate; and P, plasma concentration of polyfructosan). The measurements of plasma and urine polyfructosan were performed with the same enzymatic method that demonstrated good specificity and reproducibility (within-run precision, <1%; between-run precision, <3.5%).19 The results were expressed per 1.73 m2 body surface area according to the Dubois equation: body surface area = height0.725 × weight0.425 × 0.007184.
All plasma creatinine measurements were performed with methods traceable to the National Institute of Standards and Technology (isotope-dilution mass spectrometry-calibrated).
From June 8, 2003, to June 8, 2010, plasma creatinine concentration was measured using a kinetic colorimetric compensated Jaffé technique (Roche Modular) whose results were standardized against the concentrations obtained by liquid chromatography mass spectrometry by linear regression adjustment. The calibration equation was as follows: standardized plasma creatinine = 0.9395 × Jaffé compensated serum creatinine in μmol/L + 4.6964. The coefficient of correlation was 0.97.
From June 9, 2010, to July 30, 2017, all plasma creatinine values were obtained by an enzymatic technique. According to the Kidney Disease: Improving Global Outcomes guidelines, the 2 techniques are relatively similar.2 Plasma creatinine was expressed in μmol/L.
The equations used in the study population (with no correction for race and ethnicity) were the following (PCr indicates plasma creatinine):
Female, PCr ≤61.88: Estimated GFR = 144 × [PCr/61.88]–0.329 × [0.993]Age
Female, PCr >61.88: Estimated GFR = 144 × [PCr/61.88]–1.209 × [0.993]Age
Male, PCr ≤79.56: Estimated GFR = 141 × [PCr/79.56]–0.411 × [0.993]Age
Male, PCr >79.56: Estimated GFR = 141 × [PCr/79.56]–1.209 × [0.993]Age
Estimated GFR = e X–0.0158 × Age +0.438 × ln(Age)
Female, PCr <150: X = 2.50 + 0.0121 × (150 – PCr)
Female, PCr ≥150: X = 2.50 – 0.926 × ln(PCr/150)
Male, PCr <150: X = 2.56 + 0.00968 × (180 – PCr)
Male, PCr ≥180: X = 2.56 – 0.926 × ln(PCr/180)
Age ≥40 years: Estimated GFR = (107.3 × Q/PCr) × 0.988(Age-40) with Q = 80 μmol/L in men and 62 μmol/L in women
Men: Estimated GFR = 3736 × PCr –0.87 × Age–0.95
Women: Estimated GFR = 3736 X PCr -0.87 × Age–0.95 × 0.82
The study considered 3 criteria for performance: bias, precision, and accuracy. Bias was defined as the median difference between measured GFR and estimated GFR. Thus, a negative bias indicates that an equation overestimates the GFR and vice versa. Precision was defined as the interquartile range (IQR) of the differences between measured GFR and estimated GFR.
Accuracy was considered under 2 criteria: the root mean square error (RMSE), the square root of (log of measured GFR – log of estimated GFR)2; and P30, percentage of estimates within 30% of the measured value.20 A P30 greater than 90% qualifies an equation as satisfactory for clinical interpretation.1,2,15,21
The concordance correlation coefficient (CCC) was used to assess the strength of theoretical agreement between each estimated GFR and measured GFR (after logarithmic transformation of their values). The CCC ranges between −1 and 1; 1 denotes perfect agreement, greater than 0.990, almost perfect agreement; 0.950 to 0.990, substantial agreement; 0.900 to 0.949, moderate agreement; and less than 0.900, poor agreement.22
The 95% CIs around bias, precision, RMSE, and P30 values were calculated using a bootstrap method (2000 bootstraps).23 To assess and compare the 4 equations, the analysis was carried out in 2 separate age groups (65-74 years and ≥75 years)24 and 2 measured GFR categories (<45 and ≥45 mL/min/1.73 m2).7 Subanalyses were carried out on 2 other measured GFR categories (<60 and ≥60 mL/min/1.73 m2), obese patients, kidney transplant recipients, and patients with various categories of albuminuria.
The area under the receiver operating characteristic (ROC) curves (AUC) was used to determine the ability of the GFR-estimating equations to discriminate between elderly patients with and without CKD (defined as measured GFR <45 mL/min/1.73 m2).7 Median biases were compared using the Mood median test.25 P30 values were compared using Cochran Q with pairwise McNemar test and Holm-Bonferroni correction.26 Areas under the curve were compared using the Delong Clarke-Pearson method.27 Results from kidney transplant recipients were compared with those of nonrecipients using the Wilcoxon signed-rank test. Whenever necessary, the Holm-Bonferroni method was used to correct for multiple comparisons.
The sample size and the measurement precision in this study were high; thus, small changes in any variable could lead to small P values. In such conditions, the American Statistical Association recommends the use of P < .005.28 Differences between measured GFR and estimated GFR were considered clinically meaningful when 2 conditions were fulfilled: the RMSEs differed by more than 2% and the biases differed by more than 5 mL/min/1.73 m2.29
The database used for this study had no missing data. Statistical analyses were performed using R for Windows, version 3.4.4 (R-Cran project, http://cran.r-project.org/).
From an initial population of 3539 participants, 2247 participants met the study criteria (Figure 1). The sociodemographic and clinical characteristics of these participants are shown in Table 1. At inclusion the mean (SD) age of the participants was 71.5 (5.0) years. Among these participants, 1192 (53.0%) were male and 311 (14.0%) were kidney transplant recipients. The mean (SD) of measured GFR was 44.5 (21) mL/min/1.73 m2. Within the measured GFR range of 5 to 147 mL/min/1.73 m2, 43.5% of measurements had values less than 45 mL/min/1.73 m2. There was no significant difference in the mean measured GFR between kidney transplant recipients and the other participants.
In the whole cohort of participants, none of the 3 other equations demonstrated a clinically better performance than CKD-EPI. Nevertheless, several differences, although not clinically significant, were statistically significant.
Regarding bias, the median bias between CKD-EPI and each of LMR and FAS was significant (less bias in the latter) (–4.0 [–4.0 to –3.5] and –2.0 [–3.5 to –2.5] mL/min/1.73 m2;P < .001); the bias vs BIS 1 was not significant (0.0 [–1.5 to 0.5] mL/min/1.73 m2; P = .07) (Table 2).
Regarding accuracy, LMR equation was the most accurate; it had the lowest RMSE (95% CI) 0.185 (0.178-0.190) (Table 3).
In patients aged 65 to 74 years with measured GFR less than 45 mL/min/1.73 m2, the median (95% CI) differences in P30 between CKD-EPI and each of LMR and FAS were not significant (P = .08 and P = .48, respectively) but the median difference in P30 with BIS 1 was significant (P = .004) (eTable 1 in the Supplement).
There were some significant differences regarding accuracy between patients with measured GFR less than 45 mL/min/1.73 m2. In patients aged 65 to 74 years, precision criterion P30 was the highest with LMR equation (72.0 [69.0-76.0]) and the lowest with BIS 1 equation (59.0 [55.0-63.0]) (eTable 2 in the Supplement). In patients 75 years and older, LMR and BIS 1 were more accurate than CKD-EPI and FAS (74.5 [70.0-79.5] and 73.0 [68.0-78.0] vs 69.0 [64.5-74.0] and 69.0 [65.5-72.0]) (eTable 2 in the Supplement).
In the whole population and in all subgroups, no CCC between measured GFR and estimated GFR by any equation was greater than 0.900. BIS 1 had the lowest CCC in the whole cohort and in participants aged 65 to 75 years (Figure 2).
There were no significant differences in the abilities (AUCs) of the equations to detect a measured GFR less than 45 mL/min/1.73 m2; the AUCs (95% CI) were 0.921 (0.916-0.926) for CKD-EPI, 0.922 (0.917-0.927) for LMR, 0.919 (0.915-0.924) for FAS, and 0.918 (0.914-0.924) for BIS 1. There were also no significant differences in equation abilities to detect measured GFR less than 60 mL/min/1.73 m2 (eFigure in the Supplement).
Overall, the 4 equations did not show clinically significant differences between the various subgroups: obese vs nonobese (eTable 3 in the Supplement), transplant recipients vs non recipients (eTable 4 in the Supplement), albumin values in the normal range (<30 mg/g) vs other categories of albuminuria (eTable 5 in the Supplement), and patients with less than 60 vs patients with greater than 60 mL/min/1.73 m2 (eTable 6 in the Supplement).
We found that the performance of the CKD-EPI equation in diagnosing CKD in persons older than 65 years was not significantly better or worse than the newer equations. None of the equations estimated a GFR that was 70% to 130% of the measured GFR more than 80% of the time.
This study demonstrated close performances of the 4 GFR-estimating equations between measured GFR groups in terms of bias, IQR, and RMSE. However, the decrease in P30 in the low-GFR group should be interpreted with caution, because small absolute errors may still indicate equation disagreement in those with lower GFRs.
As previously demonstrated, a GFR-estimating equation performs best in populations that resemble the population within which it was developed.15 For instance, the CKD-EPI equation is recommended for estimating GFR in adults of any age in North America, Europe, and Australia. It was developed in a North American and European population with a wide age range (mean [SD], 50  years) and a mean measured GFR of 68.0 mL/min/1.73 m2; in addition, CKD-EPI equation takes into account age, sex, race, and ethnicity.13 Although the proportion of patients 65 years or older within the CKD-EPI development and internal validation data sets was 13.0%, the present study found adequate performance of CKD-EPI in older adults at different levels of measured GFR.4
The LMR equation was developed in a large cohort of patients referred for GFR measurement in a European population, predominantly (67%) from a single country (Sweden). In this population, 28% were 70 years or older10 and had a mean [SD] measured GFR of 55 (9-21) mL/min/1.73 m2. In similar populations, this equation performed better than CKD-EPI, especially in people 70 years and older, but this did not receive an external validation.14,30-32 In the present study, however, the performance of LMR equation was only slightly more accurate than the CKD-EPI equation (P30) in the subgroup of patients with measured GFR less than 45 mL/min/1.73 m2, which is close to the measured GFR in the development population.
The FAS equation was designed to estimate GFR across a broad range of ages, children to older adults.12 It was developed in a predominantly European population in a multicenter study that included 1764 patients 70 years and older (mean [SD] age, 77 [5.4] years) with mean [SD] measured GFR of 55.7 [20.6] mL/min/1.73 m2. In the present study, the FAS equation did not perform better than the other 3 equations.
The rare equations developed for GFR estimation in old subjects have reasonable performance, usually with poor accuracy in subjects with measured GFR 60 mL/min/1.73 m2 or greater.10-12,32-37 This is the case of BIS 111 that was developed in subjects 70 years and older (mean [range] age, 78.5 [9-121] years) and included 29.4% of healthy older adults with mean [range] measured GFR of 60.3 mL/min/1.73 m2. In comparison with plasma iohexol clearance, the performance of BIS 1 was excellent: absolute bias close to zero (0.11), good precision (IQR, 11.1) and accuracy (P30: 95.1%). The BIS 1 equation appeared then to be a good tool for estimating GFR in older adults.11 However, other studies that attempted to validate this equation reported conflicting results.10,12,14,31,32,35-39 In the present study, BIS 1 had a poor performance in persons aged 65 to 74 years with measured GFR less than 45 mL/min/1.73 m2. This is in agreement with other results37,40 and is probably owing to differences between study and development cohorts.
In older adults, late referrals for CKD management may result in suboptimal outcomes, including increased mortality, higher rates of hospitalization, increased referrals for kidney transplant, and higher rates of catheter use for dialysis. Although plasma creatinine is the most used marker to estimate GFR, the method has many limitations; it is influenced by muscle mass and diet, especially in older adults, women, and children. Cystatin C, a low-molecular-weight protein, does not present such limitations and could be a better marker; this led to equations based on cystatin C or cystatin C plus plasma creatinine. Up to now, several studies have reported that cystatin C was more accurate than creatinine in detecting CKD in older adults at both cutoffs of 45 and 60 mL/min/1.73 m2.8,14,36 A recent study reported that the addition of cystatin C improved all creatinine-based equations.32 Thus, the Kidney Disease: Improving Global Outcomes recommended the recourse to GFR estimation by cystatin C whenever the estimated GFR by creatinine is below 60 mL/min/1.73 m2 and when a confirmation of CKD is required.2 However, cystatin C is not always available, as in this study.
The strengths of the present study are that it used a unique data set regarding size, age range, measured GFR range, use of plasma creatinine assays calibrated on standardized values, use of a reference method for GFR measurement (ie, inulin clearance),16 and rigorous statistical techniques.
This study had several limitations. First, it was a single-center study that included potential kidney donors and patients referred for suspected or confirmed renal disease. Second, the data set is devoid of information about race and ethnicity and therefore did not allow us to assess the effect of this characteristic, which is common to other European studies (LMR, FAS, and BIS 1). However, recent studies have reported that GFR is independent of race and ethnicity.41 Third, the performance of the equations in persons with measured GFR less than 30 mL/min/1.73 m2 could not be examined because of the small number of participants with severe CKD. Fourth, the use of plasma creatinine alone as endogenous marker (without cystatin C) has some well-known limitations, especially in older participants with sarcopenia.9,42,43
This study was conducted in a single institution on a referral population, which might have introduced a selection bias, both in terms of demographic and clinical characteristics. Thus, the overall result of “no differences between the four equations” should be investigated in populations with more complex mixes of races and ethnicities and in populations with lower suspicion of moderate to severe CKD.
Among a referral group of patients 65 years and older who had GFR estimated by CDK-EPI, LMR, BIS 1, and FAS, a comparison with renal inulin clearance found that none of the equations had a superior diagnostic performance. Each equation had limitations regarding accuracy. Thus, any of the 4 equations may be used to estimate GRF in adults 65 years and older, depending on local clinical, technical, or practical criteria.
Accepted for Publication: January 20, 2019.
Corresponding Author: Luciano da Silva Selistre, MD, PhD, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130, 95070-560 Caxias do Sul, Brazil (email@example.com).
Published Online: April 29, 2019. doi:10.1001/jamainternmed.2019.0223
Author Contributions: Drs da Silva Selistre and Dubourg had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Concept and design: da Silva Selistre, Rech, Lemoine, Dubourg.
Acquisition, analysis, or interpretation of data: da Silva Selistre, De Souza, Iwaz, Lemoine, Dubourg.
Drafting of the manuscript: da Silva Selistre, Rech, De Souza, Iwaz, Dubourg.
Critical revision of the manuscript for important intellectual content: da Silva Selistre, Rech, Lemoine, Dubourg.
Statistical analysis: da Silva Selistre, De Souza.
Obtained funding: da Silva Selistre.
Administrative, technical, or material support: da Silva Selistre, Iwaz, Lemoine, Dubourg.
Supervision: da Silva Selistre, Rech, De Souza, Dubourg.
Conflict of Interest Disclosures: None reported.
Funding/Support: The study had no specific public or private financial support. During the period of analysis and publication, Dr Selistre benefited from a grant from the Brazilian government (CAPES Foundation, Ministry of Education of Brazil, grant number: 88881.156638/2017-01).
Role of the Funder/Sponsor: The CAPES Foundation had no roles in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Additional Contributions: The authors thank Philip Robinson, PhD, of Hospices Civils de Lyon, for his helpful revisions of this manuscript as part of his regular responsibilities.