Fisher linear discriminant function models. The 20 vertical lines represent the 20 best models of 3 retinal nerve fiber layer (RNFL) measures each. The squares indicate which parameter was used in each model. The best model has the highest between variance–within variance ratio and is the model at the far right. The parameters are ordered top to bottom by the frequency with which they were included in a model. The GDx number is determined using GDx software (version 1.0.02; Laser Diagnostic Technologies, San Diego, Calif).
Receiver operator characteristic (ROC) curves based on Fisher linear discriminant function (LDF) for the best subset of 3 variables, Fisher LDF for the 10th best subset of 3 variables, and the GDx number (GDx software, version 1.0.02; Laser Diagnostic Technologies, San Diego, Calif). Numbers in parentheses represent area under the ROC curves.
Weinreb RN, Zangwill L, Berry CC, Bathija R, Sample PA. Detection of Glaucoma With Scanning Laser Polarimetry. Arch Ophthalmol. 1998;116(12):1583-1589. doi:10.1001/archopht.116.12.1583
To determine which retinal nerve fiber layer (RNFL) measures obtained with scanning laser polarimetry are most useful in detecting early to moderate glaucomatous visual field loss.
Subjects and Methods
One eye from 84 healthy individuals and 83 patients with early to moderate glaucomatous visual field loss (167 eyes) was assessed with a scanning laser polarimeter (Laser Diagnostic Technologies, San Diego, Calif). Three separate scans were obtained, and a baseline scan was created and used in the analyses. Integrated software (program GDx, version 1.0.02; Laser Diagnostic Technologies) was evaluated by assessing its sensitivity and specificity for detecting early and moderate glaucomatous visual field loss. Fisher linear discriminant functions also were developed in this population to assess sensitivity and specificity and were compared with the GDx analyses.
There were statistically significant differences between the healthy and glaucomatous eyes for 14 of the 15 RNFL measures (P=.001). However, considerable overlap in measurements between groups was found. With the GDx number, the area under the receiver operator characteristic (ROC) curve was 0.78, and the sensitivity and specificity were 82% and 62%, respectively. Applying the best discriminant function using 3 variables (average thickness, ellipse modulation, and average ellipse thickness) to our study population resulted in an area under the ROC curve of 0.89 and a sensitivity and specificity of 74% and 92%, respectively.
A combination of RNFL measures obtained using the scanning laser polarimeter improved the ability to differentiate between healthy eyes and eyes with early and moderate glaucomatous visual field loss. Analyses using GDx software did not differentiate between healthy and glaucomatous eyes as well as the discriminant analysis function did.
SCANNING LASER polarimetry is a diagnostic method that has been developed to measure quantitatively the thickness of the peripapillary retinal nerve fiber layer (RNFL).1- 3 It is based on the assumption that the parallel arrangement of microtubules within the RNFL causes a change in the state of polarization of an illuminating laser beam, also know as retardation, and that this can be measured in the light reflected from the retina.1 Since retardation is linearly related to the thickness of the RNFL in hemisected, fixated monkey eyes in which the cornea and lens have been removed, it has been assumed that it might also be related to RNFL thickness in intact human eyes.2,3 This technique is independent of a reference plane and magnification effects of the eye and is attractive because it provides rapid, objective data.3
Clinical studies have demonstrated this method to be a promising one for assessing RNFL thickness in clinical practice.3- 12 Weinreb et al3 studied a group of 105 healthy subjects and 64 age-matched healthy patients and patients with glaucoma. They concluded that scanning laser polarimetry provides quantitative measurements that correspond to known properties of the RNFL in healthy and glaucomatous eyes. Moreover, mean retardation was significantly higher among healthy eyes than glaucomatous eyes in inferior and superior regions; however, there was considerable overlap between the healthy and glaucomatous eyes. Other studies have demonstrated that retardation decreases with age,4,5,11 is moderately correlated with glaucomatous visual field loss,4,10 and is less in black than in white subjects.11,13
Recently, Tjon-Fo-Sang and Lemij10 reported the sensitivity of scanning laser polarimetry for detecting glaucoma in a group of patients with a wide range of visual field mean deviation (MD) (−31.5 to 0.76 dB). They calculated mean retardation in 6 selected areas of 256 pixels from each of 4 quadrants (superior, inferior, temporal, and nasal) and calculated ratios relative to the nasal segment for the superior and inferior quadrants. In their study, they found that scanning laser polarimetry had a sensitivity of 96% and a specificity of 93%, and concluded that the method was promising for screening populations for glaucoma.
Based on their encouraging report, a method of examining subsets of the 65,536 pixels (256×256 picture elements) was implemented into the software of the scanning laser polarimeter. We assessed the sensitivity and specificity of this software in another group of patients with glaucoma with only early and moderate visual field loss.
One eye from 84 healthy subjects and 83 patients with early to moderate glaucomatous visual field loss (167 eyes) was included. Forty-nine patients with glaucoma had early visual field loss, and 34 had moderate visual field loss. Intraocular pressure (IOP) or the appearance of the optic disc was not used to determine whether glaucoma was present. The ages (mean±SD) of healthy subjects and patients with early and moderate glaucomatous field loss were 55.4±11.6, 65.7±11.9, and 70.1±10.8 years, respectively. Informed consent was obtained from all participants, and the study was approved by the Human Subjects Committee of the University of California, San Diego.
All subjects underwent an ophthalmologic examination, automated visual field testing, stereoscopic photography, and scanning laser polarimetry of the peripapillary RNFL. The ophthalmologic examination included slitlamp biomicroscopy, applanation tonometry, gonioscopy, and dilated indirect ophthalmoscopy. The optic nerve head was assessed through a dilated pupil using biomicroscopy with a 78-diopter (D) lens. For inclusion in the study, scanning laser polarimetry and visual field testing were completed within 6 months of each other; mean±SD time between testing was 0.7±1.4 months.
Visual field testing was performed using Humphrey Field Analyzer program 24-2 or 30-2 (Humphrey Systems, Dublin, Calif). Only the 24-2 points were used in the analysis to avoid rim artifacts. A pupil diameter of at least 3.0 mm was required for visual field testing, and only visual fields with reliability indices (fixation losses, false-positive rate, and false-negative rate) of no more than 25% were used. One randomly chosen eye per subject was included in the study.
Healthy subjects had no history of ophthalmologic or neurologic disease or surgery, no history of diabetes or other systemic diseases, and no use of medications known to affect visual field sensitivity. A family history of glaucoma was not an exclusion criterion. These subjects had normal results of an eye examination and a normal visual field with automated threshold perimetry. They had a best corrected visual acuity of 20/30 or better; spherical refraction within ±5.0 D; cylinder correction within ±2.5 D; IOP of less than 22 mm Hg; and no history of elevation, open angles, or symmetrical optic discs (asymmetry of vertical cup-disc ratio, <0.2) with no excavation, notches, or hemorrhages based on results of clinical examination and assessment of simultaneous stereoscopic photographs. With visual field testing, the glaucoma hemifield test (GHT), MD, and corrected pattern standard deviation (CPSD) were within normal limits. Healthy subjects were recruited from volunteers in the community, hospital staff, and spouses or friends of patients.
Visual fields of 186 patients with glaucoma enrolled in a longitudinal study evaluating quantitative techniques for diagnosing and monitoring glaucoma were reviewed. Of these, 92 had repeatable early or moderate visual field loss and met each of the following criteria: GHT results outside normal limits or a CPSD outside 95% normal limits; and repeatable cluster of 3 or more points depressed at the 5% level on the pattern deviation plot on at least 2 visual fields (2 with achromatic automated perimetry or 1 each with achromatic and short-wavelength perimetry). Short-wavelength fields were analyzed relative to our normative database of 99 eyes using a statistical analysis package for GHT and pattern deviation. Early visual field loss was defined by an MD of no worse than −6 dB (with at least 50% of test locations within normal limits) and a CPSD no worse than the 1% probability level. Moderate visual field loss was defined by an MD of no worse than −15 dB and a CPSD worse than the 1% probability level.
The scanning laser polarimeter (Nerve Fiber Analyzer [NFA]–GDx or NFA II; Laser Diagnostic Technologies, San Diego, Calif) is a confocal scanning laser ophthalmoscope with an integrated polarization modulator. Details of its operation have been reported previously.3,11 In brief, the light source, a polarization-modulated laser beam (wavelength, 780 nm), is directed to 1 point of the retina by the optical media of the eye, and the reflected light that double-passes the RNFL is detected to obtain the retardation at that point. The laser beam is directed sequentially over each of 256×256 retinal locations to obtain a retardation map in which each retinal location (pixel) has a corresponding retardation value. We used a field of view of approximately 15°. The acquisition time for these 65,536 data sets was 0.7 seconds. Other than the availability of a screening mode (which we did not use) and a normative database with the NFA-GDx, there are no differences between it and the NFA II.
Three retardation maps of the peripapillary retina of each eye were acquired, and a baseline map was automatically created and used for analysis by averaging the 3 retardation values corresponding to each pixel. The disc margin was established by an experienced operator who used a circle or ellipse to outline the inner margin of the peripapillary scleral ring.
Of the 92 eligible glaucomatous eyes, 9 were excluded due to spuriously high reflectivity from zones of peripapillary atrophy (n=6) or an insufficient number of available pixels for analysis (n=3). None of the eligible healthy eyes were excluded. A total of 83 glaucomatous and 84 healthy eyes were included in the study. Measurement of 67 glaucomatous and 29 healthy eyes were obtained using the NFA II. The remaining 16 glaucomatous and 55 healthy eyes were acquired and analyzed using the NFA-GDx. All measurements were analyzed using NFA-GDx software. There were no significant differences between the mean values obtained with the NFA II and the NFA-GDx for each of the 15 RNFL measures in healthy eyes (for GDx number, P=.13; superior ratio, P=.83; inferior ratio, P=.80; superior/nasal ratio, P=.52; maximum modulation, P=.73; superior maximum, P=.96; inferior maximum, P=.49; average thickness, P=.37; volume, P=.43; ellipse modulation, P=.16; ellipse average, P=.85; superior average, P=.55; inferior average, P=.91; superior integral, P=.05).
Program GDx analyzes subsets of the 65,536 pixels in each of 4 quadrants (superior, inferior, temporal, and nasal). The GDx software uses a race- and age-specific normative database of more than 1100 eyes to classify 15 measures of RNFL thickness as within normal limits, borderline, or outside normal limits. According to the manufacturer, healthy patients aged 18 to 80 years were enrolled at 6 sites using a standard protocol for inclusion in the normative database. Healthy eyes could not have a significant visual field defect, unreliable field, pathologic appearance of the optic disc, or asymmetry of the cup-disc ratio of more than 0.2. All patients had best-corrected visual acuity of 20/40 or better and refractive error of less than 5 D of sphere and less than 2 D of cylinder. Patients were excluded if they reported a family history of glaucoma or a history of elevated IOP (>21 mm Hg), retinal disease, retinal surgery, or refractive surgery. With GDx software, outside normal limits is defined as outside 95% confidence limits. Borderline is defined as outside 90% confidence limits. The healthy subjects recruited for this study were not included in the GDx normative database.
Fifteen RNFL measures per patient were automatically calculated using GDx software. For GDx number, a trained neural network assesses all pixels and assigns to an eye a number from 0 to 100 (0 indicates normal; 100, advanced glaucoma). A back-propagation type network with 2 hidden layers is used. The input layer consists of 128 input neurons (number of parameters), and the output layer consists of 1 output neuron (the diagnosis).
A total of 1500 pixels per quadrant peripheral to an ellipse 1.75 disc diameters from the center of the disc was used to calculate ratio and maximum measures. Symmetry was the ratio of the average 1500 thickest pixels in the superior quadrant divided by the average of the 1500 thickest pixels in the inferior quadrant. Superior ratio was the ratio of the average of the 1500 thickest pixels in the superior quadrant divided by the average of the 1500 median pixels in the temporal quadrant. Inferior ratio was the ratio of the average of the 1500 thickest pixels in the inferior quadrant divided by the average of the 1500 median pixels in the temporal quadrant. Superior-nasal ratio was the average of the 1500 thickest pixels in the superior quadrant divided by the average of the 1500 median pixels in the nasal quadrant.
For maximum modulation, first, the average is calculated for the 1500 thickest points in the superior and inferior quadrants. Next, the 1500 median points in the nasal and temporal quadrants are calculated. The lowest of these 4 values is subtracted from the highest, and then divided by the lowest value.
Superior maximum was the average of the 1500 thickest pixels in the superior quadrant; inferior maximum, the average of the 1500 thickest pixels in the inferior quadrant; and average thickness, the average of all pixels outside the disc margin that have a valid reading. Pixel measurements obtained during an eye movement were invalid and excluded from the calculation of average thickness.
Volume was the average thickness multiplied by the number of pixels×0.000295 mm2 (area of a pixel).
Ellipse modulation, ellipse average, superior average, and inferior average were calculated using pixels within the 10-pixel-wide elliptical band that is automatically positioned concentric with the disc margin outline and 1.75 disc diameters from the center of the optic disc. Ellipse modulation was calculated by taking the thickest pixel within the elliptical band, subtracting the thinnest pixel within the band, and dividing the total by the value of the thinnest pixel. Ellipse average was calculated using average thickness of the pixels within the elliptical band surrounding the optic nerve. Superior average was the average thickness of the pixels within the elliptical band in the superior quadrant. Inferior average was the average thickness of the pixels within the elliptical band in the inferior quadrant. Superior integral was the total area under the curve and within the superior portion of the elliptical band surrounding the optic nerve.
Optic disc area was measured using images obtained with a confocal scanning laser ophthalmoscope (Heidelberg Retinal Tomograph, Heidelberg Engineering, Heidelberg, Germany).14,15 Corneal curvature was used to correct for magnification, and the disc margin was outlined on the mean topography image.
Analyses were completed using commercially available software (JMP; SAS Institute, Cary, NC, and Splus, version 3.4, MathSoft Inc, Seattle, Wash). Student t or χ2 test was used to evaluate differences between both study groups. A P value of no more than .05 was considered statistically significant.
Fisher linear discriminant functions (LDFs) were used to develop a classification rule.16 They were calculated to differentiate between groups, using all subsets of the 15 RNFL measures and age. The ratio (between-group variance to within-group variance) of each LDF was examined to determine the relative importance of the different variables. The LDF is a score formed by taking a weighted sum of the predictor variables and is chosen so that the ratio is maximized.16 This ratio, which serves as a criterion for assessing the usefulness of the LDF, was examined to determine the relative importance of the different variables. Since all subsets of predictors were examined, we summarized the results by looking first at the maximum ratio for subsets consisting of 1 variable, then the maximum for subsets of 2 variables, and so on to see whether the ratio would increase markedly if more variables were included.
A receiver operator characteristic (ROC) curve was used to assess the ability to differentiate between healthy and glaucomatous eyes. The ROC curves and the areas beneath them were computed using direct calculation. Differences of these areas were tested using the method of DeLong et al.17 An area under the ROC curve of 1.0 indicated perfect discrimination. An area of 0.5 indicated no discrimination. A k-fold cross-validation on 12 randomly chosen training and test sets was used to estimate the bias in the area under the ROC curves from using the same study population to identify and test the discriminant function.
For the discriminant functions presented, we determined several appropriate cutoff points for sensitivity and specificity estimates using subjective evaluation of the ROC curve and by summarizing the sensitivities and specificities using various cutoffs.
The age, sex, refractive status, and race of the healthy subjects and patients with glaucoma are provided in Table 1. There was no statistically significant difference in sex or race among the study groups, but the healthy subjects were significantly younger than the patients with glaucoma. For this reason, age was included as a candidate variable in the linear discriminant analyses.
There were statistically significant differences (P<.001) between the healthy and glaucomatous groups for all 15 RNFL variables measured except symmetry (P=.08) (Table 2).
Each measure had low sensitivity when compared with the internal normative database (Table 3). We therefore created an all-normal variable that was within normal limits if comparison of each of the 14 measures (excluding the GDx number) with the internal normative database was within normal limits. The GDx number was excluded because there was no normative database. The sensitivity of all-normal was 90% (75/83), but the specificity was 57% (48/84). The proportion of patients correctly identified as having moderate glaucoma using 1 summary measure was at best 41% (superior-nasal ratio). For early glaucoma, the best measure (inferior ratio) correctly identified only 25%.
The estimates of bias under the ROC curve were small; the bias for the best of 1, 2, 3, and 4 predictors was 0.025, 0.041, 0.018, and 0.020, respectively.
The ROC areas were estimated as 0.793, 0.869, 0.905, and 0.909 for 1 through 4 predictor rules, respectively. After correcting for bias, the ROC areas were estimated as 0.768, 0.828, 0.887, and 0.889 for 1 through 4 predictor rules, respectively. Increases beyond size 3 did not increase the area under the ROC curve. We therefore present results of the LDF for 3 variables only. In addition, several LDFs using 3 variables were used to form ROC curves.
The area under the ROC curve for the GDx number was 0.78 (Table 4). With an arbitrarily selected cutoff of greater than 17 as glaucoma, the sensitivity was 82%, whereas the specificity was 62%. If a cutoff of 15 or 23 was used, the sensitivities were 92% and 64%, respectively, whereas the specificities were 56% and 77%, respectively.
With our 84 healthy and 83 glaucomatous eyes, the best LDF discriminant function formula using 3 variables was as follows: LDF=−4.442655−(0.156×Average Thickness)+(0.935×Ellipse Modulation)+ (0.183×Ellipse Average) This discriminant function had a bias-corrected ROC curve area of 0.887. With an arbitrarily selected cutoff of less than −0.289 as normal, the LDF correctly labeled 76 of 84 healthy eyes (specificity, 92%) and 61 of 83 eyes with early to moderate glaucomatous visual field loss (sensitivity, 74%) (Table 4). However, if a value of 0.501 was used to classify healthy, the sensitivity of the formula increased to 88% (73/83), and the specificity to identify healthy eyes decreased to 68% (57/84).
There were several subsets of 3 variables that were competitive with the above formula. The parameters included in the best 20 discriminant functions of 3 variables are presented in Figure 1. Ellipse modulation and ellipse average were included in more subsets than any other variable, at 13 and 12 models, respectively. Age was included in only 6 of the 20 best models.
The LDF identified in the eyes with early to moderate visual field loss was applied to the group of eyes with early loss. Using a cutoff of −0.265, the sensitivity (61%) was lower compared with the combined group with early and moderate glaucomatous visual field loss (74%), whereas the specificity remained the same (93%). In addition, we developed an LDF using the eyes with early visual field loss only. The variables of the best-of-3 model were identical to the model developed using eyes with early and moderate visual field loss.
To further investigate whether the difference in age between the healthy and glaucomatous eyes was influencing our results, we completed the analysis in 2 age groups: no older than 60 and older than 60 years. In the group no older than 60 years consisting of 19 glaucomatous and 56 healthy eyes, the area under the ROC curve for the best-of-3 LDF model was 0.894. In the group older than 60 years consisting of 64 glaucomatous and 28 healthy eyes, the area under the ROC curve for the best-of-3 LDF model was 0.873. The area under the ROC curve for the GDx number for persons no older than 60 years was 0.786, and 0.670 for those older than 60 years.
Optic disc area was not included in any of the best 30 linear discriminant functions of 3 variables. In addition, we stratified our study population based on disc area (<1.5mm2 or ≥1.5 mm2) and found similar results in both groups, with areas under the ROC curve of 0.936 and 0.894, respectively. Therefore, disc area has little or no role in discriminating between healthy subjects and patients with glaucoma in this population.
In our study, there were statistically significant differences in the mean values of RNFL measures between the healthy subjects and patients with glaucoma. As with other glaucoma diagnostic tests used in clinical practice, ie, IOP or ophthalmoscopic assessment of the cup-disc ratio, there was overlap between the healthy and glaucomatous groups in the range of values for all of the measures. This overlap may be related to wide variations in the RNFL among healthy individuals. Such overlap is important, because it limits the usefulness of a test for diagnosing glaucoma without other corroborating findings. Moreover, the use of this instrument, or any other diagnostic instrument, to detect glaucoma depends on the demographic characteristics of the patient population and the severity of the disease.
Although none of the RNFL measures individually identified even half of these patients with early and moderate glaucoma as outside normal limits, our results indicate that the ability to differentiate between healthy and glaucomatous eyes can be improved by using a combination of them in a discriminant analysis function. Using a combination of measures also has been shown to improve the ability to discriminate between healthy and glaucomatous eyes with analysis of stereoscopic photographs18 and topographic optic disc parameters obtained with a confocal scanning laser ophthalmoscope.19,20 Given that the discriminating ability between 2 parameters may be similar, it is probable that 2 or more models will provide similar discriminating power. Our analysis strategy assessed this possibility by evaluating all 560 combinations of 3 variables using the Fisher LDF. We found many competitive subsets of 3 variables that differentiated healthy from glaucomatous eyes. As shown in Figure 2, the area under the ROC curves for the best and 10th-best model of 3 variables were similar (0.91 and 0.87, respectively).
The best such subset consisted of ellipse modulation, average thickness, and ellipse average. Ellipse modulation and ellipse average were included in more of the best models than any other measures. Ellipse modulation is 1 of 2 modulation measures designed to capture the difference between the thickest and thinnest areas of the RNFL. One of these modulation variables was included in 18 of the best 20 models of 3 predictors. These variables may capture features of the RNFL that reflect relatively early loss of retinal nerve fibers. In contrast, the 4 ratio measures (symmetry, superior ratio, inferior ratio, and superior-nasal ratio) were included in only 6 of the best 20 models of 3 predictors.
Given the possibility that RNFL loss is more difficult to detect in relatively small discs, we included disc area as a possible variable to be included in the discriminant analysis. Optic disc area was not included in any of the best 30 LDFs, and stratified analysis into 2 groups based on disc area found similar results. Therefore, disc area had little or no role in discriminating between healthy subjects and patients with glaucoma in this population.
The GDx number is obtained automatically from a trained neural network that assigns to an eye a number from 0 to 100. With a higher number (maximum of 100), the RNFL loss is worse, and the glaucoma is more advanced. With the GDx number, the sensitivity and specificity depend on the value selected to be the cutoff. For a cutoff of 17, differentiation of healthy subjects and patients with glaucoma had a sensitivity and specificity of 82% and 62%, respectively.
The combination of sensitivity and specificity of all analyses in our study are considerably less than those obtained by Tjon-Fo-Sang and Lemij10 in an earlier study. They found a sensitivity and specificity of 96% and 93%, respectively. Although it is important to replicate estimates of sensitivity and specificity of diagnostic techniques in other study populations, it is often the case that initial reports of high sensitivity and specificity, even in well-designed studies, are difficult to replicate in different study populations. Several explanations may account for the differences in our study and that by Tjon-Fo-Sang and Lemij.10
We used NFA II or NFA-GDx, not the original NFA used in the earlier study,10 to reduce interoperator and intraoperator variability. The instruments used in our study contain a polarization detector that allows detection of polarized and unpolarized light. By electronically adding the polarized and unpolarized light components, it was able to recognize the intensity setting chosen by the operator. This feature was included to reduce interoperator variability and to minimize the dependency on the intensity setting of the original instrument.12 With the modified instrument, the dynamic range of the polarization detector was increased by approximately 30%. This allowed a wider range of retardation value to be delineated. These values are displayed in a retardation map, and a fundus image also is displayed, allowing for better visibility of the optic disc.12
The discrepancies between these studies also may be related to differences in data analysis. All images in our study were analyzed using GDx software for analysis, and this was not available when the earlier study was conducted. The GDx software includes a relatively large age- and race-specific normative database to automatically classify measurements as outside normal limits. It also includes the GDx number, which was developed based on the results from a back propagation neural network that included healthy subjects and patients with glaucoma. Moreover, we assessed the mean retardation data from 3 measurements in each eye rather than just 1 measurement, as in the earlier study.
Perhaps the most important reason for the discrepancies in the results of these studies relates to the severity of glaucoma included in each of them. In the earlier study, patients were included in the glaucoma group if they had results of a GHT that were outside normal limits. The MD of the included patients was −10.33 dB (range, −31.5 to 0.76 dB). Therefore, they included patients with early defects and patients with severe visual field loss. In contrast, our study was designed to include patients only with early or moderate visual field loss. Each of them had repeatable early or moderate visual field loss and results of a GHT outside normal limits or a CPSD outside 95% normal limits, and a repeatable cluster of 3 or more points depressed at the 5% level on the pattern deviation plot on at least 2 visual fields. Patients were excluded if MD was worse than −15 dB.
In both studies, the healthy subjects were not a part of the normative data used to determine outside normal limits. As we compared only patients with early or moderate glaucoma with healthy subjects, and Tjon-Fo-Sang and Lemij10 compared patients with advanced glaucoma as well, it is not surprising that they found higher sensitivity and specificity.
One limitation of our study was that we did not test our discriminant function on another cohort of subjects to independently estimate its sensitivity and specificity. However, we provided estimates of the bias in the calculation of the area under the ROC curve by using a k-fold cross-validation on 12 randomly chosen training and test sets. The bias was relatively small, resulting in a reduction of the area under the ROC curve of approximately 1.8% in the best-of-3 LDF model. Roughly the same amount of bias is inherent in the sensitivity and specificity estimates as in the area under the ROC curve. Therefore, it is our best estimate that the sensitivity and specificity values presented using cutoffs of −0.289 and 0.501 are overstated by approximately 1.8%.
In conclusion, we showed that using a combination of RNFL measures obtained using the scanning laser polarimeter improved the ability to differentiate between healthy eyes and eyes with early and moderate glaucomatous visual field loss. The GDx number and normative database provided with the software did not differentiate between healthy and glaucomatous eyes as well as the LDF. Further study is needed to determine whether the automated analysis features of the normative data and neural network can improve the clinical usefulness of the instrument for detection of early and moderate glaucoma.
Accepted for publication August 5, 1998.
Supported by grants EY11008 (Dr Zangwill), EY08208 (Dr Sample), and EY11158 (Dr Weinreb) from the National Eye Institute, Bethesda, Md, and Joseph Drown Foundation, Los Angeles, Calif (Dr Weinreb).
Corresponding author: Robert N. Weinreb, MD, Glaucoma Center, University of California, San Diego, 9500 Gilman Dr, La Jolla, CA 92093-0946.