Receiver operating characteristic curve for nonfollicular neoplasms.
Receiver operating characteristic curve for follicular neoplasms.
A, Irregular and ill-defined solid nodule with hypoisoechogenicity (arrows). The probability ratio is 0.97. B, Regular and well-defined solid nodule with hypoechogenicity (arrows). The probability ratio is 0.83. C, Regular and well-defined solid nodule with hypoisoechogenicity and fine calcification (arrows). The probability ratio is 0.85.
Koike E, Noguchi S, Yamashita H, Murakami T, Ohshima A, Kawamoto H, Yamashita H. Ultrasonographic Characteristics of Thyroid NodulesPrediction of Malignancy. Arch Surg. 2001;136(3):334-337. doi:10.1001/archsurg.136.3.334
Copyright 2001 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.2001
High-resolution real-time ultrasonography (US) can detect characteristics of thyroid nodules, but the US differentiation between malignant nodules and benign nodules is not well described.
Ultrasonography is useful for predicting malignancy of thyroid nodules.
A retrospective study of 329 thyroid nodules (≥5 mm) in 309 patients comparing US characteristics and pathological results.
A center for the treatment of thyroid diseases where about 1400 thyroid operations are performed per year.
Between January 1 and June 30, 1999, 309 patients were examined by US before thyroidectomy.
Main Outcome Measure
The US characteristics to predict malignancy for both follicular and nonfollicular neoplasms by means of multiple logistic regression analysis.
The sensitivity of preoperative US diagnosis was 86.5% for nonfollicular neoplasms and 18.2% for follicular neoplasms. The specificity was 92.3% and 88.7%, respectively. According to multiple logistic regression analysis, margin, shape, echo structure, echogenicity, and calcification were reliable indication of malignancy in nonfollicular neoplasms. According to a receiver operating characteristic curve constructed from this multiple logistic regression analysis, the best point not to overlook malignancy is the point at which sensitivity is 94% and specificity is 87%. The probability of malignancy at this point is greater than 0.2. For follicular neoplasms, ultrasonographic diagnosis was unreliable, even when multiple logistic regression analysis was applied.
We can predict malignancy of nonfollicular neoplasms of the thyroid by using multiple logistic regression analysis based on only 5 features: margin, shape, echo structure, echogenicity, and calcification.
ULTRASONOGRAPHY (US) is useful for evaluation of thyroid nodules because of its safety, noninvasiveness, nonradioactivity, and effectiveness. Presently, high-resolution real-time US not only detects the presence, site, number, and size of thyroid nodules but also clearly shows the characteristics of thyroid nodules. However, the US differentiation between malignant nodules and benign nodules is not well described.1- 12 Some authors have reported US diagnosis of thyroid nodules of all types with relatively high sensitivity ranging from 74% to 81%,3,6,7,9 but no formulas based on analysis of US characteristics are available to predict malignancy. Any useful formula should be simple and accurate for clinical use. We therefore evaluated US characteristics of thyroid nodules prospectively and analyzed the probability of malignancy on the basis of multiple logistic regression analysis.
This study included 309 consecutive patients with 329 thyroid nodules larger than 5 mm in diameter. All patients underwent US examination before thyroidectomy between January 1 and June 30, 1999, at Noguchi Thyroid Clinic and Hospital Foundation, Oita, Japan. The patients comprised 279 females and 30 males with a mean age (±SD) of 52.8 ± 13.8 years (range, 15-82 years). Solitary nodules were found in 254 patients (82.2%), and multiple nodules were found in 55 (17.8%). For multiple nodular goiters, US characteristics were evaluated on the dominant nodule and/or suspected malignant nodule. Surgery was performed for a preoperative diagnosis of malignant nodule in 152 patients (49.2%), for follicular neoplasms in 52 (16.8%), and for cosmetic reasons and/or tracheal compression by nodular goiter in 77 (24.9%).
The US examination was performed with a US scanner (Logiq TM500MD; GE Yokogawa Medical Systems, Tokyo, Japan) equipped with a 6- to 13-MHz linear probe (LA 39; GE Yokogawa Medical Systems). The spatial resolution for this system is about 2 mm. Margin (well defined or ill defined), shape (regular or irregular), halo sign (present or absent), echo structure (solid, mixed, or cystic), internal echo (homogeneous or heterogeneous), echogenicity (hyperechoic or isoechoic, hypoechoic, or hypoisoechoic), calcification (fine, other, or absent), and invasion to adjacent organs (present or absent) were assessed to make a US diagnosis of either malignant or benign disease. Preoperatively, we considered ill-defined or irregular nodules with heterogeneous internal echo or hypoechogenicity to be malignant. With signs of invasion to adjacent organs, nodules were malignant regardless of the other characteristics. We also considered solitary and solid nodules with homogeneous internal echo to be follicular neoplasms.
The final pathological diagnosis was determined by a single pathologist (Hiroto Yamashita). Pathologically, we found 145 papillary carcinomas, 11 follicular carcinomas, 2 medullary carcinomas, 1 mucoepidermoid carcinoma, 101 adenomatous goiters, 54 follicular adenomas, 14 cysts, and 1 calcified nodule. Nodules were divided into 2 groups on the basis of pathological diagnosis: follicular neoplasms (follicular adenoma and carcinoma) and nonfollicular neoplasms (papillary carcinoma, medullary carcinoma, mucoepidermoid carcinoma, adenomatous goiter, cyst, and calcified nodule).
The numbers of true-positive (TP), true-negative (TN), false-positive (FP), and false-negative (FN) US diagnoses were determined. Sensitivity was calculated as TP/(TP + FN), and specificity was calculated as TN/(TN + FP). Fisher exact 2-tailed test and multiple logistic regression analysis with a JMP program for Macintosh (SAS Institute Inc, Cary, NC) were used for statistical analyses. P<.05 was considered statistically significant. The probability of malignancy was calculated by the following formula:
where exp represents exponent, a0 is the intercept estimate, a1-5 is the estimate of US characteristics, and X is the point of US characteristics.
For preoperative US diagnosis, the overall sensitivity was 81.8% and the overall specificity was 90.6%. For the 265 nonfollicular neoplasms, sensitivity and specificity were 86.5% and 92.3%, respectively. For the 64 follicular neoplasms, sensitivity and specificity were 18.2% and 88.7%, respectively.
For nonfollicular malignant neoplasms, the positivity and the sensitivity of each US characteristic are shown in Table 1. According to Fisher exact 2-tailed test, all characteristics assumed to determine malignancy were statistically significant.
Considerations of internal echo, invasion, and halo sign were excluded from multiple logistic regression analysis for the following reasons. When nodules were studied for echogenicity, nodules showing a heterogeneous internal echo appeared hypoisoechoic; thus, a bias existed. Invasion showed a low positivity (18/148) with 100% sensitivity, and the P value for the halo sign was not significant (P = .60). The remaining 5 characteristics were reanalyzed. The remaining characteristics assumed to determine malignancy were statistically significant (P<.05) and the estimate and point of each characteristic are shown in Table 2. We constructed receiver operating characteristic curves with the use of these data (Figure 1). According to the receiver operating characteristic curve, the best point at which to predict malignancy was when sensitivity was 94% and specificity was 87%. The probability of malignancy at this point was greater than 0.2. Figure 2 shows the receiver operating characteristic curve for follicular neoplasms, which was created in the same manner.
Thyroid nodules are very common, detected on palpation in 2.1% to 4.2% of the population in the United States,13,14 in 5.1% to 6.5% in Europe,15,16 and in 1.3% to 3.9% in Japan.17,18 Thus, it is important to differentiate malignant nodules from benign nodules to avoid unnecessary thyroidectomy. Ultrasonography is the first choice for detecting thyroid abnormalities because of its safety, noninvasiveness, nonradioactivity, and effectiveness. Ultrasound can detect the presence, site, size, and number of thyroid nodules, and there have been reports of US characteristics of malignancy, such as ill-defined margin, irregular shape, hypoechogenicity, heterogeneity, absence of cystic lesion and/or the halo sign, the presence of calcification, and invasion to adjacent organs.1- 4,6- 8,10 However, the characteristics of thyroid nodules on US can vary widely by observer,19 and a formula for predicting malignancy has not yet been reported. The US diagnosis of thyroid nodules is still controversial for these 2 reasons.
For this study, we chose nodules with a diameter greater than 5 mm, because high-resolution real-time US can detect characteristics of even small thyroid nodules in detail, and sometimes patients with minute cancers die of local and/or distant metastasis.20,21 It is important not to overlook malignancy. We also separated thyroid nodules into 2 groups on the basis of pathological diagnosis. Because follicular neoplasms are difficult to diagnose as malignant or benign, they should be considered separately.
As far as we know, this is the first study to formulate statistical prediction of malignancy based on US characteristics of thyroid nodules in a large number of patients in a short time. The US features reported previously to predict malignancy (Table 1) were statistically significant on the basis of univariate analysis. According to multiple logistic regression analysis, 5 features (margin, shape, echo structure, echogenicity, and calcification) were statistically significant for predicting malignancy in nonfollicular neoplasms. Using receiver operating characteristic curves based on the multiple logistic regression analysis, we identified the best point with high true-positive and low false-negative rate. The probability at this point was greater than 0.2. Some typical US characteristics are shown in Figure 3. For follicular neoplasms, we could not differentiate carcinoma from adenoma by the same procedure.
In conclusion, we created a formula for predicting malignancy of thyroid nodules on the basis of multiple logistic regression analysis with the use of only 5 US features. It can be applied to nonfollicular thyroid neoplasms with high accuracy but not to follicular thyroid neoplasms.
Corresponding author and reprints: Shiro Noguchi, MD, PhD, Noguchi Thyroid Clinic and Hospital Foundation, 6-33 Noguchi Naka-machi, Beppu, Oita, 874-0932, Japan.