The 115 patient/tumor sample dendrogram was taken from the hierarchical clustering analysis of the breast intrinsic gene list.15,17 Tissue samples in gray indicate unknown subtype. The gene expression data for estrogen receptor (ER), human epidermal growth factor receptor-2 (HER2), CK5 (cytokeratin), and HER1 are shown with red squares representing the highest average expression, black representing average gene expression, and green representing the lowest below average expression. Gray indicates gene expression data not available. Note that PR (progesterone receptor) was not included in this gene expression analysis because it was not present on these early generation microarrays. Below the gene expression data are the revised immunohistochemical (IHC) classification schema used in this study. PR was added to the IHC profile since it is an ER-regulated gene expressed in most ER+ tumors.
Prevalence was luminal A = 51%, luminal B = 16%, basal-like = 20%, HER2+/ER− = 7%, and unclassified = 6%.
Lisa A. Carey, Charles M. Perou, Chad A. Livasy, Lynn G. Dressler, David Cowan, Kathleen Conway, Gamze Karaca, Melissa A. Troester, Chiu Kit Tse, Sharon Edmiston, Sandra L. Deming, Joseph Geradts, Maggie C. U. Cheang, Torsten O. Nielsen, Patricia G. Moorman, H. Shelton Earp, Robert C. Millikan. Race, Breast Cancer Subtypes, and Survival in the Carolina Breast Cancer Study. JAMA. 2006;295(21):2492–2502. doi:10.1001/jama.295.21.2492
Author Affiliations: Division of Hematology/Oncology (Dr Carey), Departments of Medicine (Drs Dressler and Earp and Mr Cowan), Genetics (Dr Perou and Ms Karaca), and Pathology (Drs Perou and Livasy), School of Public Health, Department of Epidemiology (Drs Conway, Troester, Deming, and Millikan and Mss Tse and Edmiston), University of North Carolina-Lineberger Comprehensive Cancer Center, Chapel Hill; Department of Community and Family Medicine, Duke University Medical Center, Durham, NC (Dr Moorman); Genetic Pathology Evaluation Centre, University of British Columbia, Vancouver (Dr Nielsen and Ms Cheang); and Roswell Park Cancer Institute, Buffalo, NY (Dr Geradts).
Context Gene expression analysis has identified several breast cancer subtypes, including basal-like, human epidermal growth factor receptor-2 positive/estrogen receptor negative (HER2+/ER–), luminal A, and luminal B.
Objectives To determine population-based distributions and clinical associations for breast cancer subtypes.
Design, Setting, and Participants Immunohistochemical surrogates for each subtype were applied to 496 incident cases of invasive breast cancer from the Carolina Breast Cancer Study (ascertained between May 1993 and December 1996), a population-based, case-control study that oversampled premenopausal and African American women. Subtype definitions were as follows: luminal A (ER+ and/or progesterone receptor positive [PR+], HER2−), luminal B (ER+ and/or PR+, HER2+), basal-like (ER−, PR−, HER2−, cytokeratin 5/6 positive, and/or HER1+), HER2+/ER− (ER−, PR−, and HER2+), and unclassified (negative for all 5 markers).
Main Outcome Measures We examined the prevalence of breast cancer subtypes within racial and menopausal subsets and determined their associations with tumor size, axillary nodal status, mitotic index, nuclear pleomorphism, combined grade, p53 mutation status, and breast cancer–specific survival.
Results The basal-like breast cancer subtype was more prevalent among premenopausal African American women (39%) compared with postmenopausal African American women (14%) and non–African American women (16%) of any age (P<.001), whereas the luminal A subtype was less prevalent (36% vs 59% and 54%, respectively). The HER2+/ER− subtype did not vary with race or menopausal status (6%-9%). Compared with luminal A, basal-like tumors had more TP53 mutations (44% vs 15%, P<.001), higher mitotic index (odds ratio [OR], 11.0; 95% confidence interval [CI], 5.6-21.7), more marked nuclear pleomorphism (OR, 9.7; 95% CI, 5.3-18.0), and higher combined grade (OR, 8.3; 95% CI, 4.4-15.6). Breast cancer–specific survival differed by subtype (P<.001), with shortest survival among HER2+/ER− and basal-like subtypes.
Conclusions Basal-like breast tumors occurred at a higher prevalence among premenopausal African American patients compared with postmenopausal African American and non–African American patients in this population-based study. A higher prevalence of basal-like breast tumors and a lower prevalence of luminal A tumors could contribute to the poor prognosis of young African American women with breast cancer.
Breast cancer is a heterogeneous disease composed of a growing number of recognized biological subtypes. The prognostic and etiologic importance of this diversity is complicated by many factors, including the observation that differences in clinical outcomes often correlate with race. Age-adjusted mortality in the United States from breast cancer in white women is 28.3 deaths per 100 000 compared with 36.4 deaths per 100 000 in African American women.1 This disparity is particularly pronounced among women younger than 50 years, in whom mortality is 77% higher among African American women compared with white women (11.0 vs 6.3 deaths per 100 000). Breast cancer in African American women has been characterized by higher grade,2,3 later stage at diagnosis,2,4 and worse survival even after controlling for stage at diagnosis.4- 6 The causes of this observed survival difference are likely multifactorial and include socioeconomic factors,4 differences in access to screening7 and treatment,6 as well as potential biological differences among the cancers themselves.3,8,9 Biological differences among breast cancers may reflect genetic influences, differences in lifestyle, or nutritional or environmental exposures. In addition, studies that include race as a characteristic must take into account that there is significant disagreement as to how race is measured and interpreted in medical research.10- 12
Gene expression studies using DNA microarrays have identified several distinct breast cancer subtypes13 based on an intrinsic gene list that includes 496 genes that differentiate breast cancers into separate groups based only on gene expression patterns. These subtypes differ markedly in prognosis14- 16 and in the repertoire of therapeutic targets they express.17 The intrinsic subtypes include 2 main subtypes of estrogen receptor (ER)–negative tumors (basal-like and human epidermal growth factor receptor-2 positive/ER− [HER2+/ER−] subtype) and at least 2 types of ER+ tumors (luminal A and luminal B).14,15 Basal-like tumors typically show low expression of HER2 and ER and exhibit high expression of genes characteristic of the basal epithelial cell layer, including expression of cytokeratins 5, 6, and 17.13 The HER2+ (ie, gene amplified and/or highly overexpressed protein) tumors fall into at least 2 distinct expression groups: those that are ER− and typically cluster near the basal-like tumors (HER2+/ER− subtype), and those that are ER+ (and may also be progesterone receptor positive [PR+]) and cluster with tumors of luminal cell origins as part of the luminal B subtype.14,15 The luminal subtype A and B tumors express ER, GATA3, and genes regulated by both ER and GATA3.18,19 Compared with luminal B tumors, luminal A tumors express higher levels of ER and GATA3 and show more favorable patient outcomes,15 whereas luminal B tumors more often express human epidermal growth factor receptor-1 (HER1), HER2, and/or cyclin E1.14,15
Previous expression studies examined breast cancer subtypes in small data sets derived from frozen tumor banks.14- 16,20,21 The incidence of any of these molecular subtypes in a large population-based study and their relationship with demographic variables have not been systematically evaluated. The Carolina Breast Cancer Study (CBCS) is a population-based, case-control study of environmental and molecular determinants of breast cancer risk.22 The CBCS is unique in that it oversampled African American and premenopausal women to allow better representation of these 2 subpopulations, making it well-suited for the examination of race- and age-related variables. We used immunohistochemical (IHC) surrogates to identify breast tumor intrinsic subtypes using formalin-fixed, paraffin-embedded tumor blocks collected for CBCS cases, and examined associations between tumor subtypes and race, menopausal status, tumor characteristics, and survival.
Although breast cancer subtypes were originally identified by gene expression analysis using DNA microarrays, large-scale subtyping using gene expression profiling from formalin-fixed, paraffin-embedded samples is not currently feasible. For this reason, we used IHC markers that had been previously verified against gene expression profiles to estimate the prevalence of the intrinsic subtypes in a large population-based epidemiological study of African American and white women. The IHC profiles were developed previously by performing both microarray analysis and IHC for ER, HER2, HER1, and cytokeratin 5/6 on a single series of breast cancers; in that way, we identified combinations of these IHC markers that best matched the gene expression patterns, and then validated these IHC surrogates using a 930-case tissue microarray from the University of British Columbia.17 In that earlier study, the IHC-based definitions were luminal (ER+ and HER2−), HER2+ subtype, and basal-like (ER−, HER2−, cytokeratin 5/6+, and/or HER1+). We updated these IHC-based definitions in 2 ways: first, we included PR, which is another widely used breast tumor marker, in the definition of luminal because PR is an ER-regulated gene expressed in most ER+ tumors and is associated with response to hormonal therapy. Second, we recategorized HER2+ tumors into 2 groups based on their ER status since HER2+/ER− tumors cluster separately from HER2+/ER+ tumors in hierarchical clustering analyses.14,15 In this way, we refined the previous IHC profiles for the breast cancer subtypes and created updated IHC subtype definitions: basal-like (ER−, PR−, HER2−, cytokeratin 5/6+, and/or HER1+), HER2+/ER− subtype (HER2+, ER−, PR−), luminal A (ER+ and/or PR+, HER2−), and luminal B (ER+ and/or PR+, HER2+). This definition for luminal B does not identify all luminal B tumors because only 30% to 50% are HER2+. The other luminal B tumors in this system would be classified with luminal A tumors. Tumors that were negative by IHC for all 5 markers (ER, PR, HER2, HER1, and cytokeratin 5/6) were considered unclassified. These refined IHC profiles are seen in Figure 1. In support of the updated profiles, the HER2+ and ER+ tumors (by gene expression) were found mostly within the ER+ tumor dendrogram branch and within the luminal B subtype, whereas the HER2+ and ER− tumors that represent the HER2+/ER− subtype gene expression pattern were seen within a distant ER− tumor dendrogram branch, which suggests that these 2 groups are different.
The CBCS is a population-based, case-control study conducted in 24 counties of eastern and central North Carolina.22 The goal of the present analysis was to estimate the prevalence of breast cancer subtypes in a population-based sample of breast cancer cases, and to examine correlations with clinico-pathologic variables and patient survival. The analysis was based on breast cancer cases ascertained between May 1993 and December 1996 (phase 1 of the CBCS) and excluded controls. Newly diagnosed (incident) cases of invasive breast cancer in women between the ages of 20 and 74 years were identified using a rapid ascertainment system developed in collaboration with the North Carolina Central Cancer Registry. Cases were selected by randomized recruitment with predetermined probabilities to increase enrollment of African American women and women younger than 50 years so that these otherwise underrepresented subpopulations would represent approximately 50% of the study population. The sampling strategy was intended to balance the 4 patient groups (younger African American, older African American, younger non–African American, older non–African American cases) so that statistically valid comparisons could be made for each of the 4 groups. To this end, the schema sampled 100% of African American cases younger than 50 years, 75% of African American cases at least 50 years old, 67% of non–African American cases younger than age 50 years, and 20% of non–African American cases at least 50 years old.22 Other than the oversampling of younger and African American women by design, the CBCS population is representative of cases reported to the North Carolina Central Cancer Registry in that region of North Carolina during that time, except for a slightly lower proportion of African American cases aged 40 to 59 years with later-stage disease (2.4% vs 10.2%, P = .03).2 Contact rates in the CBCS were lowest among younger women and African American women, while participation rates were lowest among older women and African American women.23 Compared with women who participated in the CBCS, nonparticipants were more likely to be of lower socioeconomic status, to have a lower educational level, and to have a recent history of unemployment.23
The study procedures for recruitment and enrollment were approved by the institutional review board of the University of North Carolina School of Medicine, and all study participants gave written informed consent.
Race was determined by self-identification and for analysis was categorized as African American or non–African American. Non–African American cases were predominantly white but also included 14 women who reported their race as Native American, Hispanic, Asian American, or multiracial. Information on race was obtained since a primary goal of the CBCS was to better understand breast cancer in African American women. Menopausal status was based on in-person interview data. Sampling was done according to age (since menopausal status was not obtained until interviews), but this did not affect the results (presenting by menopausal status rather than by age <50 and ≥50 years). Women younger than 50 years who had undergone natural menopause, bilateral oophorectomy, or irradiation to the ovaries were classified as postmenopausal and were considered together. In women aged 50 years or older, menopausal status was assigned based on cessation of menstruation.24
Centralized review of histology for all tumors was conducted by a single pathologist (J.G.),2 who was blinded to patient demographics and other study variables. Based on histology, tumors were classified into 6 groups: A (invasive ductal carcinomas not otherwise specified, medullary, apocrine, neuroendocrine carcinomas), B (tubular, mucinous, papillary carcinoma, cribriform carcinomas), C (metaplastic, anaplastic, undifferentiated high-grade carcinomas), D (invasive lobular carcinomas), E (mixed ductal and lobular carcinomas), and unknown (unable to classify). Tumor size, lymph node status, and American Joint Committee on Cancer (AJCC, 5th edition) stage at diagnosis were abstracted from the medical records. Nuclear grade, histologic grade, and mitotic index were previously determined2 according to the Nottingham modification of the Scarff-Bloom-Richardson criteria.25 High mitotic index was defined as greater than 10 mitotic figures per 10 high-power fields.
Estrogen receptor and PR status were determined from medical records (80%) or by IHC performed at the University of North Carolina-Lineberger Comprehensive Cancer Center Immunohistochemistry Core Facility in Chapel Hill.26 For the cases in which ER and PR status was obtained from the medical record, various clinical laboratories determined the results. About half used IHC on paraffinized tissue with cutoffs for receptor positivity from more than 0% to more than 20%, and about half used biochemical assays on frozen tissue with cutoffs of 10 to 15 fmol/mg. For the remaining tumors, IHC was performed in the Core laboratory at the University of North Carolina.26 Scoring for IHC was adapted from the method of the the University of North Carolina Hospitals Department of Pathology with 5% invasive breast cancer nuclei-positive cells as the cutoff value for ER or PR status. In a 10% random sample of 23 cases that were ER+ and 24 cases that were ER− based on medical records, comparison of the medical record IHC result with IHC done by the Core Laboratory at the University of North Carolina revealed a κ statistic of 0.62, indicating substantial agreement beyond chance27 with an overall concordance of 81%. The HER2 status was determined using the CB11 antibody (Biogenex, San Ramon, Calif) as previously defined.28 HER2-positivity was defined as membrane or membrane plus cytoplasmic staining with weak or greater intensity in at least 10% of tumor cells. On a subset of 184 patients, a comparison of 2 independent scorers of the HER2 IHC assay, who were blinded to the other clinical variables, yielded a κ statistic of 0.58, indicating moderate agreement beyond chance27 with an overall concordance of 82%. Staining for HER1 was categorized using a 0 to 3 scoring system,17 and our assignment of HER1 positivity was defined as any HER1 staining. Cytokeratin 5/6 was scored positive if any cytoplasmic and/or membranous staining was seen.29
A TP53 mutational analysis was performed at the University of North Carolina-Lineberger Comprehensive Cancer Center Molecular Epidemiology Core Facility using single-strand conformational polymorphism analysis with direct sequencing of positive results as previously described.30 Screening for germline mutations in BRCA1 was accomplished using multiplex single-strand conformation analysis as previously described on the first 211 cases in phase 1 of the CBCS.31
The National Death Index provided vital status on CBCS cases as of May 11, 2004. These data were derived from death certificates and included all causes of death for overall survival and disease-specific cause of death for breast cancer–specific survival. In 1 large epidemiological study, the sensitivity of the National Death Index search was 98% and specificity was approximately 100% for breast cancer.32 Breast cancer-specific survival was determined by the International Classification of Diseases (ICD) breast cancer codes 174.9 (ICD-9) or C50.9 (ICD-10) as the underlying cause of death on the death certificate.
To account for the sampling strategy that systematically overrepresented certain patient groups (eg, younger, African American), analyses are presented stratified by the 4 patient groups. Differences between breast cancer subtypes with regard to clinicopathologic characteristics were examined using 1-way analysis of variance (ANOVA) for age, and χ2 tests for the remaining variables. The Fisher exact test was used when expected cell counts were less than 5 using the Monte Carlo method as implemented in SAS.33 Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated to estimate magnitude and precision of association among breast cancer cases. Odds ratios represent prevalence and were calculated using logistic regression as implemented in SAS version 8.0 (SAS Institute Inc, Cary, NC). P values were similar when prevalence ratios were used as the measure of association, but several models did not converge. The reported P values reflect the β coefficient in the relevant logistic model. Variables were chosen based on clinical interest, and included age, race, and stage at diagnosis. Because of collinearity with stage, lymph node status was not included with stage in logistic models. To test for overfitting, we performed the Hosmer-Lemeshow goodness-of-fit test,34 which did not reveal significant evidence for lack of fit. Likelihood ratio tests for interaction were conducted by comparing models with main effects to models with main effects plus an interaction term. P values were not corrected for multiple comparisons since the variables examined (clinicopathologic variables, definitions of breast cancer subtypes) were not independent and thus do not represent separate statistical tests. Survival curves were generated using the Kaplan-Meier method,35 and the log-rank test36 was used to compare mean survival across the IHC subtypes. To confirm that the assumptions of the log-rank test were fulfilled,36 we determined that censoring due to non–breast cancer causes of death was unrelated to breast cancer subtype (P = .55), and the proportion of patients in each of the breast cancer subtypes did not differ across the years of enrollment in the study (P = .41). Censoring did not differ according to year of enrollment in the study for 5-year breast cancer–specific survival (P = .73) or overall survival (P = .33). Date and cause of death were obtained from the National Death Index and were thus assigned without knowledge of breast cancer subtype.
As a further test for differences in survival among breast cancer subgroups, we performed univariate Cox regression to estimate hazard ratios for basal-like breast cancer vs luminal A, and for HER2+/ER− breast cancer vs luminal A.37 Power calculations were performed using a computer program developed by Dupont and Plummer,38 and concluded that power was very good (70%-80%) or excellent (>80%) for the majority of comparisons in this analysis. Statistical analysis was performed by C.K.T. under the supervision of R.C.M.
A total of 1153 incident cases of invasive breast cancer were identified in phase 1 of the CBCS. Successful contact was obtained in 861 cases (75%), and of these 807 (94%) had tumor blocks or tissue sections for centralized review and IHC. Of the 807 cases, 496 (61%) had both adequate tumor and interpretable IHC data for ER, PR, HER2, cytokeratin 5/6, and HER1, which was a requirement for inclusion in the subtype analysis. These cases included 196 African American and 300 non–African American women. Comparison of these 496 cases included those with the 365 excluded cases (on whom we did not have either adequate tumor tissue or complete IHC data) revealed the following differences: the included cases were more likely to be stage II (51% vs 39%) and less likely to be stage I (39% vs 48%), with little difference seen in stage III (8% vs 10%) or stage IV (3% vs 4%) percentages. The included cases also were more likely to have tumors with high mitotic indices (46% vs 34%, P<.001). These differences likely reflected the fact that tumor blocks from patients with smaller tumors were either unavailable or had insufficient tissue for subtype analysis. There were no differences between the included and excluded cases in age, race, menopausal status, lymph node status, nuclear grade, histologic grade, or survival.
Characteristics of the 496 CBCS cases with IHC data, overall and according to IHC subtypes, are presented in Table 1. The IHC subtypes differed significantly by age (P<.001), race (P = .03), menopausal status (P = .008), combined race and menopausal status (P<.001), axillary lymph node status at time of diagnosis (P = .04), histology group (P<.001), nuclear grade (P<.001), histologic grade (P<.001), and mitotic index (P<. 001). Patients with luminal A and B tumors were older than the other patients, and patients with the HER2+/ER− subtype had the highest prevalence of positive lymph nodes. Patients with basal-like tumors were more likely to be African American, premenopausal, and to have tumors with high nuclear grade, high histologic grade, and high mitotic index. Basal-like tumors also showed the highest prevalence of unfavorable histologies (group C: metaplastic, anaplastic, and undifferentiated high-grade carcinomas).
In the overall study population, the prevalence of the basal-like subtype was 20% (100 cases total). The prevalence of basal-like breast cancer was significantly higher in African American breast cancer cases, comprising 52 of 196 African American women (26%) vs 48 of 300 non–African American cases (16%) (Table 1). Basal-like tumors were also more frequent in premenopausal cases, comprising 64 of 261 (24%) vs 36 of 235 (15%) postmenopausal cases. These prevalence estimates should be interpreted with caution, because they do not reflect the sampling probabilities used to define eligible cases in the CBCS. To account for the sampling strategy, separate estimates were derived for each of the 4 patient groups defined a priori (Table 2). The high prevalence of basal-like tumors in African American women was mostly seen in premenopausal women, in whom the prevalence was 39%. The prevalence of basal-like breast cancer in premenopausal African American women was significantly elevated compared with postmenopausal African American (14%) or non–African American women (16%) of any age (P<.001) (Table 2). The difference in prevalence of basal-like breast cancer between premenopausal and postmenopausal cases was statistically significant among African American cases (P<.001), but not among non–African American cases (P = .94). The luminal A subtype, conversely, was less frequent among premenopausal African American women (36%) compared with postmenopausal African American (59%) or non–African American (54%) women. The higher prevalence of basal-like breast cancers in younger African American patients was maintained when we stratified on stage at diagnosis. For example, among cases with stage I disease, the prevalence of basal-like breast cancer was 40% in premenopausal African American women, 6% in postmenopausal African American women, 10% in premenopausal non–African American women, and 8% in postmenopausal non–African American women (P = .001). This difference by race and menopausal status was not seen in the other ER− subtype (HER2+/ER−), which also was associated with high grade (Table 1).
Odds ratios for the association of breast cancer subtypes with lymph node status, histologic grade, and mitotic index are presented in Table 3, with the luminal A subtype (the most common IHC subtype representing 51% of the cases) serving as the referent group. Odds ratios were adjusted for age, stage, and race. Compared with the luminal A subtype, patients with basal-like tumors were 2.1 times more likely to be African American (P = .004). Likelihood ratio tests showed a significant interaction between race and menopausal status for developing the basal-like subtype (P = .02), but not HER2+/ER− (P = .49), luminal B (P = .62), or unclassified tumors (P = .58) compared with luminal A. In comparison with luminal A tumors and after adjustment for age, race, and stage, the basal-like subtype was 11 times more likely to have high mitotic index (P<.001), 9.7 times more likely to have high nuclear grade (P<.001), and 2.5 times more likely to have high histologic grade (P = .003). The basal-like subtype was not associated with the presence of positive axillary lymph nodes at the time of diagnosis (P = .53), whereas both HER2+ subtypes (HER2+/ER− and luminal B) were significantly more likely to have positive lymph nodes at presentation (P = .04). Notably, a strong association with high histologic, nuclear, and mitotic grade was seen for both subtypes of ER− tumors, namely the basal-like and HER2+/ER− tumors. However, the HER2+/ER− subtype was not significantly associated with race or menopausal status.
The TP53 sequence-based mutation analysis was performed on 330 of the 496 IHC classified breast cancer cases, of which 84 (25%) had TP53 mutations. The presence of TP53 mutations differed significantly with IHC subtype: 44% (28 of 63) of basal-like tumors and 43% (10 of 23) of HER2+/ER− subtype tumors contained TP53 mutations, whereas only 23% (12 of 52) of luminal B and 15% (25 of 175) of luminal A were mutation-positive (P<.001). These findings were in agreement with previous comparisons of the breast tumor intrinsic subtypes and TP53 mutation status14 as well as previous demonstration of a high proportion of p53-mutant tumors in BRCA1 and cytokeratin 5/6–positive tumors.39,40
A subset of CBCS patients were screened for BRCA1 germline mutations.31 Of the 496 cases assayed, 211 were screened for mutations in BRCA1, with 4 carriers and 1 variant of unknown effects being identified. The BRCA1 mutation carriers comprised 1 luminal A tumor, 1 unclassified tumor, and 2 basal-like tumors. Although these numbers were very small, the data were consistent with earlier findings that most BRCA1 mutant tumors show the basal-like phenotype15,41,42 and that most BRCA1 mutant tumors do not show HER2 positivity.43
The maximum duration of follow-up for the CBCS phase 1 cases was 11.2 years (minimum of 8.1 years). During this period of observation, the study patients had 73% overall survival (232 deaths among 861 cases). Of the 232 deaths, 170 were considered breast cancer-specific, giving an overall disease-specific survival of 80% (691 of 861). African American cases had worse breast cancer-specific survival (74%) compared with non–African American cases (84%) (P<.001). Age, race, menopausal status, stage, ER status, PR status, TP53 mutation status, mitotic index, nuclear grade, and histologic grade were also significant predictors of breast cancer-specific survival (P<.001 for each).
The breast cancer subtypes also differed significantly in breast cancer-specific survival (P<.001): basal-like subtype (75%), HER2+/ER− subtype (52%), luminal A (84%), luminal B (87%), and unclassified (77%). Kaplan-Meier survival curves for breast cancer-specific survival are presented in Figure 2. A steep fall in breast cancer–specific survival was observed in the first 4 to 5 years for the basal-like and HER2+/ER− tumors, with particularly poor survival for the HER2+/ER− subtype. A similar early relapse pattern has been described for BRCA1 tumors.44,45 Over the entire observation period, breast cancer–specific survival was significantly worse among basal-like (hazard ratio, 1.8; 95% CI, 1.1-2.9; P = .03) and HER2+/ER− breast cancer patients (hazard ratio, 3.5; 95% CI, 1.9-6.2; P<.001) compared with luminal A as the referent group.
The difference in survival by breast cancer subtype was seen both among lymph node–positive patients (P = .01) and lymph node–negative patients (P = .03). Data were sparse after stratifying on lymph node status and should be interpreted with caution. Breast cancer–specific survival within lymph node–positive patients by subtype was the following: basal-like (51%), HER2+/ER− (39%), luminal A (65%), luminal B (83%), and unclassified (44%). Within the lymph node−negative patients, breast cancer-specific survival was the following: basal-like (93%), HER2+/ER− (71%), luminal A (94%), luminal B (92%), and unclassified (91%).
The outcomes in premenopausal African American cases did not become more similar to the other groups when basal-like cases were removed. The breast cancer−specific survival by racial and menopausal subsets without basal-like breast cancers still differed significantly: premenopausal African American 64%, postmenopausal African American 81%, premenopausal non–African American 81%, and postmenopausal non–African American 91% (P<.001). These data suggest that factors other than subtype, such as access to treatment, could also be influencing survival in younger African American women.
Gene expression profiling has identified breast cancer intrinsic subtypes that predict distinct clinical outcomes14,15 and which have been shown to be present in women of multiple ethnicities.46 The basal-like subtype has been associated with poor clinical outcomes,15,16 which likely reflect this subtype's high proliferative capacity14- 16 as well as the lack of directed therapies since basal-like tumors do not typically express ER− or overexpress HER2.17 To facilitate investigation of the population-based frequencies of the basal-like breast cancer subtype, we refined an IHC-based assay to identify the main breast tumor intrinsic subtypes. We used the IHC method for categorization and determined for the first time the population-based prevalence of these subtypes. Although IHC-based assays do not provide as much biological insight into tumor biology as mRNA-based assays containing thousands of genes, this IHC assay allowed classification of tumors into categories that have demonstrated associations between intrinsic subtypes and proliferation rates, overall survival, TP53 status, and BRCA1 mutation status.14,15,17,29,41,42 The reproducible correlations across different studies and when using different assays (IHC and DNA microarray expression profiles) shows that we are tracking common tumor subtypes with similar biologic characteristics and clinical behaviors across distinct patient sets. The IHC-based classification system also allows analyses of subtypes to be conducted in patient populations where fresh tissue is not available.
In the population-based CBCS, the prevalence of the basal-like and luminal A breast cancer subtypes was strongly influenced by race and menopausal status; the highest prevalence of basal-like and lowest prevalence of luminal A tumors were observed among premenopausal African American breast cancer patients. Because the CBCS is a population-based sample, within defined race and age groups estimates of prevalence are likely to be representative of the underlying North Carolina population.2 Differences between the CBCS and breast cancer patients reported to the North Carolina Central Cancer Registry include a lower proportion of African American women between the ages of 40 and 59 years with higher-stage tumors and lower participation among women from lower socioeconomic and educational strata.2,23 Each of these factors could actually produce an underestimate of the prevalence of more aggressive breast cancer subtypes (basal-like and HER2+/ER−) among younger African American cases enrolled in the CBCS. However, this potential bias may have been partially offset by the fact that the analysis of IHC markers in the CBCS was based on patients with larger tumors.
A high frequency of basal-like tumors was observed in a study of breast cancer in Nigerian women, among whom ER-negative and HER2-negative tumors comprised 87 of 148 women, or 59% of total cases.47 According to gene expression studies, ER-negative breast tumors fall into 1 of 2 categories,14,15 namely basal-like tumors (ER−, PR−, and HER−) and the HER2+/ER− subtype (HER2+/ER−) (Figure 1). The HER2+/ER– group, which is also a high-grade and ER-negative tumor group, did not vary significantly with age or race. These findings suggest that associations between premenopausal breast cancer, race, and hormone status in the CBCS was driven by an excess of the basal-like subtype. Breast cancers that develop among BRCA1 mutation carriers are generally basal-like.15,41,42 However, very few BRCA1 mutation carriers were present in the CBCS, with 2 out of the 4 known carriers falling into the basal-like category. No BRCA1 carriers were identified among the African American cases tested in the CBCS and only a single variant of unknown biological significance was identified.31 Thus BRCA1 variants are unlikely to explain the high prevalence of basal-like breast cancer in younger African American patients in this study.
Basal-like breast cancers in the CBCS exhibited aggressive features, including high proliferative capacity (measured by mitotic index), high histologic grade, high nuclear grade, and frequent TP53 mutations. Even after adjustment for age, race, and stage, the association of basal-like and HER2+/ER− subtypes with aggressive features remained significant. These findings were expected given the high expression of the proliferation cluster of genes in microarray analyses of basal-like and HER2+/ER− subtype tumors.13- 15,48 The association of race with high-grade breast tumors and ER negativity has been previously reported.2,3 However, our study suggests that this association is driven by the increased prevalence of basal-like tumors and not by an increase in HER2+/ER− subtype.
The observation that the intrinsic breast cancer subtypes carry different prognoses was confirmed in the CBCS. Disease-specific survival was significantly lower among breast cancer cases with basal-like and HER2+/ER− tumors, and more favorable among cases with luminal A tumors. The HER2+/ER− subtype appears particularly prone to early and frequent relapse, befitting the clinical experience with HER2 overexpressing tumors49; the CBCS cases in this study were diagnosed between 1993 and 1996 and were not treated with the anti–HER2 monoclonal antibody trastuzumab. Basal-like tumors were more frequent in younger African American women in the CBCS, and could contribute to their poor prognosis compared with other breast cancer patients. However, when cases of basal-like tumors were removed, the breast cancer–specific survival remained significantly worse among premenopausal African American cases. As noted previously, this may reflect the impact on prognosis of access to care, treatment, or other differences. In other words, while the high incidence of the poor-prognosis basal-like subtype may contribute to their relatively worse outcome, it does not entirely explain the poor outcomes seen in younger African Americans. We lacked treatment data in the CBCS, so we could not examine interactions between IHC subtypes and efficacy of cancer therapy. Examination of tumor microarray data using patients treated with surgery alone also suggests that these subtypes are prognostic and reflect the natural history of these tumors.15 Interestingly, unlike HER2+/ER− and luminal B tumors, the basal-like subtype was not associated with involvement of positive axillary lymph nodes, a finding that was previously noted in a study of cytokeratin 5/6–positive tumors that oversampled BRCA1 tumors.40 Since basal-like breast cancers still carried a poor prognosis, it is possible, as suggested by others,40 that this finding reflects a predominantly hematogenous, rather than lymphatic, pattern of dissemination. Further studies are needed to address this issue.
Further research is needed to confirm the finding that the basal-like breast tumor subtype shows a high prevalence in young African American breast cancer patients. In studies of race and breast cancer, it is important that race be evaluated in the context of other variables such as stage at diagnosis and tumor histology. Information on breast cancer risk factors will help to determine whether basal-like tumors have a different underlying etiology compared with other types of breast cancer. Since BRCA1 carriers tend to develop basal-like tumors, there may be other inherited genetic variants that predispose to developing specific subtypes of breast cancer.15,21 The absence of BRCA1 carriers among African American breast cancer patients in the CBCS suggests that genes other than BRCA1 could predispose women to basal-like breast cancers; however, environmental and socioeconomic factors could also play a role in the observed distribution of breast cancer subtypes. Notably, in the CBCS, the prevalence of BRCA1 mutations was 0 in African Americans and low (3.3%) in non–African Americans.31 Most importantly, our data suggest that epidemiological studies of breast cancer in African American women should consider the joint distribution of ER, PR, and HER2 status (ie, subtypes), rather than rely on ER and PR status alone. Previous analyses typically group together HER2+/ER− tumors with basal-like tumors under the ER-negative designation; however, in the CBCS, HER2+/ER− tumors were not associated with race or menopausal status.
The high prevalence of basal-like tumors in younger African American women could contribute to their higher breast cancer mortality. Additional studies of long-term survival among patients with specific breast cancer subtypes are needed. Clinical trials aimed at identifying therapeutic approaches to the management of basal-like breast cancer are also needed, especially for young African American women.
Corresponding Author: Lisa A. Carey, MD, Division of Hematology/Oncology, University of North Carolina-Lineberger Comprehensive Cancer Center, CB 7305, 3009 Old Clinic Bldg, Chapel Hill, NC 27599-7305 (Lisa_Carey@med.unc.edu).
Author Contributions: Dr Millikan had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Carey, Perou, Moorman, Millikan.
Acquisition of data: Livasy, Dressler, Conway, Edmiston, Deming, Geradts, Cheang, Nielsen, Moorman, Millikan.
Analysis and interpretation of data: Carey, Perou, Livasy, Dressler, Cowan, Conway, Karaca, Troester, Tse, Deming, Cheang, Nielsen, Earp, Millikan.
Drafting of the manuscript: Carey, Perou, Tse, Nielsen, Millikan.
Critical revision of the manuscript for important intellectual content: Carey, Perou, Livasy, Dressler, Cowan, Conway, Karaca, Troester, Edmiston, Deming, Geradts, Cheang, Nielsen, Moorman, Earp, Millikan.
Statistical analysis: Tse, Deming, Cheang, Millikan.
Obtained funding: Perou, Dressler, Conway, Earp, Millikan.
Administrative, technical, or material support: Carey, Livasy, Dressler, Cowan, Conway, Karaca, Troester, Edmiston, Nielsen, Moorman, Millikan.
Study supervision: Carey, Perou, Dressler, Conway, Edmiston, Nielsen, Millikan.
Financial Disclosures: None reported.
Funding/Support: This work was supported by an award to the University of North Carolina for a Breast Cancer Specialized Program of Research Excellence (SPORE) from the National Cancer Institute (NIH/NCI P50-CA58223), a grant from the General Clinical Research Centers Program of the Division of Research Resources/National Institutes of Health (M01RR00046 awarded to Dr Carey), and by the NCI (RO1-CA-101227-01 awarded to Dr Perou).
Role of the Sponsor: All study funding was from public grants for scientific research. The funding organizations had no role in the design and conduct of the study; the collection, analysis, and interpretation of the data; or the preparation, review, or approval of the manuscript.
Previous Presentation: This work was presented in part at the 40th Annual Meeting of the American Society of Clinical Oncology; New Orleans, La; June 2004.
Acknowledgment: For their critical review, we thank Barbara Rimer, PhD, School of Public Health; and Paul Godley, MD, PhD, and Matthew G. Ewend, MD, School of Medicine, University of North Carolina. They were not compensated for their time.