
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Departments of Medical Oncology and 2 Biostatistics, Dana-Farber Cancer Institute, Boston, MA;
3 Department of Pathology, Brigham and Women's Hospital, Boston, MA;
4 Department of Pathology, Faulkner and Brigham and Women's Hospital, Boston, MA;
5 Department of Pathology, Beth-Israel Deaconess Medical Center, Boston, MA;
6 Harvard Medical School and 7 Harvard School of Public Health, Boston, MA;
8 Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD;
9 National Cancer Institute, Bethesda, MD; and
10 Department of Pathology, Duke University Medical Center, Durham, NC
Requests for reprints: Kornelia Polyak, Dana-Farber Cancer Institute, 44 Binney St. D740C, Boston, MA 02115. Phone: (617) 632-2106; Fax: (617) 632-4005. E-mail: Kornelia_Polyak{at}dfci.harvard.edu
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Comprehensive gene expression profiling has proven useful for the molecular classification of invasive breast tumors, but no such extensive study has been performed in DCIS largely due to technical difficulties with obtaining DCIS samples suitable for RNA-based analysis (5, 6). Our laboratory uses serial analysis of gene expression (SAGE) to identify genes implicated in breast tumorigenesis (79). SAGE requires a relatively small amount of tissue (50,000 cells or less) for the generation of comprehensive expression profiles without the requirement for RNA/cDNA amplification steps; thus, it is particularly well suited for the analysis of small, preinvasive tumors (10). Although SAGE allows the quantitative measurement of the mRNA levels of thousands of genes simultaneously in one specimen, to establish how frequently an emerging candidate gene is differentially expressed, it is necessary to examine hundreds of breast specimens. The recently developed tissue microarray technology allows rapid profiling of hundreds of specimens on one slide and is therefore ideally suited for the further evaluation of candidate genes emerging from SAGE analysis (11, 12).
Here we report that based on SAGE analyses of breast tumors of different histopathologic stages, we identified several transcripts that belong to one of the following groups: up- or down-regulated in breast cancer regardless of stage; preferentially up-regulated in DCIS; or in invasive carcinomas. The expression of 14 genes was confirmed at the cellular level by mRNA in situ hybridization using a panel of frozen DCIS and invasive breast tumors. Tissue microarrays composed of primary breast cancers of different pathological stages were used to explore the potential clinical usefulness of 10 genes by investigating their relationship to histopathologic features of the tumors and patient outcome.
| Results |
|---|
|
|
|---|
|
|
|
|
Putative Molecular Markers in DCIS
To determine if there are genes that are statistically significantly more likely to be expressed in DCIS or invasive tumors, we performed various statistical tests (see "Materials and Methods"). On the basis of these analyses, we found that the expression of CD74 and a SAGE tag (CTGGGCGCCC) with no database match were significantly more abundant in invasive or metastatic tumors than in DCIS (P = 0.02 and P = 0.05, respectively, Table 3). The expression of MGC2328, DCD, and eight other genes was also more likely to occur in invasive/metastatic tumors than in DCIS, but none of these reached statistical significance (Table 3). Similarly, the expression of S100A7 and keratin 19 (KRT19) was more frequent and at higher levels in DCIS than in invasive/metastatic tumors, but this was also only marginally statistically significant. In a second statistical analysis, receiver operating characteristic (ROC) curve analysis was used to choose the "best" cutoff for values that result in the most samples being correctly classified as DCIS or invasive, weighing both kinds of misclassification equally (Table 3). Tags that do not include 0.50 in the CI might be potentially useful for the differential diagnosis of in situ and invasive carcinomas. This includes all the tags that had P
0.13 using the higher of two normal cutoffs as well as three other high in DCIS tags and three other high in invasive tags (Table 3). Using the best cutoff values, several of the SAGE tags correctly classified most of the DCIS and invasive SAGE libraries. For example, KRT19 classified 75% of the DCIS and 0% of the invasive libraries as DCIS, while MGC23280 identified 78% of the invasive cancer and 0% of the DCIS libraries as invasive. Thus, MGC23280 had 78% sensitivity and 100% specificity to correctly categorize breast tumors as DCIS or invasive/metastatic in this data set.
|
0.001 using the SAGE 2000 software (Table 4). We hypothesized that genes most highly differentially expressed between normal and DCIS tissue or two different types of DCIS tumors could be used as molecular markers for defining biologically and potentially clinically meaningful subgroups of DCIS. This hypothesis was supported by the fact that clustering analysis of the eight DCIS libraries using only these 26 genes gave a nearly identical dendogram to that of using over 500 genes (Fig. 1C).
Confirming Gene Expression by Messenger RNA in Situ Hybridization
mRNA in situ hybridization determines gene expression at the cellular level, which is particularly useful in solid tumors heterogeneous in their cellular composition. We used 18 frozen DCIS and invasive breast cancer samples for this purpose. Whenever possible, tumors were selected to include normal, DCIS, and invasive components on the same slide to obtain expression data in these three stages of breast tumorigenesis. Examples of in situ hybridization results are depicted in Fig. 2A. Interestingly, we found that the expression of several genes up-regulated in DCIS was localized mostly or exclusively to non-epithelial cells. Specifically, CTGF (connective tissue growth factor) and RGS5 (regulator of G protein signaling) were highly expressed in DCIS myoepithelial cells and stromal fibroblasts, although in certain tumors, we detected expression in DCIS epithelial cells as well (Fig. 2A). Cumulative scores for in situ hybridization were used for hierarchical clustering analysis and statistical tests. A dendogram of the 18 different tumors and 5 normal breast tissues determined that using 14 genes, we were able to differentiate between normal and cancer samples and group the tumors into subclasses (Fig. 2B). Confirming our SAGE results, we were not able to differentiate DCIS and invasive tumors based on gene expression profiles. Surprisingly, in the majority of cases within the same tissue sample, the in situ and invasive areas of the tumor did not show the highest similarity to each other (Fig. 2B). Although this result could be due to the use of mRNA in situ hybridization and the selected genes, it may suggest that gene expression profiles are not necessarily maintained during tumor progression. Fisher's exact test revealed significant positive correlation between the expression of TFF3 and IFI-6-16 (P = 0.01), LOC51235, and BEX1 (P = 0.05), while inverse correlation was found between the expression of S100A7 and RGS5Tu (tumor cell specific expression) (P = 0.04), S100A7 and TFF3 (P = 0.04), and CTGF and TM4SF1 (P = 0.01). No statistically significant associations were found between the expression of the genes and histopathologic features of the tumors.
|
|
| Discussion |
|---|
|
|
|---|
Although no genes were absolutely specific for DCIS tumors, two calcium binding proteins of the S100 family, S100A7 (psoriasin) and S100A9, trefoil factor 3 (TFF3), keratin 19 (KRT19), and apolipoprotein D (APOD) were preferentially more abundant in DCIS than in normal or invasive/metastatic cancerous breast tissue. S100A7 has been previously identified as a gene differentially expressed between in situ and invasive breast cancer and it has been demonstrated to bind calcium and act as a chemoattractant for T-lymphocytes and neutrophils (14). Correlating with this, S100A7 expression was particularly high in high-grade comedo DCIS that frequently demonstrate strong lymphocytic infiltration. Similarly, S100A9 is known to influence cell migration (15). However, it is likely that both S100A7 and S100A9 have additional intracellular functions such as the regulation of cell growth, differentiation, or survival. Keeping with this, the expression of S100A7 was shown to be dramatically up-regulated in response to apoptosis inducing stimuli and in psoriatic and premalignant keratinocytes that are unable to differentiate (1618). The high expression of S100A7 in ER negative, poorly differentiated, and lymph node positive invasive breast tumors suggests that its expression may predict bad clinical outcome and high risk of recurrence or progression in DCIS. Kaplan-Meier curve analysis demonstrating a somewhat decreased overall survival of patients with S100A7 positive invasive breast tumors supports this hypothesis, but to conclusively determine if S100A7 could be useful for the prognostication of breast cancer patients requires further studies.
Another gene highly expressed in DCIS is trefoil factor 3, a secreted protein that has been implicated in tumorigenesis, wound healing, and regulation of epithelial integrity (19). Although its role in breast tumorigenesis is unclear, in colon cancer cells, it has been demonstrated to confer apoptosis resistance (20). Keratin 19 and apolipoprotein D were also more abundant in DCIS than in normal epithelial cells and invasive carcinomas. Keratin 19 is the smallest known acidic keratin that is preferentially expressed in breast tumors of ductal origin (21). Apolipoprotein D is a glycoprotein involved in lipid transport that is negatively regulated by estrogen and its expression in invasive breast cancer may correlate with shorter disease free survival (22, 23). Although the differential expression of KRT19, S100A9, TFF3, and APOD between in situ and invasive carcinomas has not been analyzed in detail, the differences we detected by SAGE would be impossible to confirm by mRNA in situ hybridization or immunohistochemistry due to the semiquantitative nature of these latter techniques. Because we believe that the most likely clinical application of molecular markers would be their evaluation by immunohistochemistry, these four genes have limited potential as molecular markers.
In addition to selecting DCIS and invasive/metastatic cancer specific genes, we also identified 26 transcripts most highly differentially expressed between normal and DCIS or between different types of DCIS and demonstrated that these genes are able to classify DCIS as accurately as over 500 genes could (Fig. 1, B and C). The expression of 15 and 10 of these genes was confirmed by mRNA in situ hybridization and immunohistochemistry, respectively (Fig. 2 and Table 5), and the expression of some of them appeared to correlate with the histopathologic features of the tumors suggesting that they may identify subgroups of DCIS.
The majority of genes that are preferentially expressed in invasive/metastatic carcinomas correspond to uncharacterized genes. Four transcripts (MGC23280, MGC14480, FLJ30428, and NUDT8) encode hypothetical proteins with no known functions, while one of the SAGE tags currently has no ESTs match (Table 3). Dermcidin (DCD) was recently identified as a small secreted protein highly expressed in sweat glands of the skin, but its role in breast cancer is unclear (24). Calmodulin-like skin protein is a calcium-binding protein with a putative role in keratinocyte differentiation (25). Similarly, cellular retinol-binding protein is a protein carrier of retinol, a compound essential for normal epithelial cell differentiation (26). The high expression of immunoglobulin
and CD74 antigen may reflect leukocytic infiltration of the tumors. However, CD74, the invariant (
) chain of the MHC class II antigen, was also highly abundant in SAGE libraries generated from purified epithelial cells; thus, it is likely to be expressed by the cancer cells themselves although the functional relevance of this is unclear. We hypothesized that these "invasive cancer specific" genes may be used for the identification of DCIS tumors with the highest risk for recurrence and progression. As a first step toward testing this hypothesis, we confirmed by immunohistochemical analysis that the expression of DCD is very rarely detected in DCIS (data not shown). However, testing this hypothesis will require the examination of hundreds of DCIS tumors with long-term clinical follow-up data.
Although no study identical to ours has been performed using DNA microarrays, several of the genes that were overexpressed in tumors (like trefoil factor 3, X-box binding protein, collagen type I, fibronectin, etc.) were also found to be up-regulated in breast cancer by other groups (5, 6, 27).
In summary, we have identified several genes that could potentially be used for the molecular differential diagnosis of DCIS and invasive breast cancer and for the classification of DCIS tumors. Determining the clinical usefulness of these genes requires further studies.
| Materials and Methods |
|---|
|
|
|---|
Generation and Analysis of SAGE Libraries
SAGE libraries were generated and analyzed essentially as previously described as part of the National Cancer Institute Cancer Gene Anatomy Project (8, 9, 28, 29). SAGE libraries generated from normal (N1 and N2), two DCIS (D1 and D2), two invasive tumors (I1 and I2), and corresponding lymph node metastases (LN1 and LN2) were previously reported (8). Some of the DCIS tumors were pure DCIS (D3 and D6), while others were derived from patients with concurrent invasive breast carcinomas. Epithelial cells from normal breast tissue (N1 and N2) and tumors (D2, D3, D6, and D7) were purified using BerEP4-coated magnetic beads (Dynal, Oslo, Norway), while other tumors were macroscopically dissected based on adjacent hematoxylin-eosin-stained slides. Approximately 50,000 SAGE tags were obtained from each library. For further analyses, libraries were normalized to the library with the highest tag number (89,541 total tags). Hierarchical clustering was applied to data using the Cluster program developed by Eisen et al. (30). Differentially expressed genes were identified based on statistical analysis of comparisons of groups of normal (two cases), DCIS (eight samples), and invasive or metastatic (nine samples) breast cancer SAGE libraries using the SAGE2000 software (7). Similarly, for the identification of genes specific for DCIS or invasive breast cancer, the eight DCIS samples were treated as a group and the nine invasive or metastatic patients were treated as another group. First we used the SAGE tag numbers highest in two normal libraries (N1 and N2) as the cutoff and evaluated tag numbers in the DCIS and invasive libraries above this "normal" value using a two-sided Fisher's exact test without multiple comparisons (see Table 3). In a second test, ROC curve analysis was used to choose the best cutoff for values. A ROC area of 0.50 is no better than chance and a ROC area of 1.00 is the best possible. The bound 2 in the first row means that values greater than or equal to 2 are counted as predicting DCIS.
Messenger RNA in Situ Hybridization
To generate templates for in vitro transcription reactions, 300- to 500-bp fragments derived from the 3' untranslated region of the selected genes were PCR amplified and subcloned into pZERO 1.0 (Invitrogen, Carlsbad, CA). This pZERO 1.0 vector contains a multiple cloning site surrounded by SP6 and T7 RNA polymerase promoters; therefore, the same plasmid was used for the generation of sense and anti-sense riboprobes for mRNA in situ hybridizations. Digitonin-labeled sense and anti-sense riboprobes were generated and mRNA in situ hybridization was performed as previously described (31). The hybridized sections were observed with a NIKON microscope, images were obtained using a SPOT CCD camera, and processed with Adobe Photoshop. Hybridizations were considered successful if the sense probe gave no significant signal. The intensity and distribution of the hybridization signal were scored (03 for intensity and 03 for distribution using a scoring scheme described below for immunohistochemistry) independently by three investigators, and scores in Fig. 2 reflect a consensus of the three independent summary scores.
Immunohistochemistry
The expression of the indicated genes in primary breast tumors was analyzed by the use of immunohistochemistry to eight tissue microarrays that contained evaluatable paraffin-embedded specimens derived from 80 DCIS, 675 primary invasive breast cancer, and 33 distant metastases. Antigen Retrieval Citra solution (Research Genetics, San Ramon, CA) and boiling in microwave (5 min at high power) was used to enhance staining. Isotype control antiserum was used as negative control. A standard indirect immunoperoxidase protocol with 3,3'-diaminobenzidine as a chromogen was used for the visualization of antibody binding (ABC-Elite; Vector Laboratories, Burlingame, CA). Primary antibodies used were as follows: mouse monoclonal anti-psoriasin antibody (17); affinity-purified rabbit polyclonal anti-CTGF (a generous gift of Dr. D. Brigstock, Childrens' Research Institute, Columbus, OH); affinity-purified rabbit polyclonal anti-TFF3 (a kind gift of Prof. Hoffman, Universitaetsklinikum, Magdeburg, Germany); mouse monoclonal anti-IL-8, GRO-1, and GRO-2 were purchased from R&D Systems (Minneapolis, MN); anti-SPARC antibody was from Hematologic Technologies (Essex Junction, VT); and anti-FASN antibody from Transduction Labs. (San Diego, CA). Antibodies were used at 1:100 dilution in PBS containing 10% heat-inactivated goat serum. Antibody staining was subjectively scored by three investigators independently in a scale of 03 for intensity (0 = no staining, 1 = faint signal, 2 = moderate, and 3 = intense staining) and 03 for extent (0 = no, 1 =
30%, 2 = 3070%, and 3 =
70% positive cells) of staining. Cumulative scores were calculated based on the average scores assigned by the three independent observers. For statistical analyses, a cumulative score at or above 3 was considered positive. Relationships between the expression of genes determined by mRNA in situ hybridization or immunohistochemistry were analyzed by Fisher's exact test without correction for multiple comparisons.
Statistical Analyses of Clinical Correlates
The relationship of gene expression to clinicopathologic parameters and the association between the expression of different genes determined by immunohistochemistry were analyzed by the following statistical methods. The eight separate tissue microarray data sets and one combined data set were analyzed for association of gene positivity and prognostic factors using a logistic regression model (with gene positivity as the outcome), and a forward, or step-up, selection procedure to determine the best fitting model. Clinicopathologic factors analyzed were: expression of the estrogen and progesterone receptors and HER2 by immunohistochemistry, histological grade, TNM stage, tumor size, number of positive lymph nodes, patient age, and overall and distant metastasis free survival. If all patients or no patients with a particular level of a covariate had gene positivity, then the logistic regression did not converge and a significance level was obtained using Fisher's exact test. If, however, there remained some patients with and without gene positivity after deleting patients with the particular level of the covariate, then a step-up logistic regression was done on them. The significance of the variables in the logistic regression models was tested using likelihood ratio tests. The cutoff used for entry into the model was
= 0.05. In addition to the analyses described above, Kaplan-Meier curves were generated and Cox models were run for two data sets that contained survival information. Calculated times to distant failure and times to survival were used and were based on the failure/death and accession dates.
| Acknowledgements |
|---|
|
|
|---|
| Notes |
|---|
|
|
|---|
Received December 2, 2002; revised February 10, 2003; accepted February 13, 2003.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
I. Kijima, J. Ye, C. Glackin, and S. Chen CCAAT/Enhancer Binding Protein {delta} Up-regulates Aromatase Promoters I.3/II in Breast Cancer Epithelial Cells Cancer Res., June 1, 2008; 68(11): 4455 - 4464. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Vincent-Salomon, C. Lucchesi, N. Gruel, V. Raynal, G. Pierron, R. Goudefroye, F. Reyal, F. Radvanyi, R. Salmon, J.-P. Thiery, et al. Integrated Genomic and Transcriptomic Analysis of Ductal Carcinoma In situ of the Breast Clin. Cancer Res., April 1, 2008; 14(7): 1956 - 1965. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Allred, Y. Wu, S. Mao, I. D. Nagtegaal, S. Lee, C. M. Perou, S. K. Mohsin, P. O'Connell, A. Tsimelzon, and D. Medina Ductal Carcinoma In situ and the Emergence of Diversity during Breast Cancer Evolution Clin. Cancer Res., January 15, 2008; 14(2): 370 - 378. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. W. Pearson and T. Hunter Real-time imaging reveals that noninvasive mammary epithelial acini can contain motile cells J. Cell Biol., December 31, 2007; 179(7): 1555 - 1567. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Fabian Tamoxifen or Raloxifene in Postmenopausal Women for Prevention of Breast Cancer: A Tale of Two Choices Counterpoint Cancer Epidemiol. Biomarkers Prev., November 1, 2007; 16(11): 2210 - 2212. [Full Text] [PDF] |
||||
![]() |
S. Esseghir, A. Kennedy, P. Seedhar, A. Nerurkar, R. Poulsom, J. S. Reis-Filho, and C. M. Isacke Identification of NTN4, TRA1, and STC2 as Prognostic Markers in Breast Cancer in a Screen for Signal Sequence Encoding Proteins Clin. Cancer Res., June 1, 2007; 13(11): 3164 - 3173. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Hartwell, B. Muir, F. Reinhardt, A. E. Carpenter, D. C. Sgroi, and R. A. Weinberg The Spemann organizer gene, Goosecoid, promotes tumor metastasis PNAS, December 12, 2006; 103(50): 18969 - 18974. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Dolled-Filhart, L. Ryden, M. Cregger, K. Jirstrom, M. Harigopal, R. L. Camp, and D. L. Rimm Classification of breast cancer using genetic algorithms and tissue microarrays. Clin. Cancer Res., November 1, 2006; 12(21): 6459 - 6468. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Lupu and J. A. Menendez Targeting Fatty Acid Synthase in Breast and Endometrial Cancer: An Alternative to Selective Estrogen Receptor Modulators? Endocrinology, September 1, 2006; 147(9): 4056 - 4066. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Okada, M. Kobayashi, H. Tanaka, T. Kobayashi, H. Tazawa, Y. Iuchi, K. Onuma, M. Hosokawa, M. C. Dinauer, and N. H. Hunt The Role of Nicotinamide Adenine Dinucleotide Phosphate Oxidase-Derived Reactive Oxygen Species in the Acquisition of Metastatic Ability of Tumor Cells Am. J. Pathol., July 1, 2006; 169(1): 294 - 302. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Schuetz, M. Bonin, S. E. Clare, K. Nieselt, K. Sotlar, M. Walter, T. Fehm, E. Solomayer, O. Riess, D. Wallwiener, et al. Progression-Specific Genes Identified by Expression Profiling of Matched Ductal Carcinomas In situ and Invasive Breast Tumors, Combining Laser Capture Microdissection and Oligonucleotide Microarray Analysis. Cancer Res., May 15, 2006; 66(10): 5278 - 5286. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yao, S. Weremowicz, B. Feng, R. C. Gentleman, J. R. Marks, R. Gelman, C. Brennan, and K. Polyak Combined cDNA array comparative genomic hybridization and serial analysis of gene expression analysis of breast tumor progression. Cancer Res., April 15, 2006; 66(8): 4065 - 4078. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Krop, A. Marz, H. Carlsson, X. Li, N. Bloushtain-Qimron, M. Hu, R. Gelman, M. S. Sabel, S. Schnitt, S. Ramaswamy, et al. A Putative Role for Psoriasin in Breast Tumor Progression Cancer Res., December 15, 2005; 65(24): 11326 - 11334. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Thangaraju, M. Rudelius, B. Bierie, M. Raffeld, S. Sharan, L. Hennighausen, A-M. Huang, and E. Sterneck C/EBP{delta} is a crucial regulator of pro-apoptotic gene expression during mammary gland involution Development, November 1, 2005; 132(21): 4675 - 4685. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Zucchi, E. Mento, V. A. Kuznetsov, M. Scotti, V. Valsecchi, B. Simionati, E. Vicinanza, G. Valle, S. Pilotti, R. Reinbold, et al. Gene expression profiles of epithelial cells microscopically isolated from a breast-invasive ductal carcinoma and a nodal metastasis PNAS, December 28, 2004; 101(52): 18147 - 18152. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Alexander, A. L. Stegner, C. Wagner-Mann, G. C. Du Bois, S. Alexander, and E. R. Sauter Proteomic Analysis to Identify Breast Cancer Biomarkers in Nipple Aspirate Fluid Clin. Cancer Res., November 15, 2004; 10(22): 7500 - 7510. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Hu, H. Sun, J. Drake, F. Kittrell, M. C. Abba, L. Deng, S. Gaddis, A. Sahin, K. Baggerly, D. Medina, et al. From Mice to Humans: Identification of Commonly Deregulated Genes in Mammary Cancer via Comparative SAGE Studies Cancer Res., November 1, 2004; 64(21): 7748 - 7755. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Namba, J. E. Maglione, L. J.T. Young, A. D. Borowsky, R. D. Cardiff, C. L. MacLeod, and J. P. Gregg Molecular Characterization of the Transition to Malignancy in a Genetically Engineered Mouse-Based Model of Ductal Carcinoma In situ Mol. Cancer Res., August 1, 2004; 2(8): 453 - 463. |