JTCS Sign the Guestbook
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Author home page(s):
Jonathan D'Cunha
Michael A. Maddaus
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hoang, C. D.
Right arrow Articles by Maddaus, M. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hoang, C. D.
Right arrow Articles by Maddaus, M. A.
Related Collections
Right arrow Lung - cancer

J Thorac Cardiovasc Surg 2004;127:1332-1342
© 2004 The American Association for Thoracic Surgery


General thoracic surgery

Expression profiling of non–small cell lung carcinoma identifies metastatic genotypes based on lymph node tumor burden

Chuong D. Hoang, MDa, Jonathan D'Cunha, MD, PhDa, Sherif H. Tawfic, MD, PhDb, Angelika C. Gruessner, PhDc, Robert A. Kratzke, MDd, Michael A. Maddaus, MDa,*

a Division of Cardiovascular and Thoracic Surgery, University of Minnesota Medical School, Minneapolis, Minn, USA
b Department of Pathology, University of Minnesota Medical School, Minneapolis, Minn, USA
c Division of Surgical Sciences, Statistics, Department of Surgery, University of Minnesota Medical School, Minneapolis, Minn, USA
d Division of Hematology, Oncology, and Transplant, University of Minnesota Medical School, and the Research Service, Veterans Affairs Medical Center, Minneapolis, Minn, USA

Received for publication May 2, 2003; revisions received October 30, 2003; accepted for publication November 10, 2003.

* Address for reprints: Michael A. Maddaus, MD, Section of General Thoracic Surgery, Division of Cardiovascular and Thoracic Surgery, MMC 207, 420 Delaware Street SE, Minneapolis, MN 55455, USA
madda001{at}tc.umn.edu


    Abstract
 Top
 Abstract
 Materials and methods
 Results
 Discussion
 APPENDIX 1:
 References
 
OBJECTIVE: This study hypothesized that non–small cell lung carcinoma cells from primary tumors isolated by laser capture microdissection would exhibit gene expression profiles associated with graded lymph node metastatic cell burden.

METHODS: Non–small cell lung carcinoma tumors (n = 15) were classified on the basis of nodal metastatic cell burden by 2 methods, obtaining 3 groups: no metastasis, micrometastasis, and overt metastasis. We then performed microarray analysis on microdissected primary tumor cells and identified gene expression profiles associated with graded nodal tumor burden using a correlation-based selection algorithm coupled with cross-validation analysis. Hierarchical clustering showed the regrouping of tumor specimens; the classification inference was assessed with Fisher’s exact test. We verified data for certain genes by using another independent assay.

RESULTS: The 15 specimens clustered into 3 groups: cluster A predominated in specimens with overt nodal metastasis; cluster B had more specimens with nodal micrometastases; and cluster C included only specimens without nodal metastases. Cluster assignment was based on a validated 75-gene discriminatory subset. Notably, genes not previously associated with positive non–small cell lung carcinoma lymph node status were encountered in the profiling analysis.

CONCLUSIONS: Microdissection, combined with microarray analysis, is a potentially powerful method to characterize the molecular profile of tumor cells. The 75-gene expression profiles representative of clusters A and B may define genotypes prone to metastasize. Overall, the 3 groups of tumor specimens clustered separately, suggesting that this approach may identify graded metastatic propensity. Further, genes singled out in clustering may yield insights into underlying metastatic mechanisms and may represent new therapeutic targets.



Hoang, Dudek, Kratzke, Maddaus, Dahlberg, D'Cunha (left to right).


Lung cancer remains the most common cause of cancer-related mortality in North America, accounting for 30% of all cancer deaths, exceeding the next 4 cancers (breast, colon, prostate, and pancreas) combined.1 A critical, independent factor affecting the negative prognosis of patients with non–small cell lung carcinoma (NSCLC) is lymph node metastasis, a process with poorly defined molecular mechanisms. Nevertheless, the 5-year survival rate for patients with stage I lymph node–negative NSCLC is only 39% to 67%.2 The relatively high mortality rate of patients with stage I NSCLC is at least in part attributable to metastasis, often occult at the time of initial diagnosis. Occult metastatic tumor cells or micrometastases, routinely missed on histopathologic analysis, require immunohistochemical analysis or polymerase chain reaction (PCR)–based assays for detection. One study showed that the presence of such micrometastases in regional lymph nodes of patients with NSCLC was an independent risk factor associated with a reduced disease-free interval.3 In other solid tumor types, lymph node micrometastases correlated with primary tumor depth of penetration, lymphatic vessel invasion, microscopic venous invasion, and histologic stage.4,5 Taken together, this increasing body of evidence suggests that the presence of micrometastases in NSCLC lymph nodes is associated with a more biologically aggressive primary tumor.3,6 Currently, the basis of this important association is not well understood.

One investigative approach would be to identify molecular characteristics from the primary resected tumor that could predict the degree of lymph node micrometastatic cell burden. Doing so would be advantageous, potentially obviating the cumbersome logistics of lymph node sampling and of micrometastases detection. Microarray gene expression profiling is uniquely suited for molecular characterization: large numbers of genes can be simultaneously investigated, and the resulting gene expression patterns can then be correlated with certain clinical parameters.7 Using microarray technology to analyze lung carcinoma tissue, researchers have identified subgroups of tumors differentiated according to tumor type and histologic subclasses.8 Beer and colleagues9 extended this body of work by demonstrating that microarray gene expression profiles derived from tumor tissue could predict overall survival in patients with early-stage lung adenocarcinomas. Complementary to this ability is the need to identify patients at high risk for recurrent or metastatic disease. Recently, Kikuchi and coworkers10 reported on the feasibility of using gene expression profiling to predict gross, pathologic lymph node status in a subgroup of patients with NSCLC adenocarcinomas.

Because the cellular heterogeneity of whole tumor specimens may confound microarray analysis, we employed laser capture microdissection (LCM)11 to isolate pure cell populations for our molecular studies. Others have concluded that the gene expression signature derived from pure populations of tumor cells yields more precise, biologically meaningful information on specimens of interest.10,12 Thus, we hypothesized that pure populations of NSCLC cells would exhibit a gene expression pattern associated with the presence of not only overt metastatic cells but also, in particular, micrometastases in lymph nodes. This specific interest led us to select tumors with nodal micrometastases to be contrasted against 2 extremes: tumors without evidence (pathologic and molecular) of nodal involvement versus advanced-stage tumors. We isolated tumor cells from these 3 groups of tumors with graded lymph node metastatic cell burdens for subsequent gene expression profiling. In addition to analyzing the resulting gene expression patterns, we also investigated individual component genes for potential insights into novel metastatic mechanisms.


    Materials and methods
 Top
 Abstract
 Materials and methods
 Results
 Discussion
 APPENDIX 1:
 References
 
Patients and tumor specimens
All tumors and lymph nodes used in this study were obtained from a tumor tissue bank at our institution. Clinical outcome results from specimens used in this study were known only after all microarray experiments were performed and analyzed. Surgically resected specimens were collected under approval from the Institutional Review Boards at the University of Minnesota and the Minneapolis Veterans Affairs Medical Center. Patients gave written informed consent to have their tissues stored in a tumor tissue bank beforehand. None of the patients had a history of lung cancer or a concurrent malignancy, nor had they been exposed to any chemoradiotherapy. All lymph node specimens were bivalved: half were reserved for routine histopathologic analysis, and half were immediately snap-frozen in liquid nitrogen. Histopathologic testing verified that all surgical tumor specimens contained malignant cells. From stage I patients, only lymph nodes confirmed as pathologically negative (N0) were used. Some patients were upstaged to stage IIIa after histopathologic testing; they were also identified for our study. Based on previous protocols,10,13 we used pooled, noncancerous, male adult human lung total RNA isolated and purified from tissue (Stratagene, La Jolla, Calif) as a reference "normal" sample for each microarray experiment.

Determination of lymph node micrometastatic cell burden
Our group previously described in detail a carcinoembryonic (CEA)-based real-time reverse transcriptase-PCR (QRT-PCR) assay to quantitatively estimate lymph node micrometastatic cell burden.14 A lymph node station was considered positive by QRT-PCR for CEA if it had >1000 micrometastatic cells (sensitivity of the assay). Hematoxylin and eosin (H&E) status of lymph nodes was determined by surgical pathologic analysis according to standard criteria. With each method, we identified results as positive (+) or negative (–). Patients with 5-year follow-up were identified from our tissue bank, and their lymph nodes were tested until we reached the following tumor specimen numbers: 5 H&E–/(QRT-PCR)–; 5 H&E–/(QRT-PCR)+ (micrometastases); and 5 H&E+.

Laser capture microdissection and RNA processing
Tumor specimen sections were prepared by standard pathologic techniques, then stained with H&E just before microdissection. The PixCell II LCM System (Arcturus Engineering, Mountain View, Calif) was used according to the manufacturer's protocols. A board-certified surgical pathologist (S.H.T.) assisted in the differentiation of malignant cells. Areas of tumor necrosis were identified and excluded from harvesting. In all, 3000 to 5000 cells were laser captured for experiments. Total RNA was extracted from LCM tumor samples using the PicoPure RNA isolation kit (Arcturus Engineering), as outlined by the manufacturer. Adequate RNA starting material was generated using a T7 RNA polymerase-catalyzed linear amplification method.15 The RiboAmp RNA amplification kit (Arcturus Engineering) was used in 2 successive rounds on all LCM-isolated RNA samples, as outlined by the manufacturer. RNA purity was determined by spectrophotometry, and integrity was verified on ethidium bromide-stained 2% agarose-formaldehyde gels. To avoid any potential cDNA labeling bias in the microarray step, we also amplified the reference nonmalignant lung total RNA in an identical manner.

cDNA microarrays
Microarray experiments were performed, with minor modifications, using the MicroMax Human 2400-gene cDNA Microarray System 1.1-TSA (Perkin Elmer LifeSciences, Boston, Mass). Briefly, for every microarray, 1 µg of amplified RNA from each LCM tumor sample and the nonmalignant lung reference sample was reverse-transcribed to target cDNA by adding 280 pmol random hexamers (Promega, Madison, Wis) to the synthesis reaction. We calculated the mean coefficient of variation (CV) among microarray experiments (n = 3) at <31%, suggesting adequate reproducibility (and consistent with the manufacturer's parameters).

Microarray analysis
Each gene was represented on a microarray in duplicate, so that a corresponding raw expression ratio was defined as the mean signal intensity of tumor sample:reference sample replicates (±2 SD). The raw expression ratios for each gene were median-centered normalized16 to facilitate comparison of relative gene expression levels. To identify a set of genes that could distinguish between tumor samples on the basis of lymph node tumor burden, we applied the separate groups, K-ordered groups filtering algorithm (Expressionist v3.1, GeneData, San Francisco, Calif) to our entire data set. This algorithm consists of combining rank-ordering tests with a Pearson correlation coefficient to identify certain genes with expression ratios that are consistent within and distinct between defined groups. We selected a Pearson coefficient of 0.9 and specified 3 groups according to our criteria for lymph node status. This process identified a subset of genes that could be used to classify the tumor cell samples into 3 separate groups of nodal tumor burden. To determine the accuracy of our selection algorithm in finding a discriminatory gene subset, we performed a leave-one-out cross-validation analysis9 and calculated the error rate. After validating the gene subset, we used it in 2-dimensional hierarchical cluster analysis17 to visualize the subsequent clustering of tumor samples.

In another approach, we also selected genes that were greatly overexpressed or underexpressed by tumor cells in comparison with nonmalignant lung tissue by using a published filtering algorithm,13 with modifications. Briefly, the 50th percentile of all measurements per tumor sample was used as a positive control for each gene; the signal intensity of every gene was divided by this synthetic positive control. The bottom 10th percentile of measurements was used as a test for background subtraction. The measurement for each gene was then divided by the corresponding signal intensity of the nonmalignant lung sample. The threshold value used to define significant relative expression changes was set at 3.0 for overexpression and 0.30 for underexpression on the basis of the experimental variability in our data (CV < 31%) and of the manufacturer's established performance criteria. Data filtering with this algorithm identified genes overexpressed by at least threefold and underexpressed by at least 30% across all specimens and within subgroups.

QRT-PCR
To examine the reliability of our microarray data, we selected 3 overexpressed genes (pescadillo, Brk, OB-cadherin) and determined their relative expression in similar samples used for microarray analysis by QRT-PCR, according to our previous protocol.14 For each gene, as well as for an endogenous control gene ß-glucuronidase (GUSB), we designed TaqMan probe and primer sets from the open reading frame by using Primer Express software v2.0 (Applied Biosystems, Foster City, Calif). Details of primer-probe sequences are included in Appendix 1. PCR products of the expected size for each gene were eluted from ethidium bromide-stained 2% agarose gels; their identities were confirmed by sequence analysis.

Relative quantitation of gene transcripts was calculated by the comparative threshold cycle method.18 GUSB was used as an endogenous control for the normalization of sample loading because levels are consistent across diseased and nondiseased lung tissues.19 The calibrator was chosen to be the same reference sample (nonmalignant lung) used for microarrays to facilitate comparison between the different techniques. Relative, normalized quantitative gene expression levels were calculated for the 3 genes of interest among tumor specimens.

Statistics
Microarray analysis was performed on Expressionist Suite v3.1 software (GeneData). The separate groups, K-ordered groups test proceeded in 4 steps: transforming expression ratios to ranks, rank-ordering within defined groups, rank-ordering between groups, and calculating a Pearson correlation statistic. The leave-one-out cross-validation analysis determined the robustness of the classification method. A sample was withheld from the total number of tumor samples; the remaining samples were used to select a set of discriminatory genes, and class prediction was made on the withheld sample. This process was repeated for every sample so that a cumulative error rate could be calculated. Fisher’s exact test was used to assess whether or not cluster membership was associated with physical and molecular characteristics of the tumor samples.

Additional analysis was performed on S-Plus 6.1 software (Insightful, Seattle, Wash). Kaplan-Meier curves were used to estimate cause-specific patient survival (time in months from surgery until death), as well as recurrence rates. The log-rank test was used to assess differences among clusters. Differences of gene expression results between microarray and QRT-PCR techniques were analyzed according to previously described methods.20 Briefly, the distribution of log-transformed expression ratios (tumor specimen:reference sample) for genes of interest was determined.


    Results
 Top
 Abstract
 Materials and methods
 Results
 Discussion
 APPENDIX 1:
 References
 
Tumor specimens
Characteristics of tumor specimens included in our analysis are detailed in Table 1. In all, 10 NSCLC tumor specimens were pathologic stage I (7 T1 N0 and 3 T2 N0) and 5 were stage IIIa (2 T1 N2 and 3 T2 N2). By histologic examination, 12 specimens were adenocarcinomas; 3 were squamous cell carcinomas. We considered that this mixture of histologic types would aid in discovering gene profiles representative of NSCLC nodal metastasis overall because it is unknown whether or not different NSCLC types metastasize via unique mechanisms (and genes).


View this table:
[in this window]
[in a new window]
 
TABLE 1. Molecular and pathologic characteristics of NSCLC tumors

 


View larger version (46K):
[in this window]
[in a new window]
 
Figure 1. Hierarchical cluster analysis. A, Two-dimensional hierarchical clustering of primary tumors per a 75-gene subset with differential expression across tumor specimen groups. Genes lie vertically and the GenBank accession numbers for all 75 genes are shown; specimens lie horizontally. Colors are mean, relative expression signal intensities for each gene. B, An enlarged view of the tumor specimen dendrogram from (A) along with a summary table of tumor specimen lymph node status.

 
We tested lymph nodes (4 to 7 nodes per tumor; 1 lymph node per station) from the 10 stage I specimens to quantitatively estimate micrometastatic cell burden. A total of 53 lymph nodes were analyzed, 14 (26%) of which were positive for CEA mRNA per our previous criteria (see Methods). These 14 lymph nodes were distributed among 5 of 10 stage I tumor specimens: 2 positive lymph nodes each in specimens P8-9, 3 positive lymph nodes each in specimens P1 and P10, and 4 positive lymph nodes in specimen P2. An estimate of lymph node micrometastatic cell burden within this 5-tumor specimen group averaged 4.0 x 104 ± 2.3 x 104 (SD) CEA-positive cells per lymph node station. This process subdivided the stage I tumor specimens into 2 groups: 5 H&E–/(QRT-PCR)– and 5 H&E–/(QRT-PCR)+. To ensure that these QRT-PCR results were valid, we tested both the primary tumors (n = 15) and H&E+ lymph nodes (n = 9) from the stage III tumor specimens. Correspondingly high CEA-positive cell numbers were observed in 12 (80%) primary tumors and in 8 (89%) lymph nodes.

Hierarchical cluster analysis
On average, 300 to 500 genes were excluded from downstream analysis, per the previously discussed criteria, after data normalization across these microarrays. The separate groups, K-ordered groups test was performed with an average of 2000 remaining genes from each microarray experiment per the 3 tumor groups defined above. This analysis identified a subset of 75 genes with significant differential expression patterns between the 3 defined tumor groups (Appendix 2). In other words, these genes displayed marked expression value changes from 1 tumor group to another, thereby facilitating tumor class discrimination. We coupled this gene-filtering method with a cross-validation analysis, calculating an error rate of 7%. The members of this 75-gene subset were diverse and encoded for molecules with various functions such as signal transduction, cellular structural components, transcription factors, cellular growth, adhesion, and trafficking.


View this table:
[in this window]
[in a new window]
 
APPENDIX 2.

Selected genes from the discriminatory 75-gene subset

 
We employed this validated 75-gene subset in 2-dimensional hierarchical clustering (Figure 1, A). We could not identify a clear pattern of segregation in the clustering when we examined each tumor by T status and histologic features (P > .05) as expected (Table 1). However, cluster analysis discriminated between tumor specimens according to graded lymph node metastatic cell burden. Three distinct tumor clusters were identified: cluster A predominated in tumor specimens with H&E+ lymph nodes; cluster B had more specimens with lymph node micrometastases; and cluster C had only specimens without any lymph node metastases (Figure 1, B). A significant association between cluster membership and the 3 groups of lymph node tumor burden was not discernible (P = .13), but this result was not entirely unexpected given the small sample size. Nevertheless, the P value trended in the appropriate direction toward statistical significance, warranting the addition of more study specimens. The number of tumor specimens with any degree of nodal metastasis decreased in a graded fashion across clusters A to C (Figure 1, B). The extent of lymph node metastasis also varied among clusters, as expected. Cluster A had a total of 14 tumor-positive lymph nodes [7 from stage III tumor specimens and 7 from H&E–/(QRT-PCR)+ specimens]; cluster B, an intermediate number: a total of 9 tumor-positive lymph nodes [2 from a stage III specimen and 7 from H&E–/(QRT-PCR)+ specimens]. Cluster C was devoid of any nodal tumor involvement.

Clinical data
Cause-specific 5-year survival (top) and NSCLC recurrence (bottom) data for the 15 patients are detailed in Table 2. The number of deaths secondary to NSCLC increased from cluster C (no deaths) to cluster A (3 deaths). The number of patients with NSCLC recurrence also increased from cluster C to cluster A, whereas the median time to recurrence decreased by more than half from cluster B to cluster A. Kaplan-Meier curves reflected these trends (data not shown), but P values did not reach statistical significance, as expected with this small sample size.


View this table:
[in this window]
[in a new window]
 
TABLE 2. Clinical data

 
Differentially expressed NSCLC genes
Genes with significant overexpression and underexpression in common across tumor specimen groups were identified by systematic sorting. We found 109 genes in common to all specimens with greater than threefold overexpression relative to the reference, and conversely 45 genes with <30% underexpression. Specimen subgroup analysis identified genes as follows: H&E–/(QRT-PCR)–, 157 over- and 242 underexpressed; H&E–/(QRT-PCR)+, 190 and 280; H&E+, 150 and 200.

Multiple genes not commonly associated with NSCLC were identified (Appendixes 2 and 3). The CL100 gene was significantly underexpressed by an average of 0.087. Both VAV2 (14.5-fold) and Brk (7.8-fold) were overexpressed in the specimen group with nodal micrometastases and conversely had low expression levels in the other 2 specimen groups. As the focus of our study was to identify gene expression patterns associated with metastatic potential, subgroup analysis of significantly altered genes in specimens with nodal micrometastases may be revealing (Appendix 3). Relative overexpression in specimens of OB-cadherin mRNA was 43.8-fold; of pescadillo mRNA, 39.5-fold. Concurrently, nidogen transcripts were underexpressed at 0.121 in specimens with nodal micrometastases.


View this table:
[in this window]
[in a new window]
 
APPENDIX 3.

Combined list of selected overexpressed or underexpressed genes and nodal micrometastasis-associated genes

 
Comparison of our results to previous NSCLC microarray profiling studies showed qualitative similarity in certain genes. Notable overexpressed or underexpressed genes in common with the study by Heighway and colleagues21 included osteopontin, insulin-like growth factor binding protein 2, gastrin-releasing peptide, and CL100; in common with the study by Beer and coworkers,9 aldolase A, cathepsin, and protein tyrosine phosphatase; and in common with the study by Bhattacharjee and colleagues,8 high-mobility group protein and bullous pemphigoid antigen. General agreement between different microarray studies using independent lung carcinoma samples suggests that many of these genes are likely relevant to NSCLC.

Verification of microarray data
Using QRT-PCR, we validated microarray expression data for 3 overexpressed genes (Brk, pescadillo, and OB-cadherin). Overall, results for each gene were broadly consistent between the 2 different techniques; for each gene tested, the QRT-PCR–derived median expression ratio changed in the same direction of differential expression as previously identified by the microarray analysis. These data confirmed the reliability of our microarray strategy in identifying differentially expressed genes from tumor specimens with varying nodal metastatic cell burden.


    Discussion
 Top
 Abstract
 Materials and methods
 Results
 Discussion
 APPENDIX 1:
 References
 
Clustering algorithms have revealed subtle, specific patterns of gene expression useful for molecular classification of various cancers.8 Our study demonstrated the feasibility of using expression profiling to differentiate for the presence of lymph node micrometastases within stage I tumors. Our work extends that of Kikuchi and colleagues10: we examined the molecular profile of tumors with nodal micrometastases as compared with advanced stage III tumors. The Kikuchi study focused on microarray identification of overt nodal status in a subset of adenocarcinomas, not reporting the specific clinicopathologic details of the patient group. Thus, in their study, identifying expression profiles specific to early-stage NSCLC tumors (which are associated with a relatively poor patient survival rate and high recurrence rate) was difficult. Gene profiles of stage I NSCLC tumors associated with nodal micrometastatic cell burden may identify a high-risk subgroup prone to develop overt nodal metastases that could benefit from more aggressive initial therapy.

In our study, hierarchical clustering segregated LCM NSCLC specimens by graded lymph node metastatic burden (per CEA QRT-PCR), according to a 75-gene subset. The association, in the literature, of many of these subset genes with the pathogenesis of various epithelial-derived cancers lends indirect support for the validity of our 75-gene selection process. For example, VAV2, a signal transduction gene, is associated with ovarian tumor progression.22 And Brk, a signaling transduction nonreceptor protein tyrosine kinase, is capable of potentiating the mitogenic response to epidermal growth factor in mammary epithelial cells23 and is overexpressed only in metastatic melanoma subtypes.24 Both VAV2 and Brk were overexpressed in our group of tumor specimens with nodal micrometastases; thus, they may be potential tumor markers of early metastases.

In general, the 75-gene signature of cluster A may define a poor prognosis expression profile as cluster A contained mostly stage III tumor specimens. Also, we noted 1 H&E–/(QRT-PCR)– tumor specimen (P3) that grouped into cluster A because it displayed an expression profile more similar to advanced-stage tumors. This observation proved to be biologically relevant: records indicated that patient P3 suffered a relapse at the initial resection site 7 months postoperatively, with a total survival time of only 11 months. Conversely, the gene signature of cluster C may define a subgroup of stage I lymph node–negative [H&E–/(QRT-PCR)–] patients with a favorable prognosis. Further, it may be significant that tumor specimens with lymph node micrometastases were distributed between clusters A and B, possibly consistent with the presence of novel micrometastatic molecular subgroups. Distinct micrometastatic subgroups may have significant implications for intermediate survival and cancer recurrence. Again, in our study, clinical data supported our conclusions. Of patients with nodal micrometastases, 2 of 3 (P8 and P10) in cluster B remain alive and disease-free; however, 1 of 2 (P2) in cluster A suffered a distant NSCLC relapse and survived only 15 months postoperatively.

We also identified significantly altered genes with potential functional links to NSCLC tumorigenesis (Appendixes 2 and 3). CL100, a homolog of mitogen-activated protein kinase phosphatase 1, functions as a protein tyrosine phosphatase negatively regulating tyrosine kinase action.25 Loss of this gene, as displayed in all 15 of our specimens, may perpetuate tyrosine kinase–driven tumor growth, as evidenced in ovarian carcinoma cell lines.25 Among tumor specimens with nodal micrometastases, we identified a 3-gene set suggesting unrecognized mechanisms involved in NSCLC metastasis. OB-cadherin has been associated with more aggressive variants of breast or gastric cancers in which the gene product mediates formation of adherens junction complexes, enhancing the invasive ability of cancer cells.26 Pescadillo has been shown to be overexpressed in malignant astrocytes, colon, and breast carcinoma cell lines; it regulates the cell cycle in a manner permissive for tumor progression.27 Both of those genes were markedly overexpressed in this tumor subgroup; presumably, these have similar function. Interestingly, nidogen, a basement membrane-stabilizing molecule,28 was significantly underexpressed in this subgroup of tumors with nodal micrometastases. Intuitively, we believe that basement membrane instability would be a prerequisite for metastasis. In a comparison with our other 2 tumor subgroups, we found a similar gene expression profile of these 3 genes in stage III tumors but not in stage I tumors with H&E–/(QRT-PCR)– lymph nodes. This finding suggests that some early-stage primary tumors with associated lymph node metastases may behave aggressively because they share a gene profile similar to advanced-stage primary tumors.

Admittedly, the sample sizes of our tumor specimen groups were too small to make broad, definitive generalizations on the lymph node status of NSCLC tumors based on the hierarchical cluster data. With a larger number of specimens, we would anticipate a more distinct and statistically significant delineation of cluster assignment correlating with quantitative lymph node micrometastatic cell burden. Nonetheless, our preliminary results support the concept of investigating this approach further for large-scale profiling of primary tumor cells to identify critical gene expression patterns.

Another limitation of our study was the use of different types of starting RNA source material: LCM tumor cells compared against pooled whole-lung tissue. Despite the potential to introduce experimental bias, Kikuchi and coworkers10 used the identical approach without obvious problems. Additionally, testing the predictive ability of the 75-gene subset on additional specimens would be an important next step toward prospectively classifying tumors by degree of lymph node metastatic cell burden. Currently, it is unknown whether the gene expression profiles defining tumor specimens in clusters A, B, and C have any prognostic impact, but our limited findings suggest differential survival among cluster groups.

In summary, LCM of primary tumors combined with microarray analysis identified a 75-gene expression profile that may identify early-stage NSCLC primary tumors that have increased metastatic propensity. Our approach differentiated between tumor specimens by graded levels of lymph node metastatic burden, identifying specimens with micrometastases separately from those with overt metastases. The individual genes defined by hierarchical clustering may suggest contributing pathogenic mechanisms in NSCLC and may represent new, early therapeutic targets. In future studies, it will be critical to test the ability of our 75-gene subset to predict nodal micrometastases to identify a subgroup of patients who may benefit from additional therapy. To refine this concept, it will also be important to correlate the primary tumor molecular signature with differing quantitative estimates of micrometastatic cell lymph node burden, possibly revealing molecular subgroups of tumors with micrometastases.


    APPENDIX 1:
 Top
 Abstract
 Materials and methods
 Results
 Discussion
 APPENDIX 1:
 References
 
Sequences of TaqMan primers and probes
Brk forward primer 5-CCATGACCACAATATCCCCTACA-3, reverse 5-ACGTCGGATTTGGTGGAGTAA-3, and probe 5-TGGACGGCCCCTGAAGCGC-3; OB-cadherin forward primer 5-AAGACATCAAACCTGAGTATCAGTACATG-3, reverse 5-CATCGACATCCACGCTGTTG-3, and probe 5-TAGACCTGGGCTCCGGCCAGC-3; pescadillo forward primer 5-CCATTCATCCTGGCTTTCCA-3, reverse 5-CCTACCCTCACCCCATCACA-3, and probe 5-CACAGTTGGACCCGTGATTCTCAGGG-3; and GUSB forward primer 5-CTCGGCAGAGACAACCAAAAA-3, reverse 5-GGTTTCATTGGCAATCTTCCA-3, and probe 5-TGCAGCGTTCCTTTTGCGAGAGAGAT-3

Discussion
Dr Bryan Meyers (St Louis, Mo). Those are very elegant experiments and certainly a good step in the right direction with your preliminary work. I would ask you to discuss a little bit about the findings between 2 different clusters, cluster A being the tumors that were T2 and cluster B, the tumors that were T1. One of the challenges with a small number of tumors like this, particularly when they are heterogenous, is that you are drawing some conclusions based on your molecular findings, but there are some other macroscopic findings that are also very associated with the tendency toward lymph node metastases.

Dr Hoang. Thank you for your comments. For clarification, cluster A was not comprised solely of T1 tumors nor was cluster B comprised solely of T2 tumors. The hierarchical clustering was based on a subset of genes that differentiated between tumor specimens by graded lymph node tumor burden. The resulting segregation of specimens into separate groups was therefore determined by nodal status. To ensure that this was the case, we examined the distribution of specimens according to other physical and molecular characteristics, including T status. There were no statistically significant associations, except for nodal status.

Dr Thomas A. D'Amico (Durham, NC). Dr Hoang, that was a wonderful presentation. I admire your work, and you and your colleagues should be congratulated. I wonder if you could discuss why you chose the presence of lymph node micrometastasis as the parameter to distinguish prognostic potential among stage I patients. There are other molecular markers, antigens, proteins, or gene mutations that would have given you a wider scope of tumors to choose from to use this 75-gene discriminant analysis to look at subgroups. Limiting yourself to whether or not a micrometastasis is present maybe wouldn't give you the same latitude. Could you discuss why you chose those subgroups?

Dr Hoang. Thank you for your very insightful question. A main focus of our lab is centered on mechanisms of nodal metastasis in NSCLC. Because micrometastases represent an early form of tumor dissemination, a better understanding of their biology may lead to new insights into underlying mechanisms. This specific interest led us to select tumors with nodal micrometastases for comparison against a true negative, that is, no nodal metastases (determined by H&E and by our CEA QRT-PCR assay) versus a true positive, that is, overt nodal metastasis (advanced-stage tumors).

Dr Larry R. Kaiser (Philadelphia, Pa). Your 2400-gene array is a fairly small array. Was this somehow selected prior to using it?

Dr Hoang. This 2400-gene cDNA microarray was a commercial resource readily available to our laboratory. The 2400 genes spotted on the microarray represent a broad cross section of gene functional categories (eg, transcription factors, cell regulatory factors, cell signaling, cell adhesion, and so on), making it well suited for the broad gene profiling approach we used. Thus, there was no predetermined selection process.

Dr Kaiser. And most genomics data is not particularly useful unless it is validated externally. Did you attempt to validate this externally, for instance, with the data from Beer or from Bhattacharjee, which was the paper from the Proceedings of the National Academy of Sciences?

Dr Hoang. That is an excellent question and important point you raise about microarray studies in general. With respect to your specific question about direct comparison of our data to the public lung cancer databases at Michigan (Beer and colleagues) and Harvard (Bhattacharjee and colleagues), we have not done that. There are significant technical reasons for not doing so that encompass differences in: (1) microarray platform (oligonucleotide vs cDNA), in number of overlapping genes present on each array, (2) specific types of samples being compared (whole tissue vs cells), (3) the reference sample (pooled tissue, pooled cell lines, single samples, tumor vs normal in same patient, etc), (4) normalization protocol, and (5) lack of consensus on a uniform approach to compare data across different studies. Thus, we are not sure of what approach would be taken or how valid it would be to attempt such a direct comparison. A crude method could consist of qualitative comparisons of genes reported as overexpressed or underexpressed.

Dr Kaiser. Another thing is that the statistical validation of any of these is particularly important, and we didn't hear much in the way of any statistical validation of any of these results. For instance, you had osteopontin as 90-fold up-regulated. Was that just due to 1 sample, was it a low expressor, what was the P value? There was some of the data that was a little bit hard to believe in terms of how much some of the gene expression was up-regulated.

Dr Hoang. Again, you bring up another important point about microarray data analysis. This is a rapidly evolving field with a multitude of statistical approaches, all of which have certain advantages and disadvantages. Currently there is no consensus on what the best approach is or if there should be a uniform approach taken. The K groups analysis we used refers to a family of data-filtering algorithms (part of our array analysis software); the specific test we performed to identify our discriminatory 75-gene subset is the separate groups, K groups test. This selection algorithm relies on a combination of rank-ordering tests with a Pearson correlation coefficient. The end result of applying this algorithm is the identification of genes with maximal variation between defined groups (3 in this case) that could be used to classify samples.

Depending on a number of technical factors (such as microarray platform, normalization algorithm, and type of comparison between experimental vs reference samples), a relatively large range of expression values could result. With respect to osteopontin, all 15 of our tumor specimens showed elevated expression with a range of about 6-fold to 260-fold. We did not report a P value because our gene selection process relied on a correlation-based statistic.

Dr Kaiser. I think it is safe to say it is very interesting, but without some external validation, we have to be very careful about looking at genomics data.

Dr Hoang. We agree with your cautionary stance, overall. To address this concern, we used real-time PCR (the most common validation technique in current usage) to externally validate our microarray data for certain genes of interest. Overall, the 2 independent methods showed qualitative agreement.


    Acknowledgments
 
We are grateful to Suzanne Grindle (University of Minnesota Cancer Center, Bioinformatics Core) for technical assistance with microarray analysis software, as well as to Debra Herzan for timely and accurate assistance with clinical and pathologic data reported herein. We also thank Dr Mary Knatterud for expert editorial assistance.


    Footnotes
 
C.D.H. is supported by grants from the Lillehei Heart Institute, the Veterans of Foreign Wars/Ladies Auxiliary Cancer Research Center Endowment Fund, the Cancer and Leukemia Group B Foundation Clinical Research Award and Fellowship, and a training program grant from the National Institutes of Health (T32HL07062) awarded to the Division of Hematology, Oncology, and Transplant, Univeristy of Minnesota.


    References
 Top
 Abstract
 Materials and methods
 Results
 Discussion
 APPENDIX 1:
 References
 

  1. Jemal A, Murray T, Samuels A, Ghafoor A, Ward E, Thun MJ. Cancer statistics, 2003. CA Cancer J Clin. 2003;53:5–26[Abstract/Free Full Text]
  2. Lau CL, Harpole DH Jr. Lung neoplasms. Norton JA. Surgery, basic science and clinical evidence. New York: Springer; 2000. p. 1189–1216
  3. Maruyama R, Sugio K, Mitsudomi T, Saitoh G, Ishida T, Sugimachi K. Relationship between early recurrence and micrometastases in the lymph nodes of patients with stage I non–small cell lung cancer. J Thorac Cardiovasc Surg. 1997;114:535–543[Abstract/Free Full Text]
  4. Greenson JK, Isenhart CE, Rice R, Mojzisik C, Houchens D, Martin EW Jr. Identification of occult micrometastases in pericolic lymph nodes of Duke's B colorectal cancer patients using monoclonal antibodies against cytokeratin and CC49. Correlation with long-term survival. Cancer. 1994;73:563–569[Medline]
  5. Ishida K, Katsuyama T, Sugiyama A, Kawasaki S. Immunohistochemical evaluation of lymph node micrometastases from gastric carcinomas. Cancer. 1997;79:1069–1076[Medline]
  6. Pantel K, Cote RJ, Fodstad O. Detection and clinical importance of micrometastatic disease. J Natl Cancer Inst. 1999;91:1113–1124[Abstract/Free Full Text]
  7. Liotta L, Petricoin E. Molecular profiling of human cancer. Nat Rev Genet. 2000;1:49–56
  8. Bhattacharjee A, Richards WG, Staunton J, Li C, Monti S, Vasa P, et al. Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc Natl Acad Sci U S A. 2001;98:13790–13795[Abstract/Free Full Text]
  9. Beer DG, Kardia SLR, Huang CC, Giordano TJ, Levin AM, Misek DE, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med. 2002;8:816–824[Medline]
  10. Kikuchi T, Daigo Y, Katagiri T, Tsunoda T, Okada K, Kakiuchi S, et al. Expression profiles of non-small cell lung cancers on cDNA microarrays: identification of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs. Oncogene. 2003;22:2192–2205[Medline]
  11. Bonner RF, Emmert-Buck M, Cole K, Pohida T, Chuaqui R, Goldstein S, et al. Laser capture microdissection: molecular analysis of tissue. Science. 1997;278:1481–1483[Free Full Text]
  12. Sugiyama Y, Sugiyama K, Hirai Y, Akiyama F, Hasumi K. Microdissection is essential for gene expression profiling of clinically resected cancer tissues. Am J Clin Pathol. 2002;117:109–116[Medline]
  13. Sugita M, Geraci M, Gao B, Powell RL, Hirsch FR, Johnson G, et al. Combined use of oligonucleotide and tissue microarrays identifies cancer/testis antigens as biomarkers in lung carcinoma. Cancer Res. 2002;62:3971–3979[Abstract/Free Full Text]
  14. D'Cunha J, Corfits AL, Herndon JE 2nd, Kern JA, Kohman LJ, Patterson GA, et al. Molecular staging of lung cancer: real-time polymerase chain reaction estimation of lymph node micrometastatic tumor cell burden in stage I non–small cell lung cancer: preliminary results of Cancer and Leukemia Group B Trial 9761. J Thorac Cardiovasc Surg. 2002;123:484–491 (discussion 491)[Abstract/Free Full Text]
  15. Van Gelder RN, von Zastrow ME, Yool A, Dement WC, Barchas JD, Eberwine JH. Amplified RNA synthesized from limited quantities of heterogeneous cDNA. Proc Natl Acad Sci U S A. 1990;87:1663–1667[Abstract/Free Full Text]
  16. Quackenbush J. Computational analysis of microarray data. Nat Rev Genet. 2001;2:418–427[Medline]
  17. Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A. 1998;95:14863–14868[Abstract/Free Full Text]
  18. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001;25:402–408[Medline]
  19. Gerard CJ, Andrejka LM, Macina RA. Mitochondrial ATP synthase 6 as an endogenous control in the quantitative RT-PCR analysis of clinical cancer samples. Mol Diagn. 2000;5:39–46[Medline]
  20. Lin YM, Furukawa Y, Tsunoda T, Yue CT, Yang KC, Nakamura Y. Molecular diagnosis of colorectal tumors by expression profiles of 50 genes expressed differentially in adenomas and carcinomas. Oncogene. 2002;21:4120–4128[Medline]
  21. Heighway J, Knapp T, Boyce L, Brennand S, Field JK, Betticher DC, et al. Expression profiling of primary non–small cell lung cancer for target identification. Oncogene. 2002;21:7749–7763[Medline]
  22. Bourguignon LY, Zhu H, Zhou B, Diedrich F, Singleton PA, Hung MC. Hyaluronan promotes CD44v3-Vav2 interaction with Grb2-p185(HER2) and induces Rac1 and Ras signaling during ovarian tumor cell migration and growth. J Biol Chem. 2001;276:48679–48692[Abstract/Free Full Text]
  23. Barker KT, Jackson LE, Crompton MR. BRK tyrosine kinase expression in a high proportion of human breast carcinomas. Oncogene. 1997;15:799–805[Medline]
  24. Easty DJ, Mitchell PJ, Patel K, Florenes VA, Spritz RA, Bennett DC. Loss of expression of receptor tyrosine kinase family genes PTK7 and SEK in metastatic melanoma. Int J Cancer. 1997;71:1061–1065[Medline]
  25. Manzano RG, Montuenga LM, Dayton M, Dent P, Kinoshita I, Vicent S, et al. CL100 expression is down-regulated in advanced epithelial ovarian cancer and its re-expression decreases its malignant potential. Oncogene. 2002;21:4435–4447[Medline]
  26. Feltes CM, Kudo A, Blaschuk O, Byers SW. An alternatively spliced cadherin-11 enhances human breast cancer cell invasion. Cancer Res. 2002;62:6688–6697[Abstract/Free Full Text]
  27. Kinoshita Y, Jarell AD, Flaman JM, Foltz G, Schuster J, Sopher BL, et al. Pescadillo, a novel cell cycle regulatory protein abnormally expressed in malignant cells. J Biol Chem. 2001;276:6656–6665[Abstract/Free Full Text]
  28. Miosge N, Holzhausen S, Zelent C, Sprysch P, Herken R. Nidogen-1 and nidogen-2 are found in basement membranes during human embryonic development. Histochem J. 2001;33:523–530[Medline]



This article has been cited by other articles:


Home page
JCOHome page
Z. Sun, D. A. Wigle, and P. Yang
Non-Overlapping and Non-Cell-Type-Specific Gene Expression Signatures Predict Lung Cancer Survival
J. Clin. Oncol., February 20, 2008; 26(6): 877 - 883.
[Abstract] [Full Text] [PDF]


Home page
Ann. Thorac. Surg.Home page
T. A. D'Amico
Molecular Biologic Staging of Lung Cancer
Ann. Thorac. Surg., February 1, 2008; 85(2): S737 - S742.
[Abstract] [Full Text] [PDF]


Home page
Cancer Epidemiol. Biomarkers Prev.Home page
C. M. Tammemagi, M. T. Freedman, T. R. Church, M. M. Oken, W. G. Hocking, P. A. Kvale, P. Hu, T. L. Riley, L. R. Ragard, P. C. Prorok, et al.
Factors Associated with Human Small Aggressive Non Small Cell Lung Cancer
Cancer Epidemiol. Biomarkers Prev., October 1, 2007; 16(10): 2082 - 2089.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Pathol.Home page
M C Boelens, A van den Berg, I Vogelzang, J Wesseling, D S Postma, W Timens, and H J M Groen
Differential expression and distribution of epithelial adhesion molecules in non-small cell lung cancer and normal bronchus
J. Clin. Pathol., June 1, 2007; 60(6): 608 - 614.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
D. N. Hayes, S. Monti, G. Parmigiani, C. B. Gilks, K. Naoki, A. Bhattacharjee, M. A. Socinski, C. Perou, and M. Meyerson
Gene Expression Profiling Reveals Reproducible Human Lung Adenocarcinoma Subtypes in Multiple Independent Patient Cohorts
J. Clin. Oncol., November 1, 2006; 24(31): 5079 - 5090.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Nygaard and E. Hovig
Options available for profiling small samples: a review of sample amplification technology when combined with microarray profiling
Nucleic Acids Res., February 9, 2006; 34(3): 996 - 1014.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
M. Meyerson and D. Carbone
Genomic and Proteomic Profiling of Lung Cancers: Lung Cancer Classification in the Age of Targeted Therapy
J. Clin. Oncol., May 10, 2005; 23(14): 3219 - 3226.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Personal Folders
Right arrow Download to citation manager
Right arrow Author home page(s):
Jonathan D'Cunha
Michael A. Maddaus
Right arrow Permission Requests
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hoang, C. D.
Right arrow Articles by Maddaus, M. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hoang, C. D.
Right arrow Articles by Maddaus, M. A.
Related Collections
Right arrow Lung - cancer


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ANN THORAC SURG ASIAN CARDIOVASC THORAC ANN EUR J CARDIOTHORAC SURG
J THORAC CARDIOVASC SURG ICVTS ALL CTSNet JOURNALS