Defining the genomic signature of the parous breast
- Suraj Peri†1,
- Ricardo López de Cicco†2,
- Julia Santucci-Pereira†2,
- Michael Slifker†1,
- Eric A Ross1,
- Irma H Russo2,
- Patricia A Russo2,
- Alan A Arslan3, 4,
- Ilana Belitskaya-Lévy5,
- Anne Zeleniuch-Jacquotte4,
- Pal Bordas6, 7,
- Per Lenner7,
- Janet Åhman6,
- Yelena Afanasyeva4,
- Robert Johansson8,
- Fathima Sheriff2,
- Göran Hallmans8,
- Paolo Toniolo3, 4, 9 and
- Jose Russo2Email author
© Peri et al.; licensee BioMed Central Ltd. 2012
Received: 24 January 2012
Accepted: 19 September 2012
Published: 11 October 2012
It is accepted that a woman's lifetime risk of developing breast cancer after menopause is reduced by early full term pregnancy and multiparity. This phenomenon is thought to be associated with the development and differentiation of the breast during pregnancy.
In order to understand the underlying molecular mechanisms of pregnancy induced breast cancer protection, we profiled and compared the transcriptomes of normal breast tissue biopsies from 71 parous (P) and 42 nulliparous (NP) healthy postmenopausal women using Affymetrix Human Genome U133 Plus 2.0 arrays. To validate the results, we performed real time PCR and immunohistochemistry.
We identified 305 differentially expressed probesets (208 distinct genes). Of these, 267 probesets were up- and 38 down-regulated in parous breast samples; bioinformatics analysis using gene ontology enrichment revealed that up-regulated genes in the parous breast represented biological processes involving differentiation and development, anchoring of epithelial cells to the basement membrane, hemidesmosome and cell-substrate junction assembly, mRNA and RNA metabolic processes and RNA splicing machinery. The down-regulated genes represented biological processes that comprised cell proliferation, regulation of IGF-like growth factor receptor signaling, somatic stem cell maintenance, muscle cell differentiation and apoptosis.
This study suggests that the differentiation of the breast imprints a genomic signature that is centered in the mRNA processing reactome. These findings indicate that pregnancy may induce a safeguard mechanism at post-transcriptional level that maintains the fidelity of the transcriptional process.
KeywordsGene expression profiling Pregnancy Breast morphology Breast differentiation Parous and nulliparous breast transcriptome Breast cancer risk Normal breast transcriptome Bioinformatics.
Epidemiological data from various parts of the world have consistently shown that early full term pregnancy and multiparity are associated with breast cancer risk reduction in postmenopausal women [1–3], whereas late pregnancy and nulliparity are associated with increased risk . It has been postulated that the mechanism of pregnancy-induced protection is mediated by changes in environmental settings , and/or alterations in the immunological profile of the host . Animal studies of the differentiation of the breast [7–9] under the influence of the complex hormonal milieu created by two newly formed endocrine organs, the placenta and the fetus , have unraveled the morphological, functional, genomic and transcriptomic changes that ultimately result in the induction of a permanent and specific profile that serves as an indicator of reduced cancer risk [11, 12]. There is some evidence supporting the concept that the degree of differentiation acquired through an early pregnancy changes the genomic signature that differentiates the lobular structures of parous from that of nulliparous women [3, 11–18]. Our efforts have been directed towards characterizing the molecular basis underlying the mechanism of pregnancy-induced protection [3, 11, 12, 14, 18].
One way to assess whether a specific genomic fingerprint is permanently imprinted in the breast by a full term pregnancy (FTP) is to compare the transcriptomic profiles of breasts from parous and nulliparous women. We have used a genome-wide approach to identify long-term genomic changes associated with FTP by studying breast core needle biopsies (CNBs) obtained from an ethnically homogeneous population of healthy postmenopausal volunteers residing in Norrbotten County, Sweden. We previously reported on the genes differentially expressed in parous and nulliparous women using a discovery/validation approach . In this paper, we describe the transcriptomic differences that were found between the breasts of parous and nulliparous women. In order to gain more statistical power in understanding the biological meaning of the transcriptomic differences, in this study the data from the discovery and validation phases were pooled and mined. Furthermore to mine the data depending on gravida status, we stratified the analyses depending on gravida status to identify importance of full-term pregnancy. Our results suggest that the differentiation of the breast induced by pregnancy imprints a genomic signature that can be detected in postmenopausal women, thus contributing to the establishment of the molecular basis of the protection against breast cancer conferred by parity.
To determine whether the pattern of gene expression differed between nulliparous and parous postmenopausal women, breast tissue was collected from volunteering healthy women residing in Norrbotten County, Sweden, an ethnically homogeneous population of Swedish or Finnish ancestry . A total of 389 women from a group who had received normal mammograms within the year prior to enrollment were initially interviewed between September 2008 and May 2009. 255 women fulfilled the eligibility criteria and signed an informed consent to participate in the study and to donate breast tissues in the form of core needle biopsies (CNB), and blood. We previously described various criteria that were used in determining the eligibility of those included in this study , such as women between 50 and 69 years of age, postmenopausal; i.e., lack of menstrual periods for 12 preceding months and elevated circulating levels of follicle stimulating hormone (FSH) (40–250 IU/L).
Based on reproductive history, eligible subjects were categorized as either parous or nulliparous. The parous group (P) included all women who had been pregnant (Gravida) one or more times and had delivered (parous) one or more live children. The nulliparous group (NP) included both nulligravida women who had never become pregnant and therefore never had a full term delivery, and women who had become pregnant one or more times (G≥1) but never completed a FTP, identified as nulligravida nulliparous (NN) and gravida nulliparous (GN), respectively. Both NN and GN women were considered as a single group (NP) for most analyses, unless indicated otherwise. The study protocol number 08-020M was approved by the Regional Ethical Review Board at the University of Umeå, Sweden. The study protocol number 02–829 was approved by the Institutional Review Board of Fox Chase Cancer Center, Philadelphia, USA.
Data and sample collection
All eligible subjects signed an informed consent and completed a questionnaire that collected data on reproductive history, medical history, family background of cancer, use of tobacco, oral contraceptive (OC), hormone replacement therapy (HRT), and/or other medications.
Breast core needle biopsies (CNBs) with 14 Gauge BARD® MONOPTY® disposable core biopsy instrument (Bard Biopsy Systems, Tempe, AZ) were performed by an experienced physician at the Mammography Department at Sunderby Hospital, Luleå, Sweden. Three to five CNBs were taken free hand from the upper outer quadrant of either right or left breast; one core was fixed in 70% ethanol for histopathological analysis and the remaining cores were placed in RNAlater® (Ambion, Austin, TX) solution for subsequent RNA extraction for genomic analysis. In addition to breast tissue samples, each participant provided blood and saliva samples that were stored at Umeå University at −20°C for subsequent laboratory analyses .
Total RNA from CNB specimen was isolated using the Qiagen Allprep RNA/DNA Mini Kit according to manufacturer's instructions (Qiagen, Alameda, CA, USA). The quantity of total RNA obtained from every specimen ranged from 150ng to 4μg, as determined using NanoDrop v3.3.0 (NanoDrop Technologies, Wilmington, DE); RNA quality was assessed using an Agilent 2100 Bioanalyzer (Agilent Technologies, CA, USA).
The GeneChip Expression 3’-Amplification Two-Cycle cDNA Synthesis Kit (Affymetrix, Santa Clara, CA) was used to prepare the cRNA for hybridization following the manufacturer’s protocol. The samples were hybridized to Affymetrix HG_U133 Plus 2.0 oligonucleotide arrays. 113 chips (71 parous, 42 nulliparous) satisfied quality control thresholds based on standard Affymetrix quality control measures and graphical criteria based on probe-level model (PLM) analysis as implemented in the Bioconductor affyPLM package. Affymetrix CEL files were pre-processed using RMA . To account for between-batch variability in the arrays, the data were adjusted using ComBat . After filtering, 18,694 probesets remained for further analysis.
Mining for functional categories and pathways
We applied data mining methods to identify enriched biological processes and pathways. Gene ontology (GO) functional categories enriched in differentially expressed genes were identified using conditional hypergeometric tests in the Bioconductor GOstats package. We carried out this analysis independently for up and down regulated genes while selecting the genes represented on U133plus2 chip as gene universe. An enrichment p-value cut off of 0.01 was used to select GO terms.
To identify pathways and associations to other previously described datasets, Gene Set Enrichment Analysis (GSEA)  was performed. Since we were interested in finding pathways and co-regulated genes, we expanded the list of differentially expressed genes by relaxing the p-value to 0.01 and did not apply any fold change filter. Pathways obtained from MSigDB (database of gene sets provided by GSEA) were tested for enrichment. Default parameters were chosen, except that in the case of genes with multiple probesets, only the probeset with maximum expression intensity was considered for analysis.
Validation through real time RT-PCR
Total RNA was reverse-transcribed (RT) using MMLV reverse transcriptase (Ambion, Austin, TX) and anchored oligo-dT. Real-time Taqman PCR Assays-on-Demand were run using Universal PCR master mix (Applied Biosystems, Foster City, CA) on a 7900 HT instrument. For each gene, the log2 fold change between parous and nulliparous samples was estimated as the difference in median Ct values. To assess the statistical significance of the differences, two-sample Wilcoxon tests were performed, and comparisons with p-value <0.05 were considered statistically significant. For comparison with the microarray study, log2 fold changes were estimated as the differences in median batch-adjusted RMA normalized gene expression intensities in the same subset of 28 samples.
Histopathological and immunohistochemical analyses
From each breast biopsy collected, one core was fixed in 70% ethanol and processed for histopathological and immunohistochemical (IHC) analyses following standard procedures. IHC analysis for cyclin L2 (CCNL2) was performed in 21 NP and 29 P CNBs tissues utilizing a polyclonal anti-human cyclin L2 (CCNL2) antibody (Novus Biologicals, Cambridge, UK) at a concentration of 5μg/ml. Reactions were performed using the MultiLink Detection Kit for HRP/DAB and the i6000 automatic stainer (both from Biogenex, San Ramon, CA). CCNL2 positive cells were scored according to the intensity of brown nuclear stain as negative (0), weakly positive (+) or strongly positive (++). Results were expressed as the percentage of positive cells over total number of epithelial cells present in ducts and lobules type 1 in each section.
Volunteers included in the analysis
As described in Methods, the study participants consisted of postmenopausal women that were grouped according to their reproductive history into parous (P) and nulliparous (NP). The nulliparous group included both, nulligravida nulliparous (NN) and gravida nulliparous (GN); both NN and GN women were considered within the NP as a single group for most analyses, unless indicated otherwise. CNBs were first analyzed histopathologically in order to determine the adequacy of tissue, presence of ductal and lobular structures and the characteristics of the stroma. A total of 126 biopsies obtained from 82 P and 44 NP women were eligible for final genomic analysis. The group of nulliparous women was 58.9±5.2 years old and the group of parous women was 59.6±5.8 years old and there were not statistic differences among the two groups. Therefore a variation in age subsets was not detected in this study, and altogether the data reflect a well controlled group of postmenopausal women.
From the 126 samples hybridized to Affymetrix HG_U133 Plus 2.0 oligonucleotide arrays, 113 chips (71 P, 42 NP) satisfied quality control thresholds. Using empirical Bayes moderated t-statistics with p-value less than 0.001 and a minimum log2 fold-change of 0.3 thresholds as criteria of significance, we identified 305 differentially expressed probesets (corresponding to 208 distinct genes) between P and NP women (see Additional file 1: Table S1). Of these, 267 were up-regulated and 38 were down-regulated (Figure 1). In order to test the validity of the list of differentially expressed genes, the parity status labels were randomly permuted and the analysis was repeated 10,000 times. Over half of the permutations yielded either 0 or 1 differentially expressed gene, the 95th percentile of counts was 15 differentially expressed genes, and only 8 out of 10,000 permutations yielded at least 305 differentially expressed genes (i.e. p-value = 0.0008) suggesting the differentially expressed gene signature does not arise by chance. Hierarchical clustering of the differentially expressed probesets (with no sample clustering) shows the pattern of up and down regulated genes within each group (see Additional file 1: Figure S1). To understand the biological theme of the observed gene expression differences, we carried out bioinformatics-based analysis of microarray data.
GO biological processes enriched for both up and down regulated genes between parous and nulliparous breast samples
GO term ID
GO term (p-value)
Genes up-regulated in parous samples
cell-substrate junction assembly (0.006)
KRT5, LAMA3, LAMC2
ectoderm development (0.002)
COL7A1, KRT5, KRT15, LAMA3, LAMC2, NTF4, KLK7
epidermis development (0.001)
COL7A1, KRT5, KRT15, LAMA3, LAMC2, NTF4, KLK7
formation of organ boundary(0.009)
hemidesmosome assembly (0.000)
KRT5, LAMA3, LAMC2
mRNA metabolic process (0.000)
CIRBP, RBMX, HNRNPA1, HNRNPA2B1, HNRNPD, LUC7L3, PNN, PRPF39, RBM25, SFPQ, SFRS1, SFRS5, SFRS7, PABPN1, PRPF4B
mRNA processing (0.000)
RBMX, HNRNPA1, HNRNPA2B1, LUC7L3, PNN, PRPF39, RBM25, SFPQ, SFRS1, SFRS5, SFRS7, PABPN1, PRPF4B
response to anoxia (0.009)
RNA metabolic process (0.006)
DDX17, CHD2,C BX3, CIRBP, ZNF785, EZH2, L3MBTL, GATA3, RBMX, ZNF789, HNRNPA1, HNRNPA2B1, HNRNPD, LUC7L3, PNN, PRPF39, ZNF83, METTL3, CREBZF, RBM25, RBBP8, RPS24, CENPK, SFPQ, SFRS1, SFRS5, SFRS7, ZNF814, ZNF207, PABPN1, RUNX3, FUBP1, PRPF4B, HNRPDL
RNA processing (0.000)
DDX17, RBMX, HNRNPA1, HNRNPA2B1, HNRNPD, LUC7L3, PNN, PRPF39, RBM25, RPS24, SFPQ, SFRS1, SFRS5, SFRS7, PABPN1, PRPF4B, HNRPDL
RNA splicing (0.000)
RBMX, HNRNPA1, HNRNPA2B1, HNRNPD, LUC7L3, PNN, PRPF39, RBM25, SFPQ, SFRS1, SFRS5, SFRS7, PABPN1, PRPF4B
GO term ID
Genes down-regulated in parous samples
activation of Ral GTPase activity (0.003)
cell proliferation in hindbrain (0.005)
glycolate metabolic process (0.001)
muscle cell differentiation (0.006)
myoblast proliferation (0.003)
phosphatidic acid biosynthetic process (0.005)
positive regulation of cellular metabolic process (0.005)
EBF1, IGF1, ABHD5, SOX6
positive regulation of glycolysis (0.006)
positive regulation of lipoprotein lipase activity (0.006)
positive regulation of tyrosine phosphorylation of Stat5 protein (0.009)
regulation of cellular catabolic process (0.002)
regulation of insulin-like growth factor receptor signaling pathway (0.007)
regulation of muscle cell apoptosis (0.006)
regulation of Ral protein signal transduction (0.003)
regulation of sequestering of triglyceride (0.006)
regulation of steroid hormone receptor signaling pathway (0.010)
regulation of triglyceride metabolic process (0.008)
skeletal muscle tissue regeneration (0.009)
small GTPase mediated signal transduction (0.005)
IGF1, RASD1, RALGAPA2
somatic stem cell maintenance (0.010)
Among the down-regulated genes, insulin-like growth factor 1 (IGF-1) was enriched in 19 biological processes that comprised cell proliferation, regulation of IGF-like growth factor receptor signaling, somatic stem cell maintenance, muscle cell differentiation and apoptosis, among others. Other down-regulated genes were RALGAPA2, SOX6, ABHD5, EBF1 and RASD1 (Table 1).
Enriched GSEA pathways and gene sets for both up- and down-regulated genes. ‘NES’ represents normalized enrichment score
Pathways and gene sets enriched by up-regulated genes
Breast cancer estrogen signaling
SCGB2A1, TFF1, SCGB2A2, SCGB1D2, STC2, GATA3, SERPINB5, SERPINA3, AZGP1, TP53, CCNA2, CCNE2, EGFR, PTEN, DLC1, FHL5
HSA01430 cell communication
KRT15, DSC3, DSG3, KRT5, COL4A6, KRT17, KRT14, LAMC2, LAMA3, LAMB3, THBS3, LAMA5,LAMC1, GJA4, LAMA4, VWF, COL4A1, COL4A2
MRNA processing reactome
METTL3, HNRPD, HNRPA2B1, PRPF4B, SFRS7, CLK4, SFRS5, PABPN1, CSTF3, HNRPU, RBM5, SNRP70, SFRS14, SNRPA1, CLK2, NXF1, SFRS8, SFRS2, PTBP2, FUS, SFRS6, SFRS16, SF3B1, HNRPA3, SNRPB, PRPF3, SFRS12, U2AF1, PHF5A, TXNL4A, CUGBP2
Pathways and gene sets enriched by down-regulated genes
HSA04910 insulin signaling pathway
CALML3, SHC4, IKBKB, PKM2, PIK3CD, PHKA1, CALM1, CBL, MAPK9, GSK3B, SKIP, MAP2K1, PIK3R3, CRK, IRS2, SORBS1, SOS1, PDE3A, PDE3B, PRKAR2B, MAPK10, PCK1
HSA04080 neuroactive ligand receptor interaction
NPY2R, OXTR, PARD3, GABBR1, LTB4R, NR3C1, EDNRA, EDNRB, EDG1, CALCRL, ADRB2, AGTRL1, ADRA2A, GHR, PTGER3, ADRA1A
HSA04530 tight junction
CGN, INADL, EXOC3, LLGL2, PARD3, PARD6G, TJP1, EPB41L1, ZAK, PPP2R1B, PTEN, GNAI1, RRAS2, CTNNA1, MAGI1, CLDN10
HSA04010 MAPK signaling pathway
NTF5, FGFR3, CACNA1D, RASGRP1, MAP3K14, TP53, CACNA1G, FAS, PDGFA, IKBKB, CACNA2D2, DAXX, GNA12, MAPK9, EGFR, GADD45A, DUSP10, CHP, DUSP3, MAP2K1, CRK, MEF2C, ZAK, EVI1, TGFBR1, SOS1, FGF10, RAPGEF2, RRAS2, RASGRF2, MAPK10, ACVR1C
HSA05210 colorectal cancer
TP53, IGF1R, FZD8, RALGDS, PIK3CD, FZD6, CYCS, MAPK9, EGFR, GSK3B, MAP2K1, PIK3R3, TGFBR1, SOS1, TCF7L2, FZD4, MAPK10, ACVR1C
HSA04510 focal adhesion
PAK7, COL4A6, LAMC2, LAMA3, SHC4, PAK6, ITGA10, LAMB3, ITGA4, PDGFA, IGF1R, THBS3, LAMA5, ZYX, PPP1R12A, ITGA2, PIK3CD, MAPK9, CAV1, EGFR, PARVA, GSK3B, LAMC1, MAP2K1, PIK3R3, CRK, FLT1, FYN, CCND2, PTEN, LAMA4, SOS1, VWF, IGF1, CAV2, TLN2, COL4A1, PDGFC, COL4A2, MAPK10
Integrin mediated cell adhesion KEGG
CAPN3, PAK6, ITGA10, ITGA4, ZYX, ITGA2, CAV1, MAP2K1, CRK, SORBS1, FYN, SOS1, CAV2, TNS1, MAPK10
HSA04310 WNT signaling pathway
CSNK1A1, NFATC3, TP53, WNT5A, VANGL2, FZD8, FZD6, MAPK9, GSK3B, CHP, DAAM1, DAAM2, SOX17, PPP2R1B, CCND2, TCF7L2, FZD4, MAPK10
ST INTEGRIN SIGNALING PATHWAY
PAK7, PAK6, ITGA10, ITGA4, ZYX, ITGA2, MAPK9, CAV1, CRK, ARHGEF7, FYN, PTEN, SOS1, TLN2, MAPK10
HSA04060 cytokine cytokine receptor interaction
CXCL6, IL28RA, CCL5, IFNGR2, FAS, IL4R, TNFSF15, PF4V1, TNFRSF14, IL2RB, IFNAR1, EGFR, LIFR, FLT1, BMPR2, TGFBR1, PDGFC, GHR, BMP2.
Contribution of full-term pregnancy (FTP) to transcriptomic changes
To investigate whether an incomplete pregnancy could induce transcriptomic changes in the breast tissue, in the nulliparous group (NP) we compared gene expression of GN against NN. We did not find any significant differences in gene expression between these two subgroups. Comparison between P and GN revealed that 12 genes (18 probes) were differentially expressed (see Additional file 1: Table S2). The comparison between P and NN revealed that 125 genes (206 probes) (see Additional file 1: Table S3) were differentially expressed, and among these, 107 genes had been identified in the comparison P vs. NP. These results suggest that in this study population, FTP was required for inducing detectable changes in the transcriptome.
Validation of microarray results
RT-PCR validation results
CREB/ATF bZIP transcription factor
X (inactive)-specific transcript (non-protein coding)
AHA1, activator of heat shock 90kDa protein ATPase homolog 2 (yeast)
Cold inducible RNA binding protein
Paired immunoglobin-like type 2 receptor beta
SRY (sex determining region Y)-box 6
The work reported here demonstrates that differentiation of the breast induced by an early pregnancy imprints a specific genomic signature that can be detected in postmenopausal women. Using bioinformatics methods we found transcriptomic differences between the breasts of parous and nulliparous women. These differentially expressed genes were used to identify enriched biological processes and pathways. Enriched biological processes related to up-regulated genes included RNA related processes, differentiation and development of epidermis and ectoderm, and cell-substrate junction assembly; whereas in the case of down-regulated genes the biological processes that were enriched included IGF-like growth factor signaling, somatic stem cell maintenance and apoptosis. Pathways that were enriched by up-regulated genes included breast cancer estrogen signaling, cell communication and mRNA processing machinery. Numerous pathways were enriched by down-regulated genes; the most significant ones were the insulin, Wnt and integrins signaling pathways, MAPK, cytokine-cytokine receptor interaction, tight junction and focal adhesion, all representing proteins that are highly expressed in malignancies.
The main components of the spliceosome machinery, including RNA and proteins that undergo dynamic changes during the splicing reaction, were up-regulated in the parous breast. Among them were the heterogeneous nuclear ribonucleoproteins (HNRPs) that include HNRPA3, HNRPA2B1, HNRPD and HNRPU , which are implicated in the regulation of mRNA stability, as well as other functions, such as mammary gland involution , negative regulation of telomere length maintenance , and regulation of mRNA trafficking from the nucleus to distal processes in neural cells . Although further studies are needed to define their precise functional role in the postmenopausal breast, we postulate that they may play an important regulatory function as transcriptional regulators. In addition, post-transcriptional methylation of internal adenosine residues in eukaryotic mRNAs by METTL3 (methyltransferase like 3), which is up-regulated in the parous breast, could play a role in the efficiency of mRNA splicing, transport or translation in the differentiated breast epithelium. Other members of the spliceosome complex are the proteins encoded by the genes SF3B1, SFRS2, SFRS7, SFRS8, SFRS14, SFRS16, SNRP70, SNRPB, SNRPA1, PRF3 and PHF5A, all of which are overexpressed in the parous breast. In the case of the small nuclear ribonucleoproteins (snRNPs), there is evidence that they suppress tumor cell growth and may have major implications as cancer therapeutic targets. The pre-mRNA splicing factors are enriched in nuclear domains termed interchromatin granule clusters or nuclear speckles. Among the members of the splicing factor compartment are CCNL1 and CCNL2 that participate in the pre-mRNA splicing process and are located in the nuclear speckles [32, 33]. These two genes are up-regulated in the parous breast and the CCNL2 protein is also overexpressed in the nucleus of breast epithelial cells. CCNL1 and CCNL2 are transcriptional regulators [32, 33] that modulate the expression of critical factors leading to cell apoptosis, possibly through the Wnt signal transduction pathway , a signaling pathway that is enriched by down-regulated genes in the parous breast (Table 2). In our previously published preclinical and clinical studies [3, 7, 9, 11], we have reported that pregnancy confers protection from breast cancer development by inducing gland differentiation, which imprints a specific and permanent genomic signature in this organ. A similar phenomenon was demonstrated in the breast of postmenopausal parous women characterized by fatty involution . We previously described a small case–control study of transcriptomic analysis of normal breast tissues obtained from parous and nulliparous women free of breast pathology and parous and nulliparous women with history of breast cancer that served as controls and cases, respectively . In order to investigate the degree of commonality between the previous case–control and the present study, we applied GO enrichment analysis to the gene lists generated from both studies. We found that processes involved in RNA metabolism and RNA processing were similar in both studies (see Additional file 1: Table S4).
A number of non-coding RNAs that included XIST, MALAT-1 (also called NEAT2) and NEAT1 were up-regulated in the parous breast. XIST, which inactivates X chromosome as an early developmental process, plays an essential role in female mammals by providing dosage equivalence between males and females. Up-regulation of XIST occurs upon differentiation, whereas failure to express XIST is often seen in malignancies and in early embryogenesis . Our findings are supported by recent reports that suggest that XIST is expressed in adult well-differentiated cells in order to maintain gene repression [35–39]. Oxytocin, a neurotransmitter that acts through it specific receptor OXTR and is overexpressed during lactation, up-regulates the expression of MALAT-1, a highly conserved non-coding RNA [40, 41]. Interestingly, both MALAT1 and OXTR remain overexpressed in the breast of postmenopausal parous women. NEAT1 and NEAT2 localize to the periphery and to the interior the spliceosome assembly factor SC35 domains or speckles. Our observation that in breast epithelial cells CCNL2 is highly enriched in nuclear speckles (Figure 2) indicates that CCNL2 might colocalize with NEAT1 and NEAT2. The down-regulation of NEAT1, NEAT2 and XIST in the breast of nulliparous women, in whom this organ never reached a stage of complete differentiation similar to that achieved after completion of pregnancy and lactation , suggests that the undifferentiated breast is not actively involved in the RNA metabolism that is necessary for maintaining a state of differentiation.
Although in this study we did not observe differential expression in estrogen receptor between parous and nulliparous breasts, several genes that are directly or indirectly regulated by estrogen receptor were up- or down-regulated in the parous breast and were found to be enriched in the breast cancer estrogen signaling gene set. Among them, GATA3, an important component of this gene set, is crucial to mammary gland morphogenesis and differentiation of progenitor cells. GATA3 has been suggested to be a tumor suppressor , a fact supported by the observations that induction of its expression in GATA3-negative undifferentiated carcinoma cells is sufficient to induce tumor differentiation and inhibition of tumor dissemination [44–46]. The down-regulation of RASD1 (RAS, dexamethasone-induced 1), a potential miR-375 target that negatively regulates ER alpha expression in breast cancer further confirms that the genes involved in the estrogen receptor regulated pathways could be under permanent transcriptional modification as a manifestation of a higher degree of cell differentiation of the parous breast, in spite of the lack of transcriptomic differences in the levels of the receptor between parous and nulliparous breast tissues.
Cell communication, which is a key element in the process of cell and organ differentiation, is well represented in the breast of parous women. The parous breast exhibits up-regulation of desmocollin (DSC3), a calcium-dependent glycoprotein that is a member of the desmocollin subfamily of the cadherin superfamily. Members of this desmosomal family, along with the desmogleins, are found primarily in epithelial cells where they constitute the adhesive proteins of the desmosome cell-cell junction and are required for cell adhesion and desmosome formation. In addition, the up-regulation of matrix Gla proteins (MGP), laminins (LAMA3 and LAMC2) and keratin 5 (KRT5) in the parous breast reflect the greater differentiated state of the breast epithelial cells . This concept is supported by the observation that the loss of Matrix Gla protein expression may be associated with tumor progression and metastasis .
Our findings that insulin-like growth factor 1 (IGF-1) is down-regulated in the parous breast is consistent with published data reporting overall lower levels of IGF-1 in parous than in nulliparous women  and support the association of IGF1 with increased breast cancer risk . It is known that IGF-1 stimulates mitosis and inhibits apoptosis, playing a significant role in signaling pathways involved in the pathogenesis of breast cancer. The down regulation of IGF-1 in the parous breast, in association with the significant down-regulation of SOX6, EBF1 (early B-cell factor 1), ABHD5, RASD1, a potential miR-375 target that negatively regulates ER alpha expression in breast cancer , and RALGAPA2, could represent a significant driving force in the reduction of breast cancer risk conferred by pregnancy.
In this study using a core needle biopsy of postmenopausal breast parenchyma comprising of stroma and lobular structures, we found a specific genomic signature induced by FTP. This genomic signature suggests that the differentiation process of breast cells is centered in the mRNA processing reactome, which emerges as an important regulatory pathway induced by pregnancy. The biological importance of the differential expression of genes that control the spliceosome could be an indication of a safeguard mechanism at post-transcriptional level that maintains the fidelity of the transcriptional process. In addition, the critical regulatory pre-mRNA splicing mechanism could also regulate the expression of specific genes controlling estrogen signaling pathways, cell communication and differentiation, as well as pathways related to chromatin remodeling, altogether resulting in control of cell differentiation and breast cancer prevention. Future studies are needed to confirm these results, in particular studies focusing specifically on lobular epithelial cells selected using laser capture microdissection (LCM). Finally, digital transcriptome analysis such RNA-Seq methods will help in understanding the precise differentiation paradigms in parous breast tissue.
This work was supported by grant 02-2008-034 from the Avon Foundation for Women Breast Cancer Research Program, NIH core grant CA06927 to Fox Chase Cancer Center and an appropriation from the Commonwealth of Pennsylvania. The authors thank the women of Norrbotten County, Sweden, for their willing contribution to the project, and the staff of the Mammography Department, Sunderby Hospital, Luleå, Sweden.
- Clarke CA, Purdie DM, Glaser SL: Population attributable risk of breast cancer in white women associated with immediately modifiable risk factors. BMC Cancer. 2006, 6: 170-10.1186/1471-2407-6-170.View ArticlePubMedPubMed CentralGoogle Scholar
- Jemal A, Siegel R, Ward E, Murray T, Xu J, Thun MJ: Cancer statistics, 2007. CA Cancer J Clin. 2007, 57: 43-66. 10.3322/canjclin.57.1.43.View ArticlePubMedGoogle Scholar
- Russo J, Balogh GA, Russo IH: Full-term pregnancy induces a specific genomic signature in the human breast. Cancer Epidemiol Biomarkers Prev. 2008, 17: 51-66. 10.1158/1055-9965.EPI-07-0678.View ArticlePubMedGoogle Scholar
- MacMahon B, Cole P, Lin TM, Lowe CR, Mirra AP, Ravnihar B, Salber EJ, Valaoras VG, Yuasa S: Age at first birth and breast cancer risk. Bull World Health Organ. 1970, 43: 209-221.PubMedPubMed CentralGoogle Scholar
- Thordarson G, Jin E, Guzman RC, Swanson SM, Nandi S, Talamantes F: Refractoriness to mammary tumorigenesis in parous rats: is it caused by persistent changes in the hormonal environment or permanent biochemical alterations in the mammary epithelia?. Carcinogenesis. 1995, 16: 2847-2853. 10.1093/carcin/16.11.2847.View ArticlePubMedGoogle Scholar
- Sinha DK, Pazik JE, Dao TL: Prevention of mammary carcinogenesis in rats by pregnancy: effect of full-term and interrupted pregnancy. Br J Cancer. 1988, 57: 390-394. 10.1038/bjc.1988.88.View ArticlePubMedPubMed CentralGoogle Scholar
- Russo J, Russo IH: Influence of differentiation and cell kinetics on the susceptibility of the rat mammary gland to carcinogenesis. Cancer Res. 1980, 40: 2677-2687.PubMedGoogle Scholar
- Tay LK, Russo J: Formation and removal of 7,12-dimethylbenz[a]anthracene–nucleic acid adducts in rat mammary epithelial cells with different susceptibility to carcinogenesis. Carcinogenesis. 1981, 2: 1327-1333. 10.1093/carcin/2.12.1327.View ArticlePubMedGoogle Scholar
- Russo IH, Koszalka M, Russo J: Comparative study of the influence of pregnancy and hormonal treatment on mammary carcinogenesis. Br J Cancer. 1991, 64: 481-484. 10.1038/bjc.1991.335.View ArticlePubMedPubMed CentralGoogle Scholar
- Fisher DA: Fetal and neonatal endocrinology. Endocrinology. Edited by: DeGroot LJ, Jameson JL. 2006, Elsevier Saunders, Philadelphia, PA, 3369-3386. 5Google Scholar
- Russo J, Moral R, Balogh GA, Mailo D, Russo IH: The protective role of pregnancy in breast cancer. Breast Cancer Res. 2005, 7: 131-142. 10.1186/bcr1029.View ArticlePubMedPubMed CentralGoogle Scholar
- Russo J, Russo IH: Role of differentiation in the pathogenesis and prevention of breast cancer. Endocr Relat Cancer. 1997, 4: 7-21. 10.1677/erc.0.0040007.View ArticleGoogle Scholar
- Henry MD, Triplett AA, Oh KB, Smith GH, Wagner KU: Parity-induced mammary epithelial cells facilitate tumorigenesis in MMTV-neu transgenic mice. Oncogene. 2004, 23: 6980-6985. 10.1038/sj.onc.1207827.View ArticlePubMedGoogle Scholar
- Srivastava P, Russo J, Russo IH: Chorionic gonadotropin inhibits rat mammary carcinogenesis through activation of programmed cell death. Carcinogenesis. 1997, 18: 1799-1808. 10.1093/carcin/18.9.1799.View ArticlePubMedGoogle Scholar
- Medina D: Breast cancer: the protective effect of pregnancy. Clin Cancer Res. 2004, 10: 380S-384S. 10.1158/1078-0432.CCR-031211.View ArticlePubMedGoogle Scholar
- Ginger MR, Gonzalez-Rimbau MF, Gay JP, Rosen JM: Persistent changes in gene expression induced by estrogen and progesterone in the rat mammary gland. Mol Endocrinol. 2001, 15: 1993-2009. 10.1210/me.15.11.1993.View ArticlePubMedGoogle Scholar
- D'Cruz CM, Moody SE, Master SR, Hartman JL, Keiper EA, Imielinski MB, Cox JD, Wang JY, Ha SI, Keister BA, Chodosh LA: Persistent parity-induced changes in growth factors, TGF-beta3, and differentiation in the rodent mammary gland. Mol Endocrinol. 2002, 16: 2034-2051. 10.1210/me.2002-0073.View ArticlePubMedGoogle Scholar
- Russo J, Russo IH: Endocrine control of breast development. Molecular basis of breast cancer: prevention and treatment. Edited by: Russo J, Russo IH. 2004, Springer, Berlin, 64-67.View ArticleGoogle Scholar
- Belitskaya-Levy I, Zeleniuch-Jacquotte A, Russo J, Russo IH, Bordas P, Ahman J, Afanasyeva Y, Johansson R, Lenner P, Li X, de Cicco RL, Peri S, Ross E, Russo PA, Santucci-Pereira J, Sheriff FS, Slifker M, Hallmans G, Toniolo P, Arslan AA: Characterization of a genomic signature of pregnancy identified in the breast. Cancer Prev Res. 2011, 4: 1457-1464. 10.1158/1940-6207.CAPR-11-0021.View ArticleGoogle Scholar
- Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003, 19: 185-193. 10.1093/bioinformatics/19.2.185.View ArticlePubMedGoogle Scholar
- Johnson WE, Li C, Rabinovic A: Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007, 8: 118-127. 10.1093/biostatistics/kxj037.View ArticlePubMedGoogle Scholar
- Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3: Article3.PubMedGoogle Scholar
- Smyth GK: Limma: linear models for microarray data. Bioinformatics and Computational Biology Solutions using R and Bioconductor. Edited by: Gentleman R, Carey V, Dudoit S, Irizarry R, Huber W. 2005, Springer, New York, 397-420.View ArticleGoogle Scholar
- Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JY, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5: R80-10.1186/gb-2004-5-10-r80.View ArticlePubMedPubMed CentralGoogle Scholar
- Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I: Controlling the false discovery rate in behavior genetics research. Behav Brain Res. 2001, 125: 279-284. 10.1016/S0166-4328(01)00297-2.View ArticlePubMedGoogle Scholar
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005, 102: 15545-15550. 10.1073/pnas.0506580102.View ArticlePubMedPubMed CentralGoogle Scholar
- de Graaf K, Hekerman P, Spelten O, Herrmann A, Packman LC, Bussow K, Muller-Newen G, Becker W: Characterization of cyclin L2, a novel cyclin with an arginine/serine-rich domain: phosphorylation by DYRK1A and colocalization with splicing factors. J Biol Chem. 2004, 279: 4612-4624.View ArticlePubMedGoogle Scholar
- Wahl MC, Will CL, Luhrmann R: The spliceosome: design principles of a dynamic RNP machine. Cell. 2009, 136: 701-718. 10.1016/j.cell.2009.02.009.View ArticlePubMedGoogle Scholar
- Taga Y, Miyoshi M, Okajima T, Matsuda T, Nadano D: Identification of heterogeneous nuclear ribonucleoprotein A/B as a cytoplasmic mRNA-binding protein in early involution of the mouse mammary gland. Cell Biochem Funct. 2010, 28: 321-328. 10.1002/cbf.1662.View ArticlePubMedGoogle Scholar
- Huang PR, Hung SC, Wang TC: Telomeric DNA-binding activities of heterogeneous nuclear ribonucleoprotein A3 in vitro and in vivo. Biochim Biophys Acta. 2010, 1803: 1164-1174. 10.1016/j.bbamcr.2010.06.003.View ArticlePubMedGoogle Scholar
- Han SP, Friend LR, Carson JH, Korza G, Barbarese E, Maggipinto M, Hatfield JT, Rothnagel JA, Smith R: Differential subcellular distributions and trafficking functions of hnRNP A2/B1 spliceoforms. Traffic. 2010, 11: 886-898. 10.1111/j.1600-0854.2010.01072.x.View ArticlePubMedPubMed CentralGoogle Scholar
- Loyer P, Trembley JH, Grenet JA, Busson A, Corlu A, Zhao W, Kocak M, Kidd VJ, Lahti JM: Characterization of cyclin L1 and L2 interactions with CDK11 and splicing factors: influence of cyclin L isoforms on splice site selection. J Biol Chem. 2008, 283: 7721-7732. 10.1074/jbc.M708188200.View ArticlePubMedGoogle Scholar
- Li HL, Wang TS, Li XY, Li N, Huang DZ, Chen Q, Ba Y: Overexpression of cyclin L2 induces apoptosis and cell-cycle arrest in human lung cancer cells. Chin Med J (Engl). 2007, 120: 905-909.Google Scholar
- Zhuo L, Gong J, Yang R, Sheng Y, Zhou L, Kong X, Cao K: Inhibition of proliferation and differentiation and promotion of apoptosis by cyclin L2 in mouse embryonic carcinoma P19 cells. Biochem Biophys Res Commun. 2009, 390: 451-457. 10.1016/j.bbrc.2009.09.089.View ArticlePubMedGoogle Scholar
- Erwin JA, Lee JT: Characterization of X-chromosome inactivation status in human pluripotent stem cells. Curr Protoc Stem Cell Biol. 2010, Chapter 1: Unit 1B 6.PubMedGoogle Scholar
- Do JT, Han DW, Gentile L, Sobek-Klocke I, Wutz A, Scholer HR: Reprogramming of Xist against the pluripotent state in fusion hybrids. J Cell Sci. 2009, 122: 4122-4129. 10.1242/jcs.056119.View ArticlePubMedGoogle Scholar
- Vincent-Salomon A, Ganem-Elbaz C, Manie E, Raynal V, Sastre-Garau X, Stoppa-Lyonnet D, Stern MH, Heard E: X inactive-specific transcript RNA coating and genetic instability of the X chromosome in BRCA1 breast tumors. Cancer Res. 2007, 67: 5134-5140. 10.1158/0008-5472.CAN-07-0465.View ArticlePubMedGoogle Scholar
- Xiao C, Sharp JA, Kawahara M, Davalos AR, Difilippantonio MJ, Hu Y, Li W, Cao L, Buetow K, Ried T, Chadwick BP, Deng CX, Panning B: The XIST noncoding RNA functions independently of BRCA1 in X inactivation. Cell. 2007, 128: 977-989. 10.1016/j.cell.2007.01.034.View ArticlePubMedGoogle Scholar
- Silver DP, Dimitrov SD, Feunteun J, Gelman R, Drapkin R, Lu SD, Shestakova E, Velmurugan S, Denunzio N, Dragomir S, Mar J, Liu X, Rottenberg S, Jonkers J, Ganesan S, Livingston DM: Further evidence for BRCA1 communication with the inactive X chromosome. Cell. 2007, 128: 991-1002. 10.1016/j.cell.2007.02.025.View ArticlePubMedGoogle Scholar
- Breton C, Di Scala-Guenot D, Zingg HH: Oxytocin receptor gene expression in rat mammary gland: structural characterization and regulation. J Mol Endocrinol. 2001, 27: 175-189. 10.1677/jme.0.0270175.View ArticlePubMedGoogle Scholar
- Koshimizu TA, Fujiwara Y, Sakai N, Shibata K, Tsuchiya H: Oxytocin stimulates expression of a noncoding RNA tumor marker in a human neuroblastoma cell line. Life Sci. 2010, 86: 455-460. 10.1016/j.lfs.2010.02.001.View ArticlePubMedGoogle Scholar
- Russo J, Rivera R, Russo IH: Influence of age and parity on the development of the human breast. Breast Cancer Res Treat. 1992, 23: 211-218. 10.1007/BF01833517.View ArticlePubMedGoogle Scholar
- Wilson BJ, Giguere V: Meta-analysis of human cancer microarrays reveals GATA3 is integral to the estrogen receptor alpha pathway. Mol Cancer. 2008, 7: 49-10.1186/1476-4598-7-49.View ArticlePubMedPubMed CentralGoogle Scholar
- Chou J, Provot S, Werb Z: GATA3 in development and cancer differentiation: cells GATA have it!. J Cell Physiol. 2010, 222: 42-49. 10.1002/jcp.21943.View ArticlePubMedPubMed CentralGoogle Scholar
- Pei XH, Bai F, Smith MD, Usary J, Fan C, Pai SY, Ho IC, Perou CM, Xiong Y: CDK inhibitor p18(INK4c) is a downstream target of GATA3 and restrains mammary luminal progenitor cell proliferation and tumorigenesis. Cancer Cell. 2009, 15: 389-401. 10.1016/j.ccr.2009.03.004.View ArticlePubMedPubMed CentralGoogle Scholar
- Kouros-Mehr H, Bechis SK, Slorach EM, Littlepage LE, Egeblad M, Ewald AJ, Pai SY, Ho IC, Werb Z: GATA-3 links tumor differentiation and dissemination in a luminal breast cancer model. Cancer Cell. 2008, 13: 141-152. 10.1016/j.ccr.2008.01.011.View ArticlePubMedPubMed CentralGoogle Scholar
- Fischer J, Klein PJ, Farrar GH, Hanisch FG, Uhlenbruck G: Isolation and chemical and immunochemical characterization of the peanut-lectin-binding glycoprotein from human milk-fat-globule membranes. Biochem J. 1984, 224: 581-589.View ArticlePubMedPubMed CentralGoogle Scholar
- Chen L, O'Bryan JP, Smith HS, Liu E: Overexpression of matrix Gla protein mRNA in malignant human breast cells: isolation by differential cDNA hybridization. Oncogene. 1990, 5: 1391-1395.PubMedGoogle Scholar
- Holmes MD, Pollak MN, Hankinson SE: Lifestyle correlates of plasma insulin-like growth factor I and insulin-like growth factor binding protein 3 concentrations. Cancer Epidemiol Biomarkers Prev. 2002, 11: 862-867.PubMedGoogle Scholar
- Key TJ, Appleby PN, Reeves GK, Roddam AW: Insulin-like growth factor 1 (IGF1), IGF binding protein 3 (IGFBP3), and breast cancer risk: pooled individual data analysis of 17 prospective studies. Lancet Oncol. 2010, 11: 530-542.View ArticlePubMedGoogle Scholar
- De Souza Rocha Simonini P, Breiling A, Gupta N, Malekpour M, Youns M, Omranipour R, Malekpour F, Volinia S, Croce CM, Najmabadi H, Diederichs S, Sahin O, Mayer D, Lyko F, Hoheisel JD, Riazalhosseini Y: Epigenetically deregulated microRNA-375 is involved in a positive feedback loop with estrogen receptor alpha in breast cancer cells. Cancer Res. 2010, 70: 9175-9184. 10.1158/0008-5472.CAN-10-1318.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/5/46/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.