- Research article
- Open Access
- Open Peer Review
Comprehensive expressional analyses of antisense transcripts in colon cancer tissues using artificial antisense probes
BMC Medical Genomicsvolume 4, Article number: 42 (2011)
Recent studies have identified thousands of sense-antisense gene pairs across different genomes by computational mapping of cDNA sequences. These studies have shown that approximately 25% of all transcriptional units in the human and mouse genomes are involved in cis-sense-antisense pairs. However, the number of known sense-antisense pairs remains limited because currently available cDNA sequences represent only a fraction of the total number of transcripts comprising the transcriptome of each cell type.
To discover novel antisense transcripts encoded in the antisense strand of important genes, such as cancer-related genes, we conducted expression analyses of antisense transcripts using our custom microarray platform along with 2376 probes designed specifically to detect the potential antisense transcripts of 501 well-known genes suitable for cancer research.
Using colon cancer tissue and normal tissue surrounding the cancer tissue obtained from 6 patients, we found that antisense transcripts without poly(A) tails are expressed from approximately 80% of these well-known genes. This observation is consistent with our previous finding that many antisense transcripts expressed in a cell are poly(A)-. We also identified 101 and 71 antisense probes displaying a high level of expression specifically in normal and cancer tissues respectively.
Our microarray analysis identified novel antisense transcripts with expression profiles specific to cancer tissue, some of which might play a role in the regulatory networks underlying oncogenesis and thus are potential targets for further experimental validation. Our microarray data are available at http://www.brc.riken.go.jp/ncrna2007/viewer-Saito-01/index.html.
Non-coding RNAs are one class of RNAs that do not encode proteins but have specific cellular activities. Some non-coding RNAs are antisense RNAs encoded on the antisense strand of protein-coding genes. Recent progress in sequencing technologies has allowed the rapid sequence analysis of the large amount of RNAs that are transcribed in the cell. For example, the large-scale cDNA sequencing projects conducted by the FANTOM consortium revealed that a large proportion of the mouse genome is transcribed into RNAs and that many of these RNAs do not have protein-coding potential and thus are considered non-coding RNAs [1, 2]. Studies using tiling arrays further support these observations and have revealed the presence of RNA-encoding regions in the genome by computational mapping . Computational mapping has revealed that many RNAs are located on the antisense strand of the protein-coding genes in the same genomic region; approximately 25% of transcriptional units are involved in cis-sense-antisense gene pairs in the human and mouse genomes, thus suggesting the presence of more sense-antisense gene pairs than previously thought [4–9]. Although the experimentally determined functions of antisense RNAs are limited, some have been shown to regulate transcription from the sense strand . Thus, expressional analyses of sense-antisense transcripts may shed light on the mechanisms underlying the control of these transcripts.
In our previous studies [7, 10], we conducted comprehensive expressional analyses of sense-antisense transcripts in human and mouse cells. In these studies large amounts of transcripts were detected by the random priming method, but not by the oligo-dT priming method, thus suggesting that these transcripts are poly(A)- and not reflected by cDNA sequences, which rely on the existence of poly(A) tails. Comprehensive cluster analysis of the expression ratio of sense and antisense transcripts is a powerful approach to characterize expression patterns of sense-antisense transcripts in various tissues. We found that expression balances of some of these transcripts were altered in specific tissues. Furthermore, Northern hybridization analysis of several selected sense-antisense transcripts showed smeary hybridization patterns in the mouse, thus indicating that the transcribed region in the genome may vary and that the cDNA sequences do not always reflect the actual sizes of the transcripts. Therefore, the design of microarray probes based on cDNA sequences obtained by large-scale cDNA sequencing projects is not appropriate for the detection of all transcripts comprising the transcriptome.
Subsequently, we designed microarray probes referred to as artificial antisense sequence (AFAS) probes specific for sequences in the antisense strand of known transcription units originating from the sense strand. In this way, we were able to detect the expression of novel antisense RNAs, which were not detected by using cDNA sequences . These AFAS probes were spotted onto microarray slides and used in our microarray platform to generate expression data for the analysis of antisense transcripts. Using mouse adult tissues, we previously found that 66.1% of 635 genes targeted for such expressional analysis showed positive expression from the antisense strand, suggesting that our AFAS probes can efficiently detect novel antisense transcripts .
We then chose mouse cancer tissues for expressional analyses of sense-antisense transcripts, because most oncogenesis is caused by abnormalities in gene sequences or in the regulatory systems controlling gene expression, and possibly also by unknown RNAs transcribed from the antisense strand. Uncovering how gene expression is regulated in cancer cells may lead us to a better understanding of the mechanisms and signaling networks underlying oncogenesis and metastasis . Several early studies succeeded in identifying genes expressed specifically in cancer cells; however, most studies have focused on investigating the expression of messenger RNAs, which encode proteins [13–17].
Several studies have identified non-coding RNAs expressed specifically in cancer cells [18, 19]; however, those focused on antisense RNAs are limited [20, 21]. Using AFAS probes, we found that expression of antisense transcripts from 95 well-annotated genes showed altered expression level in cancer tissues . Some of their expressions were validated using RT-PCR, Northern blot analysis, and in situ hybridization, thus suggesting that our platform can efficiently detect novel antisense transcripts, some of which may be regulated in cancer cells.
Here, using the same microarray platform, we analyzed expression patterns of antisense transcripts in human tissue. First we characterized the expression patterns of sense-antisense transcripts, based on available cDNA sequences, in colon (colorectal) cancer tissues and in normal tissues surrounding the cancer tissues. Although expression balances (ratios) of most of sense and antisense transcript pairs did not change between patients or between normal and cancer tissues, we found 68 sense-antisense transcripts whose expression balances were altered specifically in colon cancer tissues. Then, to identify novel antisense RNAs expressed in cancer tissues, we used the AFAS probes to analyze the expression patterns of sequences in the antisense strand of well-known protein-coding genes that are suitable for cancer research as defined in the Atlas Human Cancer 1.2 Array. Sixty-mer probes were designed for every 500 bps of the antisense strand of protein-coding genes. We used the oligo-dT and random priming methods for transcript generation in order to screen both the poly(A)+ and poly(A)- transcript populations in total RNA extracted from colon cancer samples obtained from 6 patients.
Using random priming, we detected gene expression with approximately 40% of the AFAS probes, which correspond to approximately 80% of the well-known genes. Among them, 172 probes detected twofold differences in expression levels of antisense transcripts between cancer tissue and normal tissue, demonstrating the usefulness of our approach to discover novel antisense transcripts, some of which might be related to the regulation of gene expression in cancer cells. Although we used only colon cancer tissues in this study, we believe that our approach is applicable to other types of cancer tissues.
The expression balance of 68 cDNA-based sense-antisense transcripts is altered in colon cancer tissues
We previously established a custom microarray platform (designated as 11 k) to analyze the expression patterns of sense-antisense transcripts in human and mouse [10, 11]. These sense-antisense transcripts were identified on the basis of available cDNA sequences as . For validation of this microarray platform, three sense-antisense pairs were selected for Northern and in situ hybridization analyses to confirm their transcription [7, 10]. Here, we conducted DNA microarray analyses by using the same set of probes designed for 2621 sense-antisense pairs in  to detect transcripts expressed in colon cancer tissues. These probes comprise 2358 pairs for the detection of protein-coding transcripts only, 250 pairs for the detection of protein-coding transcripts paired with non-protein-coding transcripts, and 13 pairs for the detection of non-protein-coding transcripts only. Protein-coding transcripts, as defined in our previous study , are those giving a significant hit to known protein sequences by BLASTX searches (homology search from DNA to protein). Otherwise, we defined the transcripts as non-coding. Total RNAs were extracted from colon cancer tissue and normal tissue surrounding the cancer tissue obtained from 6 patients. Fluorescence labeling of transcripts was performed by using both oligo-dT and random priming methods.
Microarray data analysis revealed that the expression ratios between sense and antisense transcripts were clustered, as shown in our previous studies (Additional file 1: Figure S1) [7, 10]. Although most of the expression ratios between sense and antisense transcripts did not change between patients or between normal and cancer tissues, we found sense-antisense transcripts showing altered expression balances in colon cancer tissues. For example, Figure 1 (also listed in Additional file 2: Table S1) shows that the expression balances of 68 sense-antisense transcript pairs were reversed in cancer tissues. For 29 pairs among them, the sense transcripts showed a >10% increase in expression levels in cancer tissues, whereas the corresponding antisense transcripts showed a similar decrease in at least three of six patients (according to expression data obtained by random priming). For the remaining 39 pairs, the expression levels of the sense transcripts was >10% greater in normal tissues than in cancer tissues, whereas the corresponding antisense transcripts showed a similar level of increased expression in cancer tissues. We analyzed functional annotations (GO, Gene Ontology ) of genes corresponding to these sense-antisense pairs. The cellular processes in which these genes are involved were biased toward signal transduction, cell adhesion, and cell differentiation, which play important roles in oncogenesis and metastasis (Additional file 2: Table S2). The up-regulation and down-regulation of sense-antisense transcripts can be more precisely assessed by analysis of the expression ratios between sense and antisense transcripts in normal and cancer tissues (Additional file 1: Figure S2).
In this analysis, we identified six sense-antisense gene pairs in which the sense gene was protein-coding and the antisense gene was non-coding, as defined previously . Interestingly, two of the sense genes were involved in cell differentiation, which may be related to cancer. Subsequently, we screened for novel non-coding transcripts that might be encoded by the antisense strand. Our previous work showed that vast amounts of transcripts were poly(A)- [7, 10] and thus they may not be detected by a cDNA-based approach. Therefore, we screened for these novel transcripts by using the AFAS probes, which could detect poly(A)- transcripts by using random priming approach.
AFAS transcripts are expressed from about 80% of the well-known genes and most are poly(A)-
Our established microarray platform (custom microarray platform 11 k) comprises 501 probes covering well-known genes suitable for cancer research, including oncogenes and tumor suppressors. We obtained expression data for cancer tissue obtained from 6 patients and validated these data by comparison with previously obtained data, as described in the methods section. Our microarray platform also contains 2376 AFAS probes designed to correspond to every 500 bases of the antisense strand of exonic regions of well-known genes suitable for cancer research to screen for transcripts originating from the antisense strand. These genomic regions, genes encoded in these regions, and transcripts originating from these regions, were designated as AFAS regions, AFAS genes, and AFAS transcripts, respectively. The AFAS probes detected fewer transcripts than the probes corresponding to protein coding genes on the sense strand. Although most of the AFAS regions were not transcribed, or were expressed only at low levels, the detection of transcripts by the AFAS probes implies the existence of antisense genes (Figure 2). For example, for normal and cancer tissue, 14.8% and 8.3% of AFAS probes, respectively, detected expression intensities satisfying a conservative threshold (≥100), according to expression data obtained by oligo-dT priming. As expected, the sense probes detected markedly more transcripts, with 64.1% and 64.2% of the sense probes detecting expression intensities in normal and cancer tissues, respectively, that satisfy the same threshold. By comparison, more transcripts were detected with AFAS probes by using the random priming approach to prepare target cDNAs; approximately 39.4% and 38.2% of the AFAS probes satisfied the conservative threshold (≥100) in normal and cancer tissues, respectively, by using random priming (P = 7.19×10-1061, χ2 test). The rates of probes detecting positive expression according to more moderate criteria provided by Agilent's platform are summarized in Additional file 2: Table S3. Figure 2 shows the higher overall expression levels detected by AFAS probes using random priming compared with oligo dT priming in both normal and cancer tissues. Since oligo-dT priming is expected to label mRNAs, including unspliced pre-mRNAs having poly(A) tails, whereas random priming is expected to label all RNAs in a cell, these results indicate that most of the transcripts originating from the AFAS regions (AFAS transcripts) are poly(A)-. This finding is consistent with the results of a previous study showing that the majority of sense-antisense transcripts are poly(A)- . The possibility that these poly(A)- transcripts are localized in the nucleus must be confirmed by biological experiments. It is noteworthy that publicly available cDNA sequences originating from the antisense strand can be found for only 139 (27.7%) of the 501 well-known genes used in the present study. This calculation is based on the set of cDNA sequences used in our previous work . Approximately 70% of the AFAS probes that detected gene expression did not correspond to these cDNA sequences. Thus, we suggest that these probes correspond to potentially novel antisense RNAs. At least one AFAS probe showed a median expression level (≥100), using random priming, for 78.2% and 77.4% of the well-known genes in normal and cancer tissues, respectively. Although cross-hybridization of AFAS probes with transcripts must be considered, our results reveal that our microarray platform utilizing the AFAS probes and the random priming method to prepare transcripts has the capacity to detect novel transcripts from the antisense strand. Therefore, for subsequent analyses, we focused on the expression data using random priming to investigate the possible regulatory functions of identified poly(A)- RNAs.
The expressional level of 172 transcripts identified with the AFAS probes is altered in colon cancer tissue
Transcripts identified by AFAS probes with altered expression levels in cancer tissue might play a role in oncogenesis. Therefore, we first calculated the ratio of the median expressional level between normal tissues and cancer tissues (cancer to normal ratio) for each sense and AFAS transcript (Additional file 1: Figure S3). We found many putative antisense genes (AFAS genes, detected by the AFAS probes) with altered expression in cancer tissues compared with normal tissue. We identified 71 AFAS transcripts expressed more than twofold higher in colon cancer tissues than in normal tissues and 101 AFAS transcripts expressed in colon cancer tissues at a level less than 50% of the level found in normal tissue. These numbers were much greater than the expected number of transcripts, which was identified as 50.14 ± 6.13 S.D by the random shuffling test.
We chose three AFAS probes that showed alterations in expression level in cancer tissue by microarray analysis and conducted RT-PCR to verify these results. Our RT-PCR results (Figure 3) showed that for all three antisense probes there were notable changes in expression level between cancer tissues and normal tissues.
We also analyzed how the expression level of sense transcript changes in response to the expressional changes of antisense transcripts detected by AFAS probes. When the oligo-dT priming method was used to prepare the transcripts, the number of sense-AFAS pairs that showed a concerted increase or decrease in expression level in cancer and normal tissues was 1451, which is statistically higher than the expected number of 1260.19 (P = 2.0×10-20, χ2 test). This finding is consistent with the results of a previous study showing a positive correlation between the expression of sense and antisense transcripts [4, 20]. However, no such positive correlation was observed for expression data obtained by random priming.
The cancer to normal ratio for the sense and antisense transcript in each pair can be used to detect sense-antisense transcript pairs having correlated or anti-correlated expression, as shown in Additional file 1: Figure S3 for random priming. For example, there are 21 sense-AFAS pairs where expression of the sense transcript is increased twofold or more while the expression of the AFAS transcript is decreased by 50% or more in cancer tissues (plots in the lower right of the fourth quadrant of Additional file 1: Figure S3).
The sense to antisense ratio is a powerful measure to characterize the expression balance of the sense and antisense genes in particular genomic regions as demonstrated by our previous studies [7, 10]. To obtain a global picture of changes in such expression balances for the sense-AFAS pairs, we conducted clustering analyses. In particular, for data obtained by random priming we calculated the sense to AFAS expression ratio for each sense-AFAS pair and conducted hierarchical clustering (Additional file 1: Figure S4). By aligning the clustering results, we then compared the expression balance patterns for data obtained by oligo-dT priming with those obtained by random priming for each sense-antisense pair.
Most of the AFAS probes revealed lower expression levels than did the sense probes in both cancer tissue and normal tissue, especially for the expression data obtained by oligo-dT priming. However, by using the random priming approach we detected 306 sense-AFAS pairs for which the AFAS probes detected expression levels three times (according to the median expression level among the patients) those detected by the corresponding sense probes, in both cancer tissue and normal tissue. In contrast, only 91 sense-AFAS pairs were detected by the same criteria using oligo-dT priming (P = 1.81×10-29, χ2 test). This observation mostly corresponds to the red region in the upper part of the heat map designated as "random" in Additional file 1: Figure S4.
We then observed sense-AFAS probe pairs having altered expression balances among normal and cancer tissues; for example there were 44 sense-AFAS probe pairs for which the sense transcript showed higher expression than AFAS transcript in cancer tissues but the balance was reversed in normal tissues for at least three of the six patients (Figure 4a and Table 1). We noted that for these 44 sense-AFAS pairs, there were at least 10% differences in expression levels detected by sense and AFAS probes, and both sense and AFAS probes showed at least a 10% change in expression level between normal and cancer tissues, for these three patients. The observed number, 44, was much higher than the expected number 12.07 ± 3.11 SD (Random shuffling test). If the threshold of 10% was changed to 20% or 30%, the observed number of pairs decreased to 35 and 21, respectively, although the rate of biologically relevant pairs within these pairs might increase. We also screened for sense-AFAS probe pairs for which the sense transcript showed lower expression than the AFAS transcripts in cancer tissues but for which the balance was reversed in normal tissues. We obtained five pairs consistent with this scenario (Figure 4b and Table 1). We also identified sense-AFAS transcript pairs for which the sense transcript showed higher (lower) expression than that of the AFAS gene, in cancer tissues, where expression levels were based on expression data from oligo-dT priming for sense transcripts and random priming for AFAS transcripts (Additional file 2: Table S4ab). This procedure might be rationalized by the better detection of protein-coding gene expression by using the oligo-dT priming approach and, conversely, the better detection of RNA expression by the random priming approach. Although transcripts detected by oligo-dT priming and those detected by random priming methods may function in different cellular processes, they might be associated at the level of gene expression. The sense-AFAS probe pairs listed in Table 1 are also indicated in Additional file 1: Figure S3 (designated as Sense up, AFAS down, corresponding to the first 44 pairs in Table 1 and Sense down, AFAS up, corresponding to the last five pairs in Table 1).
The biological relevance of sense-AFAS genes can be inferred from their expression patterns
To obtain a global view of the functions of the genes corresponding to the sense-AFAS transcript pairs showing altered expression balances among normal and cancer cells (shown in Table 1 and Additional file 2: Table S4), we classified the known functions of the sense genes on the basis of the annotations given in the Atlas Human Cancer 1.2 Array (Additional file 2: Table S5). Approximately 30% of the sense genes showing altered expression balances with their corresponding AFAS genes were oncogenes, tumor suppressor genes, or genes related to cell cycle function. This finding demonstrates that the design of our AFAS probes had the capacity to efficiently detect transcripts originating from the antisense strand of genes related to oncogenesis.
We generated bar graphs for the sense-AFAS pairs shown in Table 1 and displayed the clearest cases in Figures 5 and 6. One of the examples is the sense-AFAS transcript pair detected with the sense probe for mitogen-activated protein kinase 9 (MAPK9) and the corresponding AFAS probe, L31951-03 (Figure 5). In this case, the overall expression level of the sense gene was higher in cancer tissues than in normal tissues, and vice versa for the AFAS gene, based on the generation of target cDNAs with the random priming approach. Four patients (No. 2, 12, 13, and 15) showed clear expression balance alterations of the sense-AFAS transcripts. The oligo-dT priming approach tended to give higher expression levels of the sense genes than did the random priming approach, presumably because of the higher efficiency of oligo-dT priming for labeling transcripts with poly(A) tails. However, the random priming approach was more efficient than the oligo-dT priming approach in allowing the detection of the putative antisense transcripts (AFAS transcripts) as well as the detection of the changes in expression level of these transcripts in normal tissue and cancer tissues. Furthermore, the expression patterns of AFAS transcripts were more likely to be different with the two priming approaches (average correlation coefficient = 0.143) than those of the sense genes (average correlation coefficient = 0.358, P = 2.2×10-16, Mann-Whitney test). This result suggests that the sets of transcripts originating from the AFAS regions might differ depending on the priming method and that random priming is the more efficient approach for detecting these transcripts.
In some cases, multiple AFAS probes simultaneously detected altered expression balances of specific sense-AFAS transcripts. For example, three AFAS probes designed to detect the transcripts encoded in the antisense strand of BRCA2 (breast cancer 2, early onset, U43746) revealed altered expression balances for the sense-AFAS transcripts originating from this gene (Table 1). This result suggests that for BRCA2, multiple transcriptional units are under the same regulatory control or, alternatively, that a single long transcription unit arises from the antisense strand. On the other hand, the expression levels detected by some AFAS probes were limited to a specific region of a gene. For example, of the five AFAS probes designed to detect the transcripts encoded in the antisense strand of cancer-related gene ADAM metallopeptidase domain 17 (ADAM17, U69611) only U69611-04 revealed an altered expression balance with the sense transcript (Additional file 1: Figure S5). In this case, it is possible that the AFAS probe maps to the transcriptional unit arising from the limited region of the antisense strand. We have preliminary results suggesting that transcripts detected by AFAS probes are more likely to be situated at the 5' end of the corresponding sense genes (Additional file 1: Figure S6), a result that we also reported in our previous study .
In the present study we screened for novel antisense transcripts using our previously established microarray platform with AFAS probes and identified those expressed at different levels in cancer tissues and normal tissues (Table 2). Although many studies have focused on investigating the expression of sense genes with poly(A) tails in cancer tissue, very few studies at a genome-wide level have investigated antisense genes with no poly(A) tail. Our previous [7, 10, 11] and present studies suggest that such antisense genes are transcribed from many of the sense genes and should be included in a thorough investigation of the gene regulatory networks underlying oncogenesis.
Hierarchical clustering of expression ratio of sense and antisense genes followed by heat map creation is a powerful approach to characterize genomic regions from the view point of expression balances among the two strands; it shows genomic regions where either sense or antisense strand is highly expressed compared to the other strand. We expected that for most of the genomic regions, sense genes are highly expressed compared to antisense genes especially because sense genes are well-known whereas most of their antisense counterparts are unknown. This was the case for expression data obtained by oligo-dT priming, but not for those obtained by random priming. We found that 12.9% of sense-AFAS probe pairs have higher expression, by more than threefold, of the AFAS probe compared to the sense probe as clearly observed in heat map (Additional file 1: Figure S4). Also some of AFAS probes showed such high expression specifically in either normal or cancer tissues (Figure 4).
Three antisense transcripts detected by our AFAS probes showed a similar pattern of expression by RT-PCR (Figure 3). More complete validations of antisense transcripts detected by our microarray platform were conducted by RT-PCR, Northern analysis, and in situ hybridization in our previous work . Our observed expression patterns of sense genes among normal and cancer tissues were consistent with those in previously published work .
The reverse transcription step employed to prepare cDNA for microarray analysis was shown to generate antisense artifacts . This could be a potential problem for our random priming approach since we labeled cDNA generated by reverse transcription. If the synthesis of such artifacts was frequent, we would observe a positive correlation between expression levels of sense and antisense transcripts. However, we observed no such correlation in random priming. Therefore, we suggest that the level of artifacts arising from random priming is minimal. We observed many cases that cannot be attributed to artifacts where the expression level of the antisense transcripts is much greater than that of the corresponding sense transcripts (Figures 5 and 6). Although some of these antisense transcripts might be attributed to transcriptional noise , which is possibly enhanced in differentiating cancer cells, we expect that the certain number of these antisense transcripts are functional and include non-coding RNAs. The genomic positions of our detected antisense RNAs are biased towards the 5' end of sense genes (Additional file 1: Figure S6 and ). This observation is consistent with previous reports [5, 26] and transcriptions of some of these antisense RNAs are suggested to be a part of processes to maintain chromatin structures for subsequent gene regulation .
There are several biological interpretations for the function of antisense transcripts detected by AFAS probes showing altered expression patterns between normal and cancer tissues. One interpretation is that they are the result of oncogenesis. For example, during the process of de-differentiation in oncogenesis, sense or antisense RNA expression could be altered. Another interpretation is that the altered expression pattern of the antisense transcripts might be attributed to processes leading to cancer. Some antisense transcripts may function in the cell by expressional control of sense genes through epigenetic regulation [21, 28]. The recently discovered non-coding RNA p15AS, which is encoded by the antisense strand of the tumor suppressor gene p15, was shown to regulate the expression of p15 through heterochromatin formation . This study revealed that p15AS can suppress the expression of p15 and thus p15 and p15AS have altered expression balances among normal and leukemia cells; p15 was down-regulated in leukemia cells, whereas p15AS was up-regulated . Although identification of antisense RNAs which suppress sense transcripts based solely on microarray data is very difficult, we extracted 49 sense-AFAS probes showing such altered expression balances (Table 1).
We suspect that Table 1 contains a number of sense-AFAS gene pairs with an important role in the regulatory networks underlying oncogenesis. One such example is the sense-AFAS probe pair X07876-03, which detects expression of wingless-type MMTV integration site family member 2 (WNT2). WNT2 is known to be up-regulated in colon cancer and this up-regulation may be an underlying cause of oncogenesis [29, 30]. Another example is U58334-02 (Figure 6), which detects expression of tumor protein p53 binding protein 2 (TP53BP2). TP53BP2 stimulates apoptosis by interacting with the tumor suppressor p53. Since this process does not contribute to oncogenesis, the altered expression balance displayed detected by U58334-02 may be the result of oncogenesis. For cadherin 3 (CDH3), the expression of which is detected by the sense-AFAS probe pair X63629-03 (Table 1), a previous study showed that its expression is increased in breast cancer . Thus, it is possible that the expression of CDH3 is also increased in colon cancer. Clues to the function of CDH3 may be gained from investigations of the effect of CDH3 up-regulation on oncogenesis.
Sense-antisense gene pairs that are not directly related to cancer might also show altered expression balances in cancer tissues. Besides the 49 sense-AFAS pairs, we identified 68 sense-antisense transcript pairs among those identified  in colon cancer patients showing altered expression balances as already described (Figure 1, Additional file 1: Figures S1-2 and Additional file 2: Table S1).
Cell population heterogeneity should be taken into account in microarray gene expression analyses since it might cause the expression level of an antisense transcript may to be altered and be masked by the expression patterns of other cell types. Thus, clear expression changes might be observed by transcriptional analysis at a single-cell level. Further investigations involving knock-down or over-expression experiments would contribute to the identification of antisense transcripts with an important regulatory role.
For some sense genes, multiple AFAS probes showed a high level of gene expression, whereas for the other sense genes, only a single AFAS probe showed a high level of expression. In the former case, long transcripts or multiple short transcripts are transcribed from the antisense strand. Since our AFAS probes are designed for every 500 bases, the resolution is low. Further experiments such as Northern blot analysis could be used to accurately determine the size of these transcripts.
Finally, our microarray approach, combined with biological validation, provides an efficient approach to elucidate the regulatory pathways underlying altered expression of antisense transcripts and might uncover important roles for these transcripts in oncogenesis. As a result, this approach has the potential to lead to the identification of novel biomarkers for cancer.
This study analyzed the expression patterns of well-known genes and their putative transcriptional units originating from the corresponding antisense strand using our original probes designed specifically to detect antisense strand transcripts. We discovered that many antisense regions are transcribed and that most of these transcripts are likely to be poly(A)-. Furthermore, we identified antisense transcripts with altered expression in cancer tissues as well as those in sense-antisense transcript pairs displaying altered expression balances in normal tissue and cancer tissue. Taken together, these results provide further insight into the regulation of genes and their antisense transcripts in cancer tissues.
Preparation of tissue samples
For microarray analysis, we prepared colon cancer tissue and normal tissue samples that were ~10 cm away from the cancer tissue, from six patients. Clinical data of the patients are presented in Additional file 2: Table S6. Pathological examinations demonstrated that all tumors included in the study were adenocarcinoma. The extracted normal tissue was composed mostly of normal mucosa. The colon cancer and normal tissue samples were obtained during surgical resection of solitary cancerous lesions in the colon. Each tissue sample was homogenized in liquid nitrogen and total RNA was extracted using TRIzol Reagent (Gibco BRL, Carlsbad, California). Transcripts were labeled with fluorescent dyes by using either oligo-dT or random priming.
The study protocol conformed to the ethical guidelines of the Declaration of Helsinki (1975). All patients provided written informed consent for the analysis of the biopsy specimens and the hospital ethics committee approved the study.
Probe design and microarray analysis of well-known genes and their AFAS
Microarray probes were designed for sense-antisense transcripts identified by genome mapping of cDNA sequences. Using the global mean scaling method, the expression intensities of probes in each array were normalized to those in brain tissue obtained by oligo-dT priming as previously described .
The list of well-known genes suitable for cancer research was obtained from the Atlas Human Cancer 1.2 Array (Cat. No. 634529, Clontech). For each gene, we designed probes for every 500 bps to detect expression from the antisense strand. The optimal position of each probe was determined by using the tools provided by Agilent. We excluded well-known genes if the AFAS probe design failed. Some AFAS probes were designed on the exon-intron boundaries of the sense strand and were designated as "truncated" (Additional file 1: Figure S5). These probes were not included in the analysis shown in Figure 2. We designed a total of 2376 AFAS probes covering 501 well-known genes.
Microarray data used in this study have been deposited in GEO and have been assigned series accession number GSE14397 and GSE14398.
To validate our expression data, we investigated whether well-known genes known to be regulated in colon cancer tissues show similar expression profiles in our system. For each gene, we calculated the median expression level among patients for normal and cancer tissues and assessed whether the median expression level increased or decreased in cancer tissues compared with normal tissues. For this analysis, we used microarray data obtained from oligo-dT primed transcripts because the transcripts of most protein-coding have poly(A) tails. We compared our gene expression data with previously reported gene expression analyses of colon cancer tissues . Of the 245 genes that were previously reported as showing altered expression in colon cancer, 28 genes were included in our list of well-known genes. We showed that 11 of the 14 genes previously shown to be up-regulated in colon cancer had a higher level of expression in cancer tissues compared with normal tissues. We also showed that 12 of the 14 genes previously shown to be down-regulated in colon cancer had a lower level of expression in cancer tissues compared with normal tissues. These results show that our data correlate well with the data of previous studies, thus validating our expression data. Our microarray data obtained from random priming of transcripts were similar, although we did not observe the down-regulation of genes previously shown to be down-regulated in colon cancer.
We note that although there are correlations among expression levels detected by multiple AFAS probes for each well-known gene and also among expression levels for multiple patients detected by each probe, we assumed that there were no such correlations in order to roughly estimate the P-values based on the χ2 tests.
Detection of antisense transcripts by RT-PCR
We performed RT-PCR assays on commercial total RNA: Human Colon Tumor Total RNA (Clontech), Human Colon Total RNA (Clontech). We also prepared total RNA from HUC-Fm (Normal umbilical cord fibroblast, Cell Bank). For RNA isolation and cDNA synthesis, we treated total RNA with RQ1 RNase-free DNase (Promega) to remove contaminating genomic DNA and then further purified the RNA using Trizol reagent (Invitrogen) according to the manufacturer's recommended protocols. RNA concentration and the 260/280 nm ratios were determined with a NanoDrop ND-1000 spectrophotometer (NanoDrop Technologies). Total RNA (1 μg) was reverse transcribed with 50 U of Super ScriptTM III reverse transcriptase (Invitrogen) using 2.5 ng/μl of strand specific primers according to manufacturer's instructions. Forward primers were used to transcribe cDNA from the (-) strand RNA. cDNA samples were diluted 1:5 in DEPC-treated water prior to real-time PCR.
We performed real-time PCR by using the Thermal Cycler Dice® Real Time System (Takara) for quantification of transcripts. Reactions were done in 10 μl volumes containing 200 nM of each primer, 1.0 μl cDNA, and 5 μl SYBER® Premix EX Taq (Takara). We used the following primer sequences. For GAPDH-AS 5'- AGTCCACTGGCGTCTTCACC -3 and 5' - ATGGTTCACACCCATGACGAAC -3'; for Cxcl12-AS 5'- TCCCTCACATGTCAGTACCTTCA - 3 and 5' - GAATCTTCATGTCCAGGATTGG- 3; for BRCA2-AS 5' - GGAAGTCTGTTTCCACACCTGTC -3 and 5' - CAGGTGGAGGTAAAGGCAGTCTA - 3; for TP53BP2-AS 5' - GCTCTTCACAATGCTGTGTGTG - 3 and 5'-AAACCGATTCCCACAACTGAAG -3. Reactions were run using the manufacture's recommended cycling parameters of 95 °C for 10 s and 40 cycles of 95 °C for 5 s and 60 °C for 30 s. The dissociation curve for each sample was then analyzed to verify the specificity of each reaction. The assays were performed in duplicate and the copy number of each RNA was calculated with Thermal Cycler Dice® Real Time System Software (Takara). The data for each sample were normalized to an internal standard (GAPDH).
Artificial Antisense Sequence
Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al: The transcriptional landscape of the mammalian genome. Science. 2005, 309 (5740): 1559-1563.
Numata K, Kanai A, Saito R, Kondo S, Adachi J, Wilming LG, Hume DA, Hayashizaki Y, Tomita M: Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection. Genome Res. 2003, 13 (6B): 1301-1306.
Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, Willingham AT, Stadler PF, Hertel J, Hackermuller J, Hofacker IL, et al: RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007, 316 (5830): 1484-1488. 10.1126/science.1138341.
Engstrom PG, Suzuki H, Ninomiya N, Akalin A, Sessa L, Lavorgna G, Brozzi A, Luzi L, Tan SL, Yang L, et al: Complex Loci in human and mouse genomes. PLoS Genet. 2006, 2 (4): e47-10.1371/journal.pgen.0020047.
He Y, Vogelstein B, Velculescu VE, Papadopoulos N, Kinzler KW: The Antisense Transcriptomes of Human Cells. Science. 2008, 322 (5909): 1855-1857. 10.1126/science.1163853.
Katayama S, Tomaru Y, Kasukawa T, Waki K, Nakanishi M, Nakamura M, Nishida H, Yap CC, Suzuki M, Kawai J, et al: Antisense transcription in the mammalian transcriptome. Science. 2005, 309 (5740): 1564-1566.
Kiyosawa H, Mise N, Iwase S, Hayashizaki Y, Abe K: Disclosing hidden transcripts: mouse natural sense-antisense transcripts tend to be poly(A) negative and nuclear localized. Genome Res. 2005, 15 (4): 463-474. 10.1101/gr.3155905.
Kiyosawa H, Yamanaka I, Osato N, Kondo S, Hayashizaki Y: Antisense transcripts with FANTOM2 clone set and their implications for gene regulation. Genome Res. 2003, 13 (6B): 1324-1334.
Numata K, Okada Y, Saito R, Kiyosawa H, Kanai A, Tomita M: Comparative analysis of cis-encoded antisense RNAs in eukaryotes. Gene. 2007, 392 (1-2): 134-141. 10.1016/j.gene.2006.12.005.
Okada Y, Tashiro C, Numata K, Watanabe K, Nakaoka H, Yamamoto N, Okubo K, Ikeda R, Saito R, Kanai A, et al: Comparative expression analysis uncovers novel features of endogenous antisense transcription. Hum Mol Genet. 2008, 17 (11): 1631-1640. 10.1093/hmg/ddn051.
Numata K, Osada Y, Okada Y, Saito R, Hiraiwa N, Nakaoka H, Yamamoto N, Watanabe K, Okubo K, Kohama C, et al: Identification of novel endogenous antisense transcripts by DNA microarray analysis targeting complementary strand of annotated genes. BMC Genomics. 2009, 10: 392-10.1186/1471-2164-10-392.
Cui Q, Ma Y, Jaramillo M, Bari H, Awan A, Yang S, Zhang S, Liu L, Lu M, O'Connor-McCourt M, et al: A map of human cancer signaling. Mol Syst Biol. 2007, 3: 152.
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, et al: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999, 286 (5439): 531-537. 10.1126/science.286.5439.531.
Ramaswamy S, Tamayo P, Rifkin R, Mukherjee S, Yeang CH, Angelo M, Ladd C, Reich M, Latulippe E, Mesirov JP, et al: Multiclass cancer diagnosis using tumor gene expression signatures. Proc Natl Acad Sci USA. 2001, 98 (26): 15149-15154. 10.1073/pnas.211566398.
Ross DT, Scherf U, Eisen MB, Perou CM, Rees C, Spellman P, Iyer V, Jeffrey SS, Van de Rijn M, Waltham M, et al: Systematic variation in gene expression patterns in human cancer cell lines. Nature genetics. 2000, 24 (3): 227-235. 10.1038/73432.
Xu L, Geman D, Winslow RL: Large-scale integration of cancer microarray data identifies a robust common cancer signature. BMC Bioinformatics. 2007, 8: 275-10.1186/1471-2105-8-275.
Zhang L, Zhou W, Velculescu VE, Kern SE, Hruban RH, Hamilton SR, Vogelstein B, Kinzler KW: Gene expression profiles in normal and cancer cells. Science. 1997, 276 (5316): 1268-1272. 10.1126/science.276.5316.1268.
Calin GA, Liu CG, Ferracin M, Hyslop T, Spizzo R, Sevignani C, Fabbri M, Cimmino A, Lee EJ, Wojcik SE, et al: Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas. Cancer Cell. 2007, 12 (3): 215-229. 10.1016/j.ccr.2007.07.027.
Lu J, Getz G, Miska EA, Alvarez-Saavedra E, Lamb J, Peck D, Sweet-Cordero A, Ebert BL, Mak RH, Ferrando AA, et al: MicroRNA expression profiles classify human cancers. Nature. 2005, 435 (7043): 834-838. 10.1038/nature03702.
Grigoriadis A, Oliver GR, Tanney A, Kendrick H, Smalley MJ, Jat P, Neville AM: Identification of differentially expressed sense and antisense transcript pairs in breast epithelial tissues. BMC Genomics. 2009, 10: 324-10.1186/1471-2164-10-324.
Yu W, Gius D, Onyango P, Muldoon-Jacobs K, Karp J, Feinberg AP, Cui H: Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA. Nature. 2008, 451 (7175): 202-206. 10.1038/nature06468.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature genetics. 2000, 25 (1): 25-29. 10.1038/75556.
Bertucci F, Salas S, Eysteries S, Nasser V, Finetti P, Ginestier C, Charafe-Jauffret E, Loriod B, Bachelart L, Montfort J, et al: Gene expression profiling of colon cancer by DNA microarrays and correlation with histoclinical parameters. Oncogene. 2004, 23 (7): 1377-1391. 10.1038/sj.onc.1207262.
Perocchi F, Xu Z, Clauder-Munster S, Steinmetz LM: Antisense artifacts in transcriptome microarray experiments are resolved by actinomycin D. Nucleic acids research. 2007, 35 (19): e128-10.1093/nar/gkm683.
Struhl K: Transcriptional noise and the fidelity of initiation by RNA polymerase II. Nat Struct Mol Biol. 2007, 14 (2): 103-105. 10.1038/nsmb0207-103.
Taft RJ, Glazov EA, Cloonan N, Simons C, Stephen S, Faulkner GJ, Lassmann T, Forrest AR, Grimmond SM, Schroder K, et al: Tiny RNAs associated with transcription start sites in animals. Nature genetics. 2009, 41 (5): 572-578. 10.1038/ng.312.
Seila AC, Calabrese JM, Levine SS, Yeo GW, Rahl PB, Flynn RA, Young RA, Sharp PA: Divergent transcription from active promoters. Science. 2008, 322 (5909): 1849-1851. 10.1126/science.1162253.
Tufarelli C, Stanley JA, Garrick D, Sharpe JA, Ayyub H, Wood WG, Higgs DR: Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease. Nature genetics. 2003, 34 (2): 157-165. 10.1038/ng1157.
Katoh M: Frequent up-regulation of WNT2 in primary gastric cancer and colorectal cancer. Int J Oncol. 2001, 19 (5): 1003-1007.
Shi Y, He B, Kuchenbecker KM, You L, Xu Z, Mikami I, Yagui-Beltran A, Clement G, Lin YC, Okamoto J, et al: Inhibition of Wnt-2 and galectin-3 synergistically destabilizes beta-catenin and induces apoptosis in human colorectal cancer cells. Int J Cancer. 2007, 121 (6): 1175-1181. 10.1002/ijc.22848.
Paredes J, Albergaria A, Oliveira JT, Jeronimo C, Milanezi F, Schmitt FC: P-cadherin overexpression is an indicator of clinical outcome in invasive breast carcinomas and is associated with CDH3 promoter hypomethylation. Clin Cancer Res. 2005, 11 (16): 5869-5877. 10.1158/1078-0432.CCR-05-0059.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/4/42/prepub
We thank Takahiro Doi's laboratory and the RIKEN BioResource Center for microarray supply and RNA extraction. We also thank Hidemasa Kato (NIKON CORPORATION) for critically reading our manuscript. This research was supported in part by grants from the Non-coding RNA Project of the New Energy and Industrial Technology Development Organization (NEDO) of Japan and by research funds from the Yamagata prefectural government and Tsuruoka City, Japan.
The authors declare that they have no competing interests.
RS wrote the manuscript. RS, YO, YO, and KN conducted bioinformatics analyses. KK, SM, and NO prepared the tissue samples, and KK participated in the design of the experiments. KN, HN, NY, and HK developed the viewer. AK, HY, KA, M.T., and HK supported writing the manuscript. KW and HK conducted microarray experiments. HK designed the experiments and organized the project. All authors have read and approved the final manuscript.
Rintaro Saito, Keisuke Kohno contributed equally to this work.