Skip to main content


A network view of microRNA and gene interactions in different pathological stages of colon cancer



Colon cancer is one of the common cancers in human. Although the number of annual cases has decreased drastically, prognostic screening and translational methods can be improved. Hence, it is critical to understand the molecular mechanisms of disease progression and prognosis.


In this study, we develop a new strategy for integrating microRNA and gene expression profiles together with clinical information toward understanding the regulation of colon cancer. Particularly, we use this approach to identify microRNA and gene expression networks that are specific to certain pathological stages. To demonstrate the application of our method, we apply this approach to identify microRNA and gene interactions that are specific to pathological stages of colon cancer in The Cancer Genome Atlas (TCGA) datasets.


Our results show that there are significant differences in network connections between miRNAs and genes in different pathological stages of colon cancer. These findings point to a hypothesis that these networks signify different roles of microRNA and gene regulation in the pathogenesis and tumorigenesis of colon cancer.


Colon cancer, which is reported to be one of the few curable cancers, is one of the most common cancers around the world. The complex progress of colon cancer stage induces the poor prediction prognosis of colon cancer. According to previous studies, there is a 92% 5-year relative survival rate [1, 2] in stage I colon cancer. For patients with stage II colon cancer, there are two stage subtypes: stage IIA and stage IIB colon cancers [1, 2]. There is an 87% 5-year relative survival rate for stage IIA and 63% for stage IIB. Similarly, for stage III colon cancer there are three subtypes: stage IIIA, IIIB and IIIC colon cancers [1, 2]. In patients with stage IIIA, the 5-year relative survival rate is 89%, for stage IIIB it is 69% and 53% for stage IIIC [1, 2]. When the cancer has reached stage IV and metastasized to other parts of the body, the 5-year relative survival rate is significantly decreased to approximately 12%. The drastic decrease in survival rate in colon cancer speaks to the need for better early diagnostic and prognostic procedures.

The role of pathologic prognostic markers is important in the advancement of personalized medicine and can help reduce the risk of recurrence, especially in high-risk patients with stage II colon cancer [35]. Due to the benefits of personalized medicine, these patients have an increased overall survival with therapies such as adjuvant chemotherapy. Gene expression signatures have shown much promise as prognostic markers [6]. For example, the progression of colon cancer is directly linked to the functional epithelial-mesenchymal transition (EMT) gene expression signatures [7]. Genes ZEB1 and ZEB2 are known repressors that regulate targets in the EMT pathway by changing the phenotype of normal cells to cancerous cells [8]. These genes are also known to be present in the beginning of metastasis.

Cell invasion and migration are also critical components in colon cancer progression. For instance, genes PRKCQ and PRKCZ are members of the protein kinase family and PRKCZ is often involved in cell survival and cell migration in different cancers such as ovarian cancer [9]. It has also been reported that ARID4B is a key player in pathogenesis and is classified as a metastasis modifier gene. Over-expression of this gene is thought to enhance the cell migration process as well as cell invasion. In contrast, the knockdown of ARID4B, causes metastasis of cancer cells to other regions of the body [10, 11].

More recently, microRNA (miRNA) expression profiles have been utilized as predictive markers for survival of colon cancer [12]. Previous studies have reported several miRNAs were relevant with poor survival and therapeutic outcome in colon cancer [1315]. For example, miR−148, miR−26a−2 and miR−130a were identified to be significantly associated with a poor clinical prognosis [16]. Exploiting the downstream neighborhoods of genes with such a critical role in the pathogenesis of colon cancer provide long-term benefits in personalized medicine and adjuvant therapies. Studies have also found that the genetic changes varied among different stages of colon cancer, specially between stage II and stage III [3, 17, 18]. However, the genetic mechanism underlying these genetic changes that drive pathological stage progression are remain poorly studied.

There are some recent studies on delineating miRNAs and genes differentially expressed in different stages of colon cancer [19, 20]. These work started with conducting differentially expressed miRNAs and/or differentially expressed gene analysis to find the pathways that target genes and/or miRNAs are involved in different pathological stages of colon cancer. In our study, we intend to integrate miRNA and gene expression interactions from mining miRNA and gene expression profiles of cancer patients to investigate different network communities and patterns specific to different pathological stages of colon cancer. We hypothesized that as colon cancer progresses, there are unique functional patterns present in early stages that are not present in later stages and vice versa. We believe that identifying these functional signatures in different pathological stages can lead to improved prognosis and better understanding in the stage progression of colon cancer. Ultimately, the identification and analysis of the evolution and dynamics of these miRNA-gene networks will improve our understanding of the etiology and treatment of colon cancer. Using The Cancer Genome Atlas (TCGA) project [21] colon cancer data [22], we developed a novel strategy to integrate miRNA and gene expression data, incorporate known miRNA and gene targets, and protein protein interactions to generate a more comprehensive view of the miRNA-gene interactions.


Our integrative strategy for this work can be summarized as a workflow containing the following five steps (Fig. 1) after preprocessing the data.

Fig. 1

An illustration of our analysis workflow

Step 1. For each pathological stage, we conducted an association analysis between miRNA expression and gene expression quantifications in the same patients respectively using MatrixeQTL [23]. Those miRNA and gene associations that are statistically significant are kept for further analysis.

Step 2. For the miRNAs that are significantly associated with gene expression variations identified in Step 1, we obtained their target genes by retrieving from miRNA target gene databases like miRTarBase[24].

Step 3. We generated a gene set directly associated with or targeted by miRNAs for each pathological stage by merging all genes significantly associated with miRNAs output from Step 1 and all of the targeted genes of those miRNAs significantly associated with gene expression obtained from Step 2. We then conducted the Gene Ontology (GO) Enrichment analysis for the gene set of each pathological stage to compare the functional differences among four stages.

Step 4. We overlapped the gene set directly associated with or targeted by miRNAs to InWeb, a comprehensive interaction database, to generate a preliminary miRNA-gene network in each pathological stage. We then expanded these preliminary miRNA-gene networks by adding interacting genes using two different methods and generated an integrated miRNA-gene network for each stage. The integrated miRNA-gene network for each pathological stage is now composed of miRNAs significantly associated with gene expression changes, the genes associated with or targeted by these miRNAs, and extended genes interacting with these genes.

Step 5. Finally, we performed community identification, network alignment, and statistical test to investigate the differences among the integrated miRNA-gene networks of all of the four pathological stages.

Data preprocessing

The data in this study was first downloaded from TCGA (download date: June 11, 2015). Specifically, we retrieved miRNA expression data, gene expression data and clinical profiles for all colon cancer patients. The miRNA and gene expression data, quantified from miRNA and RNA sequencing respectively, were then preprocessed by first removing samples with missing data. We retained all clinical profiles of patients whose pathologic stage was known. If the sample stage was unknown, we did not include that sample in the study. We hence collected all samples with pathological stages marked as pathological stage I, II, III, and IV. We then conducted the analysis on these samples in each pathological stage respectively. We filtered the miRNA and gene expression matrices by removing those with over 10% of missing values in the samples. To reduce the variation between samples, we utilized inverse quantile normalization to normalize the miRNA and gene expression data separately. The quantile normalized miRNA and gene expression matrices are then used for this study.

Identification of associations between miRNA and gene expression

We performed miRNA-gene association analysis between the miRNA expression and gene expression profiles in four pathological stages separately. We identified significant associations between miRNAs and genes where they are on the same chromosome and located across different chromosomes. The reference genome we used was GRCH38 with the gene reference of gencode.v29.annotation.gff3 downloaded from [25], and the miRNA reference of hsa.gff3 was downloaded from [2630] The association analysis was performed using Matrix eQTL [31], which is an R package that uses matrix operations to identify pairwise associations. We conducted miRNA-gene association analysis by running a linear regression model using Matrix eQTL for each pair of miRNA and gene, and selected significant miRNA-gene associations using a 5% cutoff of false discovery rate (FDR). For each pathological stage, we extracted all of the experimentally validated target genes of those miRNAs asscociated with gene expressions from the miRTarBase target database [24]. At this point, we generated a target and associated gene set of miRNAs for each pathological stage by merging the genes associated with miRNAs and the genes targeted by miRNAs for further analysis.

Gene Ontology analysis of miRNA associated and targeted genes

We conducted GO enrichment analysis for target and associated gene set of miRNAs (GO annotation released 2018-12-01) for each stage using the PANTHER over-representation test (Fisher’s exact test with a Bonferroni-corrected p-value cutoff at 0.05) [32, 33]. Using the 20,996 human whole-genome genes as background, we performed GO enrichment analysis of the genes for three categories including molecular function, biological process and cellular component.

The construction of integrated miRNA-gene networks

To explore the genetic network induced by miRNA expression, we constructed a miRNA-gene interaction network for each pathological stage using the following strategy. First, we overlapped the gene set, composed of miRNA associated genes and targeted genes, with the genes in an interaction database InWeb [34, 35]. InWeb is a comprehensive protein-protein interaction network that integrated various interaction resources and was constructed using a stringent orthology majority-voting scheme [35]. We used InWeb as a template network for our network extension since it is dense and covers a wide spectrum of interaction information among genes. This overlapping process generated a preliminary miRNA-gene network for each pathological stage that is composed of miRNAs, their associated and targeted genes, and direct interactions among these genes.

Next, we expanded our miRNA and gene expression network for each stage using two methods. The first method is called GeNets [36], which uses a randomization approach to choose a new node to connect, based on a known network structure from InWeb [34, 35]. The second method we applied was a spin-glass model [3739] that uses simulated annealing to identify the communities in a network. Our network extension process thus includes two stages. First, we used a scalable spin-glass algorithm in R’s igraph library [40] to identify all communities in the preliminary network including InWeb interactions among the associated and targeted genes of miRNAs. Second, we selected the most connected genes in the communities to find their downstream genes. The final integrated miRNA-gene network for each pathological stage was then constructed by including the miRNAs, the associated and targeted genes of these miRNAs, and the communities and extended neighborhood genes of the these associated and targeted genes. With this strategy, we constructed the final miRNA-gene network for each pathological stage that captures the direct and indirect effect of miRNAs on gene expression and provides a network view of how miRNAs affect gene expression in different pathological stages.

Network analysis and alignment

We conducted comparative network analysis and network alignment among the integrated miRNA-gene networks across different pathological stages of colon cancer to investigate the differences in network properties and topology that may indicate disease progression. We first calculated important network properties of the four miRNA-gene networks for the four pathological stages. Specifically, we calculated the following four network properties, namely edge similarity, centrality, diameter and betweenness. Edge similarity indicates the similarity of edges/connections in two networks. Here we calculated the edge similarity as the proportion of common edges between stage I and all other stages respectively. Network diameter measures the length of the longest path from one vertex to another in a network and indicates the size of a network. Network centrality indicates the importance of vertices within a network. The betweenness of a network denotes the degree to which nodes stand between each other and is an important measurement of network centrality. We also calculated the degree distribution of all nodes in each network. We then performed network alignment to investigate how these miRNA-gene networks in different pathological stages align and differ in network topology. We utilized the “GraphAlignment” package in R [41] that aligns two networks or graphs using the adjacency matrices of two networks. We can thus identify subnetworks specific to each stage based on common nodes in the intersection of two networks.


Significant miRNA-gene associations

From our miRNA and gene expression association analysis, we identified 923, 1964, 838 and 930 significant miRNA-gene associations for stage I, II, III and IV, separately (Table 1). We observed that the distances between miRNAs and their associated genes in different distance windows follow a similar trend but they do differ among different pathological stages. We found that these miRNA-gene associations are stage specific and there is no overlap miRNA-gene pairs among all stages although there 17 miRNAs and one gene shared among these stages (Fig. 2). The numbers of significant associations identified may be confounded by the varied sample sizes in different stages (31 for stage I, 82 for stage II, 59 for stage III and 23 for stage IV). However, this observation that different stages share little in miRNA-gene associations, implies that miRNAs may affect gene expression differently in varied pathological stages and this may indicate disease progression and etiology. We admit that this little sharing of miRNA-gene pairs in different stages might also be partially due to the limited power of association analysis resulted from small sample sizes. Nonetheless, some of the stage-specific miRNAs have been previously reported to affect the progression of colon cancer. For example, hsamir−200c was reported to be related to colon cancer progression [42] that was found to be associated with cancer genes in stage I but not in other stages in our results (Fig. 3a). hsamir−1249 which was shown to up-regulate genes in colon cancer [43] and was found to be associated with OSMR that affect the progression of colon cancer in stage II in our association study [44] (Fig. 3b). hsamir−34a was reported to up-regulate or down-regulate target genes in colon cancer in stage III [45] (Fig. 3c). In stage IV, hsamir−130b was reported to affect the tumor progress in colon cancer. [14, 46, 47] which was associated with cancer gene GPSM2 in our study [48] (Fig. 3d). In the miRNA-gene associations, we also found some stage-specific genes were reported to be associated with colon cancer in literature. For example, PLA2G2A was only identified to be associated with hsamir−129−1 in stage I, and was previously reported to have low expression in colon cancer patients [49] and hsamir−129−1 was reported to act as a tumor suppressor [50]. Another gene MCC was only found to be associated with hsamir−671 in stage II that was reported to have somatic mutations in colon cancer [51]. Moreover, the associated miRNA hsamir−671 was reported to up-regulate the gene expression in colon cancer study [52]. Another gene associated with colon cancer was FGFR3 [53], and the deregulated expression of the associated miRNA hsamir−9, only identified in Stage IV, had an important role in colon cancer progression [54]. All these examples demonstrated that some of these miRNAs and genes specifically associated in different pathological stages are reported to be related to colon cancer, and can thus serve as potential biomarkers and anchor points toward further understanding of the progression and prognosis of colon cancer.

Fig. 2

Venn diagrams of miRNA-gene associations identified in each pathological stage. a Comparison of miRNA-gene associations among four stages; b Comparison of miRNAs associated with gene expression among four stages; c Comparison of genes in association with miRNAs among four stages

Fig. 3

Examples of miRNA-gene subnetworks specific to each stage. Purple nodes represent genes, orange nodes denote cancer genes, and red triangle nodes represent miRNAs that are associated with genes. Gene network expansion are black lines and miRNA-gene associations are solid blue lines. a A subnetwork specific to stage I; b A subnetwork specific to stage II; c A subnetwork specific to stage III; d A subnetwork specific to stage IV

Table 1 The distance distribution between miRNAs and their associated genes

Gene Ontology enrichment analysis

We conducted GO enrichment analysis for the miRNA associated genes and found that 6 molecular functions, 23 biological processes, 24 cellular components were enriched for stage I; 9 molecular functions, 30 biological processes, 23 cellular components were enriched, and 5 biological process depleted for stage II; 2 molecular functions, 16 biological processes, 12 cellular components were enriched, and 2 biological process depleted for stage III; and 26 molecular functions, 78 biological processes, 50 cellular components were enriched for stage IV (Additional file 1). The comparison of molecular function, biological process and cellular component are showed in Fig. 4. Besides the common enriched cellular components, molecular functions and biological processes, there were different functions in these four categories enriched by target and associated genes through our GO analysis. For example, biological processes that were related to colon cancer such as chemotaxis and regulation of cell projection organization, were particularly enriched in stage I; cellular process and cell adhesion were particularly enriched in stage II; gene expression were particularly enriched in stage III and cell differentiation and regulation of immune system process were particularly enriched in stage IV [55]. Molecular functions that were related to colon cancer such as sodium ion transmembrane transporter activity were particularly enriched in stage I; growth factor activity and receptor regulator activity was particularly enriched in stage II; and chemokine activity and ion channel activity was particularly enriched in stage IV [55]. Cellular components that were related to colon cancer such as exocytic vesicle and synaptic vesicle were particularly enriched in stage I; cytoplasmic part and cytoplasm was particularly enriched in stage II; membrane protein complex and integral component of lumenal side of endoplasmic reticulum membrane was particularly enriched in stage IV [55]. It is thus important to investigate the particularly enriched biological processes, molecular functions, and cellular components that are specific to certain pathological stage. Such investigations will help shed light on how miRNAs play a different role in regulating gene expression in various pathological stages and thus can serve as biomarkers or signature profiles to help screen, subtype, diagnose and treat patients.

Fig. 4

GO enrichment analysis of gene associated with miRNAs in four stages. a. Enriched molecular functions; b. Enriched cellular components; c. Enriched biological processes

Network analysis and alignment

First, we checked how the genes are interacting with each other in the genes targeted by or associated with miRNAs in each pathological stage (Additional file 2). We found that 98.87% (12420/12562) of these genes are interacting according to the protein protein interactions with high confidence in the InWeb database where we set the confidence score cutoff at 0.2 (Table 2).

Table 2 The numbers of genes targeted by and associated with miRNAs, and their interactions in each pathological stage

Second, we compared the four integrated miRNA-gene networks of different stages by comparing important network properties of these networks such as the edge similarity, centrality of vertices, diameter of graph, and betweenness of the network in each pathological stage (Table 3). The edge similarity was calculated as the proportion of common edges of the network for stage I and all other networks for the remaining three stages. The centrality of vertices denoted the importance of the vertices in the network [56] with higher values implying that the nodes were closer to the center of the network. The betweenness of vertices denoted the number of the shortest paths passing through vertices [57]. The higher the betweenness of a node, the more of other nodes can reach the center by the shortest path through this node. We use the average of centrality and betweenness of all nodes in the network in Table 3. Another measurement we used was the diameter of each network of each stage which measured the longest path from one vertex to another in the network. We also investigated the degree distribution of vertices of network in each stage (Fig. 5). These network properties showed that there was different pattern between stage I and other stages, which provided evidence on different effect of miRNAs on gene expression in different pathological stages. Particularly, we observed that pathological stage II is the most different comparing with networks of the other stages, and stage III and IV are more similar. The network of stage II is most similar to stage I, has the largest diameter, has the lowest centrality where nodes are less important comparing to other networks, and has the largest betweenness with more nodes reaching to network centers via shortest paths. Hence, these results suggest that we need to investigate why stage II has distinct network property comparing to other stages to better understand disease progression and prognosis.

Fig. 5

The histogram of degree distribution in the miRNA-gene network of each stage

Table 3 Comparison of network properties of the integrated miRNA-gene networks in four stages

Finally, we conducted network alignment to find local network structures that differ in the integrated miRNA-gene networks in the four pathological stages. We found many examples that miRNAs affect gene expression with different patterns among varied pathological stages (Fig. 3). For example, Fig. 3a showed one miRNA hsamir−200c was related to colon cancer progression [42] that was associated with some cancer genes only in stage I. hsamir−1249 which was reported to up-regulate genes in colon cancer [43] was associated with OSMR in stage II and OSMR affects the progression of colon cancer [44]. In addition, another gene CTNNB1 reported to affect colon cancer progression was also involved in this miRNA-gene network in stage II [58]. hsamir−34a was reported to up-regulate or down-regulate target genes in colon cancer and was found to be associated with genes changes only in stage IV [45]. In stage IV, hsamir−130b was reported to affect the tumor progression in colon cancer. [14, 46, 47] which is associated with cancer gene GPSM2 in our study [48]. These four networks are only present in each corresponding stage and do not overlap cross stages which provide the evidence of our assumption that the miRNA-gene network has different patterns in colon cancer pathological stages.

Discussion and conclusion

This study implemented an association and network integration strategy to help understand the underlying mechanism on how miRNAs distinctively affect gene expression in different pathological stages of colon cancer. Our results showed that there is significant difference among the miRNA-gene associations and interactions in different pathological stages. Particularly, pathological stage II was notably different from other stages regarding to their network properties in our miRNA-gene network analysis. We believe that such analysis will point to potential follow-up studies dedicated to investigating molecular mechanisms that lead to these differences. With more samples with clinical information available, together with matching genetic and epigenetic profiles, we expect to apply the strategy designed in this study to find more detailed delineations of regulatory changes in different pathological stages and diseases. It is also worth noting that this method is applicable to any cancer type or even other disease that has clinical profile data available.

In summary, we found that there is significant statistical difference on the miRNA-gene interactions between different pathological stages in colon cancer, which supported by other studies that network signature plays different roles in the pathogenesis [59, 60], metastasis [61] and the tumorigenesis of colon cancer. Our approach for integrating different types of data (including but not limited to miRNA expression, gene expression, protein protein interactions) can reveal the interactions and cross-talks between multi-layers of genetic components in interpreting the etiology, prognosis and progression of various diseases including different cancer types. Given the evidence that these interactions and network communities differ in different disease stages, this study also shed light on potential mitigation or treatment plan that helps deter or redirect the progression of diseases through diverting or changing the network structure in a systems biology way.

Availability of data and materials

The data, code, and results of this study can be publicly accessed via github at



pithelial-mesenchymal transition


Gene Ontology


The Cancer Genome Atlas


  1. 1

    Siegel RL, Miller KD, Jemal A. Cancer statistics, 2017. CA Cancer J Clin. 2017; 67(1):7–30.

  2. 2

    American Society of Clinical Oncology, et al.Understanding statistics used to guide prognosis and evaluate treatment. 2005. Accessed 2018.

  3. 3

    Dotan E, Cohen SJ. Challenges in the management of stage II colon cancer. In: Seminars in oncology, vol. 38. Elsevier: 2011. p. 511–20. WB Saunders.

  4. 4

    Sharif S, O’Connell MJ. Gene signatures in stage ii colon cancer: a clinical review. Curr Color Cancer Rep. 2012; 8(3):225–31.

  5. 5

    Dienstmann R, Salazar R, Tabernero J. Personalizing colon cancer adjuvant therapy: selecting optimal treatments for individual patients. J Clin Oncol. 2015; 33(16):1787–96.

  6. 6

    Dalerba P, Sahoo D, Paik S, Guo X, Yothers G, Song N, Wilcox-Fogel N, Forgó E, Rajendran PS, Miranda SP, et al.Cdx2 as a prognostic biomarker in stage ii and stage iii colon cancer. N Engl J Med. 2016; 374(3):211–22.

  7. 7

    Woischke C, Blaj C, Schmidt EM, Lamprecht S, Engel J, Hermeking H, Kirchner T, Horst D. Cyb5r1 links epithelial-mesenchymal transition and poor prognosis in colorectal cancer. Oncotarget. 2016; 7(21):31350.

  8. 8

    Galván J, Zlobec I, Wartenberg M, Lugli A, Gloor B, Perren A, Karamitopoulou E. Expression of e-cadherin repressors snail, zeb1 and zeb2 by tumour and stromal cells influences tumour-budding phenotype and suggests heterogeneity of stromal cells in pancreatic cancer. Br J Cancer. 2015; 112(12):1944–50.

  9. 9

    Seto KKY, Andrulis IL. Atypical protein kinase c zeta: Potential player in cell survival and cell migration of ovarian cancer. PloS ONE. 2015; 10(4):e0123528.

  10. 10

    Wu M-Y, Eldin KW, Beaudet AL. Identification of chromatin remodeling genes arid4a and arid4b as leukemia suppressor genes. J Natl Cancer Inst. 2008; 100(17):1247–59.

  11. 11

    Winter SF, Lukes L, Walker RC, Welch DR, Hunter KW. Allelic variation and differential expression of the msin3a histone deacetylase complex gene arid4b promote mammary tumor growth and metastasis. PLoS Genet. 2012; 8(5):e1002735.

  12. 12

    Li P, Ou Q, Chen G, Oduncu FS. Microrna as predictive biomarker of survival for stage iiib colon cancer patients. Ann Oncol. 2016; 27(suppl 6):532P.

  13. 13

    Li J, Liu Y, Wang C, Deng T, Liang H, Wang Y, Huang D, Fan Q, Wang X, Ning T, et al.Serum mirna expression profile as a prognostic biomarker of stage ii/iii colorectal adenocarcinoma. Sci Rep. 2015; 5:12921.

  14. 14

    Schetter AJ, Leung SY, Sohn JJ, Zanetti KA, Bowman ED, Yanaihara N, Yuen ST, Chan TL, Kwong DLW, Au GKH, et al.Microrna expression profiles associated with prognosis and therapeutic outcome in colon adenocarcinoma. Jama. 2008; 299(4):425–36.

  15. 15

    Díaz R, Silva J, García JM, Lorenzo Y, García V, Peña C, Rodríguez R, Muñoz C, García F, Bonilla F, et al.Deregulated expression of mir-106a predicts survival in human colon cancer patients. Genes Chromosom Cancer. 2008; 47(9):794–802.

  16. 16

    Xu J, Zhao J, Zhang R. Four micrornas signature for survival prognosis in colon cancer using tcga data. Sci Rep. 2016; 15:38306.

  17. 17

    Frank SA. Multistage progression. In: Dynamics of Cancer: Incidence, Inheritance, and Evolution 2007. Princeton University Press: 2007.

  18. 18

    Kheirelseid EAH, Miller N, Chang KH, Nugent M, Kerin MJ. Clinical applications of gene expression in colorectal cancer. J Gastrointest Oncol. 2013; 4(2):144.

  19. 19

    Falzone L, Scola L, Zanghì A, Biondi A, Cataldo AD, Libra M, Candido S. Integrated analysis of colorectal cancer microrna datasets: identification of micrornas associated with tumor development. Aging (Albany NY). 2018; 10(5):1000.

  20. 20

    Feng D, Yu X, Zhang Y, Liu Y, Zhou L, Han B. Interaction analysis of molecular markers in colorectal cancer by integrated bioinformatics analysis. Med Sci Monit. 2018; 24:6059–69.

  21. 21

    The cancer genome atlas (tcga). Accessed June 2015.

  22. 22

    Cancer Genome Atlas Network, et al.Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012; 487(7407):330.

  23. 23

    Shabalin AA. Matrix eqtl: ultra fast eqtl analysis via large matrix operations. Bioinformatics. 2012; 28(10):1353–8.

  24. 24

    Chou C-H, Shrestha S, Yang C-D, Chang N-W, Lin Y-L, Liao K-W, Huang W-C, Sun T-H, Tu S-J, Lee W-H, et al.mirtarbase update 2018: a resource for experimentally validated microrna-target interactions. Nucleic Acids Res. 2017; 46(D1):D296–D302.

  25. 25

    Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S, et al.Gencode: the reference human genome annotation for the encode project. Genome Res. 2012; 22(9):1760–74.

  26. 26

    Kozomara A, Griffiths-Jones S. mirbase: annotating high confidence micrornas using deep sequencing data. Nucleic Acids Res. 2013; 42(D1):D68–D73.

  27. 27

    Kozomara A, Griffiths-Jones S. mirbase: integrating microrna annotation and deep-sequencing data. Nucleic Acids Res. 2010; 39(suppl_1):D152–D157.

  28. 28

    Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ. mirbase: tools for microrna genomics. Nucleic Acids Res. 2007; 36(suppl_1):D154–D158.

  29. 29

    Griffiths-Jones S, Grocock RJ, Van Dongen S, Bateman A, Enright AJ. mirbase: microrna sequences, targets and gene nomenclature. Nucleic Acids Res. 2006; 34(suppl_1):D140–D144.

  30. 30

    Griffiths-Jones S. The microrna registry. Nucleic Acids Res. 2004; 32(suppl_1):D109–D111.

  31. 31

    Shabalin AA. Matrix eqtl: Ultra fast eqtl analysis via large matrix operations. Bioinformatics. 2012; 28(10):1353–8.

  32. 32

    Mi H, Huang X, Muruganujan A, Tang H, Mills C, Kang D, Thomas PD. Panther version (11) expanded annotation data from gene ontology and reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. 2016; 45(D1):D183–D189.

  33. 33

    Gene Ontology Consortium. Expansion of the gene ontology knowledgebase and resources. Nucleic Acids Res. 2016; 45(D1):D331–D338.

  34. 34

    Lage K, et al.A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol. 2007; 25:309–16.

  35. 35

    Li T, Wernersson R, Hansen RB, Horn H, Mercer J, Slodkowicz G, Workman CT, Rigina O, Rapacki K, Stærfeldt HH, et al.A scored human protein–protein interaction network to catalyze genomic interpretation. Nat Methods. 2017; 14(1):61.

  36. 36

    Li T, Kim A, Rosenbluh J, Horn H, Greenfeld L, An D, Zimmer A, Liberzon A, Bistline J, Natoli T, Li Y. GeNets: a unified web platform for network-based genomic analyses. Nature methods. 2018; 15(7):543–46.

  37. 37

    Eaton E, Mansbach R. A spin-glass model for semi-supervised community detection. Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence. Toronto; 2012.

  38. 38

    Pan L, Wang C, Xie J. A spin-glass model based local community detection method in social networks. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence. IEEE: 108.

  39. 39

    Reichardt J, Bornholdt S. Statistical mechanics of community detection. Phys Rev E. 2006; 74:1.

  40. 40

    Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal, Complex Syst. 2006; 1695(5):1–9.

  41. 41

    Berg J, Lässig M. Cross-species analysis of biological networks by bayesian alignment. Proc Natl Acad Sci. 2006; 103(29):10967–72.

  42. 42

    Schetter AJ, Okayama H, Harris CC. The role of micrornas in colorectal cancer. Cancer J (Sudbury, Mass.) 2012; 18(3):244.

  43. 43

    Li E, Ji P, Ouyang N, Zhang Y, Wang XY, Rubin DC, Davidson NO, Bergamaschi R, Shroyer KR, Burke S, et al.Differential expression of mirnas in colon cancer between african and caucasian americans: Implications for cancer racial health disparities. Int J Oncol. 2014; 45(2):587–94.

  44. 44

    Hofer P, Hagmann M, Brezina S, Dolejsi E, Mach K, Leeb G, Baierl A, Buch S, Sutterlüty-Fall H, Karner-Hanusch J, et al.Bayesian and frequentist analysis of an austrian genome-wide association study of colorectal cancer and advanced adenomas. Oncotarget. 2017; 8(58):98623.

  45. 45

    Colangelo T, Fucci A, Votino C, Sabatino L, Pancione M, Laudanna C, Binaschi M, Bigioni M, Maggi CA, Parente D, et al.Microrna-130b promotes tumor development and is associated with poor prognosis in colorectal cancer. Neoplasia. 2013; 15(9):1086–99.

  46. 46

    Arndt GM, et al.Characterization of global microrna expression reveals oncogenic potential of mir-145 in metastatic colorectal cancer. BMC Cancer. 2009; 9(1):1.

  47. 47

    Tazawa H, et al.Tumor-suppressive mir-34a induces senescence-like growth arrest through modulation of the e2f pathway in human colon cancer cells. Proc Natl Acad Sci. 2007; 104(39):15472–7.

  48. 48

    Tu Z, He D, Deng X, Xiong M, Huang X, Li X, Hao L, Ding Q, Zhang Q. An eight-long non-coding rna signature as a candidate prognostic biomarker for lung cancer. Oncol Rep. 2016; 36(1):215–22.

  49. 49

    Cormier RT, Hong KH, Halberg RB, Hawkins TL, Richardson P, Mulherkar R, Dove WF, Lander ES. Secretory phospholipase pla2g2a confers resistance to intestinal tumorigenesis. Nat Genet. 1997; 17(1):88.

  50. 50

    Gao Y, Feng B, Han S, Lu L, Chen Y, Chu X, Wang R, Chen L. Microrna-129 in human cancers: from tumorigenesis to clinical treatment. Cell Physiol Biochem. 2016; 39(6):2186–202.

  51. 51

    Kinzler KW, Nilbert MC, Vogelstein B, Bryan TM, Levy DB, Smith KJ, Preisinger AC, Hedge P, Markham A, et al.Identification of a gene located at chromosome 5q21 that is mutated in colorectal cancers. Science. 1991; 251(4999):1366–70.

  52. 52

    Zhang J, Luo X, Li H, Deng L, Wang Y. Genome-wide uncovering of STAT3-mediated miRNA expression profiles in colorectal cancer cell lines. BioMed Res Int. 2014; 2014:187105.

  53. 53

    Jang J-H, Shin K-H, Park J-G. Mutations in fibroblast growth factor receptor 2 and fibroblast growth factor receptor 3 genes associated with human gastric and colorectal cancers. Cancer Res. 2001; 61(9):3541–3.

  54. 54

    Cekaite L, Rantala JK, Bruun J, Guriby M, Ågesen TH, Danielsen SA, Lind GE, Nesbakken A, Kallioniemi O, Lothe RA, et al.Mir-9,-31, and-182 deregulation promote proliferation and tumor cell survival in colon cancer. Neoplasia. 2012; 14(9):IN20–IN21.

  55. 55

    Chisanga D, Keerthikumar S, Pathan M, Ariyaratne D, Kalra H, Boukouris S, Mathew NA, Al Saffar H, Gangoda L, Ang C-S, et al.Colorectal cancer atlas: an integrative resource for genomic and proteomic annotations from colorectal cancer cell lines and tissues. Nucleic Acids Res. 2015; 44(D1):D969–D974.

  56. 56

    Freeman LC. Centrality in social networks conceptual clarification. Soc Netw. 1978; 1(3):215–39.

  57. 57

    Freeman LC. A set of measures of centrality based on betweenness. Sociometry. 1977; 1:35–41.

  58. 58

    Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M, Nguyen-Viet TA, Bowers P, Sidorenko J, Linnér RK, et al.Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018; 50(8):1112.

  59. 59

    Li W, Cai S, Wang L, Yang C, Zhou B, Wang H. Hint2 downregulation promotes colorectal carcinoma migration and metastasis. Oncotarget. 2017; 8(8):13521.

  60. 60

    Martin J, St-Pierre MV, Dufour J-F. Hit proteins, mitochondria and cancer. Biochim Biophys Acta (BBA) Bioenerg. 2011; 1807(6):626–32.

  61. 61

    Hart M, Rheinheimer S, Leidinger P, Backes C, Menegatti J, Fehlmann T, Grässer F, Keller A, Meese E. Identification of mir-34a-target interactions by a combined network based and experimental approach. Oncotarget. 2016; 7(23):34288.

Download references


Not applicable.

About this supplement

This article has been published as part of BMC Medical Genomics, Volume 12 Supplement 7, 2019: Selected articles from the 14th International Symposium on Bioinformatics Research and Applications (ISBRA-18): medical genomics. The full contents of the supplement are available at


The design of the study, the collection, analysis, and interpretation of data, and the writing of the manuscript is partially supported by National Institutes of Health (R15HG009565 to X.S.). Publication costs are funded by R15HG009565.

Author information

XS planned the study and supervised the work. JW and BH conducted the analysis. JW, BH, and XS wrote the manuscript. All authors have read and approved the final manuscript.

Correspondence to Xinghua Shi.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wen, J., Hall, B. & Shi, X. A network view of microRNA and gene interactions in different pathological stages of colon cancer. BMC Med Genomics 12, 158 (2019).

Download citation