Skip to main content

Screening and identification of immune-related genes for immunotherapy and prognostic assessment in colorectal cancer patients



Increasing evidence indicates that the immune microenvironment plays a key role in the genesis and progression of colorectal cancer (CRC). This study aimed to establish an immune-related gene (IRG) signature and determine its clinical prognostic value in patients with CRC.


The RNA sequencing and associated clinical data of CRC were downloaded from The Cancer Genome Atlas (TCGA) database. We then screened for differentially expressed IRGs by intersecting with IRGs obtained from the Immunology Database and Analysis Portal. Functional enrichment analyses were carried out to determine the potential biological functions and pathways of the IRGs. We also explored the specific molecular mechanisms of the IRGs by constructing regulatory networks. Prognostic IRGs were obtained by LASSO regression analysis, and subsequently, gene models were constructed in the TCGA dataset to confirm the predictive capacity of these IRGs. Finally, we used the TIMER tool to assess the immune properties of prognostic IRGs and correlate them with immune cells.


We identified 409 differentially expressed IRGs in patients with CRC. Kyoto Encyclopaedia of Genes and Genomes and Gene Ontology enrichment analyses suggested that these differentially expressed IRGs were significantly related to 102 cancer signalling pathways and various biological functions. Based on the prediction and interaction results, we obtained 59 TF–IRG, 48 miRNA–IRG, and 214 drug–IRG interaction networks for CRC. Four prognostic genes (POMC, TNFRSF19, FGF2, and SCG2) were developed by integrating 47 survival-related IRGs and 42 characteristic CRC genes. The results of gene model showed that patients in the low risk group had better survival outcomes compared to those in the high risk group. The expression of POMC, TNFRSF19, FGF2, and SCG2 was significantly correlated with immune cells.


This study identified some valid IRGs, and these findings can provide strong evidence for precision immunotherapy in patients with CRC.

Peer Review reports


Colorectal cancer (CRC) was the third most commonly diagnosed cancer and the second leading cause of cancer death in 2020, with approximately 1.9 million new cases and 935,000 deaths worldwide, representing approximately one in ten cancer cases and deaths [1]. Although the five-year survival rate for CRC patients has improved with early screening in developed countries, the outcome for patients with advanced CRC remains unsatisfactory, with a median five-year survival rate of only 12.5% in the USA [2]. Therefore, it is necessary to identify specific biomarkers for early diagnosis and potential therapeutic targets in CRC.

Immunotherapy is gradually becoming the standard treatment for cancer and is as important as surgery, radiotherapy, and chemotherapy. Cancer immunotherapy is designed to promote the immune response of tumour-specific T cells [3]. When fully reprogrammed, T cells are considered the most powerful anti-cancer immune cells. Immunotherapy has not only produced unprecedented clinical results in patients with refractory tumours but has also brought long-term clinical remission to patients with diseases that were historically considered incurable [4]. In recent years, the advent of immune checkpoint inhibitors (ICIs), such as anti-PD-1, has opened up a new landscape for cancer immunotherapy. Nevertheless, the use of ICIs in CRC is currently limited to patients with high microsatellite instability and is only 5–10% effective in CRC patients with microsatellite stability (approximately 90%) [5]. Hence, it is necessary to explore reliable immune-related genes (IRGs) as important immune signatures to improve efficacy and predict prognosis in patients with CRC.

With the presentation of large-scale publicly available gene expression databases, researchers have been able to quickly and accurately identify potential biomarkers for tumour surveillance [6]. The Cancer Genome Atlas (TCGA) is a commonly used database that contains a large amount of transcriptome data and can provide many tumour samples. Multiple immune-related prognostic signatures for lung adenocarcinoma [7], hepatocellular carcinoma [8], breast cancer [9], and clear cell renal cell carcinoma [10] were established from TCGA.

Our study integrated differentially expressed genes obtained from TCGA with IRGs collected from the Immunology Database and Analysis Portal (IMMPort) and conducted an in-depth mining analysis of CRC data. We then analysed and processed the IRGs further by using functional enrichment analyses and regulatory network construction. In addition, we discovered new immune biomarkers associated with CRC prognosis applying LASSO regression analysis. We hope that these findings will lead to accurate prognostic assessment and effective immunotherapy strategies for patients with CRC.


Original data acquisition

The RNA-sequencing and miRNA data were downloaded from TCGA using the UCSC Xena browser ( [11]. The corresponding clinical data for CRC included 353 samples (342 tumour samples and 11 normal samples). The counts per million values were obtained by transforming the original data.

We downloaded the CRC data set numbered GSE39582 from the National Center for Biotechnology Information GEO (Gene Expression Omnibus) ( database [12]. The data set was processed by the original author and standardized probe expression matrix was downloaded. Meanwhile, the probe annotation information of corresponding platform was downloaded. Convert the probe to gene symbol and eliminate the probe that is not compared to gene symbol. For multiple probes mapped to the same gene symbol, the average value of probes was taken as the expression level of the gene. Then the expression values of four genes FGF2, POMC, SCG2, and TNFRSF19 were selected for subsequent analysis.

Differential immune-related genes and miRNA screening

The samples were divided into tumour and normal groups. The TMM algorithm in the R (Version 4.1.1) software package edgeR (Version 3.36.0) [13] was used to standardise the raw count and transform it into counts per million, which was used for subsequent analysis. The significance of differences in gene expression was calculated using an unpaired t-test and corrected by applying the Benjamini–Hochberg (BH) procedure. Threshold | logFC | > 1 and p-value < 0.05 were selected as significant differences in miRNAs and genes expression. The IRGs were collected from the IMMPort database [14] (, and 1793 different IRGs associated with human cancers were screened out (Additional file 6: Table S1). These genes were intersected with 4747 differentially expressed genes to obtain differentially expressed IRGs in CRC.

Functional enrichment analyses of IRGs

To explore the potential biological functions and pathways of 409 differentially expressed IRGs, the R software package clusterProfiler [15] was applied to conduct the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway [16,17,18] and Gene Ontology (GO) [19] enrichment analysis for IRGs. GO has three ontologies: molecular function (MF), cellular component (CC), and biological process (BP). The results with a p-value < 0.05 after BH correction were selected as the most significant enrichment results.

Survival-associated IRG screening

Clinical survival information and gene expression data from patients with CRC were extracted from TCGA. We used the survival R package (version 2.41-3, to analyse the impact of differentially expressed IRGs on patient survival and prognosis. Subsequently, we plotted the Kaplan–Meier (K–M) curve to compare overall survival for high- and low-risk expressions, and survival-associated IRGs were identified using a log-rank test (p < 0.05).

Construction of transcription factor (TF)-IRG and miRNA-IRG regulatory network

The over-representation analysis enrichment method was applied to predict TF-target enrichment of differentially expressed genes in protein–protein interaction network, using WebGestalt GAST [20] ( The species selected was hsapiens, the enrichment parameter (the minimum number of enrichment genes) was set at 2, and the results of the Top 10 were displayed. The TF-target gene interaction relations were obtained using Cytoscape [21] (version 3.6.0, to draw TF-IRG regulatory network.

We used the miRWalk [22] tool to predict regulatory miRNAs of differentially expressed IRGs. The miRNAs obtained by our previous differential analysis were screened out from these miRNAs to construct the relationship between differentially expressed miRNAs and IRGs. Finally, the miRNA-IRG regulatory network was mapped using Cytoscape.

Establishment of drug–gene interaction network

According to the drug prediction database DGIdb ( [23], drug–gene interactions of key differential genes regulated by miRNA and TFs were further predicted by the filtering parameter ‘FDA-approved’. We then constructed the drug–gene interaction network based on the prediction results, using Cytoscape software.

Prognostic characteristic gene screening and model construction

We screened the characteristic genes of CRC using LASSO regression analysis and integrated them with 47 survival-related IRGs to obtain the prognostic characteristic genes of CRC. The lambda value of the LASSO filter was set to 0.004 by iterative calculation. To confirm the predictive capacity of these IRGs, two thirds of the samples (including 219 CRC samples) in the TCGA dataset were randomly selected using the R language for model construction. The model was validated using one third of the samples (including 110 CRC samples).

Immune evaluation and mutation analysis of prognostic characteristic genes

We used TIMER tools ( [24] to assess the immune characteristics of four prognostic characteristic genes (GRP, TNFRSF19, FGF2, and SCG2) in order to determine their relevance to immune cells. Simultaneously, the mutation data of four prognostic characteristic genes were downloaded from TCGA genomic data, and the extracted mutation signatures were visualised using R package Maftools (version 2.10.0) [25] .

Validation of four prognostic characteristic genes from GEO database

To verify the differential expression levels of these four genes (FGF2, POMC, SCG2, and TNFRSF19), we first selected 17 samples with paired paracancer and cancer tissues. The box diagram of the expression of the four genes between the cancer tissue and the paired paracancer tissue samples was then drawn. Paired T test was used to calculate the significance. To verify that these four genes are indeed significantly correlated with prognosis, 550 samples with survival time greater than 30 days were selected first. K–M curve was used to evaluate the association between different gene expression levels and survival prognosis. Expression level higher than or equal to cutoff value is high sample group, expression level lower than cutoff value is low sample group. The cutoff value is judged by the optimal critical value according to the expression value, survival time, and survival state of each gene using R package SurvMiner (Version 0.4.3). To verify the significant association between the four genes and immune cells, we used the Timer algorithm and the Immunedeconv package (version 2.0.0) based on R language [26]. The infiltration levels of macrophages, neutrophils, dendritic cells (DCs), CD8+ T cells, CD4+ T cells and B cells were calculated. Furthermore, spearman correlation and significant P values between the expression level of 4 genes and the level of cell invasion were calculated by corresponding relationship of cancer tissue samples.


Confirmation of differentially expressed IRGs and miRNAs

A total of 4747 differentially expressed genes, including 2490 up-regulated and 2257 down-regulated genes in CRC, were collected by the above screening method. Meanwhile, 426 differential miRNAs were obtained, of which 193 miRNAs were up-regulated and 233 miRNAs were down-regulated in CRC. In the volcano diagram (Fig. 1A), there is a significant difference between the experimental group and the control group (p < 0.05). Then, after intersection of 1793 IRGs downloaded from IMMPort with 4,747 differentially expressed genes, 409 differentially expressed IRGs in CRC were obtained (Fig. 1B).

Fig. 1
figure 1

Identification of differentially expressed IRGs and miRNAs. A The volcano diagram shows up-regulated genes in red, down-regulated genes in green, and no differentially expressed genes in black. The mRNA volcano on the left and miRNA volcano on the right. B Venn diagram, intersection of 4747 differentially expressed genes and 1793 IRGs

Enrichment results of genes

The results showed that these 409 differentially expressed IRGs were significantly enriched in 102 KEGG pathways. As shown in Fig. 2A, the top five enriched pathways were cytokine–cytokine receptor interaction, viral protein interaction with cytokine and cytokine receptor, chemokine signalling pathway, natural killer cell-mediated cytotoxicity, and neuroactive ligand-receptor interaction. Moreover, the GO enrichment analysis showed that ‘cell chemotaxis’, ‘external side of plasma membrane’, and ‘receptor ligand activity’ were the most enriched terms in the BP, CC, and MF, respectively (Fig. 2B–D).

Fig. 2
figure 2

Enrichment results of differentially expressed IRGs. A Results of the KEGG pathway enrichment analysis. B Results of the GO enrichment analysis in the BP. In the Go-BP bubble diagram, the more red the color is, the smaller the P value is and the more significant the Go-BP is. The larger the bubble, the more genes it contains. C Results of the GO enrichment analysis in the CC. D Results of the GO enrichment analysis in the MF

Validation of 47 survival-associated IRGs

Through survival analysis, we obtained 47 IRGs that were significantly associated with survival, of which 18 were positively correlated and 29 were negatively correlated (Table 1). The K–M survival curve also confirmed the survival difference between the high (n = 165) and low (n = 164) expressing populations. As shown in Fig. 3A, the median survival time of the NRG1 high expression group was significantly longer than that of the low expression group (p < 0.05). However, the PGR high expression group showed reduced median survival time (Fig. 3B; p < 0.05).

Table 1 Validation of 47 survival associated IRGs in CRC
Fig. 3
figure 3

Validation of IRGs associated with survival. A Expression level of NRG1 is positively correlated with overall survival. B Expression level of PGR is negatively correlated with survival time. The horizontal axis is survival time, the vertical axis is overall survival, red represents the high gene expression group, black represents the low gene expression group

Survival analysis adjusting for age and tumor stage

We conducted survival analysis adjusting for age and tumor stage at diagnosis via K-M survival curve. First, our results showed significant differences between patients over 60 years of age in the high-low risk group. Although the results were not significant in patients under 60 years of age, the prognosis of the high-risk group was worse than that of the low-risk group (Additional file 4: Fig. S4A, B). Next, there was a significant difference between the high and low risk groups in stage III-IV patients. No significant results were seen in stage I-II patients, but the prognosis of the high-risk group was worse than that of the low-risk group (Additional file 4: Fig. S4C, D).

TF-IRG and miRNA-IRG regulatory networks in CRC

According to the prediction results of the over-representation analysis enrichment method, we obtained 59 pairs of TF-IRG interactions, including 9 TFs (NFAT, COUP, STAT4, TEF1, P53, PPAR, TATA, FREAC2, PU1) and 24 IRGs, of which 7 IRGs (TNFRSF19, TGFB2, GREM1, SPP1, PGF, INHBB, and GRP) were upregulated and 17 IRGs (SEMA6D, BMP5, TPM2, SCG2, NRG1, FABP2, ANGPTL1, POMC, UCN3, COLEC12, RBP2, PTH1R, CCL15, AGTR1, ACVRL1, NTS, and CCL28) were downregulated in CRC. Based on the above results, a complex TF-IRG network diagram for CRC was constructed using Cytoscape (Fig. 4).

Fig. 4
figure 4

TF-IRG regulatory network in CRC. The circle represents IRG, red is up-regulated gene, green is down-regulated gene; the blue diamond represents TF

After integrating the 426 differential miRNAs previously obtained and the targeted miRNAs of the predicted IRGs using the miRWalk tool, we identified 43 miRNAs, 13 IRGs, and 48 miRNA-IRG relationship pairs, as shown in Fig. 5.

Fig. 5
figure 5

MiRNA-IRG regulatory network in CRC. The triangle represents miRNA; Red circles represent up-regulated genes, green circles represent down-regulated genes

Drug–gene interaction network in CRC

Based on the drug–gene interaction information of TF- and miRNA-regulated IRGs in the DGIdb database, we obtained 214 relationship pairs between small drug molecules and IRGs, including 18 IRGs (MAPT, NST, PGR, NRG1, FGF2, TLR3, PTH1R, AGTR1, F2RL1, MTNR1A, VIPR1, BMP5, GRP, BIRC5, SPP1, CD1A, PGF, and MC1R) and 195 drugs. The drug–gene interaction network diagram was plotted using Cytoscape (Fig. 6).

Fig. 6
figure 6

Drug-IRG interaction network in CRC. The yellow hexagon represents small drug molecules; the red circles represent up-regulated genes and the green circles represent down-regulated genes

Screening and modelling of disease characteristic genes

By integrating gene node information in multiple networks, 42 CRC characteristic genes were screened using the LASSO method (Fig. 7A), and four prognostic characteristic genes (POMC, TNFRSF19, FGF2, and SCG2) were obtained by further intersection with 47 survival-related IRGs. To further confirm the predictive effect of these IRGs, 329 CRC samples from TCGA were used to construct a gene model. As shown in Fig. 7B, patients in the low-risk group had a better survival prognosis than those in the high-risk group, which was consistent with the model validation results in Fig. 7C.

Fig. 7
figure 7

Screening and modeling of prognostic characteristic IRGs in CRC. A The characteristic genes were screened by LASSO method. B IRGs model was constructed in TCGA dataset. C Validation diagram of gene model

Immunocorrelation and mutation analysis of prognostic characteristic genes

Our results showed that four prognostic characteristic genes (FGF2, POMC, SCG2, and TNFRSF19) were significantly related to a variety of immune cells. As shown in Fig. 8, the expression of POMC, TNFRSF19, FGF2, and SCG2 was significantly associated with macrophages, neutrophils, DCs, CD8+ T cells, CD4+ T cells and B cells (p < 0.05). Mutation analysis was also performed for the four prognostic characteristic genes, but Figure 9A only shows the summary gene mutation information for TNFRSF19 and SCG2, because there was no gene mutation information for POMC and FGF2 available in TCGA. Due to the small number of genes and mutation sites, the waterfall diagram of the mutation analysis was not obvious (Fig. 9B). Figure 9C displays the overall distribution of six different mutational transformations (C > T, T > C, C > A, C > G, T > G, and T > A).

Fig. 8
figure 8

Immunocorrelation of prognostic characteristic IRGs. A Immune cell correlation of FGF2. B Immune cell correlation of POMC. C Immune cell correlation of SCG2. D Immune cell correlation of TNFRSF19

Fig. 9
figure 9

Mutation analysis of prognostic characteristic IRGs. A Summary statistical of TNFRSF19 and SCG2 mutation information including “Variant Classification”, “Variant Type”, “SNV Class”, “Variants per Sample”, “Variant Classification Summary” and “Top 10 mutated genes”. B Waterfall diagram: mutation statistics in each sample. C The overall distribution of six different mutational transformations

Validation results from the GEO database

As shown in Additional file 1: Fig. S1, it can be found that the expression levels of FGF2 and SCG2 in CRC are significantly down-regulated, while the expression levels of POMC and TNFRSF19 are significantly up-regulated, which is consistent with the previous difference results. In addition, it can be seen from Additional file 2: Fig. S2 that all four genes showed worse prognosis after high expression. Except TNFRSF19, the other three genes were significantly correlated with prognosis (p < 0.05). Further correlation heat maps showed significant correlations between genes (FGF2 and SCG2) and all six immune cells. POMC was significantly correlated with other immune cells except neutrophils. There was also a significant correlation between TNFRSF19 and macrophages and B cells (Additional file 3: Fig. S3). These conclusions are basically consistent with the previous analysis results.


CRC is currently the second leading cause of cancer-related death, with malignant progression and metastasis leading to high mortality in advanced CRC [1]. Immune components in the tumour microenvironment have recently been reported to influence tumour progression in various cancers, including CRC [27]. As some immune cells are further polarised, the adaptive immune response is reversed, ultimately accelerating cancer cell proliferation, tumour angiogenesis, progression, and metastasis [28]. Therefore, the regulation of the tumour immune microenvironment has become an attractive clinical strategy for cancer treatment. With the launch of the first cancer immunotherapy (recombinant cytokine interferon-α for hairy cell leukaemia) in 1986, more than a dozen immunotherapies have been approved for a variety of cancers, including melanoma, advanced stomach cancer, bladder cancer, hepatocellular carcinoma, prostate cancer, kidney cancer, and non-small cell lung cancer [29]. Unlike chemotherapy, which kills cancer cells directly, cancer immunotherapies attack tumour cells by activating the host's immune system with fewer off-target effects [30]. However, the role of IRGs as important immune signatures in CRC has not yet been fully explored. In this study, we acquired 409 differentially expressed IRGs in CRC from TCGA and IMMPort using the above screening methods. Furthermore, KEGG enrichment analyses indicated that these differentially expressed IRGs were significantly associated with 102 cancer signalling pathways. In patients undergoing colorectal cancer surgery, IRGs related to the enrichment pathway for natural killer cell-mediated cytotoxicity was significantly reduced after primary tumour resection [31]. This also confirms the value of these IRGs in the treatment of CRC. In addition, GO enrichment analysis suggested that these IRGs possess multiple molecular functions and engagement in various biological processes such as cell chemotaxis and receptor ligand activity, which are involved in tumour development and metastasis [32, 33].

Based on the prediction and interaction results, we obtained 59 TF-IRG and 48 miRNA-IRG interaction networks in CRC. TFs such as NFAT have been experimentally confirmed to be involved in the development and progression of CRC [34]. Recent studies have also suggested that the expression of NFATc1 is closely related to the clinical stage and metastasis of CRC, and the application of Ca2+–calcineurin–NFAT signalling inhibitors can inhibit CRC metastasis in mouse models [35]. Moreover, another TF that we obtained, P53, not only controls the expression of anticancer genes through transcriptional activity, but also plays a tandem role with various signalling pathways in CRC [36]. The 43 miRNAs obtained also play a variety of roles in cancer genesis, progression, metastasis, and recurrence. For example, high miR-181c expression was significantly associated with recurrence in stage II CRC patients [37]. Furthermore, Hernandez demonstrated that the overexpression of miRNA-26a increased the proliferation and migration rates of CRC cells in vitro [38]. Finally, we constructed 214 drug–IRG regulatory networks based on the drug–gene interaction results of TF- and miRNA-regulated IRGs in CRC. These results provide a strong basis for precision immunotherapy in CRC patients.

The latest global statistics show that the five-year relative survival rate of CRC reached 64% in the United States from 2009 to 2015, was nearly 57% in China from 2012 to 2015, and was less than 50% in many Eastern and Southern European countries [39]. In particular, metastatic CRC has a five-year survival rate of only 14% in Europe, despite advances in treatment [40]. Therefore, early diagnosis and treatment of CRC is highly effective in order to significantly improve the survival rate of patients. In this study, we identified four prognostic genes for CRC (POMC, TNFRSF19, FGF2, and SCG2) by integrating 47 survival-related IRGs and 42 CRC characteristic genes. We believe that these findings may lead to more early diagnostic biomarkers for CRC and improvement of the five-year survival rate of patients. Furthermore, the expression of POMC, TNFRSF19, FGF2, and SCG2 was significantly associated with immune cells, such as macrophages, neutrophils, DCs, CD8+ T cells, CD4+ T cells and B cells. Immune cells in the tumour microenvironment can antagonise or promote tumours. It has been demonstrated that a high proportion of infiltrating dendritic, CD8+ T, and CD4+ T cells leads to better clinical outcomes in CRC patients [41]. However, recent studies suggest that macrophages increase the migration, invasion, and metastatic ability of tumours, reflecting their tumour-promoting effect in CRC [42]. Prognostic genes may therefore serve as targeted entry points for CRC immunotherapy.

We used the information from the group of Li [43] to discuss related analysis of MSI via R software. A total of 342 samples who have MSI information and related gene expression information after matching our information and the data in Li’s database. Then, we conducted statistical analysis via using limma package to compare MSI and MSS. According to the threshold FDR<0.05 and | logFC | > 1, a total of 1294 differentially expressed genes were obtained. Next, we got 160 differentially IRGs after taking the intersection between the above 1294 differentially expressed genes and the IRGs in IMMPort database ( However, no statistical difference of immune gene expressions was observed between MSI and MSS groups. In order to confirm the conclusion, we conducted an analysis about the levels of infiltration of 22 immune cells in each sample based on gene expression matrix via CIBERSORT [44]. Except for T cell Gamma Delta and Neutrophi, which showed a significant difference in the level of cell infiltration, the other cells showed no significant difference in infiltration between the two groups, which once again proved that there was little difference in immunity between MSI and MSS groups in the samples of this study (Additional file 5: Fig. S5).


In this study, bioinformatics analysis revealed 59 TF-IRG and 48 miRNA-IRG regulatory networks in CRC, which provides theoretical basis for further improving the biological mechanism of CRC occurrence, development and metastasis. We also identified several valid characteristic survival-related IRGs (POMC, TNFRSF19, FGF2, and SCG2) that could effectively assess the prognosis of patients with CRC. These potential immune biomarkers could be used to develop precise and effective personalised immunotherapy strategies for CRC patients.

Availability of data and materials

The datasets used in this study are available from the TCGA database (, GEO database (, the IMMPort database (, and the DGIdb database (



Colorectal cancer


Immune-related gene


The cancer genome atlas


Immune checkpoint inhibitors


Immunology database and analysis portal




Kyoto encyclopedia of genes and genomes


Gene expression omnibus


Gene ontology


Molecular function


Cellular component


Biological process




Transcription factor


  1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209–49.

    Article  Google Scholar 

  2. Ganesh K, Stadler ZK, Cercek A, Mendelsohn RB, Shia J, Segal NH, Diaz LA Jr. Immunotherapy in colorectal cancer: rationale, challenges and potential. Nat Rev Gastroenterol Hepatol. 2019;16(6):361–75.

    Article  Google Scholar 

  3. Borst J, Ahrends T, Babala N, Melief CJM, Kastenmuller W. CD4(+) T cell help in cancer immunology and immunotherapy. Nat Rev Immunol. 2018;18(10):635–47.

    CAS  Article  Google Scholar 

  4. Guedan S, Ruella M, June CH. Emerging cellular therapies for cancer. Annu Rev Immunol. 2019;37:145–71.

    CAS  Article  Google Scholar 

  5. Liu C, Liu R, Wang B, Lian J, Yao Y, Sun H, Zhang C, Fang L, Guan X, Shi J, et al. Blocking IL-17A enhances tumor response to anti-PD-1 immunotherapy in microsatellite stable colorectal cancer. J Immuno Ther Cancer. 2021;9(1):e001895.

    Article  Google Scholar 

  6. Lin P, Guo YN, Shi L, Li XJ, Yang H, He Y, Li Q, Dang YW, Wei KL, Chen G. Development of a prognostic index based on an immunogenomic landscape analysis of papillary thyroid cancer. Aging. 2019;11(2):480–500.

    CAS  Article  Google Scholar 

  7. Sun S, Guo W, Wang Z, Wang X, Zhang G, Zhang H, Li R, Gao Y, Qiu B, Tan F, et al. Development and validation of an immune-related prognostic signature in lung adenocarcinoma. Cancer Med. 2020;9(16):5960–75.

    CAS  Article  Google Scholar 

  8. Chen W, Ou M, Tang D, Dai Y, Du W. Identification and validation of immune-related gene prognostic signature for hepatocellular carcinoma. J Immunol Res. 2020;2020:5494858.

    PubMed  PubMed Central  Google Scholar 

  9. Huang Y, Chen L, Tang Z, Min Y, Yu W, Yang G, Zhang L. A novel immune and stroma related prognostic marker for invasive breast cancer in tumor microenvironment: a TCGA based study. Front Endocrinol. 2021;12: 774244.

    Article  Google Scholar 

  10. Zhang L, Li J, Zhang M, Wang L, Yang T, Shao Q, Liang X, Ma M, Zhang N, Jing M, et al. Identification of a six-gene prognostic signature characterized by tumor microenvironment immune profiles in clear cell renal cell carcinoma. Front Genet. 2021;12: 722421.

    CAS  Article  Google Scholar 

  11. Goldman M, Craft B, Swatloski T, Cline M, Morozova O, Diekhans M, Haussler D, Zhu J. The UCSC cancer genomics browser: update 2015. Nucleic Acids Res. 2015;43(D1):D812–7.

    CAS  Article  PubMed  Google Scholar 

  12. Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R. NCBI GEO: mining tens of millions of expression profiles--database and tools update. Nucleic Acids Res. 2007;35(Database):D760–5.

    CAS  Article  PubMed  Google Scholar 

  13. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.

    CAS  Article  Google Scholar 

  14. Bhattacharya S, Dunn P, Thomas CG, Smith B, Schaefer H, Chen J, Hu Z, Zalocusky KA, Shankar RD, Shen-Orr SS, et al. ImmPort, toward repurposing of open access immunological assay data for translational and clinical research. Sci Data. 2018;5: 180015.

    CAS  Article  Google Scholar 

  15. Tianzhi W, Erqiang H, Shuangbin X, Chen M, Guo P, Dai Z, Feng T, Zhou L, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innov. 2021;2(3):100141.

    CAS  Article  Google Scholar 

  16. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.

    CAS  Article  Google Scholar 

  17. Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28(11):1947–51.

    CAS  Article  Google Scholar 

  18. Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M, Tanabe M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49(D1):D545–51.

    CAS  Article  Google Scholar 

  19. Gene Ontology C. Gene ontology consortium: going forward. Nucleic Acids Res. 2015;43(D1):D1049–56.

    CAS  Article  Google Scholar 

  20. Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019;47(W1):W199–205.

    CAS  Article  Google Scholar 

  21. Otasek D, Morris JH, Boucas J, Pico AR, Demchak B. Cytoscape Automation: empowering workflow-based network analysis. Genome Biol. 2019;20(1):185.

    Article  Google Scholar 

  22. Sticht C, De La Torre C, Parveen A, Gretz N. miRWalk: An online resource for prediction of microRNA binding sites. PLoS ONE. 2018;13(10): e0206239.

    Article  Google Scholar 

  23. Cotto KC, Wagner AH, Feng YY, Kiwala S, Coffman AC, Spies G, Wollam A, Spies NC, Griffith OL, Griffith M. DGIdb 3.0: a redesign and expansion of the drug–gene interaction database. Nucleic Acids Res. 2018;46(D1):D1068–73.

    CAS  Article  PubMed  Google Scholar 

  24. Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, Li B, Liu XS. TIMER: A web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Res. 2017;77(21):e108–10.

    CAS  Article  Google Scholar 

  25. Mayakonda A, Lin DC, Assenov Y, Plass C, Koeffler HP. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 2018;28(11):1747–56.

    CAS  Article  Google Scholar 

  26. Sturm G, Finotello F, List M. Immunedeconv: An R package for unified access to computational methods for estimating immune cell fractions from Bulk RNA-sequencing data. Methods Mol Biol. 2020;2120:223–32.

    CAS  Article  Google Scholar 

  27. Sharma P, Hu-Lieskovan S, Wargo JA, Ribas A. Primary, adaptive, and acquired resistance to cancer immunotherapy. Cell. 2017;168(4):707–23.

    CAS  Article  Google Scholar 

  28. Zhang Y, Song J, Zhao Z, Yang M, Chen M, Liu C, Ji J, Zhu D. Single-cell transcriptome analysis reveals tumor immune microenvironment heterogenicity and granulocytes enrichment in colorectal cancer liver metastases. Cancer Lett. 2020;470:84–94.

    CAS  Article  Google Scholar 

  29. Riley RS, June CH, Langer R, Mitchell MJ. Delivery technologies for cancer immunotherapy. Nat Rev Drug Discov. 2019;18(3):175–96.

    CAS  Article  Google Scholar 

  30. Bergman PJ. Cancer immunotherapies. Vet Clin North Am Small Anim Pract. 2019;49(5):881–902.

    Article  Google Scholar 

  31. Niavarani SR, Lawson C, Bakos O, Boudaud M, Batenchuk C, Rouleau S, Tai LH. Lipid accumulation impairs natural killer cell cytotoxicity and tumor control in the postoperative period. BMC Cancer. 2019;19(1):823.

    Article  Google Scholar 

  32. Karin N, Razon H. Chemokines beyond chemo-attraction: CXCL10 and its significant role in cancer and autoimmunity. Cytokine. 2018;109:24–8.

    CAS  Article  Google Scholar 

  33. Du Z, Lovly CM. Mechanisms of receptor tyrosine kinase activation in cancer. Mol Cancer. 2018;17(1):58.

    Article  Google Scholar 

  34. Gang W, Yu-Zhu W, Yang Y, Feng S, Xing-Li F, Heng Z. The critical role of calcineurin/NFAT (C/N) pathways and effective antitumor prospect for colorectal cancers. J Cell Biochem. 2019;120(12):19254–73.

    Article  Google Scholar 

  35. Shen T, Yue C, Wang X, Wang Z, Wu Y, Zhao C, Chang P, Sun X, Wang W. NFATc1 promotes epithelial-mesenchymal transition and facilitates colorectal cancer metastasis by targeting SNAI1. Exp Cell Res. 2021;408(1): 112854.

    CAS  Article  Google Scholar 

  36. Cho YH, Ro EJ, Yoon JS, Mizutani T, Kang DW, Park JC, Il Kim T, Clevers H, Choi KY. 5-FU promotes stemness of colorectal cancer via p53-mediated WNT/beta-catenin pathway activation. Nat Commun. 2020;11(1):5321.

    CAS  Article  Google Scholar 

  37. Yamazaki N, Koga Y, Taniguchi H, Kojima M, Kanemitsu Y, Saito N, Matsumura Y. High expression of miR-181c as a predictive marker of recurrence in stage II colorectal cancer. Oncotarget. 2017;8(4):6970–83.

    Article  Google Scholar 

  38. Coronel-Hernandez J, Lopez-Urrutia E, Contreras-Romero C, Delgado-Waldo I, Figueroa-Gonzalez G, Campos-Parra AD, Salgado-Garcia R, Martinez-Gutierrez A, Rodriguez-Morales M, Jacobo-Herrera N, et al. Cell migration and proliferation are regulated by miR-26a in colorectal cancer via the PTEN-AKT axis. Cancer Cell Int. 2019;19:80.

    Article  Google Scholar 

  39. Li N, Lu B, Luo C, Cai J, Lu M, Zhang Y, Chen H, Dai M. Incidence, mortality, survival, risk factor and screening of colorectal cancer: a comparison among China, Europe, and northern America. Cancer Lett. 2021;522:255–68.

    CAS  Article  Google Scholar 

  40. Mauri G, Sartore-Bianchi A, Russo AG, Marsoni S, Bardelli A, Siena S. Early-onset colorectal cancer in young individuals. Mol Oncol. 2019;13(2):109–31.

    Article  Google Scholar 

  41. Picard E, Verschoor CP, Ma GW, Pawelec G. Relationships between immune landscapes, genetic subtypes and responses to immunotherapy in colorectal cancer. Front Immunol. 2020;11:369.

    CAS  Article  Google Scholar 

  42. Wei C, Yang C, Wang S, Shi D, Zhang C, Lin X, Liu Q, Dou R, Xiong B. Crosstalk between cancer cells and tumor associated macrophages is required for mesenchymal circulating tumor cell-mediated colorectal cancer metastasis. Mol Cancer. 2019;18(1):64.

    Article  Google Scholar 

  43. Li L, Feng Q, Wang X. PreMSIm: an R package for predicting microsatellite instability from the expression profiling of a gene panel in cancer. Comput Struct Biotechnol J. 2020;18:668–75.

    CAS  Article  Google Scholar 

  44. Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA. Profiling tumor infiltrating immune cells with CIBERSORT. Methods Mol Biol. 2018;1711:243–59.

    CAS  Article  Google Scholar 

Download references


Not applicable.


This work was supported by: Wuxi Traditional Chinese Medicine Hospital Inheritance Studio construction project [2020 No.5].

Author information

Authors and Affiliations



WSW, LG are responsible for the project design. CL, JF collect data from the TCGA, GEO, IMMPort, and DGIdb database. WSW, CL completed data analysis and manuscript writing. WSW, LG revised the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Shuwei Wang or Gan Li.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1. Fig. S1

: The box diagram of expression of prognostic characteristic IRGs. From A to D, the expression box diagram of FGF2, SCG2, POMC, and TNFRSF19 is shown in the figure. Green is the paracancer tissue, yellow is the cancer tissue, and the line in the middle indicates that they belong to the same sample.

Additional file 2. Fig. S2

: Correlation between IRGs and prognosis from GEO database. The survival curves of FGF2, SCG2, POMC, and TNFRSF19 were shown from A to D. In the figure, red represents high expression group and black represents low expression group.

Additional file 3. Fig. S3

: Validation of immunocorrelation of IRGs from GEO database. From left to right are heat maps of correlations between FGF2, POMC, SCG2, TNFRSF19 and immune cells. The top left corner of each small square in the figure represents significance, and * represents p < 0.05, ** represents p < 0.01. The lower right corner shows correlation, green to red shows significance from negative to positive, and the deeper the correlation coefficient is, the greater the absolute value.

Additional file 4. Fig. S4

: The K-M survival curve of survival analysis adjusting for age and tumor stage. (A) K-M survival curve of patients under 60 years in the high-low risk group. (B) K-M survival curve of patients over 60 years in the high-low risk group. (C) K-M survival curve of patients in stage I-II. (D) K-M survival curve of patients in stage III-IV.

Additional file 5. Fig. S5

: The levels of immune cells infiltration in MSI and MSS groups. T cell Gamma Delta and Neutrophi showed a significant difference in the level of cell infiltration (p < 0.05), while the other cells showed no significant difference in infiltration between the two groups (p > 0.05).

Additional file 6. Table S6

: Details of 1793 different IRGs collected from the IMMPort database.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, S., Cheng, L., Jing, F. et al. Screening and identification of immune-related genes for immunotherapy and prognostic assessment in colorectal cancer patients. BMC Med Genomics 15, 177 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Colorectal cancer
  • Immune-related gene
  • Prognostic value
  • Immunotherapy