- Open Access
Screening and identification of immune-related genes for immunotherapy and prognostic assessment in colorectal cancer patients
BMC Medical Genomics volume 15, Article number: 177 (2022)
Increasing evidence indicates that the immune microenvironment plays a key role in the genesis and progression of colorectal cancer (CRC). This study aimed to establish an immune-related gene (IRG) signature and determine its clinical prognostic value in patients with CRC.
The RNA sequencing and associated clinical data of CRC were downloaded from The Cancer Genome Atlas (TCGA) database. We then screened for differentially expressed IRGs by intersecting with IRGs obtained from the Immunology Database and Analysis Portal. Functional enrichment analyses were carried out to determine the potential biological functions and pathways of the IRGs. We also explored the specific molecular mechanisms of the IRGs by constructing regulatory networks. Prognostic IRGs were obtained by LASSO regression analysis, and subsequently, gene models were constructed in the TCGA dataset to confirm the predictive capacity of these IRGs. Finally, we used the TIMER tool to assess the immune properties of prognostic IRGs and correlate them with immune cells.
We identified 409 differentially expressed IRGs in patients with CRC. Kyoto Encyclopaedia of Genes and Genomes and Gene Ontology enrichment analyses suggested that these differentially expressed IRGs were significantly related to 102 cancer signalling pathways and various biological functions. Based on the prediction and interaction results, we obtained 59 TF–IRG, 48 miRNA–IRG, and 214 drug–IRG interaction networks for CRC. Four prognostic genes (POMC, TNFRSF19, FGF2, and SCG2) were developed by integrating 47 survival-related IRGs and 42 characteristic CRC genes. The results of gene model showed that patients in the low risk group had better survival outcomes compared to those in the high risk group. The expression of POMC, TNFRSF19, FGF2, and SCG2 was significantly correlated with immune cells.
This study identified some valid IRGs, and these findings can provide strong evidence for precision immunotherapy in patients with CRC.
Colorectal cancer (CRC) was the third most commonly diagnosed cancer and the second leading cause of cancer death in 2020, with approximately 1.9 million new cases and 935,000 deaths worldwide, representing approximately one in ten cancer cases and deaths . Although the five-year survival rate for CRC patients has improved with early screening in developed countries, the outcome for patients with advanced CRC remains unsatisfactory, with a median five-year survival rate of only 12.5% in the USA . Therefore, it is necessary to identify specific biomarkers for early diagnosis and potential therapeutic targets in CRC.
Immunotherapy is gradually becoming the standard treatment for cancer and is as important as surgery, radiotherapy, and chemotherapy. Cancer immunotherapy is designed to promote the immune response of tumour-specific T cells . When fully reprogrammed, T cells are considered the most powerful anti-cancer immune cells. Immunotherapy has not only produced unprecedented clinical results in patients with refractory tumours but has also brought long-term clinical remission to patients with diseases that were historically considered incurable . In recent years, the advent of immune checkpoint inhibitors (ICIs), such as anti-PD-1, has opened up a new landscape for cancer immunotherapy. Nevertheless, the use of ICIs in CRC is currently limited to patients with high microsatellite instability and is only 5–10% effective in CRC patients with microsatellite stability (approximately 90%) . Hence, it is necessary to explore reliable immune-related genes (IRGs) as important immune signatures to improve efficacy and predict prognosis in patients with CRC.
With the presentation of large-scale publicly available gene expression databases, researchers have been able to quickly and accurately identify potential biomarkers for tumour surveillance . The Cancer Genome Atlas (TCGA) is a commonly used database that contains a large amount of transcriptome data and can provide many tumour samples. Multiple immune-related prognostic signatures for lung adenocarcinoma , hepatocellular carcinoma , breast cancer , and clear cell renal cell carcinoma  were established from TCGA.
Our study integrated differentially expressed genes obtained from TCGA with IRGs collected from the Immunology Database and Analysis Portal (IMMPort) and conducted an in-depth mining analysis of CRC data. We then analysed and processed the IRGs further by using functional enrichment analyses and regulatory network construction. In addition, we discovered new immune biomarkers associated with CRC prognosis applying LASSO regression analysis. We hope that these findings will lead to accurate prognostic assessment and effective immunotherapy strategies for patients with CRC.
Original data acquisition
The RNA-sequencing and miRNA data were downloaded from TCGA using the UCSC Xena browser (https://xenabrowser.net/datapages/) . The corresponding clinical data for CRC included 353 samples (342 tumour samples and 11 normal samples). The counts per million values were obtained by transforming the original data.
We downloaded the CRC data set numbered GSE39582 from the National Center for Biotechnology Information GEO (Gene Expression Omnibus) (https://www.ncbi.nlm.nih.gov/) database . The data set was processed by the original author and standardized probe expression matrix was downloaded. Meanwhile, the probe annotation information of corresponding platform was downloaded. Convert the probe to gene symbol and eliminate the probe that is not compared to gene symbol. For multiple probes mapped to the same gene symbol, the average value of probes was taken as the expression level of the gene. Then the expression values of four genes FGF2, POMC, SCG2, and TNFRSF19 were selected for subsequent analysis.
Differential immune-related genes and miRNA screening
The samples were divided into tumour and normal groups. The TMM algorithm in the R (Version 4.1.1) software package edgeR (Version 3.36.0)  was used to standardise the raw count and transform it into counts per million, which was used for subsequent analysis. The significance of differences in gene expression was calculated using an unpaired t-test and corrected by applying the Benjamini–Hochberg (BH) procedure. Threshold | logFC | > 1 and p-value < 0.05 were selected as significant differences in miRNAs and genes expression. The IRGs were collected from the IMMPort database  (https://www.immport.org/shared/home), and 1793 different IRGs associated with human cancers were screened out (Additional file 6: Table S1). These genes were intersected with 4747 differentially expressed genes to obtain differentially expressed IRGs in CRC.
Functional enrichment analyses of IRGs
To explore the potential biological functions and pathways of 409 differentially expressed IRGs, the R software package clusterProfiler  was applied to conduct the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway [16,17,18] and Gene Ontology (GO)  enrichment analysis for IRGs. GO has three ontologies: molecular function (MF), cellular component (CC), and biological process (BP). The results with a p-value < 0.05 after BH correction were selected as the most significant enrichment results.
Survival-associated IRG screening
Clinical survival information and gene expression data from patients with CRC were extracted from TCGA. We used the survival R package (version 2.41-3, https://CRAN.R-project.org/view=Survival) to analyse the impact of differentially expressed IRGs on patient survival and prognosis. Subsequently, we plotted the Kaplan–Meier (K–M) curve to compare overall survival for high- and low-risk expressions, and survival-associated IRGs were identified using a log-rank test (p < 0.05).
Construction of transcription factor (TF)-IRG and miRNA-IRG regulatory network
The over-representation analysis enrichment method was applied to predict TF-target enrichment of differentially expressed genes in protein–protein interaction network, using WebGestalt GAST  (http://www.webgestalt.org/option.php). The species selected was hsapiens, the enrichment parameter (the minimum number of enrichment genes) was set at 2, and the results of the Top 10 were displayed. The TF-target gene interaction relations were obtained using Cytoscape  (version 3.6.0, http://www.cytoscape.org/) to draw TF-IRG regulatory network.
We used the miRWalk  tool to predict regulatory miRNAs of differentially expressed IRGs. The miRNAs obtained by our previous differential analysis were screened out from these miRNAs to construct the relationship between differentially expressed miRNAs and IRGs. Finally, the miRNA-IRG regulatory network was mapped using Cytoscape.
Establishment of drug–gene interaction network
According to the drug prediction database DGIdb (http://www.dgidb.org/) , drug–gene interactions of key differential genes regulated by miRNA and TFs were further predicted by the filtering parameter ‘FDA-approved’. We then constructed the drug–gene interaction network based on the prediction results, using Cytoscape software.
Prognostic characteristic gene screening and model construction
We screened the characteristic genes of CRC using LASSO regression analysis and integrated them with 47 survival-related IRGs to obtain the prognostic characteristic genes of CRC. The lambda value of the LASSO filter was set to 0.004 by iterative calculation. To confirm the predictive capacity of these IRGs, two thirds of the samples (including 219 CRC samples) in the TCGA dataset were randomly selected using the R language for model construction. The model was validated using one third of the samples (including 110 CRC samples).
Immune evaluation and mutation analysis of prognostic characteristic genes
We used TIMER tools (https://cistrome.shinyapps.io/timer/)  to assess the immune characteristics of four prognostic characteristic genes (GRP, TNFRSF19, FGF2, and SCG2) in order to determine their relevance to immune cells. Simultaneously, the mutation data of four prognostic characteristic genes were downloaded from TCGA genomic data, and the extracted mutation signatures were visualised using R package Maftools (version 2.10.0)  .
Validation of four prognostic characteristic genes from GEO database
To verify the differential expression levels of these four genes (FGF2, POMC, SCG2, and TNFRSF19), we first selected 17 samples with paired paracancer and cancer tissues. The box diagram of the expression of the four genes between the cancer tissue and the paired paracancer tissue samples was then drawn. Paired T test was used to calculate the significance. To verify that these four genes are indeed significantly correlated with prognosis, 550 samples with survival time greater than 30 days were selected first. K–M curve was used to evaluate the association between different gene expression levels and survival prognosis. Expression level higher than or equal to cutoff value is high sample group, expression level lower than cutoff value is low sample group. The cutoff value is judged by the optimal critical value according to the expression value, survival time, and survival state of each gene using R package SurvMiner (Version 0.4.3). To verify the significant association between the four genes and immune cells, we used the Timer algorithm and the Immunedeconv package (version 2.0.0) based on R language . The infiltration levels of macrophages, neutrophils, dendritic cells (DCs), CD8+ T cells, CD4+ T cells and B cells were calculated. Furthermore, spearman correlation and significant P values between the expression level of 4 genes and the level of cell invasion were calculated by corresponding relationship of cancer tissue samples.
Confirmation of differentially expressed IRGs and miRNAs
A total of 4747 differentially expressed genes, including 2490 up-regulated and 2257 down-regulated genes in CRC, were collected by the above screening method. Meanwhile, 426 differential miRNAs were obtained, of which 193 miRNAs were up-regulated and 233 miRNAs were down-regulated in CRC. In the volcano diagram (Fig. 1A), there is a significant difference between the experimental group and the control group (p < 0.05). Then, after intersection of 1793 IRGs downloaded from IMMPort with 4,747 differentially expressed genes, 409 differentially expressed IRGs in CRC were obtained (Fig. 1B).
Enrichment results of genes
The results showed that these 409 differentially expressed IRGs were significantly enriched in 102 KEGG pathways. As shown in Fig. 2A, the top five enriched pathways were cytokine–cytokine receptor interaction, viral protein interaction with cytokine and cytokine receptor, chemokine signalling pathway, natural killer cell-mediated cytotoxicity, and neuroactive ligand-receptor interaction. Moreover, the GO enrichment analysis showed that ‘cell chemotaxis’, ‘external side of plasma membrane’, and ‘receptor ligand activity’ were the most enriched terms in the BP, CC, and MF, respectively (Fig. 2B–D).
Validation of 47 survival-associated IRGs
Through survival analysis, we obtained 47 IRGs that were significantly associated with survival, of which 18 were positively correlated and 29 were negatively correlated (Table 1). The K–M survival curve also confirmed the survival difference between the high (n = 165) and low (n = 164) expressing populations. As shown in Fig. 3A, the median survival time of the NRG1 high expression group was significantly longer than that of the low expression group (p < 0.05). However, the PGR high expression group showed reduced median survival time (Fig. 3B; p < 0.05).
Survival analysis adjusting for age and tumor stage
We conducted survival analysis adjusting for age and tumor stage at diagnosis via K-M survival curve. First, our results showed significant differences between patients over 60 years of age in the high-low risk group. Although the results were not significant in patients under 60 years of age, the prognosis of the high-risk group was worse than that of the low-risk group (Additional file 4: Fig. S4A, B). Next, there was a significant difference between the high and low risk groups in stage III-IV patients. No significant results were seen in stage I-II patients, but the prognosis of the high-risk group was worse than that of the low-risk group (Additional file 4: Fig. S4C, D).
TF-IRG and miRNA-IRG regulatory networks in CRC
According to the prediction results of the over-representation analysis enrichment method, we obtained 59 pairs of TF-IRG interactions, including 9 TFs (NFAT, COUP, STAT4, TEF1, P53, PPAR, TATA, FREAC2, PU1) and 24 IRGs, of which 7 IRGs (TNFRSF19, TGFB2, GREM1, SPP1, PGF, INHBB, and GRP) were upregulated and 17 IRGs (SEMA6D, BMP5, TPM2, SCG2, NRG1, FABP2, ANGPTL1, POMC, UCN3, COLEC12, RBP2, PTH1R, CCL15, AGTR1, ACVRL1, NTS, and CCL28) were downregulated in CRC. Based on the above results, a complex TF-IRG network diagram for CRC was constructed using Cytoscape (Fig. 4).
After integrating the 426 differential miRNAs previously obtained and the targeted miRNAs of the predicted IRGs using the miRWalk tool, we identified 43 miRNAs, 13 IRGs, and 48 miRNA-IRG relationship pairs, as shown in Fig. 5.
Drug–gene interaction network in CRC
Based on the drug–gene interaction information of TF- and miRNA-regulated IRGs in the DGIdb database, we obtained 214 relationship pairs between small drug molecules and IRGs, including 18 IRGs (MAPT, NST, PGR, NRG1, FGF2, TLR3, PTH1R, AGTR1, F2RL1, MTNR1A, VIPR1, BMP5, GRP, BIRC5, SPP1, CD1A, PGF, and MC1R) and 195 drugs. The drug–gene interaction network diagram was plotted using Cytoscape (Fig. 6).
Screening and modelling of disease characteristic genes
By integrating gene node information in multiple networks, 42 CRC characteristic genes were screened using the LASSO method (Fig. 7A), and four prognostic characteristic genes (POMC, TNFRSF19, FGF2, and SCG2) were obtained by further intersection with 47 survival-related IRGs. To further confirm the predictive effect of these IRGs, 329 CRC samples from TCGA were used to construct a gene model. As shown in Fig. 7B, patients in the low-risk group had a better survival prognosis than those in the high-risk group, which was consistent with the model validation results in Fig. 7C.
Immunocorrelation and mutation analysis of prognostic characteristic genes
Our results showed that four prognostic characteristic genes (FGF2, POMC, SCG2, and TNFRSF19) were significantly related to a variety of immune cells. As shown in Fig. 8, the expression of POMC, TNFRSF19, FGF2, and SCG2 was significantly associated with macrophages, neutrophils, DCs, CD8+ T cells, CD4+ T cells and B cells (p < 0.05). Mutation analysis was also performed for the four prognostic characteristic genes, but Figure 9A only shows the summary gene mutation information for TNFRSF19 and SCG2, because there was no gene mutation information for POMC and FGF2 available in TCGA. Due to the small number of genes and mutation sites, the waterfall diagram of the mutation analysis was not obvious (Fig. 9B). Figure 9C displays the overall distribution of six different mutational transformations (C > T, T > C, C > A, C > G, T > G, and T > A).
Validation results from the GEO database
As shown in Additional file 1: Fig. S1, it can be found that the expression levels of FGF2 and SCG2 in CRC are significantly down-regulated, while the expression levels of POMC and TNFRSF19 are significantly up-regulated, which is consistent with the previous difference results. In addition, it can be seen from Additional file 2: Fig. S2 that all four genes showed worse prognosis after high expression. Except TNFRSF19, the other three genes were significantly correlated with prognosis (p < 0.05). Further correlation heat maps showed significant correlations between genes (FGF2 and SCG2) and all six immune cells. POMC was significantly correlated with other immune cells except neutrophils. There was also a significant correlation between TNFRSF19 and macrophages and B cells (Additional file 3: Fig. S3). These conclusions are basically consistent with the previous analysis results.
CRC is currently the second leading cause of cancer-related death, with malignant progression and metastasis leading to high mortality in advanced CRC . Immune components in the tumour microenvironment have recently been reported to influence tumour progression in various cancers, including CRC . As some immune cells are further polarised, the adaptive immune response is reversed, ultimately accelerating cancer cell proliferation, tumour angiogenesis, progression, and metastasis . Therefore, the regulation of the tumour immune microenvironment has become an attractive clinical strategy for cancer treatment. With the launch of the first cancer immunotherapy (recombinant cytokine interferon-α for hairy cell leukaemia) in 1986, more than a dozen immunotherapies have been approved for a variety of cancers, including melanoma, advanced stomach cancer, bladder cancer, hepatocellular carcinoma, prostate cancer, kidney cancer, and non-small cell lung cancer . Unlike chemotherapy, which kills cancer cells directly, cancer immunotherapies attack tumour cells by activating the host's immune system with fewer off-target effects . However, the role of IRGs as important immune signatures in CRC has not yet been fully explored. In this study, we acquired 409 differentially expressed IRGs in CRC from TCGA and IMMPort using the above screening methods. Furthermore, KEGG enrichment analyses indicated that these differentially expressed IRGs were significantly associated with 102 cancer signalling pathways. In patients undergoing colorectal cancer surgery, IRGs related to the enrichment pathway for natural killer cell-mediated cytotoxicity was significantly reduced after primary tumour resection . This also confirms the value of these IRGs in the treatment of CRC. In addition, GO enrichment analysis suggested that these IRGs possess multiple molecular functions and engagement in various biological processes such as cell chemotaxis and receptor ligand activity, which are involved in tumour development and metastasis [32, 33].
Based on the prediction and interaction results, we obtained 59 TF-IRG and 48 miRNA-IRG interaction networks in CRC. TFs such as NFAT have been experimentally confirmed to be involved in the development and progression of CRC . Recent studies have also suggested that the expression of NFATc1 is closely related to the clinical stage and metastasis of CRC, and the application of Ca2+–calcineurin–NFAT signalling inhibitors can inhibit CRC metastasis in mouse models . Moreover, another TF that we obtained, P53, not only controls the expression of anticancer genes through transcriptional activity, but also plays a tandem role with various signalling pathways in CRC . The 43 miRNAs obtained also play a variety of roles in cancer genesis, progression, metastasis, and recurrence. For example, high miR-181c expression was significantly associated with recurrence in stage II CRC patients . Furthermore, Hernandez demonstrated that the overexpression of miRNA-26a increased the proliferation and migration rates of CRC cells in vitro . Finally, we constructed 214 drug–IRG regulatory networks based on the drug–gene interaction results of TF- and miRNA-regulated IRGs in CRC. These results provide a strong basis for precision immunotherapy in CRC patients.
The latest global statistics show that the five-year relative survival rate of CRC reached 64% in the United States from 2009 to 2015, was nearly 57% in China from 2012 to 2015, and was less than 50% in many Eastern and Southern European countries . In particular, metastatic CRC has a five-year survival rate of only 14% in Europe, despite advances in treatment . Therefore, early diagnosis and treatment of CRC is highly effective in order to significantly improve the survival rate of patients. In this study, we identified four prognostic genes for CRC (POMC, TNFRSF19, FGF2, and SCG2) by integrating 47 survival-related IRGs and 42 CRC characteristic genes. We believe that these findings may lead to more early diagnostic biomarkers for CRC and improvement of the five-year survival rate of patients. Furthermore, the expression of POMC, TNFRSF19, FGF2, and SCG2 was significantly associated with immune cells, such as macrophages, neutrophils, DCs, CD8+ T cells, CD4+ T cells and B cells. Immune cells in the tumour microenvironment can antagonise or promote tumours. It has been demonstrated that a high proportion of infiltrating dendritic, CD8+ T, and CD4+ T cells leads to better clinical outcomes in CRC patients . However, recent studies suggest that macrophages increase the migration, invasion, and metastatic ability of tumours, reflecting their tumour-promoting effect in CRC . Prognostic genes may therefore serve as targeted entry points for CRC immunotherapy.
We used the information from the group of Li  to discuss related analysis of MSI via R software. A total of 342 samples who have MSI information and related gene expression information after matching our information and the data in Li’s database. Then, we conducted statistical analysis via using limma package to compare MSI and MSS. According to the threshold FDR<0.05 and | logFC | > 1, a total of 1294 differentially expressed genes were obtained. Next, we got 160 differentially IRGs after taking the intersection between the above 1294 differentially expressed genes and the IRGs in IMMPort database (https://www.immport.org/shared/home). However, no statistical difference of immune gene expressions was observed between MSI and MSS groups. In order to confirm the conclusion, we conducted an analysis about the levels of infiltration of 22 immune cells in each sample based on gene expression matrix via CIBERSORT . Except for T cell Gamma Delta and Neutrophi, which showed a significant difference in the level of cell infiltration, the other cells showed no significant difference in infiltration between the two groups, which once again proved that there was little difference in immunity between MSI and MSS groups in the samples of this study (Additional file 5: Fig. S5).
In this study, bioinformatics analysis revealed 59 TF-IRG and 48 miRNA-IRG regulatory networks in CRC, which provides theoretical basis for further improving the biological mechanism of CRC occurrence, development and metastasis. We also identified several valid characteristic survival-related IRGs (POMC, TNFRSF19, FGF2, and SCG2) that could effectively assess the prognosis of patients with CRC. These potential immune biomarkers could be used to develop precise and effective personalised immunotherapy strategies for CRC patients.
Availability of data and materials
The datasets used in this study are available from the TCGA database (https://xenabrowser.net/datapages/), GEO database (https://www.ncbi.nlm.nih.gov/), the IMMPort database (https://www.immport.org/shared/home), and the DGIdb database (http://www.dgidb.org/).
The cancer genome atlas
Immune checkpoint inhibitors
Immunology database and analysis portal
Kyoto encyclopedia of genes and genomes
Gene expression omnibus
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209–49.
Ganesh K, Stadler ZK, Cercek A, Mendelsohn RB, Shia J, Segal NH, Diaz LA Jr. Immunotherapy in colorectal cancer: rationale, challenges and potential. Nat Rev Gastroenterol Hepatol. 2019;16(6):361–75.
Borst J, Ahrends T, Babala N, Melief CJM, Kastenmuller W. CD4(+) T cell help in cancer immunology and immunotherapy. Nat Rev Immunol. 2018;18(10):635–47.
Guedan S, Ruella M, June CH. Emerging cellular therapies for cancer. Annu Rev Immunol. 2019;37:145–71.
Liu C, Liu R, Wang B, Lian J, Yao Y, Sun H, Zhang C, Fang L, Guan X, Shi J, et al. Blocking IL-17A enhances tumor response to anti-PD-1 immunotherapy in microsatellite stable colorectal cancer. J Immuno Ther Cancer. 2021;9(1):e001895. https://doi.org/10.1136/jitc-2020-001895.
Lin P, Guo YN, Shi L, Li XJ, Yang H, He Y, Li Q, Dang YW, Wei KL, Chen G. Development of a prognostic index based on an immunogenomic landscape analysis of papillary thyroid cancer. Aging. 2019;11(2):480–500.
Sun S, Guo W, Wang Z, Wang X, Zhang G, Zhang H, Li R, Gao Y, Qiu B, Tan F, et al. Development and validation of an immune-related prognostic signature in lung adenocarcinoma. Cancer Med. 2020;9(16):5960–75.
Chen W, Ou M, Tang D, Dai Y, Du W. Identification and validation of immune-related gene prognostic signature for hepatocellular carcinoma. J Immunol Res. 2020;2020:5494858.
Huang Y, Chen L, Tang Z, Min Y, Yu W, Yang G, Zhang L. A novel immune and stroma related prognostic marker for invasive breast cancer in tumor microenvironment: a TCGA based study. Front Endocrinol. 2021;12: 774244.
Zhang L, Li J, Zhang M, Wang L, Yang T, Shao Q, Liang X, Ma M, Zhang N, Jing M, et al. Identification of a six-gene prognostic signature characterized by tumor microenvironment immune profiles in clear cell renal cell carcinoma. Front Genet. 2021;12: 722421.
Goldman M, Craft B, Swatloski T, Cline M, Morozova O, Diekhans M, Haussler D, Zhu J. The UCSC cancer genomics browser: update 2015. Nucleic Acids Res. 2015;43(D1):D812–7. https://doi.org/10.1093/nar/gku1073.
Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R. NCBI GEO: mining tens of millions of expression profiles--database and tools update. Nucleic Acids Res. 2007;35(Database):D760–5. https://doi.org/10.1093/nar/gkl887.
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Bhattacharya S, Dunn P, Thomas CG, Smith B, Schaefer H, Chen J, Hu Z, Zalocusky KA, Shankar RD, Shen-Orr SS, et al. ImmPort, toward repurposing of open access immunological assay data for translational and clinical research. Sci Data. 2018;5: 180015.
Tianzhi W, Erqiang H, Shuangbin X, Chen M, Guo P, Dai Z, Feng T, Zhou L, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innov. 2021;2(3):100141. https://doi.org/10.1016/j.xinn.2021.100141.
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.
Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28(11):1947–51.
Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M, Tanabe M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49(D1):D545–51.
Gene Ontology C. Gene ontology consortium: going forward. Nucleic Acids Res. 2015;43(D1):D1049–56. https://doi.org/10.1093/nar/gku1179.
Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019;47(W1):W199–205.
Otasek D, Morris JH, Boucas J, Pico AR, Demchak B. Cytoscape Automation: empowering workflow-based network analysis. Genome Biol. 2019;20(1):185.
Sticht C, De La Torre C, Parveen A, Gretz N. miRWalk: An online resource for prediction of microRNA binding sites. PLoS ONE. 2018;13(10): e0206239.
Cotto KC, Wagner AH, Feng YY, Kiwala S, Coffman AC, Spies G, Wollam A, Spies NC, Griffith OL, Griffith M. DGIdb 3.0: a redesign and expansion of the drug–gene interaction database. Nucleic Acids Res. 2018;46(D1):D1068–73. https://doi.org/10.1093/nar/gkx1143.
Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, Li B, Liu XS. TIMER: A web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Res. 2017;77(21):e108–10.
Mayakonda A, Lin DC, Assenov Y, Plass C, Koeffler HP. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 2018;28(11):1747–56.
Sturm G, Finotello F, List M. Immunedeconv: An R package for unified access to computational methods for estimating immune cell fractions from Bulk RNA-sequencing data. Methods Mol Biol. 2020;2120:223–32.
Sharma P, Hu-Lieskovan S, Wargo JA, Ribas A. Primary, adaptive, and acquired resistance to cancer immunotherapy. Cell. 2017;168(4):707–23.
Zhang Y, Song J, Zhao Z, Yang M, Chen M, Liu C, Ji J, Zhu D. Single-cell transcriptome analysis reveals tumor immune microenvironment heterogenicity and granulocytes enrichment in colorectal cancer liver metastases. Cancer Lett. 2020;470:84–94.
Riley RS, June CH, Langer R, Mitchell MJ. Delivery technologies for cancer immunotherapy. Nat Rev Drug Discov. 2019;18(3):175–96.
Bergman PJ. Cancer immunotherapies. Vet Clin North Am Small Anim Pract. 2019;49(5):881–902.
Niavarani SR, Lawson C, Bakos O, Boudaud M, Batenchuk C, Rouleau S, Tai LH. Lipid accumulation impairs natural killer cell cytotoxicity and tumor control in the postoperative period. BMC Cancer. 2019;19(1):823.
Karin N, Razon H. Chemokines beyond chemo-attraction: CXCL10 and its significant role in cancer and autoimmunity. Cytokine. 2018;109:24–8.
Du Z, Lovly CM. Mechanisms of receptor tyrosine kinase activation in cancer. Mol Cancer. 2018;17(1):58.
Gang W, Yu-Zhu W, Yang Y, Feng S, Xing-Li F, Heng Z. The critical role of calcineurin/NFAT (C/N) pathways and effective antitumor prospect for colorectal cancers. J Cell Biochem. 2019;120(12):19254–73.
Shen T, Yue C, Wang X, Wang Z, Wu Y, Zhao C, Chang P, Sun X, Wang W. NFATc1 promotes epithelial-mesenchymal transition and facilitates colorectal cancer metastasis by targeting SNAI1. Exp Cell Res. 2021;408(1): 112854.
Cho YH, Ro EJ, Yoon JS, Mizutani T, Kang DW, Park JC, Il Kim T, Clevers H, Choi KY. 5-FU promotes stemness of colorectal cancer via p53-mediated WNT/beta-catenin pathway activation. Nat Commun. 2020;11(1):5321.
Yamazaki N, Koga Y, Taniguchi H, Kojima M, Kanemitsu Y, Saito N, Matsumura Y. High expression of miR-181c as a predictive marker of recurrence in stage II colorectal cancer. Oncotarget. 2017;8(4):6970–83.
Coronel-Hernandez J, Lopez-Urrutia E, Contreras-Romero C, Delgado-Waldo I, Figueroa-Gonzalez G, Campos-Parra AD, Salgado-Garcia R, Martinez-Gutierrez A, Rodriguez-Morales M, Jacobo-Herrera N, et al. Cell migration and proliferation are regulated by miR-26a in colorectal cancer via the PTEN-AKT axis. Cancer Cell Int. 2019;19:80.
Li N, Lu B, Luo C, Cai J, Lu M, Zhang Y, Chen H, Dai M. Incidence, mortality, survival, risk factor and screening of colorectal cancer: a comparison among China, Europe, and northern America. Cancer Lett. 2021;522:255–68.
Mauri G, Sartore-Bianchi A, Russo AG, Marsoni S, Bardelli A, Siena S. Early-onset colorectal cancer in young individuals. Mol Oncol. 2019;13(2):109–31.
Picard E, Verschoor CP, Ma GW, Pawelec G. Relationships between immune landscapes, genetic subtypes and responses to immunotherapy in colorectal cancer. Front Immunol. 2020;11:369.
Wei C, Yang C, Wang S, Shi D, Zhang C, Lin X, Liu Q, Dou R, Xiong B. Crosstalk between cancer cells and tumor associated macrophages is required for mesenchymal circulating tumor cell-mediated colorectal cancer metastasis. Mol Cancer. 2019;18(1):64.
Li L, Feng Q, Wang X. PreMSIm: an R package for predicting microsatellite instability from the expression profiling of a gene panel in cancer. Comput Struct Biotechnol J. 2020;18:668–75.
Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA. Profiling tumor infiltrating immune cells with CIBERSORT. Methods Mol Biol. 2018;1711:243–59.
This work was supported by: Wuxi Traditional Chinese Medicine Hospital Inheritance Studio construction project [2020 No.5].
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1. Fig. S1
: The box diagram of expression of prognostic characteristic IRGs. From A to D, the expression box diagram of FGF2, SCG2, POMC, and TNFRSF19 is shown in the figure. Green is the paracancer tissue, yellow is the cancer tissue, and the line in the middle indicates that they belong to the same sample.
Additional file 2. Fig. S2
: Correlation between IRGs and prognosis from GEO database. The survival curves of FGF2, SCG2, POMC, and TNFRSF19 were shown from A to D. In the figure, red represents high expression group and black represents low expression group.
Additional file 3. Fig. S3
: Validation of immunocorrelation of IRGs from GEO database. From left to right are heat maps of correlations between FGF2, POMC, SCG2, TNFRSF19 and immune cells. The top left corner of each small square in the figure represents significance, and * represents p < 0.05, ** represents p < 0.01. The lower right corner shows correlation, green to red shows significance from negative to positive, and the deeper the correlation coefficient is, the greater the absolute value.
Additional file 4. Fig. S4
: The K-M survival curve of survival analysis adjusting for age and tumor stage. (A) K-M survival curve of patients under 60 years in the high-low risk group. (B) K-M survival curve of patients over 60 years in the high-low risk group. (C) K-M survival curve of patients in stage I-II. (D) K-M survival curve of patients in stage III-IV.
Additional file 5. Fig. S5
: The levels of immune cells infiltration in MSI and MSS groups. T cell Gamma Delta and Neutrophi showed a significant difference in the level of cell infiltration (p < 0.05), while the other cells showed no significant difference in infiltration between the two groups (p > 0.05).
Additional file 6. Table S6
: Details of 1793 different IRGs collected from the IMMPort database.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Wang, S., Cheng, L., Jing, F. et al. Screening and identification of immune-related genes for immunotherapy and prognostic assessment in colorectal cancer patients. BMC Med Genomics 15, 177 (2022). https://doi.org/10.1186/s12920-022-01329-2
- Colorectal cancer
- Immune-related gene
- Prognostic value