- Research article
- Open Access
Identification of macrophage related gene in colorectal cancer patients and their functional roles
BMC Medical Genomics volume 14, Article number: 159 (2021)
Recent scientific research has enabled the identification of macrophages related-genes (MaRG), which play a key role in the control of the immune microenvironment in many human cancers. However, the functional role of MaRGs in human tumors is ill-defined. Herein, we aimed at bioinformatically exploring the molecular signatures of MaRGs in colorectal cancer.
A list of MaRGs was generated and their differential expression was analyzed across multiple datasets downloaded from the publicly available functional genomics database Gene Expression Omnibus. The weighted gene co-expression network analysis (WGCNA) was also applied to identify the partner genes of these MaRGs in colorectal cancer.
After integration of the results from analyses of different datasets, we found that 29 differentially expressed MaRGs (DE-MaRGs) could be considered as CRC-related genes as obtained from the WGCNA analysis. These genes were functionally involved in positive regulation of DNA biosynthetic process and glutathione metabolism. Protein–protein interaction network analysis indicated that PDIA6, PSMA1, PRC1, RRM2, HSP90AB1, CDK4, MCM7, RFC4, and CCT5 were the hub MaRGs. The LASSO approach was used for validating the 29 MaRGs in TCGA-COAD and TCGA-READ data and the results showed that ten among the 29 genes could be considered as MaRGs significantly involved in CRC. The maftools analysis showed that MaRGs were mutated at varying degrees. The nomogram analysis indicated the correlation of these MaRGs with diverse clinical features of CRC patients.
Conclusively, the present disclosed a signature of MaRGs as potential key regulators involved in CRC pathogenesis and progression. These findings contribute not only to the understanding of the molecular mechanism of CRC pathogenesis but also to the development of adequate immunotherapies for CRC patients.
Colorectal cancer is one of the deadliest tumors in the world . The diagnosis of this tumor is often late due to the lack of appropriate screening methods. Also, the lack of global knowledge on the pathogenesis of colorectal cancer limits its effective management, which complicates the treatment options, leaving only surgical intervention, chemotherapy and radiotherapy as essential choices, all of which present a certain degree of side effects . In addition, given the difference in the response of each patient to treatments and the differences in clinical presentation specific to each, a comprehensive study of the mechanisms involved in the pathogenesis and pathophysiology of colorectal cancer is strongly encouraged . This is possible by elucidating the cellular and molecular machinery involved in the pathological process . The immune response of patients plays a major role in the initiation and progression of human cancers . Previous studies and numerous reviews have shown that weakened immune system is often accompanied by exacerbation of tumor progression [6,7,8]. In fact, immune cells have the ability to inhibit tumor growth and progression through a multitude of mechanisms combining the recognition and rejection of cancer cells. Other studies have shown that immune cells infiltrating tumors are of critical relevance in the tumor microenvironment (TME) [9,10,11]. Specifically, scientific research results suggest that cancer cells secrete inflammatory molecules such as cytokines and chemokines, the attraction of which promotes the infiltration of immune cells . Based on this aspect, the researchers embarked on the development of drugs with immunotherapeutic properties. A recent example of such immunotherapies is the development of blockers of the interaction between PD-1 and PD-L1 . However, it should be noted that cancer cells are capable of developing an ability to escape immune system by modulating the metabolism of immune cells such as T cells, macrophages and neutrophils . The escape takes place through the regulation of a number of genes and proteins associated with these immune cells. Therefore, elucidating the molecular mechanisms involved in immune processes is fundamental not only for understanding cancer progression, but also for facilitating drug development and the implementation of appropriate therapeutic strategies.
Macrophages are immune cells that play an important role in antigenic degradation and the presentation of antigens . Macrophages constitute an important class of immune cells in the cancerous microenvironment and their frequency is often associated with unfavorable patient survival . Macrophages are involved in malignant processes such as cell invasion, angiogenesis and metastasis . In colorectal cancer, macrophages play a primary role in liver metastasis . Macrophages associated with tumors represent a regulatory bridge between cancer and the immune system of patients. Studies have shown that macrophages and genes that stimulate macrophages worsen the prognosis and condition of patients [19, 20]. Other studies suggest that macrophages are involved in the killing of immune cells and influence the effectiveness of different treatment strategies for cancers [21, 22]. Some studies, on the other hand, indicated that macrophages have the ability to kill cancer cells [23, 24]; therefore, it is evident that macrophages have both a pro- and anti-cancer properties. Previous studies have shown that CD68+-type macrophages associated with tumors have the potential to become a prognostic indicator for colorectal cancer [25, 26]. The multifunctional nature of macrophages in cancer pathogenesis and development could be attributed to the diverse regulatory roles of macrophage-related genes (MaRGs). Representative MaRGs include IL-6 (Interleukin 6), IL-8 (Interleukin 8), CD80 (Cluster of differentiation 80), and PIM1 (Pim-1 Proto-Oncogene, Serine/Threonine Kinase) . IL-6 was previously proposed as a preoperative serum marker for predicting colon cancer prognosis ; it is also involved in promoting the stemness of colon cancer by provoking inflammation . IL-8 was found significantly associated with the tumor size, stage and liver metastasis of colon cancer , which might be ascribed to its antigenic property that promotes colon cancer metastasis and supports tumor growth . CD80 is recognized as an important co-stimulatory molecule responsible for eradication of tumor cells; in colon cancer, CD80 coordinates the immune surveillance for precancerous lesions . PMI1 is a well-established oncogene whose overexpression in colon cancer could counteract the deprivation of glucose by triggering a compensatory Warburg effect, conferring colon cancer with survival advantages under metabolic stress . The above findings suggest a sophisticated interplay between MaRGs and colon cancer that determine the progression and prognostic outcomes of colon cancer; however, there are still considerable MaRGs and the corresponding mechanisms still remain to be uncovered in the context of colon cancer. In addition, research on the interactions between macrophages and cancer cells that would allow the discovery of new genetic signatures of macrophages as prognostic and therapeutic markers is lacking with regard to CRC. In this regard, further investigation on multifarious facets of MaRGs is required.
Bioinformatics is a discipline involving the use of computational tools for the in-silico analysis of biological data. The advances in genomics and transcriptomics have accelerated the development of this field which has allowed the extraction of valuable biological information from experimentally-derived data or publicly available data. This technique has been applied for retrieving significant data for various diseases including diabetes, neurodegenerative diseases, cardiovascular diseases and cancers . However, bioinformatical analysis of the implications of MaRGs in colorectal cancer has not been reported so far.
Thus, in the present study, we set out to analyze the importance of MaRGs in colorectal cancer using bioinformatics tools based on publicly available data.
GeneCard (https://www.genecards.org/) is a comprehensive human gene database that contains all annotated and predicted human genetic information. We downloaded 119,923 macrophage-related genes (MaRGs) based on the keyword "macrophage" from GeneCard. The corresponding information of MaRGs is available in Additional file 6: Table S1. We screened all microarray datasets related to CRC in the Gene Expression Omnibus database (GEO, https://www.ncbi.nlm.nih.gov/geo/). The keywords “colorectal cancer” and “gene expression profiling” were used to query the datasets from the GEO database. The datasets conform to the following criteria were reserved: (I) the organism was “Homo sapiens”; (II) the experiment types were “Expression profiling by array”; (III) the dataset contained both tumor and control samples; (IV) the annotation of the gene probes were completed; (V) the number of normal or control samples was larger than 5. Finally, we selected five microarray datasets including 135 normal samples and 167 tumor samples, and the detailed information can be accessed in Table 1.
In addition, according to the keywords "TCGA-COAD" and "TCGA-READ", we downloaded two datasets from The Cancer Genome Atlas Program (TCGA, https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga). The TCGA-COAD dataset contains 20 normal samples and 436 tumor samples and TCGA-READ dataset contains 6 normal samples and 161 tumor samples, which were used to validate the efficacy of genes.
Identification of differentially expressed MaRGs (DE-MaRGs)
When a gene symbol was mapped to multiple probes, the average expression of this gene was preserved whereas genes containing missing values (or zero values) were removed. The R “limma” package  was used for data preprocessing and differential expression analysis. Quantile method for data normalization and logarithmic transformation for data scaling was performed in R. We screened out differentially expressed genes (DEGs) from each dataset with a threshold of |logFC|> 0.263 and P-value < 0.05, and obtained common DEGs that were up-regulated and down-regulated in the 5 datasets through merging. Then weighted gene co-expression network analysis (WGCNA) was performed to analyze the relationship between gene co-expression modules and clinical traits in each dataset. The correlation between CRC and modules was calculated by the Pearson Correlation Coefficient (PCC) between eigengenes per module and CRC status. The two modules with the highest correlation (positive correlation and negative correlation) with CRC status of each dataset were retained. CRC-related genes (CrRGs) were obtained by the intersection of the genes of modules most significantly associated with CRC after WGCNA analysis of the five datasets. Finally, we combined the MaRGs, DEGs, and CrRGs, and obtained DE-MaRGs that may be potential biomarkers in CRC. The R “WGCNA” package  was used to perform WGCNA in this study.
Functional enrichment analysis of DE-MaRGs
To reveal the biological functions of the DE-MaRGs, we performed the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis via the R “clusterProfiler” package . The terms with adjusted P-value 0.05 were considered significant and the top 20 terms were visualized via R “ggplot2” package .
Protein–protein interaction (PPI) network construction and identification of hub DE-MaRGs
The Search Tool for The Retrieval of Interacting Genes/Proteins (STRING, https://string-db.org/cgi/) is a database of known and predicted protein–protein interactions. Herein, we uploaded DE-MaRGs to the STRING database and selected the organism "Homo sapiens". TSV file was downloaded from the STRING database and used as network input for visualization of the PPI network in the Cytoscape software 3.7.2. We identified the hub DE-MaRGs in the PPI network via the Cytoscape plugin MCODE with parameters as degree cut-off ≥ 2, node score cut-off ≥ 0.2, k-core ≥ 2, and max depth = 100. Then, we analyzed the types of mutations and the mutation rate that may exist in the key DE-MaRGs using the R Maftools package based on TCGA- COAD and TCGA- READ mutation data information. In this study, COAD and READ mutation information was downloaded from the TCGA database with the key word “TCGA-COAD” or “TCGA- READ”, “Somatic Variant Aggregation and Masking”, and “Simple nucleotide variation”.
Identification of the key DE-MaRGs
Based on the TCGA-COAD and TCGA-READ datasets, we used the least absolute shrinkage and selection operator (LASSO) to obtain the key DE-MaRGs. The R “glmnet” package  was used to implement LASSO analysis. Since the normal samples were too small compared to tumor samples, it may not be statistically valid to use two full TCGA datasets for validation. Herein, we performed LASSO analysis repeatedly on a random subset of tumor samples and all of the normal samples (with tumor-normal ratio of 3:1) and then the overlapped results were used as valid results. The time of LASSO analyses was set as 1000. DE-MaRGs were counted every time they were identified by LASSO analysis, and the eight DE-MaRGs with the highest cumulative number were selected as candidate genes. The key DE-MaRGs were obtained by the intersection of the candidate genes identified from the TCGA-COAD and TCGA-READ dataset.
To investigate the prognosis effects of the key DE-MaRGs, we performed survival analysis. The normal samples in the TCGA-COAD and TCGA-READ datasets were excluded, and the relationship between the key prognosis DE-MaRGs and the survival time of CRC patients was analyzed using the R "survival" package (https://cran.r-project.org/web/packages/survival/index.html). Kaplan–Meier Curve analysis was used to obtain overall survival (OS) and disease-free survival (RFS) of CRC patients. Based on the different tumor stages of CRC, we obtained the relationship between the expression of the key DE-MaRGs and the tumor stage. In order to verify the mutation information of key DE-MaRGs, we constructed a CRC mutation prediction model. The mutation data were downloaded as described above and using the R "Maftools" package (https://cran.r-project.org/web/packages/maptools/index.html), we analyzed the types of mutations and the mutation rate of the key DE-MaRGs based on the mutation data.
Evaluation of clinical independence and construction of the nomogram
We deleted CRC samples with missing clinical information, including survival status, time, tumor stage, age, sex, and weight, from the TCGA-COAD and TCGA-READ datasets. R "rms" package was used to construct univariate and multivariate Logistic Regression analyses for CRC clinical information. Then we constructed the Cox risk model using the R "survival" package and R "rms" package (https://cran.r-project.org/web/packages/rms/index.html). Finally, we used the R "rms" package to integrate clinical information for nomogram construction. Ten key DE-MaRGs model was constructed based on TCGA-COAD and TCGA-READ datasets according to gene expression profiles. Moreover, ROC curves were used to estimate the prediction capabilities of the key DE-MaRGs and were implemented by R “ROCR” package.
Identification and functional enrichment of the DE-MaRGs in five CRC datasets
To identify the DE-MaRGs, we performed differential expression analysis based on five CRC-related datasets from the GEO database. The volcano plot for DEGs in the five datasets and heatmap of the top 10 up-regulated DEGs and down-regulated DEGs in the five datasets were shown in Fig. 1 and Additional file 1: Figure S1, respectively. After integrating the five CRC datasets, we obtained a total of 29 up-regulated CRC DEGs and two down-regulated CRC DEGs. Since the GSE156355 dataset did not conform to a scale-free distribution, we only analyzed the remaining four datasets separately with the WGCNA approach (Additional files 2, 3, 4, 5: Figs. S2–S5). The module-traits relationship heatmap of the four datasets was shown in Fig. 2a–d. A total of 8 modules related to CRC were screened, and 25,470 of CRC-related genes (CrRGs) were obtained after merging these modules (Fig. 2e). According to the keyword "macrophage", we downloaded a list of 119,923 MaRGs in GeneCard. After the intersection of MaRGs, DEGs, and CrRGs, 29 DE-MaRGs were obtained (Fig. 3a), and the heatmap of the DE-MaRGs in the five datasets based on log2FC was shown in Fig. 3b.
GO and KEGG analyses were used to reveal the biological functions of the 29 DE-MaRGs. We found that the 29 DE-MaRGs were mainly enriched in positive regulation of DNA biosynthetic process, telomere maintenance, cellular modified amino acid metabolic process, chaperone complex, chaperon-containing T-complex, methylome. Moreover, glutathione metabolism and DNA replication were considered as the significant pathway of 29 DE-MaRGs via KEGG pathway analysis. The bubble charts of GO and KEGG are shown in Fig. 3c–f.
The PPI network of 29 DE-MaRGs and hub gene mutation analysis
Through the STRING website, we constructed a PPI network composed of 29 DE-MaRGs-encoded proteins. After removing the proteins that were not connected to the network, the final PPI network contained 27 nodes and 130 edges (Fig. 4a). The top 18 hub proteins with the highest degrees were RFC4, PRC1, SNRPB, ACLY, ATIC, CCT7, CCT5, SMS, HSP90AB1, PSMA1, RRM2, MCM7, WDR77, CDK4, NUP155, NONO, HMGB3, and CDC25B. More detailed information can be found in the Additional file 7: Table S2. The mutation analysis results (Fig. 4b) showed that ACLY, ATIC, MCM7, and CDC25B were the genes with the highest mutation frequency in TCGA-COAD mutation data. Moreover, NUP155 RFC4 presented multiple mutations in one sample. Similarly, we also found that the mutation frequency of PRC1 was highest in the TCGA-READ mutation data. However, there was no tendency for ACLY, CCT5, SMS, RRM2, WDR77, and CDC25B to mutate in the TCGA-READ mutation data (Fig. 4c).
Identification of the key DE-MaRGs
LASSO analysis was used to screen out the key DE-MaRGs from the 29 DE-MaRGs using R "glmnet" package (https://cran.r-project.org/web/packages/glmnet/index.html). With a 1000-times resampling and training set: testing set ratio of 3:1, the LASSO validation selected 8 candidate genes (SUPT16H, ENC1, PSMA1, HSP90AB1, PRC1, WDR77, AATF, and NUP155) from the TCGA-COAD dataset, and another 8 candidate genes (SUPT16H, ENC1, PSMA1, ATIC, PRC1, WDR77, NUP155, and NIT2) from TCGA-READ dataset. After deduplication of the two sets of candidate genes, ten key DE-MaRGs (SUPT16H, ENC1, PSMA1, ATIC, PRC1, WDR77, NUP155, NIT2, HSP90AB1, and AATF) were finally obtained. The information of the total count of each key DE-MaRG (number of times it had a non-zero coefficient) in 1000-times LASSO selection was provided in Additional file 8: Table S3.
The mutation information of ten key DE-MaRGs was predicted by the CRC simple nucleotide variation data downloaded from the TCGA database. Figure 5a showed that ENC1, SUPT16H, and ATIC were the top mutant genes in the TCGA-COAD mutation dataset. The mutation types of these genes were mainly missense mutations. In the TCGA-READ mutation dataset, the mutation types of DE-MaRGs were more diverse. Figure 5b showed that the mutation type of PSMA1 was splice site, and SUPT16H, PRC1, and NUP155 were the top-ranked mutant genes. In addition, SUPT16H was considered to have multiple mutations in the same sample, and its mutation frequency reached 2%. Except for NIT2 (P-value < 0.05) and ATIC (P-value < 0.05) (Fig. 5c, d), no difference was found between the results from the survival status, survival time, and KM curve analysis of the high expression status subgroup and low expression status group in key DEG-MaRGs. ROC curve analysis (Fig. 5e, f) revealed that the area under the ROC curve (AUC) of the ten key DE-MaRGs models were close to 0.5.
Nomogram building and validation
According to the patient's clinical information, we constructed a comprehensive prognostic array map based on the entire TCGA data set to assess the probability of survival of CRC patients within 3 and 5 years. Six clinical features, including survival status, time, tumor stage, age, sex, and weight, were included in the nomogram analysis (Fig. 6a, 7a). In the TCGA cohort, in terms of 3-year and 5-year survival rates, the calibration chart showed good agreement between nomogram predictions and actual observations (Fig. 6b, c, 7b, c).
Macrophage is a type of innate immune cell that plays an important role in host defense and inflammation. They are highly plastic and can be polarized into subtypes with different functions in different pathological environments. Tumor-associated macrophages (TAMs) are abundant in the TME, and they play an important role in promoting the growth of various tumors . It is worth noting that different types of TAM have different effects on the TME of CRC. For example, some macrophages can promote tumor formation, while others inhibit tumor formation. High macrophage infiltration is believed to be related to the prognosis of tumors . Although macrophage is one of the most common cells in the microenvironment of colorectal cancer, their prognostic role in tumors is not fully understood . The differentiation and activation of macrophages require regulation of gene expression, which is subject to the interaction of many factors, including transcription factors and epigenetic modifications . For example, there are differences of the human c-fes gene and murinespi-1 (PU.1) gene in constitutive and inducible gene expression in macrophages . In solid tumors, many macrophages and other immune cells constitute the TME . The TME changes the malignancy of tumors. Studies have shown that macrophages stimulate tumor cell migration, invasion, vascular invasion and strengthen blood vessels to promote the development of tumors in the direction of malignant tumors [46, 47]. Generally, the expression level of genes is specific in TAMs and tumor cells. However, the expression of some genes is consistent in tumor cells and TAMs. Myeloma cells can secrete vascular endothelial growth factor A (VEGFA) to stimulate angiogenesis . A recent study reported that M2 macrophages and RPMI 8226 cells can synergistically promote the proliferation, migration and tube formation of human umbilical vein endothelial cells (HUVEC), and the consumption of VEGFA in both cell types can inhibit the tube formation ability of HUVEC . In addition, representative molecular markers for macrophage such IL-6, IL-8, CD80, and PIM1 were closely associated with CRC [28,29,30,31,32,33]. Therefore, we speculate that there are other unidentified macrophage-related genes that bear the potential to regulate the occurrence and metastasis of CRC. In this study, we integrated five CRC transcriptome datasets, including 302 samples, and screened the mMaRGs in CRC. Through differential expression analysis, we obtained 31 DEGs from the five datasets, 29 of which were up-regulated in the five datasets while two of them were down-regulated. We identified 25,470 CrRGs from the four datasets (GSE23194, GSE32323, GSE37182, and GSE103512) via WGCNA analysis. Then 119,923 MaRGs were obtained from GeneCard with the search keyword “macrophage”. Finally, we obtained 29 DE-MaRGs via merging DEGs, CrRGs and MaRGs.
At present, the role of macrophages in the pathogenesis of CRC is still unclear. At the molecular level, the activation of oncogenes and the inactivation of tumor suppressor genes are related to the occurrence of CRC . At the cellular level, macrophages in the TME may adopt different polarization states, thereby affecting the occurrence of CRC. The regulation and expression of genes promote the differentiation and activation of macrophages, which is affected by the interaction of many factors, including transcription factors and epigenetic modifications . Therefore, it is very meaningful to study the potential biological functions and pathways of the genes associated with macrophages in CRC. We found that the main biological pathways of the 29 DE-MaRGs enrichment were DNA biosynthetic process, telomere maintenance, cellular modified amino acid metabolic process, chaperone complex. Telomere maintenance is an important sign of cancer. Continuous classification of cells can cause telomere shortening, but tumor cells can use telomere maintenance mechanisms to avoid this phenomenon. The presence of TAMs will shorten the survival time of patients. A previous study pointed out that most tumors with uncertain mechanisms of telomere maintenance have a large number of TAMs , which is consistent with our finding that these DE-MaRGs were enriched in the telomere maintenance. Activated macrophages can be divided into M1 macrophages and M2 macrophages. Our finding demonstrated that the cellular modified amino acid metabolic process was a significant pathway for DE-MaRGs, which can explain the M2 polarization defect and the enhancement of M1 polarization caused by Lamtor1 deficiency, amino acid starvation, and mTOR inhibition . Intracellular macrophage migration inhibitory factor (MIF) usually becomes stable in human cancer cells. MIF can promote tumor cell survival. We found that DE-MaRGs were enriched in the chaperone complex, which is consistent with a previous study reporting that tumor-activated HSP90 chaperone complex can protect MIF from degradation . It is worth noting that molecular chaperone proteins are pleiotropic signals of many kinds of cells. Henderson and colleagues  put forward a hypothesis by comparing the literature, that is, animal molecular chaperones can induce a variety of macrophage activation states. We found that glutathione metabolism was an important pathway for DE-MaRGs. The oxidative state of cells is one of the key factors that mediate apoptosis, and glutathione plays a vital role in mediating cell apoptosis through NO* and reactive oxygen species (ROS). Thus, our findings can explain why glutathione levels determine apoptosis in macrophages . In another study, glutathione was considered as a significant protective component against NO cytotoxicity on macrophages . In conclusion, our findings can provide new ideas on the functions and pathways of the genes related to the macrophages and CRC.
To better understand the pathogenesis mechanisms of CRC and the related function of macrophages, we studied the interaction of proteins encoded by DE-MaRGs. Here, we used PPI analysis to construct a network of 22 nodes and 48 edges. After Cytoscape MCODE analysis, we finally obtained nine hub DE-MaRGs, including PDIA6, PSMA1, PRC1, RRM2, HSP90AB1, CDK4, MCM7, RFC4, and CCT5. The mutation results showed that MCM7, HSP90AB1, and RFC4 were the genes with the highest mutation frequency. A previous study demonstrated that RFC4 is overexpressed in CRC and is associated with tumor progression and poor survival results , which is consistent with the results found in this study that RFC4 was jointly up-regulated in the five datasets. RFC is involved in DNA replication as a clamp loader and is regulated in a series of cancers. According to the results of Cytoscape MCODE analysis, we found that PDIA6 may play an important role in macrophages. A study reported that oxysterol loaded in THP-1 macrophages caused a decrease in the abundance of proteins related to cell death or cell life, including PDIA6 , which is consistent with the findings of the present study. At present, there is no sufficient evidence that PRC1 is directly related to macrophages, but a previous study reported that PRC1 had a regulatory role in immune evasion and angiogenesis . Recent studies have shown that overexpression of PRC1 may promote the formation of various tumors, including ovarian cancer  and colorectal cancer . The expression of RRM2 and p53R2 is related to the malignancy and progression of several types of tumors. Overexpression of RRM2 was thought to be useful for predicting metastasis and disease prognosis . Similarly, the ribonucleotide reductase subunit RRM2B was considered to be associated with advanced stage III-IV tumors that have better survival than early stage I-II tumors, and its expression was associated with better survival prognosis in CRC patients . However, the mechanism of how macrophages regulate the occurrence and metastasis of CRC through RRM2B is still unclear. We suggested that CDK4 may be indirectly involved in the pathway through which macrophages promote the formation of CRC. CDK4 is a type of cyclin-dependent kinase, its inhibitor gene p16 (INK46a) can inhibit rheumatoid arthritis in synovial tissue, in which macrophages are the main source of inflammatory cytokines . CDK4 is the basic driving factor of the cell cycle and is essential in the initiation and development of various malignant tumors. A previous study reported that selective CDK4 inhibitors can induce tumor cell cycle arrest and promote anti-tumor immunity. Therefore, macrophages may participate in the pathogenesis and development of CRC by indirectly regulating the expression of CDK4 . Although we have discussed some genes related to macrophages and CRC, there are still many genes that have not yet been reported. In the future, more in vitro and in vivo experiments are needed to verify the role of these genes in macrophages in the occurrence of CRC.
In order to further study the prognostic role of the 29 DE-MaRGs in CRC, we used LASSO analysis to select 10 key DE-MaRGs from the 29 DE-MaRGs. Among the key DE-MaRGs, we found that the expression levels of NIT2 (P-value < 0.05) and ATIC (P-value < 0.05) were related to the prognosis of CRC via survival analysis. A previous study indicated that the down-regulation of NIT2 inhibited the proliferation of colon cancer cells through the caspase-3 and PARP pathways, and induced cell cycle arrest . Another research team gave a similar finding that NIT1 inhibited the growth of CRC through the positive feedback formed by NIT1 and the activation of the TGFβ-Smad signaling pathway . Therefore, we anticipate that a low level of NIT2 may be associated with a better CRC prognosis. In addition, our study found that the expression level of NIT2 was up-regulated in the five CRC datasets, indicating that the high expression level of NIT2 may be related to the occurrence of CRC. As far as we know, we found for the first time that low level of ATIC was associated with better CRC prognosis, which is consistent with the increase of ATIC in the five CRC datasets. The mechanism of how ATIC is involved in CRC and the correlation between ATIC and macrophages deserves more in-depth research.
Although we have discovered some key DE-MaRGs and discussed how these genes participate in the formation and development of CRC, there are still some genes that have not been reported, and more in vivo and in vitro experiments are needed to verify their functions. We have discovered several possible pathways in which DE-MaRGs participate; this can provide new ideas for understanding the pathways via which macrophages may participate in CRC. However, the present study was only based on the computational analysis of biological information, and more studies are needed to verify our findings in the future. Moreover, we evaluated the predictive effects of ten key DE-MaRGs models and constructed the nomogram of CRC using the TCGA dataset. Due to the limited number of samples of the dataset we used, these CRC prognostic models may have more room for improvement in the future. There are few published macrophage microarray data, thus we used CRC datasets to explore the role of MaRGs in CRC pathogenesis, which can explain the specific role of these genes in CRC and indirectly give insights on the role of macrophages in CRC. In the future, we will collect macrophages associated with CRC for single-cell sequencing to get insight into the specific molecular mechanism of macrophages in the occurrence and development of CRC.
We obtained 29 DE-MaRGs associated with macrophages and CRC from five CRC datasets through a comprehensive analysis method. These genes may directly or indirectly participate in the occurrence and development of CRC through telomere maintenance, cellular modified amino acid metabolic process, chaperone complex, etc. Among these genes, NIT2 and ATIC were considered to be related to the prognosis of CRC (P-value < 0.05). In the future, we will conduct in vivo and in vitro experiments to verify the role of these genes in macrophages and CRC. Our research provides a new direction for understanding the biological functions of the genes related to both macrophages and CRC, and provide more diverse options for the prognosis of CRC.
Availability of data and materials
The datasets GSE156355, GSE23194, GSE32323, GSE37182 and GSE103512 analyzed during the current study are available in the Gene Expression Omnibus repository. Persistent web links to these datasets are as follow: GSE156355: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE156355; GSE23194: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE23194; GSE32323: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE32323; GSE37182: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE37182; GSE103512: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE103512.
Gene Expression Omnibus
Weighted gene co-expression network analysis
CRC related genes
Differentially expressed genes
Kyoto Encyclopedia of Genes and Genes
Least absolute shringkage and selection operator
Rawla P, Sunkara T, Barsouk A. Epidemiology of colorectal cancer: incidence, mortality, survival, and risk factors. Przeglad gastroenterologiczny. 2019;14(2):89–103.
Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, Wallace MB. Colorectal cancer. Lancet. 2019;394(10207):1467–80.
Cappell MS. The pathophysiology, clinical presentation, and diagnosis of colon cancer and adenomatous polyps. Med Clin N Am. 2005;89(1):1–42, vii.
Ziapour P, Ataee R, Shadifar M, Vaillancourt C, Ahmadi A, Jafari-Sabet M, et al. New intracellular and molecular aspects in pathophysiology of colorectal cancer. Gastroenterol Hepatol Bed Bench. 2011;4(2):43–52.
Gonzalez H, Hagerling C, Werb Z. Roles of the immune system in cancer: from tumor initiation to metastatic progression. Genes Dev. 2018;32(19–20):1267–84.
Pandya PH, Murray ME, Pollok KE, Renbarger JL. The immune system in cancer pathogenesis: potential therapeutic approaches. J Immunol Res. 2016;2016:4273943.
Mallet DG, De Pillis LG. A cellular automata model of tumor-immune system interactions. J Theor Biol. 2006;239(3):334–50.
Waldner M, Schimanski CC, Neurath MF. Colon cancer and the immune system: the role of tumor invading T cells. World J Gastroenterol. 2006;12(45):7233–8.
Galli F, Aguilera JV, Palermo B, Markovic SN, Nisticò P, Signore A. Relevance of immune cell and tumor microenvironment imaging in the new era of immunotherapy. J Exp Clin Cancer Res. 2020;39(1):89.
Gajewski TF, Schreiber H, Fu YX. Innate and adaptive immune cells in the tumor microenvironment. Nat Immunol. 2013;14(10):1014–22.
Lei X, Lei Y, Li JK, Du WX, Li RG, Yang J, et al. Immune cells within the tumor microenvironment: biological functions and roles in cancer immunotherapy. Cancer Lett. 2020;470:126–33.
Grivennikov SI, Greten FR, Karin M. Immunity, inflammation, and cancer. Cell. 2010;140(6):883–99.
Nowicki TS, Hu-Lieskovan S, Ribas A. Mechanisms of Resistance to PD-1 and PD-L1 Blockade. Cancer J (Sudbury, Mass). 2018;24(1):47–53.
Gun SY, Lee SWL, Sieow JL, Wong SC. Targeting immune cells for cancer therapy. Redox Biol. 2019;25:101174.
Mantegazza AR, Magalhaes JG, Amigorena S, Marks MS. Presentation of phagocytosed antigens by MHC class I and II. Traffic (Copenhagen, Denmark). 2013;14(2):135–52.
Nielsen SR, Schmid MC. Macrophages as Key Drivers of Cancer Progression and Metastasis. Mediat Inflamm. 2017;2017:9624760.
Dandekar RC, Kingaonkar AV, Dhabekar GS. Role of macrophages in malignancy. Ann Maxillofac Surg. 2011;1(2):150–4.
Cortese N, Soldani C, Franceschini B, Barbagallo M, Marchesi F, Torzilli G, et al. Macrophages in Colorectal Cancer Liver Metastases. Cancers. 2019;11(5):633.
Poh AR, Ernst M. Targeting Macrophages in Cancer: From Bench to Bedside. Front Oncol. 2018;8:49.
Larionova I, Kazakova E, Patysheva M, Kzhyshkowska J. Transcriptional, epigenetic and metabolic programming of tumor-associated macrophages. Cancers. 2020;12(6):1411.
Hirayama D, Iida T, Nakase H. The phagocytic function of macrophage-enforcing innate immunity and tissue homeostasis. Int J Mol Sci. 2017;19(1):92.
Klöditz K, Fadeel B. Three cell deaths and a funeral: macrophage clearance of cells undergoing distinct modes of cell death. Cell Death Discov. 2019;5:65.
Guerriero JL. Macrophages: the road less traveled, changing anticancer therapy. Trends Mol Med. 2018;24(5):472–89.
Mills CD, Lenz LL, Harris RA. A breakthrough: macrophage-directed cancer immunotherapy. Can Res. 2016;76(3):513–6.
Feng Q, Chang W, Mao Y, He G, Zheng P, Tang W, et al. Tumor-associated macrophages as prognostic and predictive biomarkers for postoperative adjuvant chemotherapy in patients with stage II colon cancer. Clin Cancer Res. 2019;25(13):3896–907.
Pinto ML, Rios E, Durães C, Ribeiro R, Machado JC, Mantovani A, et al. The two faces of tumor-associated macrophages and their clinical significance in colorectal cancer. Front Immunol. 2019;10:1875.
Zhuang G, Zeng Y, Tang Q, He Q, Luo G. Identifying MI macrophage-related genes through a co-expression network to construct a four-gene risk-scoring model for predicting thyroid cancer prognosis. Front Genet. 2020;11:591079.
Shiga K, Hara M, Nagasaki T, Sato T, Takahashi H, Sato M, et al. Preoperative serum interleukin-6 is a potential prognostic factor for colorectal cancer, including stage II patients. Gastroenterol Res Pract. 2016;2016:9701574.
Wang T, Song P, Zhong T, Wang X, Xiang X, Liu Q, et al. The inflammatory cytokine IL-6 induces FRA1 deacetylation promoting colorectal cancer stem-like properties. Oncogene. 2019;38(25):4932–47.
Terada H, Urano T, Konno H. Association of interleukin-8 and plasminogen activator system in the progression of colorectal cancer. Eur Surg Res. 2005;37(3):166–72.
Heidemann J, Ogawa H, Dwinell MB, Rafiee P, Maaser C, Gockel HR, et al. Angiogenic effects of interleukin 8 (CXCL8) in human intestinal microvascular endothelial cells are mediated by CXCR2. J Biol Chem. 2003;278(10):8508–15.
Marchiori C, Scarpa M, Kotsafti A, Morgan S, Fassan M, Guzzardo V, et al. Epithelial CD80 promotes immune surveillance of colonic preneoplastic lesions and its expression is increased by oxidative stress through STAT3 in colon cancer cells. J Exp Clin Cancer Res. 2019;38(1):190.
Zhang M, Liu T, Sun H, Weng W, Zhang Q, Liu C, et al. Pim1 supports human colorectal cancer growth during glucose deprivation by enhancing the Warburg effect. Cancer Sci. 2018;109(5):1468–79.
Strianese O, Rizzo F, Ciccarelli M, Galasso G, D’Agostino Y, Salvati A, et al. Precision and personalized medicine: how genomic approach improves the management of cardiovascular and neurodegenerative disease. Genes. 2020;11(7):747.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucl Acids Res. 2015;43(7):e47-e.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9(1):559.
Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–7.
Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer; 2009.
Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1.
Zhong X, Chen B, Yang Z. The role of tumor-associated macrophages in colorectal carcinoma progression. Cell Physiol Biochem. 2018;45(1):356–65.
Forssell J, Oberg A, Henriksson ML, Stenling R, Jung A, Palmqvist R. High macrophage infiltration along the tumor front correlates with improved survival in colon cancer. Clin Cancer Res. 2007;13(5):1472–9.
Väyrynen JP, Haruki K, Lau MC, Väyrynen SA, Zhong R, Dias Costa A, et al. The prognostic role of macrophage polarization in the colorectal cancer microenvironment. Cancer Immunol Res. 2020;9:8–19.
Chen S, Yang J, Wei Y, Wei X. Epigenetic regulation of macrophages: from homeostasis maintenance to host defense. Cell Mol Immunol. 2020;17(1):36–49.
Greaves DR, Gordon S. Macrophage-specific gene expression: current paradigms and future challenges. Int J Hematol. 2002;76(1):6–15.
Vitale I, Manic G, Coussens LM, Kroemer G, Galluzzi L. Macrophages and metabolism in the tumor microenvironment. Cell Metab. 2019;30(1):36–50.
Condeelis J, Pollard JW. Macrophages: obligate partners for tumor cell migration, invasion, and metastasis. Cell. 2006;124(2):263–6.
Wenes M, Shang M, Di Matteo M, Goveia J, Martín-Pérez R, Serneels J, et al. Macrophage metabolism controls tumor blood vessel morphogenesis and metastasis. Cell Metab. 2016;24(5):701–15.
Kumar S, Witzig TE, Timm M, Haug J, Wellik L, Fonseca R, et al. Expression of VEGF and its receptors by myeloma cells. Leukemia. 2003;17(10):2025–31.
Sun M, Qiu S, Xiao Q, Wang T, Tian X, Chen C, et al. Synergistic effects of multiple myeloma cells and tumor-associated macrophages on vascular endothelial cells in vitro. Med Oncol (Northwood, Lond, Engl). 2020;37(11):99.
Forrester K, Almoguera C, Han K, Grizzle WE, Perucho M. Detection of high incidence of K-ras oncogenes during human colon tumorigenesis. Nature. 1987;327(6120):298–303.
Hung NA, Eiholzer RA, Kirs S, Zhou J, Ward-Hartstonge K, Wiles AK, et al. Telomere profiles and tumor-associated macrophages with different immune signatures affect prognosis in glioblastoma. Mod Pathol. 2016;29(3):212–26.
Kimura T, Nada S, Takegahara N, Okuno T, Nojima S, Kang S, et al. Polarization of M2 macrophages requires Lamtor1 that integrates cytokine and amino-acid signals. Nat Commun. 2016;7:13130.
Schulz R, Marchenko ND, Holembowski L, Fingerle-Rowson G, Pesic M, Zender L, et al. Inhibiting the HSP90 chaperone destabilizes macrophage migration inhibitory factor and thereby inhibits breast tumor progression. J Exp Med. 2012;209(2):275–89.
Henderson B, Henderson S. Unfolding the relationship between secreted molecular chaperones and macrophage activation states. Cell Stress Chaperones. 2009;14(4):329–41.
Boggs SE, McCormick TS, Lapetina EG. Glutathione levels determine apoptosis in macrophages. Biochem Biophys Res Commun. 1998;247(2):229–33.
Romão PR, Fonseca SG, Hothersall JS, Noronha-Dutra AA, Ferreira SH, Cunha FQ. Glutathione protects macrophages and Leishmania major against nitric oxide-mediated cytotoxicity. Parasitology. 1999;118(Pt 6):559–66.
Xiang J, Fang L, Luo Y, Yang Z, Liao Y, Cui J, et al. Levels of human replication factor C4, a clamp loader, correlate with tumor progression and predict the prognosis for colorectal cancer. J Transl Med. 2014;12:320.
Ward LJ, Ljunggren SA, Karlsson H, Li W, Yuan XM. Exposure to atheroma-relevant 7-oxysterols causes proteomic alterations in cell death, cellular longevity, and lipid metabolism in THP-1 macrophages. PLoS ONE. 2017;12(3):e0174475.
Su W, Han HH, Wang Y, Zhang B, Zhou B, Cheng Y, et al. The polycomb repressor complex 1 drives double-negative prostate cancer metastasis by coordinating stemness and immune suppression. Cancer Cell. 2019;36(2):139-55.e10.
Bu H, Li Y, Jin C, Yu H, Wang X, Chen J, et al. Overexpression of PRC1 indicates a poor prognosis in ovarian cancer. Int J Oncol. 2020;56(3):685–96.
Benoit YD, Witherspoon MS, Laursen KB, Guezguez A, Beauséjour M, Beaulieu JF, et al. Pharmacological inhibition of polycomb repressive complex-2 activity induces apoptosis in human colon cancer stem cells. Exp Cell Res. 2013;319(10):1463–70.
Chang CC, Lin CC, Wang CH, Huang CC, Ke TW, Wei PL, et al. miR-211 regulates the expression of RRM2 in tumoral metastasis and recurrence in colorectal cancer patients with a k-ras gene mutation. Oncol Lett. 2018;15(5):8107–17.
Liu X, Lai L, Wang X, Xue L, Leora S, Wu J, et al. Ribonucleotide reductase small subunit M2B prognoses better survival in colorectal cancer. Can Res. 2011;71(9):3202–13.
Murakami Y, Mizoguchi F, Saito T, Miyasaka N, Kohsaka H. p16(INK4a) exerts an anti-inflammatory effect through accelerated IRAK1 degradation in macrophages. J Immunol (Baltimore, Md: 1950). 2012;189(10):5066–72.
Goel S, DeCristo MJ, Watt AC, BrinJones H, Sceneay J, Li BB, et al. CDK4/6 inhibition triggers anti-tumour immunity. Nature. 2017;548(7668):471–5.
Zheng B, Chai R, Yu X. Downregulation of NIT2 inhibits colon cancer cell proliferation and induces cell cycle arrest through the caspase-3 and PARP pathways. Int J Mol Med. 2015;35(5):1317–22.
Lin C, Zhang J, Lu Y, Li X, Zhang W, Zhang W, et al. NIT1 suppresses tumour proliferation by activating the TGFβ1-Smad2/3 signalling pathway in colorectal cancer. Cell Death Dis. 2018;9(3):263.
This work was supported by the University Nursing Program for Young Scholars with Creative Talents in Heilongjiang Province (UNPYSCT-2018140). The funders have no roles in the design of the study, collection, analysis and interpretation of the data or writing of the manuscript.
Ethics approval and consent to participate
Consent for publication.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1:
Fig. S1. Heatmap of the top ten DEGs of up-regulated cluster and down-regulated cluster in five datasets. (A) Heatmap of DEGs identified in GSE23194. (B) Heatmap of DEGs identified in GSE32323. (C) Heatmap of DEGs identified in GSE37182. (D) Heatmap of DEGs identified in GSE103512. (E) Heatmap of DEGs identified in GSE156355
Additional file 2:
Fig. S2. WGCNA analysis of GSE23194. (A) Sample dendrogram and clinical trait heatmap. (B) The identification of βvalue for the optimal scale-free topology network. (C) Module identification. The dendrogram indicates the gene clustering according to TOM dissimilarity
Additional file 3:
Fig. S3. WGCNA analysis of GSE32323. (A) Sample dendrogram and clinical trait heatmap. (B) The identification of βvalue for the optimal scale-free topology network. (C) Module identification. The dendrogram indicates the gene clustering according to TOM dissimilarity
Additional file 4:
Fig. S4. WGCNA analysis of GSE37182. (A) Sample dendrogram and clinical trait heatmap. (B) The identification of βvalue for the optimal scale-free topology network. (C) Module identification. The dendrogram indicates the gene clustering according to TOM dissimilarity
Additional file 5:
Fig. S5. WGCNA analysis of GSE103512. (A) Sample dendrogram and clinical trait heatmap. (B) The identification of βvalue for the optimal scale-free topology network. (C) Module identification. The dendrogram indicates the gene clustering according to TOM dissimilarity
Additional file 6:
Table S1. The detailed information of MaRGs obtained from GeneCard with the keyword "macrophage"
Additional file 7:
Table S2. The detailed information about the PPI analysis of 29 DE-MaRGs
Additional file 8:
Table S3. The detailed information of the LASSO analysis for identification of key DE-MaRGs
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Chen, Y., Zhang, C., Zou, X. et al. Identification of macrophage related gene in colorectal cancer patients and their functional roles. BMC Med Genomics 14, 159 (2021). https://doi.org/10.1186/s12920-021-01010-0
- Colorectal cancer
- Macrophage-related genes