Skip to main content

Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets


Corona virus disease 2019 (COVID-19) increases the risk of cardiovascular occlusive/thrombotic events and is linked to poor outcomes. The underlying pathophysiological processes are complex, and remain poorly understood. To this end, platelets play important roles in regulating the cardiovascular system, including via contributions to coagulation and inflammation. There is ample evidence that circulating platelets are activated in COVID-19 patients, which is a primary driver of the observed thrombotic outcome. However, the comprehensive molecular basis of platelet activation in COVID-19 disease remains elusive, which warrants more investigation. Hence, we employed gene co-expression network analysis combined with pathways enrichment analysis to further investigate the aforementioned issues. Our study revealed three important gene clusters/modules that were closely related to COVID-19. These cluster of genes successfully identify COVID-19 cases, relative to healthy in a separate validation data set using machine learning, thereby validating our findings. Furthermore, enrichment analysis showed that these three modules were mostly related to platelet metabolism, protein translation, mitochondrial activity, and oxidative phosphorylation, as well as regulation of megakaryocyte differentiation, and apoptosis, suggesting a hyperactivation status of platelets in COVID-19. We identified the three hub genes from each of three key modules according to their intramodular connectivity value ranking, namely: COPE, CDC37, CAPNS1, AURKAIP1, LAMTOR2, GABARAP MT-ND1, MT-ND5, and MTRNR2L12. Collectively, our results offer a new and interesting insight into platelet involvement in COVID-19 disease at the molecular level, which might aid in defining new targets for treatment of COVID-19–induced thrombosis.

Peer Review reports


The coronavirus SARS-CoV-2 is a highly contagious infection that causes a severe respiratory disease known as COVID-19. This disease that has reached a pandemic level, is impacting tens of millions of people worldwide. In the United States, there are around 78 million reported cases, over 4 million hospital admissions, and 900 thousand deaths as of February 2022 [1]. It is now known that COVID-19-induced thrombosis increases the incidence of cardiovascular occlusive events in infected patients, a fact that has been reported in several studies [2,3,4], Indeed, abnormal hemostasis responses were observed in COVID-19 hospitalized patients, which was linked to poor prognosis [2, 5, 6] In addition, studies have shown that COVID-19 leads to increase in platelet activation through alterations of platelet transcriptome and proteome [7, 8]. In this connection, it is now well established that platelets play roles beyond vascular hemostasis, including innate immunity and tumor metastasis [9]. Moreover, platelets were shown to be activated in the septic state, and antiplatelet therapy has been used as a strategy to prevent organ damage in sepsis [10]. To this end, evidence has indicated that viral infections are associated with coagulation disorders, and thrombotic cardiovascular events [11, 12], which is consistent with the thrombotic phenotype seen in COVID-19 patients/SARS-CoV-2 viral infection. While there has been some progress, our understanding of the pathways that govern platelet participation in COVID-19–induced thrombosis remains limited, but clearly warrants investigation.

To obtain a comprehensive insight into the pathogenesis of specific disease states, several computational and research methods have been developed [13]. Some of these approaches were employed to examine the potential gene networks, which are very instrumental to guide understanding of diseases and their mechanistic pathways. Notably, co-expression analysis is one such approach, which clusters genes into coexpressed groups known as modules. These genes that belong to the same module are thought to share functional properties [14]. This approach relies on using graph theory concepts that allow researchers to understand in a systematic way the relations between the genes of a module and the phenotype based on the module eigingene [14]. In fact, co-expression using weighted correlation network analysis (WGCNA) has been used for analyzing a number of biological processes, including cancer [15, 16] and cognitive and mental disorders [17, 18]. In short, gene networks provide the utility to move beyond individual-gene comparisons and comprehensively identify biologically meaningful relationships between gene products and phenotypes.

At the same time, machine learning and artificial intelligence are getting extensively used in biology [19], especially for feature selection. “Feature selection” is used to select the minimum number of features to predict the biological phenomenon or correctly classify the biological samples. This approach facilitates understanding of the underlying disease mechanisms and other factors that reasonably could have affected the disease status. One particular approach for results validation is to build a classifier using the information derived from the identified set of biomarkers (e.g., gene expression) and test the performance of that classifier on totally different data set to examine its ability to classify two status (e.g. disease vs healthy). successful classifier gives strong evidence supporting the biomarkers validity [20, 21].

Previous studies on the mechanisms of thrombosis in COVID-19 disease have primarily concentrated on specific pathophysiological functions, with relatively fewer studies identifying comprehensive regulatory networks. Therefore, in the present study, WGCNA was used to determine gene networks associated with COVID-19 disease in platelets. PRJNA634489 data set- which contained a total of 15 samples from COVID-19 patients and health controls [7] was used in the present study. Three modules with the highest level of significance in correlation with COVID-19 disease were identified. Of note, the three aforementioned modules were validated as a predictor of COVID-19 phenotype using another set, and the three genes with the highest intramodular connectivity were selected as the hub genes in the respective modules for COVID-19. Gene enrichment analysis was also conducted to determine enrichments in the key modules. The results of this study may provide novel information/insights into the underlying mechanisms of COVID-19 disease and may assist in the identification of potential biomarkers for diagnosis and/or targets for treatment.


Data preprocessing and differentially expressed genes screening

RNAseq data is publicly available and were downloaded from BioProject accession #PRJNA634489 [7]. Data comprised of ten COVID-19 patients in addition to age- and sex-matched five healthy controls. Of note, while the original paper included a total 58 subjects divided as 41 COVID-19 patients and 17 healthy controls, samples from only 15 subjects were sequenced, and hence used in our analysis. The Kallisto program was employed for pseudoalignment of reads and quantification to obtain the counts and the transcript per million (TPM) [22]. Log2CPM (log transformed counts per million) was used for the differential expression analysis by employing Voom normalization [23] and Limma R package [24] TPM normalized and filtered to exclude low variance transcripts (\(\le\) 0.001) [25] was used for the weighted gene co-expression network analysis. All methods were performed in accordance with the relevant guidelines and regulations.

RNA seq data for validation was downloaded from the publicly NCBI SRA repository under accession: #PRJNA736410, analyzed and normalized by following the same steps as first data set.

Weighted gene coexpression network analysis

The weighted co-expression network was produced using R package “WCGNA” [14] as per the flowchart in Fig. 1. To weight highly correlated genes, the soft thresholding power (\(\beta\)) was set at 12, and the minimal module size was set at 30. To define clusters of genes in the data set, the adjacency matrix was used to calculate the topological overlap matrix (TOM), which shows the degree of overlap in shared neighbors between pairs of genes in the network. The resulting gene network was visualized as a heatmap.

Fig. 1
figure 1

Flowchart illustrating the major step of the study

Screening for key modules and hub genes

Correlation between module eigengenes and the COVID-19 status was calculated to identify key modules that have significant correlation. The correlation values were displayed within a heatmap. The modules that correlated with COVID-19 most significantly were considered as the key modules. Gene significance (GS) was defined as the correlation between gene expression and the COVID-19 status. Module membership (MM) was defined as the correlation between gene expression and each module’s eigengene, and intramodular connectivity (, which measures how connected a given gene with respect to the genes of a particular module, was also calculated using WGCNA. Subsequently, the correlation between GS and MM as well as GS and were examined to verify module-COVID-19 status associations. The correlation analyses in this study were performed using the Pearson correlation as described in the “WGCNA” package [14]. All module genes were ranked according to their intramodular connectivity, and only the top three genes were selected as hub genes.

Validation of key modules using machine learning

To validate the results of the above mentioned analysis, multiple classification models (Lasso, Naiive Bayes, Random forest, SVM and XGBboost) were trained using the key modules of the original data set. Those models were employed to classify the samples of a second data set [26] of platelets gene expression in COVID-19 patients and healthy subjects. The second data was totally isolated from the training process.

Functional enrichment analysis of key modules

The genes in each key modules were extracted from the network and enrichment analysis was performed to further explore the functions of the respective modules. Targetmine [27] which is a web-based integrative data analysis platform for target prioritisation and broad-based biological knowledge discovery- was used to perform Gene Ontology (GO) and Reactome pathway enrichment analysis. In this analysis, a benjamini hochberg adjusted P-value of 0.05 was set as the significance threshold to identify the most significant functional pathways/GO terms. Only top results of enriched terms are reported.

Statistical and visualization tools

We used R statistical programming language [28] version 4.1.0, with the following packages: “WGCNA” [14] for coexpression analysis; “Scikit-learn” [29] for machine learning building and evaluation; “circlize” [30] for chord diagram building; “ggplot2” [31] and “seaborn” [32] for visualization ; “Igraph” for network analysis [33] and “ggraph” [34] for network visualization.


Construction of co-expression network

The transcript per million (TPM) gene expression data set were filtered based on variance, and 7119 genes in the 15 samples of ten COVID-19 patients and five healthy controls were used to construct the co-expression network. The results of cluster analysis of the samples are demonstrated in (Fig. 2A). To construct the network, a soft-threshold of 12 was used to obtain the approximate scale-free topology (Additional file 1: Fig. S1). Genes across the 15 samples were hierarchically clustered based on topological overlap (Fig. 2C, D). We identified 16 modules in which genes are coexpressed, random colors were assigned to the modules to distinguish between them. The size (number of genes/module) of each module is presented in (Fig. 2B). To demonstrate how these modules were relatively distinctive, we plotted the network heatmap of 400 randomly selected genes based on topological overlap matrix dissimilarity and their cluster dendrogram (Fig. 3A) indicating relative independence among these clusters.

Fig. 2
figure 2

Construction of weighted co-expression network. A Sample dendrogram and trait heatmap. B Module size. C and D Cluster dendrogram block 1 and block 2. Each color represents one specific co-expression module, and branches above represent genes. Plots produced by WGCNA R package Version 1.70-3)

Fig. 3
figure 3

Co-expression module analysis. A Interaction of co-expression genes based on TOM dissimilarity and the cluster dendrogram of 1000 randomly selected genes. The colors of the axes represent respective modules. The intensity of the yellow inside the heatmap represents the degree of overlap, with a darker yellow representing an increased overlap. B Eigengene adjacency heatmap of different gene co-expression modules. C An eigengene dendrogram identified groups of correlated modules. D Heatmap of the correlation between COVID-19 status and module eigengenes. Column corresponds to a clinical trait, and each row corresponds to a module. Each row contains the correlation coefficients which correspond to the cell color; green represents negative correlation and red represents positive correlation. The P-values are stated in the brackets

Correlation between modules and COVID-19 disease status

To examine the relation of COVID-19 status with the emerged modules, we built the eigengene adjacency matrix by calculating the correlation of the eigengenes matrix after inserting COVID-19 status to the matrix. The heatmap (Fig. 3B) showed the modules’ relationship and the correlation between the modules namely black, cyan, yellow, blue, and magenta and COVID-19 status.

Identification of key modules in relationship to COVID-19 disease status

To further determine the closest modules to COVID-19 status, we re-clustered the eigengenes using single linkage method with absolute correlation as a distance function; the single linkage clustering algorithm looks for closest pair of modules to form a cluster, then cluster them with the next nearest module progressively until one cluster is formed [35]. As demonstrated in Fig. 3C, the closest three modules to COVID-19 status are magenta, yellow and black. Three essential measurements can help confirm the importance of the module to a specific trait, 1) Module membership (MM), which increases for a particular gene, when the module eigengene accurately represents this gene, 2) gene significance (GS) is measured by calculating the correlation of gene expression with the specific trait and 3) intramodular connectivity ( for a gene within the module, reflecting the centrality of the gene to the module expression network. Based on WGCNA, if a gene is higher with GS, MM, and, it is more meaningful to the clinical trait of interest [36, 37].

Explicitly, the higher the correlation between gene significance of genes in a module and their module membership, the higher its importance. Similarly, when the gene centrality in the network increases in parallel with gene significance, that also is strong evidence that key modules are essential in that trait. The correlations between gene significance and module membership as well as between gene significance and intramodular connectivity show that yellow, black, and magenta modules have the highest correlation values with a substantial difference to the next nearest module (Blue R = 0.61) (Additional file 1: Fig. S2). For those reasons, we selected yellow, black, and magenta modules for further investigation and will refer to them using the term key modules.

Key modules show high correlation to COVID-19 disease status

The module-trait relationship was determined by correlating module eigengenes with COVID-19 disease status to identify significant correlation. The yellow and the black modules exhibited the highest positive correlation (R=0.91; p-value=\(3 \times 10^{-6}\), and R=0.86; p-value= \(3 \times 10^{-5}\), respectively; Fig. 3D). On the other hand, the magenta module (R=-0.96; p-value=\(1 \times 10^{-8}\)) exhibited the highest negative correlation (Fig. 3D). Therefore, these three modules were identified as key modules for COVID-19 disease and its impact on platelets. The significant correlations between the different GS, MM, and for COVID-19 are illustrated in (Fig. 4A, B). We also showed the GS, MM, and of the green module that showed the low correlation to COVID-19 disease status (Fig. 4A, B).

Fig. 4
figure 4

Module features of GS, MM and A Module Membership (MM) and Gene significance (GS) are significantly correlated in the key modules (magenta, black, and yellow). Each point represented an individual gene within each module, which was plotted by GS on the y-axis and MM on the x-axis. The regression line, correlation value, and p-value were shown for each plot. B Correlation of the (x-axis) and the GS (y-axis). Green module is not a key module and was added here for sake of comparison

In summery, although all samples were used to identify the co-expression modules, the top modules were selected based on meeting the following criteria: 1) high correlation between module eigengene and COVID-19 status, 2) close clustering with COVID-19 status using single linkage with absolute correlation distance, 3) high correlation between genes significance and module membership, and 4) high correlation between gene significance and intramodular connectivity. Together those measures confirm the importance of the key modules in COVID-19 status

Key modules’ genes can differentiate COVID-19 from normal subjects

The classification models trained using data from key modules genes showed high performance in terms of high balanced accuracy, sensitivity, specificity, Matthews correlation coefficient, as well as, area under the receiver operating characteristic curve (AUC) (Fig. 5), suggesting that the genes of these three modules are important in the pathology of COVID-19 disease. Furthermore, the accurate classification of the external validation set samples suggests that these results can be generalized and not limited to the analyzed data set.

Fig. 5
figure 5

Machine learning performance in classification of validation set samples. This bar plot shows the performance of 5 modules (Lasso, Naiive Bayes, Random Forest, SVM and XGboost). All the models have good performance and could classify the samples with high specificity and sensitivity

Gene hub detection and visualization of module networks

Genes in the selected key modules were ranked according to the intramodular connectivity and the top 20 genes of each key modules were used to visualize the network of each specific module (Fig. 6). Subsequently, the top three genes of the yellow, black, and magenta modules were labeled as the hub genes in their modules that are important for COVID-19 disease. Thus, the protein coding genes COPE, CDC37 and CAPNS1 were selected as the hub genes in the yellow module, whereas AURKAIP1, LAMTOR2, and GABARAP protein coding genes were selected as the hub genes in the black module. Regarding the magenta module, MT-ND1, MT-ND5, and MTRNR2L12 were selected as hub genes. All of these hub genes exhibited a high intramodular connectivity, which established their network centrality and potentially vital roles in the COVID-19 disease. We also observed that not all of hub genes show differential gene expression (Table 1). A full list of genes and their modules can be found in the supplementary tables (Additional files 2, 3, 4, 5, 6).

Fig. 6
figure 6

Interaction of gene co-expression patterns in the key identified module and hub gene abundance. The module was visualized using R package “ggraph” software. The node size corresponds to the level. and the thickness of the link represents the strength of correlation between genes. For sake of visualiztion clarity, edges of weight less than 0.6 were not drawn

Table 1 Differential expression of hub genes identified in the key modules

Enrichment analysis of key modules

Gene ontology (GO) pathway enrichment analyses were performed on the yellow, black, and magenta modules using Targetmine platform, and the top relevant terms of each category are presented in (Fig. 7A). The pathway enrichment results demonstrated that the genes in both yellow and black modules were primarily enriched in pathways associated with metabolic process, protein translation, energy substance metabolism, mitochondrial activity, and oxidative phosphorylation. Genes in the magenta module were enriched in several pathways that are primarily associated with regulation of megakaryocyte differentiation and apoptosis, including the regulation of the execution phase of apoptosis. Reactome showed enriched pathways of metabolism, platelet degranulation, and response to elevated platelet cytosolic Ca2+ in the yellow module. The black module shows enrichment of respiratory electron transport, ATP synthesis by chemiosmotic coupling, heat production by uncoupling proteins, citric acid (TCA) cycle, and respiratory electron transport just to name a few (Fig. 7B) (More detailed results are shown in Additional file 1: Fig. S3 and cross check of hubgenes with Disgenet database is shown in Additional file 6: Table: S5 [38]).

Fig. 7
figure 7

Gene ontology (GO) terms and Reactome enrichment analysis of the key modules. Chord plots depicting the enriched top significant GO terms of biological process (A), and top significant Reactome pathways (B) in the key modules. Thickness of the connection is corresponding to the number of involved genes. Figures are produced using circlize R package Version 0.4.13)


The underlying pathophysiological mechanisms of thrombosis in COVID-19 are extremely complicated [39], and hence clearly require more examination. Inspecting gene co-expression patterns is proven to be an effective method to analyze and uncover complicated genetic networks. To address the aforementioned issues, in the present study, gene co-expression analysis was performed on platelet RNAseq data set containing gene expression data from ten COVID-19 patients and five healthy controls. There were three modules that were identified as the key modules in COVID-19, with the highest level of significant association. The top three genes of each key module with the highest intramodular connectivity were identified as hub genes for COVID-19 in platelets. The results of the enrichment analysis suggest that the key modules and the pathological processes underlying the disease are associated with energy metabolism, mitochondrial processes, and apoptosis. Furthermore, we also saw enrichment of platelet secretion and activation pathways. These results provide- at least in part- an insight into the comprehensive platelet regulatory network in COVID-19, which should improve the current understanding of the mechanisms underlying immunothrombosis in COVID-19 patients. Ultimately, these findings might help in finding appropriate therapeutic targets.

The present study used the data in BioProject accession #PRJNA634489 [7] to perform the co-expression analysis using WGCNA. The data used in this study, which was generated by Manne et al. [7], revealed that COVID-19 disease leads to changes in platelet transcriptional profiles in comparison to control. Manne et al. showed that platelet differential gene expression in COVID-19 is associated with enrichment of protein ubiquitination, antigen presentation, and mitochondrial dysfunction. The major differences in the genes or modules obtained in the present study, compared with the results from other studies including the one by Manne et. al. [7] is that the present study used a more comprehensive method by employing WGCNA. Using this method, we were able to identify/pull-out co-expression modules of genes, namely the yellow and black modules, which represent important regulatory modules of platelet function in COVID-19. In addition, we were able to identify the magenta module, which represents genes that are negatively correlated with the COVID-19 disease state and show enrichment in megakaryocyte differentiation and apoptotic pathways. This systematic and in-depth analysis should complement results obtained from conventional DEGs analysis, and therefore allow for a better understanding of the pathophysiological mechanisms of COVID-19 disease.

Notably, the co-expression analysis revealed a total cluster of 16 modules with the yellow and black modules exhibiting the strongest positive correlation and the magenta exhibiting the strongest negative correlation to COVID-19 disease. These three modules were selected as key modules and their genes deemed important for the COVID-19 disease state. This result was validated when the genes of these three key modules/clusters were used to accurately classify the subjects from another recently published platelet data set (Barrett et. al 2021) to either COVID-19 or healthy using machine learning classifiers. The high accuracy of this classification underscores the importance of these platelet gene clusters in the pathogenesis of COVID-19 disease.

Enrichment analysis indicated that the genes in the yellow and black modules were primarily associated with platelet metabolism, energy, and oxidative phosphorylation. Furthermore, the analysis of the yellow module showed enrichment of a host of platelet functional responses/activities, such as platelet degranulation/secretion and increased platelet response to Ca2+. Indeed, other studies showed the COVID-19 disease to be associated with platelet activation and increased platelet alpha granule secretion, which are critical in the development of thrombosis seen in those patients [7, 8]. It is noteworthy that the platelet alpha granule secretion response is not only important for thrombus formation, but also in inflammation by releasing receptors that facilitate adhesion of platelets with other vascular cells as well as releasing a wide range of inflammatory chemokines [40].

The yellow and black modules show strong enrichment in platelet metabolic processes, which is in agreement with the increase in platelet activation. To this end, previous data have shown that platelet transition from inactive to active state requires alteration in ATP availability [41], and furthermore, substrate metabolism (e.g. glucose) was shown to be essential for platelet activation [42], and thrombosis [43]. This seems to suggest that altered platelet metabolism may play a critical role in the pathophysiology of thrombosis in COVID-19 patients. It is important to note that reports have suggested that a state of hypermetabolic demand is one of COVID-19 disease features, especially when sepsis develops [44]. Like other viruses that can impact cellular metabolism in human cells and utilize them to their advantage, SARS-CoV-2 virus appears to have the ability to localize proteins to mitochondria and hijack the host’s mitochondrial function [45]. This mechanism might explain the enrichment of platelet mitochondrial processes we observed in the yellow and black modules. This finding is in fact supported by a recent study that reported that SARS-CoV-2 impacts mitochondria in platelets, which affects their involvement in the pathophysiology of thrombosis in COVID-19 patients [46]. The enrichment of protein translation in the yellow and black modules suggests an alteration in protein synthesis and possible hijacking of the translation machinery of platelet by the virus. In line with this observation, one study suggested that the cells infected with SARS-CoV-2 might exhibit a faster protein synthesis rate, which implies a higher translation rate [47]. This notion requires further investigation to determine the exact mechanism underlying enhancement of translation in platelets of COVID-19 patient.

One particular characteristic of platelet apoptotic processes is phosphatidylserine (PS) exposure, which is essential for the generation of thrombin [48]. PS exposure is found to be downregulated in activated platelets from COVID-19 patients due to mitochondrial dysfunction [46]. This observation is supported by the negative regulation of apoptotic processes in platelet enrichment in the negatively correlated magenta module. On the contrary, another report showed that COVID-19 increases PS externalization, which is linked to thrombosis [49]. The impact of platelets mitochondrial damage on hemostasis seems to depend on its severity. Thus, it leads to bleeding by progressing toward apoptosis if it is severe; or toward platelet activation pathways and development of thrombosis risk in case of mild damage [50]. Based on this reasoning, COVID-19 disease-caused mitochondrial damage in platelets is probably mild; and hence the thrombotic phenotype still prevails in these patients. Based on these considerations, more investigation is needed to confirm these observations and to understand the underlying mechanisms.

Additionally, we identified hub genes in each of the key modules. For example, in the yellow module the COPE, CDC37, and CAPNS1, which are protein coding genes involved in vesicle-mediated transport, positive regulation of cellular processes, and regulation of interferons. Furthermore, some of these protein coding genes have also been investigated in platelets and shown to regulate important aspects of their function [51,52,53,54], Interestingly, although our co-expression analysis showed that CAPNS1 is an important hub gene in the yellow module, this gene was not differentially expressed in our differential gene expression analysis. Furthermore, CAPNS1 was found to play a significant role in regulating platelet activity and thrombosis under hypoxia [53], a condition commonly seen in severe COVID-19 patients [55]. This observation might indicate that some of the important genes in establishing thrombotic phenotype in COVID-19 may not necessarily be differentially expressed.

The hub genes of the black module, AURKAIP1, LAMTOR2, and GABARAP are linked to regulation of mitochondrial activity, regulation of signaling processes, and protein targeting. Data on the role of these genes in platelets is limited, thus, further investigation is warranted. It is noteworthy that LAMTOR2 is a known regulator of the MAPK/ERK and mTOR signaling pathways [56, 57], both of which were shown to be important in regulating platelet function [58, 59]. Moreover, the p14/LAMTOR2 deficiency- which is associated with one of the primary immunodeficiency diseases that also include “Hermansky–Pudlak syndrome type 2”- has been linked to platelet defects [60]. However, more needs to be done to examine the exact role of LAMTOR2 in platelets of COVID-19 patients.

In the magenta module, MT-ND1 [61], MT-ND5 [62], and MTRNR2L12 protein coding genes are related to NADH dehydrogenase activity and apoptotic processes. According to our analysis, all hub genes in the magenta module are differentially expressed and downregulated in COVID-19 patients in comparison to healthy controls. Down regulation of MT-ND1 and MT-ND5 protein coding genes might, at least in part, explain the mitochondrial dysfunction seen in platelets of COVID-19 patients. With respect to MTRNR2L12, it was observed that it is one of the differentially expressed genes in bronchoalveolar lavage fluid samples from patients with severe COVID-19 in comparison to control [63]. MTRNR2L12 is a paralog of the protein coding gene MTRNR2L8, and both are expressed in platelets [64]. It is of note that MTRNR2L12 was shown to be among the top 10 RNA with differential splice junctions in platelets of patients of multiple sclerosis [65].

In addition to the identified hub genes, a number of other canonical platelet genes in the yellow and black modules were also associated with platelet function. For example, SLEB and ITGA2B protein coding genes were present in the yellow module with high intramodular connectivity (ranked in the top 50) and both proteins are critical for platelet function [66]. Moreover, another canonical platelet gene that was also identified in the yellow module, namely ITGB3 was ranked 132 with regard to its intramodular connectivity, which is considered high in the yellow module of 681 genes. Furthermore, we also noticed that the protein coding gene IFITM3 shows high module membership (black module). The protein encoded by this gene is an interferon-induced membrane protein that was shown to be important in immunity against influenza A H1N1 virus, West Nile virus, and dengue virus [67,68,69]. Most recently, IFITM3 was also found to be upregulated protein in COVID-19 disease [7, 70], which importantly was also demonstrated/confirmed by Western blot [7].

The present study has certain limitations that should be noted. Firstly, the analysis focused on only one data set, due to limited access to platelet gene expression data that were collected from COVID-19 patients. Therefore, additional data sets should be analyzed, if available, to validate our findings and/or obtain more representative results. Also, the number of samples was 15, which may be associated with some noise, albeit it is the minimum number of samples recommended for co-expression analysis by WGCNA. Finally, any limitations in the original study, from which the data was obtained will also be reflected in the results of this study.

In conclusion, our co-expression analysis of a platelet RNAseq data set from COVID-19 patients and healthy controls revealed 16 modules, amongst which the yellow, black, and magenta were identified as the most critical in COVID-19 disease and validated using machine learning. Additionally, nine hub genes were determined to potentially serve key roles in the pathophysiological mechanisms of COVID-19 in the context of platelet biology. The positively associated yellow and black modules were identified to be involved in platelet degranulation, energy metabolism, and mitochondria. The negatively associated magenta module was associated with interactive pathways of apoptosis. These data should help expand our understanding of the underlying mechanisms of thrombosis in COVID-19 disease and help promote and guide future experimental studies to investigate the roles of the protein coding genes in the pathophysiology of this disease. Additionally, these genes may serve as novel therapeutic targets for treating patients.

Availability of data and materials

The data sets analysed during the current study are publicly available in the NCBI BioProject repositories, (PRJNA634489, PRJNA736410).



Whole genome correlation network analysis


Severe acute respiratory syndrome corona virus 2


Corona virus disease 2019


Transcript per million


Log transformed counts per million


Topological overlap matrix

Intramodular connectivity


Gene ontology


Module membership


Gene significance


Adenosine triphosphate


Citric acid cycle




  1. CDC. COVID Data Tracker Weekly Review. Visited 2022-02-18). 2022. Accessed 18 Feb 2022.

  2. Tang N, Li D, Wang X, Sun Z. Abnormal coagulation parameters are associated with poor prognosis in patients with novel coronavirus pneumonia. J Thromb Haemostasis. 2020;18(4):844–7.

    CAS  Article  Google Scholar 

  3. ...Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z, Yu T, Xia J, Wei Y, Wu W, Xie X, Yin W, Li H, Liu M, Xiao Y, Gao H, Guo L, Xie J, Wang G, Jiang R, Gao Z, Jin Q, Wang J, Cao B. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet. 2020;395(10223):497–506.

    CAS  Article  Google Scholar 

  4. ...Guan W-j, Ni Z-y, Hu Y, Liang W-h, Ou C-q, He J-x, Liu L, Shan H, Lei C-l, Hui DSC, Du B, Li L-j, Zeng G, Yuen K-Y, Chen R-c, Tang C-l, Wang T, Chen P-y, Xiang J, Li S-y, Wang J-l, Liang Z-j, Peng Y-x, Wei L, Liu Y, Hu Y-h, Peng P, Wang J-m, Liu J-y, Chen Z, Li G, Zheng Z-j, Qiu S-q, Luo J, Ye C-j, Zhu S-y, Zhong N-s. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382(18):1708–20.

    CAS  Article  PubMed  Google Scholar 

  5. Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, Xiang J, Wang Y, Song B, Gu X, Guan L, Wei Y, Li H, Wu X, Xu J, Tu S, Zhang Y, Chen H, Cao B. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The Lancet. 2020;395(10229):1054–62.

    CAS  Article  Google Scholar 

  6. Bowles L, Platton S, Yartey N, Dave M, Lee K, Hart DP, MacDonald V, Green L, Sivapalaratnam S, Pasi KJ, MacCallum P. Lupus anticoagulant and abnormal coagulation tests in patients with Covid-19. N Engl J Med. 2020;383(3):288–90.

    CAS  Article  PubMed  Google Scholar 

  7. Manne BK, Denorme F, Middleton EA, Portier I, Rowley JW, Stubben C, Petrey AC, Tolley ND, Guo L, Cody M, Weyrich AS, Yost CC, Rondina MT, Campbell RA. Platelet gene expression and function in patients with COVID-19. Blood. 2020;136(11):1317–29.

    CAS  Article  PubMed  Google Scholar 

  8. Zaid Y, Puhm F, Allaeys I, Naya A, Oudghiri M, Khalki L, Limami Y, Zaid N, Sadki K, Haj RBE, Mahir W, Belayachi L, Belefquih B, Benouda A, Cheikh A, Langlois M-A, Cherrah Y, Flamand L, Guessous F, Boilard E. Platelets can associate with SARS-CoV-2 RNA and are hyperactivated in COVID-19. Circ Res. 2020;127(11):1404–18.

    CAS  Article  PubMed Central  Google Scholar 

  9. Smyth SS, McEver RP, Weyrich AS, Morrell CN, Hoffman MR, Arepally GM, French PA, Dauerman HL, Becker RC, Arepally GM, Becker RC, Bhatt DL, Cho J, Dauerman HL, Gretler DD, Hoffman MR, Horrow J, Kleiman NS, Kocharian R, Lincoff AM, Maya J, McEver RP, Morrell CN, Prats J, Rusconi CP, Smyth SS, Strony J, Sun H, Veltri EP, Weyrich AS, Wiviott SD, Wood JP. Platelet functions beyond hemostasis. J Thromb Haemost. 2009;7(11):1759–66.

    CAS  Article  PubMed  Google Scholar 

  10. Bashour TT, Myler RK, Andreae GE, Stertzer SH, Clark DA, Ryan CJ. Current concepts in unstable myocardial ischemia. Am Heart J. 1988;115(4):850–61.

    CAS  Article  PubMed  Google Scholar 

  11. Goeijenbier M, van Wissen M, van de Weg C, Jong E, Gerdes VEA, Meijers JCM, Brandjes DPM, van Gorp ECM. Review: viral infections and mechanisms of thrombosis and bleeding. J Med Virol. 2012;84(10):1680–96.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  12. Kwong JC, Schwartz KL, Campitelli MA, Chung H, Crowcroft NS, Karnauchow T, Katz K, Ko DT, McGeer AJ, McNally D, Richardson DC, Rosella LC, Simor A, Smieja M, Zahariadis G, Gubbay JB. Acute myocardial infarction after laboratory-confirmed influenza infection. N Engl J Med. 2018;378(4):345–53.

    Article  PubMed  Google Scholar 

  13. Cai Y, González JV, Liu Z, Huang T. Computational systems biology methods in molecular biology, chemistry biology, molecular biomedicine, and biopharmacy. BioMed Res Int. 2014;2014:1–2.

    CAS  Article  Google Scholar 

  14. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinfor. 2008;9(1):66.

    CAS  Article  Google Scholar 

  15. Tian Z, He W, Tang J, Liao X, Yang Q, Wu Y, Wu G. Identification of important modules and biomarkers in breast cancer based on WGCNA. OncoTargets Therapy. 2020;13:6805–17.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  16. Bai K-H, He S-Y, Shu L-L, Wang W-D, Lin S-Y, Zhang Q-Y, Li L, Cheng L, Dai Y-J. Identification of cancer stem cell characteristics in liver hepatocellular carcinoma by WGCNA analysis of transcriptome stemness index. Cancer Med. 2020;9(12):4290–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  17. Liang J-W, Fang Z-Y, Huang Y, Liuyang Z-Y, Zhang X-L, Wang J-L, Wei H, Wang J-Z, Wang X-C, Zeng J, et al. Application of weighted gene co-expression network analysis to explore the key genes in Alzheimer’s disease. J Alzheimer’s Dis. 2018;65(4):1353–64.

    CAS  Article  Google Scholar 

  18. Zeng D, He S, Ma C, Wen Y, Song W, Xu Q, Zhao N, Wang Q, Yu Y, Shen Y, Huang J, Li H. Network-based approach to identify molecular signatures in the brains of depressed suicides. Psychiatry Res. 2020;294:113513.

    CAS  Article  PubMed  Google Scholar 

  19. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2021;23(1):40–55.

    CAS  Article  PubMed  Google Scholar 

  20. Saeys Y, Inza I, Larranaga P. A review of feature selection techniques in bioinformatics. Bioinformatics. 2007;23(19):2507–17.

    CAS  Article  PubMed  Google Scholar 

  21. Xiao J, Wang R, Cai X, Ye Z. Coupling of co-expression network analysis and machine learning validation unearthed potential key genes involved in rheumatoid arthritis. Front Genet. 2021;66:12.

    CAS  Article  Google Scholar 

  22. Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(5):525–7.

    CAS  Article  PubMed  Google Scholar 

  23. Law CW, Chen Y, Shi W, Smyth GK. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15(2):29.

    CAS  Article  Google Scholar 

  24. Smyth GK. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3(1):1–25.

    Article  Google Scholar 

  25. Kadarmideen HN, Watson-haigh NS. Building gene co-expression networks using transcriptomics data for systems biology investigations: comparison of methods using microarray data. Bioinformation. 2012;8(18):855–61.

    Article  PubMed  PubMed Central  Google Scholar 

  26. ...Barrett TJ, Bilaloglu S, Cornwell M, Burgess HM, Virginio VW, Drenkova K, Ibrahim H, Yuriditsky E, Aphinyanaphongs Y, Lifshitz M, Liang FX, Alejo J, Smith G, Pittaluga S, Rapkiewicz AV, Wang J, Iancu-Rubin C, Mohr I, Ruggles K, Stapleford KA, Hochman J, Berger JS. Platelets contribute to disease severity in COVID-19. J Thromb Haemostasis. 2021;19(12):3139–53.

    CAS  Article  Google Scholar 

  27. Chen Y-A, Tripathi LP, Fujiwara T, Kameyama T, Itoh MN, Mizuguchi K. The TargetMine data warehouse: enhancement and updates. Front Genet. 2019.

  28. R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria; 2021.

  29. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30.

    Google Scholar 

  30. Gu Z, Gu L, Eils R, Schlesner M, Brors B. circlize implements and enhances circular visualization in R. Bioinformatics. 2014;30(19):2811–2.

    CAS  Article  PubMed  Google Scholar 

  31. Wickham H. Ggplot2: elegant graphics for data analysis. Springer; 2016.

  32. Waskom ML. seaborn: statistical data visualization. J Open Source Softw. 2021;6(60):3021.

    Article  Google Scholar 

  33. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex Systems. 2006;1695.

  34. Pedersen TL. Ggraph: an implementation of grammar of graphics for graphs and networks. 2021. R package version 2.0.5.

  35. Bu J, Liu W, Pan Z, Ling K. Comparative study of hydrochemical classification based on different hierarchical cluster analysis methods. Int J Environ Res Public Health. 2020;17(24):9515.

    CAS  Article  PubMed Central  Google Scholar 

  36. Langfelder P, Mischel PS, Horvath S. When is hub gene selection better than standard meta-analysis? PLoS ONE. 2013;8(4):61505.

    CAS  Article  Google Scholar 

  37. Lou Q, Chen L, Mei H, Xu K, Wei H, Feng F, Li T, Pang X, Shi C, Luo L, Zhong Y. Root transcriptomic analysis revealing the importance of energy metabolism to the development of deep roots in rice (Oryza sativa L.). Front Plant Sci. 2017.

  38. Piñero J, Ramírez-Anguita JM, Saüch-Pitarch J, Ronzano F, Centeno E, Sanz F, Furlong LI. The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 2019.

    Article  PubMed Central  Google Scholar 

  39. Loo J, Spittle DA, Newnham M. COVID-19, immunothrombosis and venous thromboembolism: biological mechanisms. Thorax. 2021;76(4):412–20.

    Article  PubMed  Google Scholar 

  40. May AE, Seizer P, Gawaz M. Platelets: inflammatory firebugs of vascular walls. Arteriosc Thromb Vasc Biol. 2008;28(3):66.

    Article  Google Scholar 

  41. Aibibula M, Naseem KM, Sturmey RG. Glucose metabolism and metabolic flexibility in blood platelets. J Thromb Haemostasis. 2018;16(11):2300–14.

    CAS  Article  Google Scholar 

  42. Fidler TP, Campbell RA, Funari T, Dunne N, Angeles EB, Middleton EA, Chaudhuri D, Weyrich AS, Abel ED. Deletion of GLUT1 and GLUT3 reveals multiple roles for glucose metabolism in platelet and megakaryocyte function. Cell Rep. 2017;20(4):881–94.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  43. Fidler TP, Marti A, Gerth K, Middleton EA, Campbell RA, Rondina MT, Weyrich AS, Abel ED. Glucose metabolism is required for platelet hyperactivation in a murine model of type 1 diabetes. Diabetes. 2019;68(5):932–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  44. Shenoy S. Coronavirus (covid-19) sepsis: revisiting mitochondrial dysfunction in pathogenesis, aging, inflammation, and mortality. Inflamm Res. 2020;69(11):1077–85.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  45. Ajaz S, McPhail MJ, Singh KK, Mujib S, Trovato FM, Napoli S, Agarwal K. Mitochondrial metabolic manipulation by SARS-CoV-2 in peripheral blood mononuclear cells of patients with COVID-19. Am J Physiol Cell Physiol. 2021;320(1):57–65.

    CAS  Article  Google Scholar 

  46. Denorme F, Manne BK, Portier I, Petrey AC, Middleton EA, Kile BT, Rondina MT, Campbell RA. COVID-19 patients exhibit reduced procoagulant platelet responses. J Thromb Haemostasis. 2020;18(11):3067–73.

    CAS  Article  Google Scholar 

  47. Dasari CM, Bhukya R. Comparative analysis of protein synthesis rate in COVID-19 with other human coronaviruses. Infect Genet Evol. 2020;85:104432.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  48. Schoenwaelder SM, Yuan Y, Josefsson EC, White MJ, Yao Y, Mason KD, O’Reilly LA, Henley KJ, Ono A, Hsiao S, Willcox A, Roberts AW, Huang DCS, Salem HH, Kile BT, Jackson SP. Two distinct pathways regulate platelet phosphatidylserine exposure and procoagulant function. Blood. 2009;114(3):663–6.

    CAS  Article  PubMed  Google Scholar 

  49. Althaus K, Marini I, Zlamal J, Pelzl L, Singh A, Häberle H, Mehrländer M, Hammer S, Schulze H, Bitzer M, Malek N, Rath D, Bösmüller H, Nieswandt B, Gawaz M, Bakchoul T, Rosenberger P. Antibody-induced procoagulant platelets in severe COVID-19 infection. Blood. 2021;137(8):1061–71.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  50. Valentino ML, Barboni P, Ghelli A, Bucchi L, Rengo C, Achilli A, Torroni A, Lugaresi A, Lodi R, Barbiroli B, Dotti M, Federico A, Baruzzi A, Carelli V. The ND1 gene of complex i is a mutational hot spot for Leber’s hereditary optic neuropathy. Ann Neurol. 2004;56(5):631–41.

    CAS  Article  PubMed  Google Scholar 

  51. Burkhart JM, Vaudel M, Gambaryan S, Radau S, Walter U, Martens L, Geiger J, Sickmann A, Zahedi RP. The first comprehensive and quantitative analysis of human platelet protein composition allows the comparative analysis of structural and functional pathways. Blood. 2012;120(15):73–82.

    CAS  Article  Google Scholar 

  52. Chen G, Cao P, Goeddel DV. TNF-induced recruitment and activation of the IKK complex require cdc37 and hsp90. Mol Cell. 2002;9(2):401–10.

    CAS  Article  PubMed  Google Scholar 

  53. Pawlinski R. Inhibit the calpain to climb the mountain. Blood. 2014;123(8):1123–4.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  54. Azam M, Andrabi SS, Sahr KE, Kamath L, Kuliopulos A, Chishti AH. Disruption of the mouse μ-calpain gene reveals an essential role in platelet function. Mol Cell Biol. 2001;21(6):2213–20.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  55. Nitsure M, Sarangi B, Shankar GH, Reddy VS, Walimbe A, Sharma V. Mechanisms of hypoxia in COVID-19 patients: a pathophysiologic reflection. Indian J Crit Care Med. 2020;24(10):967–70.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  56. Thauerer B, Voegele P, Hermann-Kleiter N, Thuille N, de Araujo MEG, Offterdinger M, Baier G, Huber LA, Baier-Bitterlich G. LAMTOR2-mediated modulation of NGF/MAPK activation kinetics during differentiation of PC12 cells. PLoS ONE. 2014;9(4):95863.

    CAS  Article  Google Scholar 

  57. Sparber F, Scheffler JM, Amberg N, Tripp CH, Heib V, Hermann M, Zahner SP, Clausen BE, Reizis B, Huber LA, Stoitzner P, Romani N. The late endosomal adaptor molecule p14 (LAMTOR2) represents a novel regulator of langerhans cell homeostasis. Blood. 2014;123(2):217–27.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  58. Flevaris P, Li Z, Zhang G, Zheng Y, Liu J, Du X. Two distinct roles of mitogen-activated protein kinases in platelets and a novel rac1-MAPK-dependent integrin outside-in retractile signaling pathway. Blood. 2009;113(4):893–901.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  59. Aslan JE, McCarty OJT. Regulation of the mTOR-rac1 axis in platelet function. Small GTPases. 2012;3(1):67–70.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Zamani R, Shahkarami S, Rezaei N. Primary immunodeficiency associated with hypopigmentation: a differential diagnosis approach. Allergologia et Immunopathologia. 2021;49(2):178–90.

    Article  PubMed  Google Scholar 

  61. Lim SC, Hroudová J, Bergen NJV, Sanchez MIGL, Trounce IA, McKenzie M. Loss of mitochondrial DNA-encoded protein ND1 results in disruption of complex I biogenesis during early stages of assembly. FASEB J. 2016;30(6):2236–48.

    CAS  Article  PubMed  Google Scholar 

  62. van der Slikke EC, Star BS, van Meurs M, Henning RH, Moser J, Bouma HR. Sepsis is associated with mitochondrial DNA damage and a reduced mitochondrial mass in the kidney of patients with sepsis-AKI. Crit Care. 2021.

    Article  PubMed  PubMed Central  Google Scholar 

  63. Chow RD, Majety M, Chen S. The aging transcriptome and cellular landscape of the human lung in relation to SARS-CoV-2. Nat Commun. 2021;12(1):66.

    CAS  Article  Google Scholar 

  64. Eicher JD, Wakabayashi Y, Vitseva O, Esa N, Yang Y, Zhu J, Freedman JE, McManus DD, Johnson AD. Characterization of the platelet transcriptome by RNA sequencing in patients with acute myocardial infarction. Platelets. 2015;27(3):230–9.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  65. Sol N, Leurs CE, Veld SGI, Strijbis EM, Vancura A, Schweiger MW, Teunissen CE, Mateen FJ, Tannous BA, Best MG, Wurdinger T, Killestein J. Blood platelet RNA enables the detection of multiple sclerosis. Multiple Sclerosis J Exp Transl Clin. 2020;6(3):205521732094678.

    Article  Google Scholar 

  66. Rondina MT, Voora D, Simon LM, Schwertz H, Harper JF, Lee O, Bhatlekar SC, Li Q, Eustes AS, Montenont E, Campbell RA, Tolley ND, Kosaka Y, Weyrich AS, Bray PF, Rowley JW. Longitudinal RNA-seq analysis of the repeatability of gene expression and splicing in human platelets identifies a platelet SELP splice QTL. Circ Res. 2020;126(4):501–16.

    CAS  Article  PubMed  Google Scholar 

  67. Everitt AR, Clare S, Pertel T, John SP, Wash RS, Smith SE, Chin CR, Feeley EM, Sims JS, Adams DJ, Wise HM, Kane L, Goulding D, Digard P, Anttila V, Baillie JK, Walsh TS, Hume DA, Palotie A, Xue Y, Colonna V, Tyler-Smith C, Dunning J, Gordon SB, Smyth RL, Openshaw PJ, Dougan G, Brass AL, PK. IFITM3 restricts the morbidity and mortality associated with influenza. Nature. 2012;484(7395):519–23.

  68. Zhu X, He Z, Yuan J, Wen W, Huang X, Hu Y, Lin C, Pan J, Li R, Deng H, Liao S, Zhou R, Wu J, Li J, Li M. IFITM3-containing exosome as a novel mediator for anti-viral response in dengue virus infection. Cell Microbiol. 2014;17(1):105–18.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  69. Gorman MJ, Poddar S, Farzan M, Diamond MS. The interferon-stimulated gene ifitm3 restricts west nile virus infection and pathogenesis. J Virol. 2016;90(18):8212–25.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  70. Hachim MY, Heialy SA, Hachim IY, Halwani R, Senok AC, Maghazachi AA, Hamid Q. Interferon-induced transmembrane protein (IFITM3) is upregulated explicitly in SARS-CoV-2 infected lung epithelial cells. Front Immunol. 2020;66:11.

    CAS  Article  Google Scholar 

  71. Shimada BK, Boyman L, Zhu J, Yang Y, Huang W, Kane MA, Yadava N, Polster BM, Zou L, Lederer WJ, Chao W. Molecular remodeling of cardiac mitochondria in mice with sepsis-induced cardiomyopathy. 2021.

  72. Maity S, Saha A. Therapeutic potential of exploiting autophagy cascade against coronavirus infection. Front Microbiol. 2021;66:12.

    Article  Google Scholar 

Download references


The authors acknowledge Matthew T. Rondina and Robert A. Campbell from the department of internal medicine, University of Utah, Salt Lake City, Utah, USA, for their contribution and suggestions that improved the work substantially. The statements made here are solely the responsibility of the authors.


No external funding was utilized for this work.

Author information




AA, AM: conceptualization; AA, AM: analyzed results and made the figures; AA: drafted the manuscript; AA, AM, KM, FA, FK: edited the manuscript and approved the submission. AA and AM contributed equally to this manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ahmed B. Alarabi or Fadi T. Khasawneh.

Ethics declarations

Ethics approval and consent to participate

(Not applicable) The data used are publicly available in NCBI BioProject repository, therefore no administrative permission is required to access the raw data. as well as, ethical approval and consent to participate are not applicable. The data is stored in NCBI Bioproject SRA (Sequence Read Archive), therefore, it is controlled by NIH GDS (National Institute of Health Genomic Data Sharing) policy which mandates that data should be anonymized prior to upload to the SRA repository. The original data were produced by experiments carried out in accordance with the Declaration of Helsinki according to the original published papers [7, 26].

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1. Figs. S1–S3


Additional file 2. Table S1

: WGCNA eigengenes.

Additional file 3. Table S2

: WGCNA modules.

Additional file 4. Table S3

: Gene significance.

Additional file 5. Table S4

: Intramodular connectivity.

Additional file 6. Table S5

: Hubgenes cross check with Disgenet database.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Alarabi, A.B., Mohsen, A., Mizuguchi, K. et al. Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets. BMC Med Genomics 15, 83 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Platelets
  • COVID-19