- Research
- Open access
- Published:
Characterization of genome-wide association study data reveals spatiotemporal heterogeneity of mental disorders
BMC Medical Genomics volume 13, Article number: 192 (2020)
Abstract
Background
Psychiatric disorders such as schizophrenia (SCZ), bipolar disorder (BIP), major depressive disorder (MDD), attention deficit-hyperactivity disorder (ADHD), and autism spectrum disorder (ASD) are often related to brain development. Both shared and unique biological and neurodevelopmental processes have been reported to be involved in these disorders.
Methods
In this work, we developed an integrative analysis framework to seek for the sensitive spatiotemporal point during brain development underlying each disorder. Specifically, we first identified spatiotemporal gene co-expression modules for four brain regions three developmental stages (prenatal, birth to 11 years old, and older than 13 years), totaling 12 spatiotemporal sites. By integrating GWAS summary statistics and the spatiotemporal co-expression modules, we characterized the risk genes and their co-expression partners for five disorders.
Results
We found that SCZ and BIP, ASD and ADHD tend to cluster with each other and keep a distance from other psychiatric disorders. At the gene level, we identified several genes that were shared among the most significant modules, such as CTNNB1 and LNX1, and a hub gene, ATF2, in multiple modules. Moreover, we pinpointed two spatiotemporal points in the prenatal stage with active expression activities and highlighted one postnatal point for BIP. Further functional analysis of the disorder-related module highlighted the apoptotic signaling pathway for ASD and the immune-related and cell-cell adhesion function for SCZ, respectively.
Conclusion
Our study demonstrated the dynamic changes of disorder-related genes at the network level, shedding light on the spatiotemporal regulation during brain development.
Background
Mental disorders are leading causes of disability and comprise a substantial financial burden on the economy. It is estimated that one out of every four American adults suffers from a mental disorder in any given year [1]. Epidemiological evidence has revealed that the experiences in the prenatal and early childhood periods are related to later wellness [2]. This period is characterized by rapid and highly dynamic processes unfolding in space and time, which will have a lasting impact on health, learning, and behavior throughout one’s whole life [3]. Common psychiatric disorders, such as schizophrenia (SCZ), bipolar disorder (BIP), major depressive disorder (MDD), attention deficit hyperactivity disorder (ADHD), and autism spectrum disorder (ASD) have been proved to have high inheritability by twins or family studies [4,5,6,7]. Many genome-wide association studies (GWAS) have been conducted for mental disorders to reveal the common genetic risk loci in population [5, 7,8,9]. These GWA studies discovered hundreds of loci significantly associated with these disorders. However, interpretation and fine mapping of GWAS loci remain a major challenge in the post-GWAS era.
Previous studies have shown many spatiotemporal features of these five mental disorders. For instance, the major brain regions affected in SCZ included the prefrontal cortex, the basal ganglia, and the limbic system [10,11,12]. SCZ related genes tended to be highly expressed during prenatal development [13]. BIP was found to be related to the amygdala, hippocampus, and prefrontal cortex region [14, 15]. Both SCZ and BIP patients have gray matter reductions in paralimbic regions (anterior cingulate and insula), the function of which is emotional processing [15]. MDD patients have been observed to have significantly lower hippocampal volumes comparing the brain to the normal controls’ hippocampal volumes [1]. ADHD has a prevalence of 5.3% in childhood (younger than 18 years old) [16]. Two-thirds of patients with an ADHD diagnosis in childhood will continue to have impairing symptoms throughout their lives [17]. Subcortical structure volume especially the size of the amygdala, smaller volumes of caudate, cerebellum, and frontal and temporal gray matter have been associated with greater symptom severity [18,19,20]. Lastly, brain volume overgrowth was linked to ASD [21]. Patterns of gene expression distinguishing frontal and temporal cortex could be observed in the brains of autism patients [22].
With these lines of prior knowledge, we expect to bridge the molecular evidence to the features of each disorder. In the previous study conducted by Psychiatric Genomic Consortium (PGC) Cross-Disorder Group [7], the authors have identified four genetic variants shared in the five mental disorders. Inspired by this work, we aimed systematically characterize the spatiotemporal expression features of disorder-associated genes for five mental disorders utilizing the BrainSpan (Atlas of the developing human brain) expression data with the temporal and spatial transcriptome dynamic changes for more than 16 developing brain tissues aging from 4 post-conceptual weeks (pcw) (prenatal) to 60+ year old [23]. We aimed to pinpoint the shared and unique genetic factors of these five common psychiatric disorders in specific spatiotemporal points critical to brain development.
Methods
GWAS summary results for five psychiatric disorders
GWAS summary statistics were downloaded from Psychiatric Genomic Consortium (PGC) Cross-Disorder Group for each of the five disorders [7]. All patients were of European ancestry and were diagnosed as each primary disorder of interest according to the criteria from the DSM third edition revised or fourth edition. Specifically, there are 4788 trio cases, 4788 trio pseudocontrols, 161 cases, 526 controls for autism spectrum disorder (ASD); 1947 trio cases, 1947 trio pseudocontrols, 840 cases, and 688 controls in attention deficit-hyperactivity disorder (ADHD); 6990 cases and 4820 controls in bipolar disorder (BIP); 9227 cases and 7383 controls in major depressive disorder (MDD); and 9379 case and 7736 controls in schizophrenia (SCZ). All individuals are of European ancestry and are diagnosed with corresponding criteria. There are 1.2 million SNPs in total after imputation on CEU + TSI Hapmap Phase 3 reference and only those SNPs with imputation quality (INFO > 0.4) were used for further analysis.
Gene-based p-values from VEGAS
We used liftOver to convert the GWAS SNPs from hg18 to hg19 [24]. The updated list of SNPs was used to calculate gene-based p-values using Versatile Gene-based Association Study (VEGAS) (version 2) [25]. VEGAS considers multiple SNPs mapped to a gene and calculates an empirical p-value to estimate the association after correcting for linkage disequilibrium (LD) structures. For each gene, we considered the SNPs mapped to the gene body or its 50 kb flanking region. We used the European population from the 1000 Genomes Project as the reference panel to estimate LD.
Brain expression data
Spatiotemporal gene expression data were downloaded from the BrainSpan Atlas [23]. Following previous works (Table S5) [26], we split the samples into 12 categories based on their distinctive spatial and temporal features, ranging in four brain regions and three developmental periods. The regions are frontal cortex (FC), sensory motor regions (SM), sub-cortical regions (SC), and temporal-parietal cortex (TP). The stages are stage 1 (prenatal), stage 2 (after birth to 11 years old), and stage 3 (older than 13 years). We considered a gene that was expressed if its RPKM (Reads Per Kilobase of transcript per Million mapped reads) value was greater than one in at least one sample at each spatiotemporal point.
PPI and CoPPI networks
We built the reference human protein-protein interaction (PPI) network by combining data from the Human Protein Reference Database [27] and the STRING database [28]. After removing self-interactions and isolated nodes, the final PPI network included 10,314 nodes (i.e., proteins) and 51,637 edges (i.e., interactions). A CoPPI is defined as an edge-weighted PPI network, in which each edge is weighted by the co-expression of the two nodes using the expression data generated in each specific spatiotemporal site. The absolute value of the Pearson Correlation Coefficient (PCC) was used to measure the co-expression level between a pair of nodes. We removed those edges involving unexpressed nodes from the network.
Determination of co-expression modules
We modified the Dense Module Search (DMS) algorithm developed in our previous works [29,30,31]. Briefly, we defined a module score as the average edge weight, i.e., \(Em=\frac{\sum {e}_{PCC}}{\#\text{edges}}\), where ePCC indicated the absolute PCC value for each edge in the module. We started with edges whose ePCC was ≥0.5 and expanded the module by always including the best edge connected to the current module, until no surrounding edge could improve the module score. With such a design, all the resultant modules had a module score > 0.5 and all their component edges had ePCC ≥ 0.5.
Determination of disorder-specific co-expression modules
Gene-based p-values from VEGAS were mapped to their respective genes in each significant co-expression module per spatiotemporal point. A module Z-score was calculated for each co-expression module for each disorder by \(Zm=\frac{\sum Gw}{\sqrt{\# genes}}\), where Gw = Φ−1(1 − pg) is the gene-based score computed from the probit function of the VEGAS p-values [32]. The modules with a larger z-score indicate there are more genetic implications from the disorder in these modules. These module scores were then normalized by \(Zn=\frac{Zm- mean(Zm)}{sd(Zm)}\). Zn was used for the following analysis.
Functional enrichment analysis
We used R package DAVID to conduct functional enrichment analysis for gene ontology biological process [33]. Briefly, we conducted a Bonferroni correction to adjust the multiple-testing for the 1658 gene ontology biological process terms, five mental disorders, and 12 spatiotemporal points. Thus, the significant raw p-value threshold is (0.05/1658/5/12 ~ 5.0 × 10− 7). We further performed the same DAVID functional enrichment analysis for non-MHC modules genes. The MHC genes are defined as the 548 genes within the 8 M MHC high linkage disequilibrium (LD) region (chr6: 25500000–33,500,000) on hg19 reference. All the codes were performed on R version 3.5.2.
Results
Overview of design and results
The outline of our work was illustrated in Fig. 1. Starting with a curated reference human PPI network, we overlaid gene co-expression relationships to each PPI pair for each temporal and spatial point, resulting in 12 CoPPI networks. We calculated PCC to measure the co-expression relationships among genes. The detailed classification and sample sizes of each spatiotemporal sites are presented in Table 1. We then constructed co-expression modules using our dense module searching (DMS) algorithm [29,30,31]. To identify highly co-expressed modules, we required all edges in a module to have an absolute PCC > 0.5 in the corresponding spatiotemporal site. We then overlaid gene-based z-scores calculated from GWAS summary statistics onto each spatiotemporal points and ranked modules according to the combined effect of co-expression and genetic associations. Notably, co-expression modules were identified for each spatiotemporal point, regardless of disorder data, and were comparable among disorders, whereas co-expression modules varied among spatiotemporal points.
Spatiotemporal co-expression modules in human brain development
The modules had an average of 4.22 nodes (range: 4–8) and 3.73 edges (range: 3–10). A total of 20,043 modules were identified for the 12 points. The SM region in the fetus had the most number (1992) of co-expression modules and the TP region in stage 3 had the least number (1416) of co-expression modules (Table 1). By comparing the module numbers in each brain region across different stages, we observed that ST1 had the highest number than the other two stages across different brain regions, indicating that ST1 was likely the most active stage in brain transcriptional activity.
Identification of disorder-specific spatiotemporal modules
We next overlaid the gene-based z-scores (transformed from VEGAS p-values) onto the co-expression modules and ranked modules for each spatiotemporal site and each disorder. In this way, the module structure remained the same at each spatiotemporal point for all five disorders, whereas the modules were re-ordered according to their disorder associations. We explored the disorder correlations using the co-expression modules at each spatiotemporal point. Because the module list remained the same but only the module scores differed in each disorder, this analysis assessed the disorder correlation at the module level. As shown in Fig. 2, in 8 out of 12 points, SCZ and BIP formed a unique cluster distinct from the other three disorders, and in 10 out of 12 (83%) points, the two traits were clustered together. This remained true whether we used all modules or parts of the modules (e.g., the most 25% or the most 50% variable modules across all spatiotemporal sites) for the clustering analysis. This is consistent with previous studies that SCZ and BIP shared common polygenic variations [4, 14, 34,35,36]. More interestingly, we also observed ASD and ADHD formed in the same cluster away from the other three disorders in 7 out of 12 points and clustered together in 11 out of 12 (92%) points, indicating ASD and ADHD share more genetic background than the other three adult-onset disorders.
Next, we checked the mean module z-score to explore the spatiotemporal points that have relatively high disorder effects from the GWAS signal (Fig. 3). We could observe ADHD ST2-SM have a higher mean z-score than the other two stages. ST2 in ASD shows a higher mean z-score than the other two stages in FC, SC, and TP, indicating the GWAS risk genes of ASD have a strong disturbance in ST2 (after birth to 11 years old). For BIP, ST3 (older than 13 years) was found to have higher z-scores in FC. ST3-FC has the highest z-score in MDD. ST1 (prenatal) in SCZ demonstrates the highest mean z-score in three brain developmental stages in SC, SM, and TP, suggesting SCZ GWAS risk genes have a strong effect in this stage across these three brain regions. Consistent findings could be observed by comparing the relative proportion of modules with Zm > 1.96 to the total modules within 12 spatiotemporal points for each disorder (Additional file 1). We further normalized these module scores using standard normalization (see Methods). After normalization, we identified the number of modules in each disorder across all points with a module score (Zn) > 1.96 as significant modules and found that each disorder contained the significant modules ranging from 27 (96 unique genes in the modules) in ADHD (stage: ST2-SC) to 64 (196 genes) in ASD (ST1-SM) across all spatiotemporal points (Table 2).
Weak overlap of modules and genes across the five disorders
We identified the overlap between the significantly identified modules across all points for the five disorders (Additional file 2A). We found that no modules overlapped between all five disorders at any spatiotemporal point. Only one module “GRB2, LNX1, MAPK9, MUSTN1” was found to be shared by four disorders (ASD, ADHD, SCZ, and BIP) in both ST3-SM and ST3-TP (Additional file 3, Table 3) [7]. Even though we observed a weak overlap in the specific modules across disorders, the genes contained in each module may overlap among the disorders. Therefore, we extracted all genes from all significant modules and determined their overlap in all points across all disorders (Table 4 & Additional file 2B). The most (196) and least (96) unique genes were extracted from ASD (ST1-SM) and ADHD (ST2-SC), respectively. We only observed two instances of a gene that was shared in all disorders. The first was for the ST2-SC (CTNNB1) and the second was for ST3-TP (LNX1), indicating these two genes might be involved in the pathogenesis of five mental disorders during these two spatiotemporal points. CTNNB1 (Cadherin-Associated Protein Beta 1) has been proved to be related to abnormal brain development [37,38,39,40]. LNX1 (Ligand of numb-protein X 1) is an E3 ubiquitin ligase for proteasomal degradation for NUMB protein, which is a key regulator of neurogenesis and neuronal differentiation [41]. Knockout LNX1 and LNX2 mice exhibited decreased anxiety-related behavior, though the mechanisms remained unknown [42]. The raw p-values of these two genes across five mental disorders were insignificant (p-value > 0.0001, Table 3), indicating that the risk genes in each disorder might have their mechanism influencing the co-expression of these two vital brain-development genes during certain spatiotemporal points. In sum, the results that most of the genes found in each disorder stayed unique to that disorder suggested unique genetic signatures for each disorder rather than shared.
Functional annotation of significant modules
To identify the biological roles of the genes in the significant modules, we performed functional enrichment analysis using DAVID (See Methods). We combined the genes from all significant modules for each point to find enriched pathways for each disorder illustrated in Fig. 4. In the enrichment study of GO terms for ADHD, we discovered genes were more likely to be enriched in proteasome-mediated ubiquitin-dependent protein catabolic process (GO:0043161) in the ST3-SM (praw = 4.0 × 10− 10) and ST2-TP (praw = 3.3 × 10− 7) regions. Regulation of transcription related functions were also found in ST1-SC (positive regulation of transcription, DNA-templated GO:0045893 praw = 1.0 × 10− 9). (Fig. 4a). For ASD, we discovered 3 significant terms in the ST1-SM (GO:0043066 negative regulation of apoptotic process praw = 1.7 × 10− 7, GO:0008284 positive regulation of cell proliferation praw = 2.5 × 10− 7) and ST2-SC (GO:0097193 intrinsic apoptotic signaling pathway praw = 2.6 × 10− 7) in early brain development, which were related to apoptotic and cell proliferation [21] (Fig. 4b). For BIP, the top significant terms in ST2-FC and ST2-SC were negative regulation of transcription from RNA polymerase II promoter (GO:0000122 praw = 1.0 × 10− 7) and Wnt signaling pathway, planar cell polarity pathway (GO:0060071 praw = 1.0 × 10− 6), respectively (Fig. 4c). MDD module genes were found enriched in ST3-SC and ST1-TP for immune-related pathways (viral process GO:0016032 praw = 1.0 × 10− 8, stimulatory C-type lectin receptor signaling pathway GO:0002223 praw = 2.5 × 10− 12, and antigen processing and presentation of exogenous peptide antigen via MHC class I, TAP-dependent GO:0002479 praw = 1.8 × 10− 9), suggesting the immune disturbance in brain TP and SC region could be the underlying etiology of MDD (Fig. 4d). Last but not the least, for SCZ, diverse Bonferroni-correction significant terms were found, e.g., the positive/negative ubiquitin-protein ligase activity in regulation of mitotic cell cycle in ST2-FC (GO:0051437 praw = 1.0 × 10− 8 and GO:0051436 praw = 1.0 × 10− 7); antigen processing and presentation of exogenous peptide antigen via MHC class I, TAP-dependent (GO:0002479) for ST1-SM (praw = 4.0 × 10− 7) and ST2-TP (praw = 1.7 × 10− 7), respectively; viral process (GO:0016032 praw = 3.2 × 10− 7) in ST1-FC, and cell-cell adhesion (GO:0007155 praw = 1.6 × 10− 7) in ST1-SC (Fig. 4e).
Discussion
The determination of the biological basis for psychiatric disorders is important in terms of patient intervention and the potential basis for treatment options. In this study, we used a network approach to identify genes and their biological mechanisms underlying five psychiatric disorders: ADHD, ASD, BIP, MDD, and SCZ. By taking advantage of the comprehensive BrainSpan data with temporal and spatial gene expression profiles, we identified significant co-expression modules and interrogated their potential functions. Specifically, we pinpointed several spatiotemporal points that genetic disturbance of gene interaction networks might increase the risk of the onset of each psychiatric disorder during brain development. Our observations also suggested that the majority of genetic predisposition to these disorders was unique to each disorder, although shared genes were identified as well.
We found that SCZ and BIP were closely clustered in 10 out of the 12 investigated spatiotemporal sites, while ASD appeared to be distantly related to the other four disorders. We identified that neurodevelopmental ST1 has the most co-expression modules than other stages across four main brain regions, indicating the prenatal stage is the ‘busiest’ stage during brain development. Surprisingly, we only observed one nominal significant module composed of four genes “GRB2, LNX1, MAPK9, MUSTN1” shared by four disorders (ASD, ADHD, SCZ, and BIP) in both ST3-SM and ST3-TP. The gene MUSTN1 has the most significant gene-level p-value (Table 3), which is nearby the genome-wide significant signal (rs2535629) from the previous meta-analysis of these five disorders [7]. A limited number of modules were shared in multiple disorders in each site, implying a much more complicated relationship among these disorders at the pathway/network level. At the gene level, we identified several genes that were shared among the most significant modules. Example genes included CTNNB1, a Wnt signaling gene; LNX1, an E3 ubiquitin-protein ligase; and a transcriptional factor ATF2. Genes with both strong associations and moderate/weak associations were found to interact with each other and form modules that led to the development of disorders.
CTNNB1 is a fundamental component of the canonical Wnt signaling pathways and controls cell growth and cell adhesion [43, 44]. Dysregulation of CTNNB1 leads to abnormal brain development and defective dendritic morphogenesis [37,38,39,40]. Mutation in CTNNB1 could to neurodevelopmental disorder [45]. In our results, CTNNB1 was found in significant co-expression modules in all five disorders at ST2 in the SC region of the brain. Raw p-values of CTNNB1 were not significant in VEGAS results (ADHD:0.029; ASD:0.046; BIP:0.0081, MDD:0.00081; SCZ: 0.11). CTNNB1 was also found to be the hub node in several spatiotemporal points of ADHD, ASD, and BIP (Table 4 and Fig. 5), suggesting that it might play important roles in these spatiotemporal points of development. More interestingly, CTNNB1 was found to be coexpressed with HDAC4 and CACNA1C in the top modules in BIP ST2-FC spatiotemporal point (Fig. 5c). While the gene CACNA1C is also the genome-wide significant loci (rs1024582) shared among these five major psychiatric disorders in previous PGC cross-disorder work [7]. The gene LNX1 was found in the top modules in the TP region ST3 (Table 4). LNX1 was an insignificant gene based on the GWAS results (ADHD:0.0063; ASD:0.012; BIP:0.0011, MDD:0.0045; SCZ: 0.0082) (Table 3). LNX1 was found to be involved in regulating the protein NUMB, which determines cell fates during development. Also, LNX1 was found to have interactions with presynaptic proteins ERC1, ERC2, and LIPRIN-αs (PPFIA1, PPFIA3), as well as the F-BAR domain proteins FCHSD2 (nervous wreck homolog) and SRGAP2 [42]. ATF2 was found to be the hub node in 15 out of 60 disorder spatiotemporal points in the top 10 significant modules (Tables 3 and 4). This gene was a transcriptional activator that regulates the transcription of various genes involved in anti-apoptosis, cell growth, and DNA damage response. According to the gene expression during development in SZGR2 database [46], ATF2 has higher expression before born than after born in the brain region, suggesting this gene was involved in regulating the fetus stage of brain development. None of the three genes (LNX1, CTNNB1, and ATF2) was significantly based on the GWAS results of the five disorders (Table 3). They were discovered by our approach mainly because these genes interact with other genes and jointly formed significant modules.
As shown in Table 2, all five mental disorders but BIP were found to have the largest amount of module Zn in Stage 1 (prenatal stage). Interestingly, recent studies also revealed that psychiatric disorders relevant genes tend to be highly expressed in prenatal than postnatal stages [47]. Consistent with the number of significant co-expression modules in Table 1, we found that ST1-FC and ST1-SM tend to have the largest numbers of disorder-related modules, suggesting these two spatiotemporal points are the most curial stages and brain regions underlying these five mental disorders. Overlapping with our functional enrichment analysis result (Fig. 4), we highlighted negative regulation of the apoptotic process (GO:0043066), positive regulation of cell proliferation (GO:0008284) and positive regulation of transcription, DNA-templated (GO:0045893) in ST1-SM for ASD. These findings were aligned with the programmed cell death during neural development, suggesting spatiotemporal, quantitative errors raised by internal or external stimuli would lead to an abnormal number of neurons and pathological neural connections [21, 48, 49]. The viral process (GO:0016032) in ST1-FC and antigen process and presentation of exogenous peptide antigen via MHC class I, TAP-dependent (GO:0002479) in ST1-SM for SCZ. Recently, SCZ has been correlated to the dysregulation in prenatal brain development and immune response function [50,51,52].
Although we failed to identify any significantly enriched functions for ADHD, BIP, MDD in these two spatiotemporal points (ST1-FC and ST1-SM), we still observed several significant terms in other spatiotemporal points strongly supported by many known observations and studies. ADHD is featured with volume changes of subcortical structure, especially the size of amygdala, smaller volumes of caudate, cerebellum, and frontal and temporal gray matter. We identified the ubiquitin process ADHD is related to in ST3-SM and ST2-TP. Previous studies have shown BIP was associated with the abnormalities in the SC and FC regions [14, 15]. We identified two top terms, negative regulation of transcription from RNA polymerase II promoter and Wnt signaling pathway in ST2-FC and ST2-SC, respectively (Fig. 4c). MDD has high comorbidity (20–55%) with mesial temporal lobe epilepsy (MTLE) [53], which is associated with TP and SC regions [26]. Strikingly, multiple immune-related terms are highly enriched in these two regions among three stages (Fig. 4d), indicating that immune disruption in these regions during brain developments might lead to MDD and its co-occurring disorders.
Due to the complex LD structure in the major histocompatibility complex (MHC) region, we also conducted a supplementary analysis for those top modules excluding those MHC genes (Methods, Additional file 4). Briefly, we found most immune-related functions are not in the top three GOBP terms, except for antigen processing and presentation functions in BIP ST2-TP point, indicating BIP might be related to an immune-associated mechanism outside the MHC region. Interestingly, positive regulation of neuron death (GO:1901216 praw = 6.0 × 10− 6) was highlighted in the key point ST2-FC for BIP.
Lastly, some of the psychiatric disorders could be differentiated by their symptom patterns and course of illness, e.g. SCZ, BIP, and MDD. However, due to the stage and degree of disorder and shared underlying genetic risk factors, it is difficult to define a clear boundary for phenotyping the psychiatric disorders, such as ASD and ADHD [7], which also leads to the different statistical powers for different psychiatric GWAS and eventually hinders our comparison across disorders. Thus, we designed to explore the top 10 genetically impacted modules for each disorder in each spatiotemporal point. However, we also provided lists of genes for each disorder in each of the 12 spatiotemporal points with significant module z-score after multiple-test correction (Additional file 1).
Conclusion
In this work, we developed a network-based module approach to investigate the cumulative impact of disorder-associated genes in different brain developmental stages across different brain regions. We pinpointed two known genetic risk factors (rs2535629 and rs1024582) in our spatiotemporal co-expression network and highlighted several hub genes, e.g., CTNNB1 and LNX1, which likely played crucial regulatory roles in these disorders. Our results recapitalized the dynamic correlations among the five mental disorders and highlighted brain regions and developmental stages underlying disorder co-expressed modules and genes. For instance, the genes from ASD and SCZ modules are significantly enriched in the apoptotic signaling pathway in ST1-SM;immune-related and cell-cell adhesion function for SCZ are enriched in ST1-FC/SM and ST1-SC, respectively. Overall, our investigation of the developmental brain provides new understandings underlying the etiology of these five mental disorders.
Availability of data and materials
The Psychiatric Genomics Consortium Cross-Disorder Group data is available through the request from their website (https://www.med.unc.edu/pgc/pgc-workgroups/cross-disorder-group/). The BrainSpan Atlas expression profiles are available from their website (https://www.brainspan.org/static/download.html). All the datasets used and/or analyzed during the current study are available from the resources described in the Methods part.
Abbreviations
- ADHD:
-
Attention deficit hyperactivity disorder
- ASD:
-
Autism spectrum disorder
- BIP:
-
Bipolar disorder
- MDD:
-
Major depressive disorder
- SCZ:
-
Schizophrenia
- FC:
-
Frontal cortex region
- SM:
-
Sensory motor region
- SC:
-
Sub-cortical region
- TP:
-
Temporal-parietal cortex region
- ST1:
-
Stage 1
- ST2:
-
Stage 2
- ST3:
-
Stage 3
- GWAS:
-
Genome-wide association studies
- PGC:
-
Psychiatric Genomic Consortium
- pcw:
-
Post-conceptual weeks
- VEGAS:
-
Versatile Gene-based Association Study
- LD:
-
Linkage disequilibrium
- PPI:
-
Protein-protein interaction
- DMS:
-
Dense module search
References
Murray CJ, Vos T, Lozano R, Naghavi M, Flaxman AD, Michaud C, Ezzati M, Shibuya K, Salomon JA, Abdalla S, et al. Disability-adjusted life years (DALYs) for 291 diseases and injuries in 21 regions, 1990-2010: a systematic analysis for the global burden of disease study 2010. Lancet. 2012;380:2197–223.
Mustard JF. Brain development, child development - adult health and well-being and paediatrics. Paediatr Child Health. 1999;4:519–20.
Spenrath MA, Clarke ME, Kutcher S. The science of brain and biological development: implications for mental health research, practice and policy. J Can Acad Child Adolesc Psychiatry. 2011;20:298–304.
Lichtenstein P, Yip BH, Bjork C, Pawitan Y, Cannon TD, Sullivan PF, Hultman CM. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet. 2009;373:234–9.
Muhleisen TW, Leber M, Schulze TG, Strohmaier J, Degenhardt F, Treutlein J, Mattheisen M, Forstner AJ, Schumacher J, Breuer R, et al. Genome-wide association study reveals two new risk loci for bipolar disorder. Nat Commun. 2014;5:3339.
Nothen MM, Nieratschker V, Cichon S, Rietschel M. New findings in the genetics of major psychoses. Dialogues Clin Neurosci. 2010;12:85–93.
Cross-Disorder Group of the Psychiatric Genomics C. Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet. 2013;381:1371–9.
Schizophrenia Psychiatric Genome-Wide Association Study C. Genome-wide association study identifies five new schizophrenia loci. Nat Genet. 2011;43:969–76.
Major Depressive Disorder Working Group of the Psychiatric GC, Ripke S, Wray NR, Lewis CM, Hamilton SP, Weissman MM, Breen G, Byrne EM, Blackwood DH, Boomsma DI, et al. A mega-analysis of genome-wide association studies for major depressive disorder. Mol Psychiatry. 2013;18:497–511.
Yoon JH, Minzenberg MJ, Raouf S, D'Esposito M, Carter CS. Impaired prefrontal-basal ganglia functional connectivity and substantia nigra hyperactivity in schizophrenia. Biol Psychiatry. 2013;74:122–9.
Vai B, Sferrazza Papa G, Poletti S, Radaelli D, Donnici E, Bollettini I, Falini A, Cavallaro R, Smeraldi E, Benedetti F. Abnormal cortico-limbic connectivity during emotional processing correlates with symptom severity in schizophrenia. Eur Psychiatry. 2015;30:590–7.
Schmaal L, Veltman DJ, van Erp TG, Samann PG, Frodl T, Jahanshad N, Loehrer E, Tiemeier H, Hofman A, Niessen WJ, et al. Subcortical brain alterations in major depressive disorder: findings from the ENIGMA major depressive disorder working group. Mol Psychiatry. 2016;21:806–12.
Gilman SR, Chang J, Xu B, Bawa TS, Gogos JA, Karayiorgou M, Vitkup D. Diverse types of genetic variation converge on functional gene networks involved in schizophrenia. Nat Neurosci. 2012;15:1723–8.
Chai XJ, Whitfield-Gabrieli S, Shinn AK, Gabrieli JD, Nieto Castanon A, McCarthy JM, Cohen BM, Ongur D. Abnormal medial prefrontal cortex resting-state connectivity in bipolar disorder and schizophrenia. Neuropsychopharmacology. 2011;36:2009–17.
Ellison-Wright I, Bullmore E. Anatomy of bipolar disorder and schizophrenia: a meta-analysis. Schizophr Res. 2010;117:1–12.
Polanczyk G, de Lima MS, Horta BL, Biederman J, Rohde LA. The worldwide prevalence of ADHD: a systematic review and metaregression analysis. Am J Psychiatry. 2007;164:942–8.
Faraone SV, Biederman J, Mick E. The age-dependent decline of attention deficit hyperactivity disorder: a meta-analysis of follow-up studies. Psychol Med. 2006;36:159–65.
Castellanos FX, Lee PP, Sharp W, Jeffries NO, Greenstein DK, Clasen LS, Blumenthal JD, James RS, Ebens CL, Walter JM, et al. Developmental trajectories of brain volume abnormalities in children and adolescents with attention-deficit/hyperactivity disorder. JAMA. 2002;288:1740–8.
Qiu A, Crocetti D, Adler M, Mahone EM, Denckla MB, Miller MI, Mostofsky SH. Basal ganglia volume and shape in children with attention deficit hyperactivity disorder. Am J Psychiatry. 2009;166:74–82.
Hoogman M, Bralten J, Hibar DP, Mennes M, Zwiers MP, Schweren LSJ, van Hulzen KJE, Medland SE, Shumskaya E, Jahanshad N, et al. Subcortical brain volume differences in participants with attention deficit hyperactivity disorder in children and adults: a cross-sectional mega-analysis. Lancet Psychiatry. 2017;4:310–9.
Hazlett HC, Gu H, Munsell BC, Kim SH, Styner M, Wolff JJ, Elison JT, Swanson MR, Zhu H, Botteron KN, et al. Early brain development in infants at high risk for autism spectrum disorder. Nature. 2017;542:348–51.
Voineagu I, Wang X, Johnston P, Lowe JK, Tian Y, Horvath S, Mill J, Cantor RM, Blencowe BJ, Geschwind DH. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature. 2011;474:380–4.
Miller JA, Ding SL, Sunkin SM, Smith KA, Ng L, Szafer A, Ebbert A, Riley ZL, Royall JJ, Aiona K, et al. Transcriptional landscape of the prenatal human brain. Nature. 2014;508:199–206.
Hinrichs AS, Karolchik D, Baertsch R, Barber GP, Bejerano G, Clawson H, Diekhans M, Furey TS, Harte RA, Hsu F, et al. The UCSC genome browser database: update 2006. Nucleic Acids Res. 2006;34:D590–8.
Mishra A, Macgregor S. VEGAS2: software for more flexible gene-based testing. Twin Res Hum Genet. 2015;18:86–91.
Gulsuner S, Walsh T, Watts AC, Lee MK, Thornton AM, Casadei S, Rippey C, Shahin H, Consortium on the Genetics of S, Group PS, et al. Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell. 2013;154:518–29.
Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, et al. Human protein reference database--2009 update. Nucleic Acids Res. 2009;37:D767–72.
Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P, et al. The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res. 2011;39:D561–8.
Jia P, Zheng S, Long J, Zheng W, Zhao Z. dmGWAS: dense module searching for genome-wide association studies in protein-protein interaction networks. Bioinformatics. 2011;27:95–102.
Wang Q, Yu H, Zhao Z, Jia P. EW_dmGWAS: edge-weighted dense module search for genome-wide association studies and gene expression profiles. Bioinformatics. 2015;31:2591–4.
Manuel AM, Dai Y, Freeman LA, Jia P, Zhao Z. Dense module searching for gene networks associated with multiple sclerosis. BMC Med Genet. 2020;13:48.
de Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol. 2015;11:e1004219.
Fresno C, Fernandez EA. RDAVIDWebService: a versatile R interface to DAVID. Bioinformatics. 2013;29:2810–1.
International Schizophrenia C, Purcell SM, Wray NR, Stone JL, Visscher PM, O'Donovan MC, Sullivan PF, Sklar P. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009;460:748–52.
Pei G, Sun H, Dai Y, Liu X, Zhao Z, Jia P. Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics. BMC Genomics. 2019;20:79.
Jia P, Dai Y, Hu R, Pei G, Manuel AM, Zhao Z. TSEA-DB: a trait-tissue association map for human complex traits and diseases. Nucleic Acids Res. 2020;48:D1022–30.
Chenn A, Walsh CA. Regulation of cerebral cortical size by control of cell cycle exit in neural precursors. Science. 2002;297:365–9.
Yu X, Malenka RC. Beta-catenin is critical for dendritic morphogenesis. Nat Neurosci. 2003;6:1169–77.
Brault V, Moore R, Kutsch S, Ishibashi M, Rowitch DH, McMahon AP, Sommer L, Boussadia O, Kemler R. Inactivation of the beta-catenin gene by Wnt1-Cre-mediated deletion results in dramatic brain malformation and failure of craniofacial development. Development. 2001;128:1253–64.
Tan ZJ, Peng Y, Song HL, Zheng JJ, Yu X. N-cadherin-dependent neuron-neuron interaction is required for the maintenance of activity-induced dendrite growth. Proc Natl Acad Sci U S A. 2010;107:9873–8.
Dho SE, French MB, Woods SA, McGlade CJ. Characterization of four mammalian numb protein isoforms. Identification of cytoplasmic and membrane-associated variants of the phosphotyrosine binding domain. J Biol Chem. 1999;274:33097–104.
Lenihan JA, Saha O, Heimer-McGinn V, Cryan JF, Feng G, Young PW. Decreased anxiety-related behaviour but apparently unperturbed NUMB function in ligand of NUMB protein-X (LNX) 1/2 double knockout mice. Mol Neurobiol. 2017;54:8090–109.
Dong F, Jiang J, McSweeney C, Zou D, Liu L, Mao Y. Deletion of CTNNB1 in inhibitory circuitry contributes to autism-associated behavioral defects. Hum Mol Genet. 2016;25:2738–51.
Brembeck FH, Rosario M, Birchmeier W. Balancing cell adhesion and Wnt signaling, the key role of beta-catenin. Curr Opin Genet Dev. 2006;16:51–9.
Stessman HA, Turner TN, Eichler EE. Molecular subtyping and improved treatment of neurodevelopmental disease. Genome Med. 2016;8:22.
Jia P, Han G, Zhao J, Lu P, Zhao Z. SZGR 2.0: a one-stop shop of schizophrenia candidate genes. Nucleic Acids Res. 2017;45:D915–24.
Sey NYA, Hu B, Mah W, Fauni H, McAfee JC, Rajarajan P, Brennand KJ, Akbarian S, Won H. A computational tool (H-MAGMA) for improved prediction of brain-disorder risk genes by incorporating brain chromatin interaction profiles. Nat Neurosci. 2020;23:583–93.
Wei H, Alberts I, Li X. The apoptotic perspective of autism. Int J Dev Neurosci. 2014;36:13–8.
Marchetto MC, Belinson H, Tian Y, Freitas BC, Fu C, Vadodaria K, Beltrao-Braga P, Trujillo CA, Mendes APD, Padmanabhan K, et al. Altered proliferation and networks in neural cells derived from idiopathic autistic individuals. Mol Psychiatry. 2017;22:820–35.
Volk DW, Lewis DA. Early developmental disturbances of cortical inhibitory neurons: contribution to cognitive deficits in schizophrenia. Schizophr Bull. 2014;40:952–7.
Volk DW, Chitrapu A, Edelson JR, Lewis DA. Chemokine receptors and cortical interneuron dysfunction in schizophrenia. Schizophr Res. 2015;167:12–7.
Jia P, Chen X, Fanous AH, Zhao Z. Convergent roles of de novo mutations and common variants in schizophrenia in tissue-specific and spatiotemporal co-expression network. Transl Psychiatry. 2018;8:105.
Nogueira MH, Pimentel da Silva LR, Vasques Moreira JC, de Rezende TJR, Zanao TA, de Campos BM, Yasuda CL, Cendes F. Major depressive disorder associated with reduced cortical thickness in women with temporal lobe epilepsy. Front Neurol. 2019;10:1398.
Acknowledgments
We thank Dr. Xueying Zhang for valuable discussion. We thank all members of the Bioinformatics and Systems Medicine Laboratory for their valuable help.
About this supplement
This article has been published as part of BMC Medical Genomics Volume 13 Supplement 11 2020: Data-driven analytics in biomedical genomics. The full contents of the supplement are available at https://bmcmedgenomics.biomedcentral.com/articles/supplements/volume-13-supplement-11.
Funding
Publication was funded by The Chair Professorship for Precision Medicine Funds to Z.Z. from the University of Texas Health Science Center at Houston.
This work was partially supported by the National Institutes of Health grant (R01LM012806 and R03DE027711). We thank the technical support from the Cancer Genomics Core funded by the Cancer Prevention and Research Institute of Texas (CPRIT RP170668 and RP180734). The funder had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
PJ and ZZ conceived the study. YD, PJ, and TO performed data analysis. YD, TO, and GP prepared the figures and tables. YD, TO, PJ, and ZZ wrote the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not Applicable.
Consent for publication
Not Applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
Bubble diagram for the proportion of the modules Zm > 1.96 in 12 spatiotemporal points. (A)(B)(C)(D)(E). Bubble diagram describing the proportion of the modules with Zm > 1.96 to the total modules within 12 spatiotemporal points for every 5 mental disorders, respectively. The bubble size represents the relative proportion of 12 spatiotemporal points of each disorder.
Additional file 2.
Venn diagram for genes from modules. (A) Venn diagram describing the overlapping modules of 12 spatiotemporal points (B) Venn diagram describing the overlapping genes from modules of 12 spatiotemporal points. Plots were generated by the online tool http://bioinformatics.psb.ugent.be/webtools/Venn/.
Additional file 3.
Gene lists merged from the statistically significant modules. Each sheet contains the gene list corresponding to the five psychiatric diseases. (A) ADHD: attention deficit hyperactivity disorder; (B) ASD: autism spectrum disorder; (C) BIP: bipolar disorder; (D) MDD: major depressive disorder; (E) SCZ: schizophrenia; FC: frontal cortex region, SM: sensory motor region, SC: sub-cortical region, TP: temporal-parietal cortex region, ST1: stage 1 (prenatal), ST2: stage 2 (after birth to 11 years old), ST3: stage 3 (older than 13 years)
Additional file 4.
GO enrichment for genes (non MHC genes) in the top 10 significant modules. Gene ontology term enrichment (biological process) analysis for five mental disorders in 12 spatiotemporal points. The top three GO terms were listed on the right for each spatiotemporal point in the order of “black”, “red”, and “blue”. Green dash indicated –log10 (p-value) after Bonferroni correction of all BP terms (2740). (A) ADHD: attention deficit hyperactivity disorder; (B) ASD: autism spectrum disorder; (C) BIP: bipolar disorder; (D) MDD: major depressive disorder; (E) SCZ: schizophrenia; FC: frontal cortex region, SM: sensory motor region, SC: sub-cortical region, TP: temporal-parietal cortex region, ST1: stage 1 (prenatal), ST2: stage 2 (after birth to 11 years old), ST3: stage 3 (older than 13 years)
Additional file 5.
Edges weights for the top 10 modules for five psychiatric diseases in 12 spatiotemporal points. The first two columns are gene symbols, the third column is the Pearson Correlation Coefficient is the edge weight, and the last column is the corresponding Disorder spatiotemporal point. (A) ADHD: attention deficit hyperactivity disorder; (B) ASD: autism spectrum disorder; (C) BIP: bipolar disorder; (D) MDD: major depressive disorder; (E) SCZ: schizophrenia; FC: frontal cortex region, SM: sensory motor region, SC: sub-cortical region, TP: temporal-parietal cortex region, ST1: stage 1 (prenatal), ST2: stage 2 (after birth to 11 years old), ST3: stage 3 (older than 13 years)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Dai, ., O’Brien, T.D., Pei, G. et al. Characterization of genome-wide association study data reveals spatiotemporal heterogeneity of mental disorders. BMC Med Genomics 13 (Suppl 11), 192 (2020). https://doi.org/10.1186/s12920-020-00832-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12920-020-00832-8