- Open Access
PD_NGSAtlas: a reference database combining next-generation sequencing epigenomic and transcriptomic data for psychiatric disorders
BMC Medical Genomics volume 7, Article number: 71 (2014)
Psychiatric disorders such as schizophrenia (SZ) and bipolar disorder (BP) are projected to lead the global disease burden within the next decade. Several lines of evidence suggest that epigenetic- or genetic-mediated dysfunction is frequently present in these disorders. To date, the inheritance patterns have been complicated by the problem of integrating epigenomic and transcriptomic factors that have yet to be elucidated. Therefore, there is a need to build a comprehensive database for storing epigenomic and transcriptomic data relating to psychiatric disorders.
We have developed the PD_NGSAtlas, which focuses on the efficient storage of epigenomic and transcriptomic data based on next-generation sequencing and on the quantitative analyses of epigenetic and transcriptional alterations involved in psychiatric disorders. The current release of the PD_NGSAtlas contains 43 DNA methylation profiles and 37 transcription profiles detected by MeDIP-Seq and RNA-Seq, respectively, in two distinct brain regions and peripheral blood of SZ, BP and non-psychiatric controls. In addition to these data that were generated in-house, we have included, and will continue to include, published DNA methylation and gene expression data from other research groups, with a focus on psychiatric disorders. A flexible query engine has been developed for the acquisition of methylation profiles and transcription profiles for special genes or genomic regions of interest of the selected samples. Furthermore, the PD_NGSAtlas offers online tools for identifying aberrantly methylated and expressed events involved in psychiatric disorders. A genome browser has been developed to provide integrative and detailed views of multidimensional data in a given genomic context, which can help researchers understand molecular mechanisms from epigenetic and transcriptional perspectives. Moreover, users can download the methylation and transcription data for further analyses.
The PD_NGSAtlas aims to provide storage of epigenomic and transcriptomic data as well as quantitative analyses of epigenetic and transcriptional alterations involved in psychiatric disorders. The PD_NGSAtlas will be a valuable data resource and will enable researchers to investigate the pathophysiology and aetiology of disease in detail. The database is available at http://bioinfo.hrbmu.edu.cn/pd_ngsatlas/.
Schizophrenia (SZ) and bipolar disorder (BP) are common and highly heritable psychiatric disorders that affect approximately 4% of the world’s population and result in considerable personal and societal burdens . Over the past decades, it has been widely accepted that both genetic and environmental risk factors lead to the occurrence and development of these disorders -. Moreover, a large number of genetic association and linkage studies have been performed to explore the pathogenesis of SZ and BP ,. However, the results do not replicate well, and they identify risk alleles with small effects, indicating that non-genetic factors may also result in disease . Recent studies have highlighted a role for epigenetic processes in mediating susceptibility, and have provided new insight into disease pathogenesis.
DNA methylation, which consists of the addition of a methyl group to the 5’-position of cytosine in CpG dinucleotides, is an important epigenetic modification involved in the regulation of transcription . DNA methylation has been shown to interfere with transcription by directly inhibiting the binding of transcription factors, enhancering blocking elements, or recruiting methyl-CpG binding proteins (MBPs) to affect chromatin structure . DNA methylation plays a crucial role in genomic imprinting, X chromosome inactivation and regulating tissue-specific gene expression ,,. Accumulating evidence indicates that abnormal DNA methylation at particular locations may affect neuronal activity , brain growth and development , learning and memory , and cognitive performance , and is associated with the pathophysiology of psychiatric disorders . Initial studies focused on DNA methylation alterations in some candidate genes. Using cultured rat neurons, Chen et al. and Martinowich et al. showed the importance of DNA methylation in the regulation of brain-derived neurotrophic factor (BDNF), which is essential for neuronal survival, development and synaptic plasticity ,. Subsequently, the first genome-wide DNA methylation landscape profiled by Mill et al. aimed to investigate DNA methylation changes associated with SZ and BP using CpG-island microarrays of approximately 12,000 GC-rich regions in the prefrontal cortex and in the germline . They found evidence for psychosis-associated DNA methylation differences in numerous loci involved in glutamatergic and GABAergic neurotransmission, brain development, and other processes functionally linked to disease aetiology. Because monozygotic (MZ) twins share common genetic information and can be used as an ideal model for investigating the contribution of epigenetic factors to disease aetiology, Dempster et al. performed a genome-wide analysis of methylation of DNA in blood samples from MZ twin pairs discordant for major psychoses using microarrays, and they demonstrated disease-associated DNA methylation differences between twins.
Although epigenetic studies promote our understanding of psychiatric disorders, there have been few studies of methylation and gene expression on a genome-wide scale. Initial studies focused on DNA methylation alterations in candidate genes, including RELN , SOX10  and GAD67. DNA methylation of the reelin promoter was suggested to be involved in downregulating the gene in SZ, and the DNA methylation status of SOX10 inversely correlated with expression levels of SOX10 and other oligodendrocyte genes . Rapid advances in the development of next-generation sequencing (NGS) technology facilitates correspondingly dramatic advances in elucidating how epigenetic processes mediate gene expression and makes it possible to integrate epigenomic and transcriptomic data to uncover the aetiology and pathophysiology of psychiatric disorders. Recently, we performed genome-wide methylation and expression analyses in two brain regions and in peripheral blood samples ,. Our results support the important roles of DNA methylation in SZ and BP and highlight the complex relationships between DNA methylation and gene expression in these disorders. In addition, the results indicate that differentially expressed genes with aberrant methylation patterns that we identified may represent novel candidates for the aetiology and pathology of neuropsychiatric disorders.
To our knowledge, although a handful of DNA methylation databases have been compiled, they either contain limited methylation data or differ in biological scope. Among these methylation databases, NGSmethDB  and MethBank  were constructed to store genome-wide methylomes. However, MethBank only supports the storage, browsing and visualizing of whole-genome DNA methylation data in two well-studied species, D.rerio and M. musculus. In addition, NGSmethDB provides data sets for cell lines, fresh and pathological tissues but not for specific diseases. Several methylation databases centred on human diseases have also been compiled, including DiseaseMeth  and the Cancer methylome system (CMS) . DiseaseMeth is a web-based resource focused on the aberrant methylomes of human diseases. However, most of the datasets are microarray-based. CMS is a web-based database application that provides comprehensive and genome-wide epigenetic portraits of human breast cancer and endometrial cancer. However, there is limited, specialised and comprehensive database of psychiatric disorders that focuses on the storage of epigenomic data based on next-generation sequencing. MethylomeDB  is the only database that presents methylation profiles of carefully selected non-psychiatric control, schizophrenia, and depression samples. However, the gene expression levels in these sample have not been profiled, and the database has not been updated for a long time. Thus, a reference database combining epigenomic and transcriptomic datasets is urgently needed for the combined analyses of the potential pathogenesis mechanisms of psychiatric disorders.
In this study, we developed the PD_NGSAtlas, which aims to store next-generation sequencing epigenomic and transcriptomic data captured from the same individuals and to perform quantitative analyses of epigenetic and transcriptional alterations involved in psychiatric disorders. The current version of the PD_NGSAtlas provides internal genome-wide DNA methylation and transcription profiles from two generally inaccessible brain regions and from accessible peripheral blood of SZ, BP and non-psychiatric disorder controls. The PD_NGSAtlas supports the search of methylation and transcription profiles for special genes or genomic regions of selected samples, which should enable a broad range of researchers to explore the molecular mechanisms of psychiatric disorders (Additional file 1: Figure S1). All retrieved results can be downloaded freely for further analysis. Furthermore, the PD_NGSAtlas offers online tools for identifying aberrantly methylated and expressed genes involved in psychiatric disorders. The database also features a genome browser, which can be used to browse multidimensional data in a given genomic context. In summary, the PD_NGSAtlas is a user-friendly, web-based, ‘one-stop’ service for basic data retrieval, analyses, visualisation and downloading, which will help provide new insights into the aetiology of psychiatric disorders.
Construction and content
All of the subjects were diagnosed by consensus for either BP or SZ according to DSM-IV-TR criteria and the control samples had no history of an Axis I disorder. The diverse types of clinical characteristics were also collected, including disease status, disease types, age, age of onset, sex and twin status (Additional file 2: Table S1). All the subjects in this study were free of confounding neuropathology. DNA and RNA samples were obtained from peripheral blood or from two distinct brain regions. DNA and RNA samples of peripheral blood were obtained from the Department of Psychiatry and Center of Excellence – Neurosciences, Texas Tech University Health Science Center (TTUHSC), whereas the post-mortem brain tissues were collected from the Southwest Brain Bank (SWBB), Department of Psychiatry, UTHSCSA, TX USA. Written, informed consent was obtained from all the participants. All of the brain samples were from freshly frozen specimens that were stored in −80°C freezers. Brodmann area 9 (BA9) and BA24 from the same hemisphere were both used based on the criteria described by Rajkowska and Goldman-Rakic .
For all the samples stored in the PD_NGSAtlas, a tooltip was added that appears when hovering over a potential sample selection and lists its full parameters. Moreover, users can click on the sample item in the ‘Tools’ section to see its detailed clinical information that helps to better explore the nature of disease.
The current release of the PD_NGSAtlas contains 43 DNA methylation profiles detected using MeDIP-Seq by our laboratory. The extracted genomic DNA samples were fragmented into 100-500bp by sonication. DNA ends were repaired to overhang a 3’-dA, and adapters were ligated to the DNA fragment ends. The double-stranded DNA was denatured, and the DNA fragments were immunoprecipitated using a 5-mC antibody. Real-time PCR was used to validate the immunoprecipitation quality. DNA fragments of the proper size (usually 200–300 bp, including the adapter sequence) were selected after PCR amplification. Finally, the resultant libraries were sequenced as paired-end 50 bp reads using the genome-wide massively parallel sequencing platform Illumina HiSeq 2000.
RNA-Seq was performed to profile gene expression in 37 samples, including 14 SZ, 12 BP and 11 control samples. Oligo (dT) beads were used to isolate poly(A) mRNA from the total RNA from these samples. Fragmentation buffer was added and the resulting 200–300 bp fragments were used as templates for random hexamer-primer synthesis of first-strand cDNAs. Second-strand cDNA was synthesised using buffer, dNTPs, RNase H and DNA polymerase I. Fragments were purified using a QIAquick PCR extraction kit and eluted with EB buffer for end reparation and poly(A) addition. Based on the results of agarose gel electrophoresis, fragments were connected with sequencing adapters; PCR was performed by selecting suitable fragments as templates. The library was sequenced as paired-end 90 bp reads using an Illumina Hiseq 2000.
Genomic features annotation
The genomic coordinates for the human genomic features investigated were downloaded from the UCSC table browser . RefSeq gene promoters were defined as ±2 kb of sequence flanking the transcription start sites. Table CpGislandext (UCSC) was used for the set of CpG islands (CGIs). We excluded CGIs with ‘random’ chromosome locations. Following Andrew et al., the CpG island shores were defined as the 2 kb regions near the CGIs. In addition, some histone modifications and open chromatin datasets were obtained from the ENCODE project  (Table 1). All the coordinates of the epigenomic and transcriptomic datasets and genomic features have been remapped from NCBI36/hg18 to GRCh37/hg19 using the UCSC’s liftOver tool.
Genome-wide DNA methylation and transcription profiles
From the raw fastq files, Illumina quality scores were converted into Sanger Phred quality scores using MAQ. Quality control was performed on the raw sequence data using FastQC. Additional file 3: Figure S2 highlights the quality of our sequencing datasets. Reads from MeDIP-Seq and RNA-Seq were mapped using the SOAP2 program . The uniquely mapped reads were retained for further analysis. The genome methylation peaks were further identified by MACS , and the threshold of the p-value was set to 1.0e-5. In addition, gene expression levels were measured using RPKM . Finally, all the DNA methylation profiles cover 6,634,043 methylation peaks, and the transcription profiles involve 19,186 expressed genes.
The PD_NGSAtlas provides a user-friendly interface for the acquisition of methylation profiles and transcription profiles for specific genes or genomic regions of selected samples. A comprehensive search interface is provided (Figure 1a). For transcription data, users can search gene expression levels by entering a gene symbol (optional) and selecting several samples of interest (Figure 1a). The search results are displayed as an overview table that summarises the gene expression levels across selected samples (Figure 1b). This table can show the gene expression pattern across selected samples and can link to the ‘Visualize’ section in which users can view gene expression profiles under a given genomic context through a tailored genome browser (Figure 1c). Similarly, users can obtain DNA methylation profiles of a given gene symbol or chromosome region across selected samples (Figure 2). Furthermore, these DNA methylation profiles can be visualised through a customised genome browser. All of the above query results can be downloaded freely. These valuable data resources should facilitate researcher on psychiatric disorders.
Identification of aberrantly methylated and/or expressed events in psychiatric disorders
In the PD_NGSAtlas database, to view global gene expression profiles, online tools can calculate the overall distribution of gene expression and present it graphically as a flex area chart (Figure 3a). The tool is useful for determining whether data values are median-centred across samples and thus suitable for cross-comparison. Similarly, users can type in a specific gene symbol and view its expression distribution across all samples in which its expression changes (Figure 3a). Typically, users can compare samples that belong to different experimental variable subsets. For transcription data, a tool was developed for users to identify genes that display marked differences in the expression levels of two sets of samples. In the current version of the database, a two-tailed t-test and several other widely used methods (including EdgeR  and DEGseq ) were provided to identify the differentially expressed genes (DEGs). The t-test is the most commonly used method to identify DEGs. With the development of high-throughput sequencing, several R packages were developed to identify DEGs for RNA-seq data. EdgeR integrated three existing methods and introduced two novel methods based on MA-plots to detect and visualise gene expression difference, whereas DEGseq used empirical Bayes methods to moderate the degree of overdispersion across transcripts, improving the reliability of inference. In addition, the Limma method can be used to identify the DEGs accounting for age and sex. All the p-values obtained by these methods were adjusted. In addition, the results of the DEGs are shown in a volcano plot, an M-A plot and a heatmap is provided to show the expression of the top 50 DEGs (Figure 3a). For DNA methylation data, aberrantly methylated peaks were detected between two samples. For each peak, the number of reads for each sample was calculated, and the significance was assessed using chi-squared tests. Then, the resultant regions with an FDR less than 5% and more than a two-fold difference of read numbers were considered to be differentially methylated regions (DMRs) . In the PD_NGSAtlas, a query interface was designed to enable a comparison between disease samples and controls, which users can employ to obtain DMRs (Figure 3b). We propose that the combination of aberrantly methylated regions and expressed genes can be used to elucidate the molecular mechanisms underlying psychiatric disorders.
Visualizing the methylation and transcription profiles of interesting genes and regions
To capture meaningful information from epigenetic and transcriptomic data, a genome browser based on JBrowse was proposed for users, and it allowed users to compare multilevel genomic, epigenetic and transcriptomic data visually to discover functional relationships (Figure 4) . Here, both methylation and transcription data can be visualised in the same view in bigWig format, which can help users to find the functional relationships between the two types of data. Furthermore, users can view other genome information including gene structure, CpG islands, repeat elements and several genomic regulation features against a human reference genome (Hg19). These data can intuitively reflect epigenetic and transcriptomic changes between different samples, which would be useful for the study of the molecular mechanisms of psychiatric disorders. The genome browser offers several easy-to-use tools, including the ability to navigate directly to a region of interest by typing in the region coordinates, to zoom in or out or drag a region, to view the annotation details by double-clicking on the annotation track, and to configure genomic annotation by clicking on the track name. Importantly, users can upload their own data to be visualised. The users’ data reside on a local computer without the need to transfer any data to the server. As shown in Figures 1–2, a visual interface can be accessed through the links in the query results.
Relational database and web interface
The web interface was developed in Java using the Servlet framework. The PD_NGSAtlas website is deployed on a Tomcat 6.0.33 web server and runs under the Cent OS 5.5 system. It is supported by a MySQL database of DNA methylation and transcription data. The JQuery was used to render, generate and manipulate the gene expression distribution views. The module for the identification of differentially expressed genes (DEGs) is realised by R and Perl script. In the ‘Visualize’ module, JBrowse (release 1.11.5), an open source genome browser, can be used to navigate multiple omics data and diverse genome information over the web. Moreover, the PD_NGSAtlas has been fully tested in Google Chrome (version 17 and later), Apple Safari (version 5 and later) and Mozilla Firefox (version 10 and later).
It is worth noting that the integration of epigenetic and transcriptomic data is intended to enhance the analysis of the aetiology of psychiatric disorders at the gene level. Taking the gene ZNF304 as an example, in the BA9 region, ZNF304 is specifically upregulated in patients with SZ compared with controls (Figure 4, t-test, p<10e-3). Furthermore, we found that the promoter of ZNF304 is hypomethylated in SZ samples compared with controls (Figure 4). In addition to ZNF304, we found that the expression of gene ZNF483 is higher in SZ samples than in the controls, and the promoter of ZNF483 is hypomethylated in SZ samples from the BA24 region of the brain. This is consistent with previous research implicating ZNF483 in SZ ,. These results suggest that the combination of epigenetics and transcriptome studies may provide new insights into the cause of psychiatric disorders.
The current version of the PD_NGSAtlas is the first release of our database, and it contains next-generation sequencing DNA methylation and gene expression profiles of datasets obtained from human brain and blood samples. Psychiatric disorders are diseases of the central nervous system, and therefore, studies of patient-derived living brain cells may provide the most pertinent information. Post-mortem brains have been extensively used in recent studies; however, obtaining a sufficient number of brains in ideal condition is difficult. Thus, it is more feasible to obtain peripheral samples that can act as potential biomarkers of SZ and BP . Psychiatric disorders, including SZ and BP, have genetic components , and CNS alterations might be reflected in peripheral tissues. Indeed, previous microarray analyses have found numerous classes of genes that are expressed both in blood and in the prefrontal cortex , including approximately half of the so-called SZ susceptibility genes . A previous study comparing the methylation status of pre-mortem blood and post-mortem brain tissue showed that significant variation in the methylation profiles of brain tissue were reflected in blood . Additionally, recent studies have shown that DMRs associated with both chronic pain and ageing are similar in brain and blood tissue . Although the number of blood samples in our current database is limited, peripheral samples for the development of biomarkers and individualised therapies may prove to be potent and complementary tools for use in psychiatric research.
Given the importance of the data as a resource for the community focused on psychiatric research, we have made the PD_NGSAtlas publicly available. To build a DNA methylation and gene expression database focusing on human psychiatric diseases, continued efforts will be made to update the PD_NGSAtlas data and improve the genomic viewer and database functionality. In our current study, we also included some sequencing-based DNA methylation and gene expression profiles related to SZ and BP collected from public databases . We will also encourage research scientists to submit their next-generation sequencing data directly to the PD_NGSAtlas and to make this database more comprehensive. The submitted datasets in the future will be manually reviewed and then integrated into this database. In addition, some interfaces are also provided in our current database, and it will be easy to integrate these datasets into the database in the future.
In this study, we proposed the PD_NGSAtlas for the visualisation and analysis of methylation and expression datasets for psychiatric disorders; however, some limitations to the current system need to be addressed in the future. Although a number of datasets were collected and processed into our database, the numbers of samples are still limited. We expected to acquire more samples to make the database more comprehensive in the future. In addition, some statistical methods were incorporated into the database to identify the DEGs. These methods should be used with caution. The user should select the method that is most suitable for a given dataset. For example, the edgeR and DEGSeq methods were specifically incorporated for gene expression profiles based on raw read counts. Moreover, it is notable that the newly submitted datasets were mainly transferred by email to our current database. In response to the rapid increase in the amount of sequencing data produced by the next-generation sequencing technologies, we expect to incorporate more effective methods to enhance the efficiency of this process.
In this work, we present the PD_NGSAtlas, a specific database for psychiatric disorders, which offers a comprehensive reference resource combining epigenetic and transcriptomic data based on next generation sequencing, and quantitative analysis of epigenetic and transcriptional alterations involved in psychiatric disorders. The PD_NGSAtlas aims to provide reference resources to assist researchers to understand the epigenetic and transcriptional effects involved in the aetiology and pathophysiological mechanisms of psychiatric disorders.
Availability and requirements
PD_NGSAtlas is freely available at http://bioinfo.hrbmu.edu.cn/pd_ngsatlas/. The web interface has been tested in the following web browsers: Google Chrome (version 17 and later), Apple Safari (version 5 and later) and Mozilla’s Firefox (version 10 and later). The “Help” page of the PD_NGSAtlas Web interface includes a step-by-step description of all PD_NGSAtlas features.
Methyl-CpG binding proteins
Brain-derived neurotrophic factor
Texas Tech University Health Science Center
Southwest Brain Bank
Brodmann area 9
Brodmann area 24
Patel V, Prince M: Global mental health: a new global health field comes of age. JAMA. 2010, 303 (19): 1976-1977. 10.1001/jama.2010.616.
Burmeister M, McInnis MG, Zollner S: Psychiatric genetics: progress amid controversy. Nat Rev Genet. 2008, 9 (7): 527-540. 10.1038/nrg2381.
Karlsgodt KH, Sun D, Jimenez AM, Lutkenhoff ES, Willhite R, van Erp TG, Cannon TD: Developmental disruptions in neural connectivity in the pathophysiology of schizophrenia. Dev Psychopathol. 2008, 20 (4): 1297-1327. 10.1017/S095457940800062X.
Lewis DA, Levitt P: Schizophrenia as a disorder of neurodevelopment. Annu Rev Neurosci. 2002, 25: 409-432. 10.1146/annurev.neuro.25.112701.142754.
Craddock N, O'Donovan MC, Owen MJ: The genetics of schizophrenia and bipolar disorder: dissecting psychosis. J Med Genet. 2005, 42 (3): 193-204. 10.1136/jmg.2005.030718.
Kushima I, Aleksic B, Ito Y, Nakamura Y, Nakamura K, Mori N, Kikuchi M, Inada T, Kunugi H, Nanko S, Kato T, Yoshikawa T, Ujike H, Suzuki M, Iwata N, Ozaki N: Association study of ubiquitin-specific peptidase 46 (USP46) with bipolar disorder and schizophrenia in a Japanese population. J Hum Genet. 2010, 55 (3): 133-136. 10.1038/jhg.2009.139.
Pidsley R, Mill J: Epigenetic studies of psychosis: current findings, methodological approaches, and implications for postmortem research. Biol Psychiatry. 2011, 69 (2): 146-156. 10.1016/j.biopsych.2010.03.029.
Bird A: DNA methylation patterns and epigenetic memory. Genes Dev. 2002, 16 (1): 6-21. 10.1101/gad.947102.
Nan X, Ng HH, Johnson CA, Laherty CD, Turner BM, Eisenman RN, Bird A: Transcriptional repression by the methyl-CpG-binding protein MeCP2 involves a histone deacetylase complex. Nature. 1998, 393 (6683): 386-389. 10.1038/30764.
Suzuki MM, Bird A: DNA methylation landscapes: provocative insights from epigenomics. Nat Rev Genet. 2008, 9 (6): 465-476. 10.1038/nrg2341.
Li E, Bestor TH, Jaenisch R: Targeted mutation of the DNA methyltransferase gene results in embryonic lethality. Cell. 1992, 69 (6): 915-926. 10.1016/0092-8674(92)90611-F.
Guo JU, Ma DK, Mo H, Ball MP, Jang MH, Bonaguidi MA, Balazer JA, Eaves HL, Xie B, Ford E, Zhang K, Ming GL, Gao Y, Song H: Neuronal activity modifies the DNA methylation landscape in the adult brain. Nat Neurosci. 2011, 14 (10): 1345-1351. 10.1038/nn.2900.
Pidsley R, Dempster EL, Mill J: Brain weight in males is correlated with DNA methylation at IGF2. Mol Psychiatry. 2010, 15 (9): 880-881. 10.1038/mp.2009.138.
Lubin FD, Roth TL, Sweatt JD: Epigenetic regulation of BDNF gene transcription in the consolidation of fear memory. J Neurosci. 2008, 28 (42): 10576-10586. 10.1523/JNEUROSCI.1786-08.2008.
Isles AR, Davies W, Wilkinson LS: Genomic imprinting and the social brain. Philos Trans R Soc Lond B Biol Sci. 2006, 361 (1476): 2229-2237. 10.1098/rstb.2006.1942.
Connor CM, Akbarian S: DNA methylation changes in schizophrenia and bipolar disorder. Epigenetics. 2008, 3 (2): 55-58. 10.4161/epi.3.2.5938.
Chen WG, Chang Q, Lin Y, Meissner A, West AE, Griffith EC, Jaenisch R, Greenberg ME: Derepression of BDNF transcription involves calcium-dependent phosphorylation of MeCP2. Science. 2003, 302 (5646): 885-889. 10.1126/science.1086446.
Martinowich K, Hattori D, Wu H, Fouse S, He F, Hu Y, Fan G, Sun YE: DNA methylation-related chromatin remodeling in activity-dependent BDNF gene regulation. Science. 2003, 302 (5646): 890-893. 10.1126/science.1090842.
Mill J, Tang T, Kaminsky Z, Khare T, Yazdanpanah S, Bouchard L, Jia P, Assadzadeh A, Flanagan J, Schumacher A, Wang SC, Petronis A: Epigenomic profiling reveals DNA-methylation changes associated with major psychosis. Am J Hum Genet. 2008, 82 (3): 696-711. 10.1016/j.ajhg.2008.01.008.
Grayson DR, Jia X, Chen Y, Sharma RP, Mitchell CP, Guidotti A, Costa E: Reelin promoter hypermethylation in schizophrenia. Proc Natl Acad Sci U S A. 2005, 102 (26): 9341-9346. 10.1073/pnas.0503736102.
Iwamoto K, Bundo M, Yamada K, Takao H, Iwayama-Shigeno Y, Yoshikawa T, Kato T: DNA methylation status of SOX10 correlates with its downregulation and oligodendrocyte dysfunction in schizophrenia. J Neurosci. 2005, 25 (22): 5376-5381. 10.1523/JNEUROSCI.0766-05.2005.
Xiao Y, Camarillo C, Ping Y, Arana TB, Zhao H, Thompson PM, Xu C, Su BB, Fan H, Ordonez J, Wang L, Mao C, Zhang Y, Cruz D, Escamilla MA, Li X: The DNA methylome and transcriptome of different brain regions in schizophrenia and bipolar disorder. PLoS One. 2014, 9 (4): e95875-10.1371/journal.pone.0095875.
Y Li, Camarillo C, J Xu, TB Arana, Y Xiao, Z Zhao, H Chen, M Ramirez, J Zavala, MA Escamilla, R Armas, R Mendoza, A Ontiveros, H Nicolini, A Jerez, LP. Rubin, X Li, C Xu: Genome-wide methylome analyses reveal novel epigenetic regulation patterns in schizophrenia and bipolar disorder. Biomed Res Int 2014, http://www.hindawi.com/journals/bmri/aa/201587/..
Hackenberg M, Barturen G, Oliver JL: NGSmethDB: a database for next-generation sequencing single-cytosine-resolution DNA methylation data. Nucleic Acids Res. 2011, 39 (Database issue): D75-79. 10.1093/nar/gkq942.
Zou D, Sun S, Li R, Liu J, Zhang J, Zhang Z: MethBank: a database integrating next-generation sequencing single-base-resolution DNA methylation programming data. Nucleic Acids Res 2014, doi:10.1093/nar/gku920..
Lv J, Liu H, Su J, Wu X, Liu H, Li B, Xiao X, Wang F, Wu Q, Zhang Y: DiseaseMeth: a human disease methylation database. Nucleic Acids Res. 2012, 40 (Database issue): D1030-1035. 10.1093/nar/gkr1169.
Gu F, Doderer MS, Huang YW, Roa JC, Goodfellow PJ, Kizer EL, Huang TH, Chen Y: CMS: a web-based system for visualization and analysis of genome-wide methylation data of human cancers. PLoS One. 2013, 8 (4): e60980-10.1371/journal.pone.0060980.
Xin Y, Chanrion B, O'Donnell AH, Milekic M, Costa R, Ge Y, Haghighi FG: MethylomeDB: a database of DNA methylation profiles of the brain. Nucleic Acids Res. 2012, 40 (Database issue): D1245-1249. 10.1093/nar/gkr1193.
Rajkowska G, Goldman-Rakic PS: Cytoarchitectonic definition of prefrontal areas in the normal human cortex: II. Variability in locations of areas 9 and 46 and relationship to the Talairach Coordinate System. Cerebral cortex. 1995, 5 (4): 323-337. 10.1093/cercor/5.4.323.
Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ: The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 2004, 32 (Database issue): D493-496. 10.1093/nar/gkh103.
Bernstein BE, Birney E, Dunham I, Green ED, Gunter C, Snyder M: An integrated encyclopedia of DNA elements in the human genome. Nature. 2012, 489 (7414): 57-74. 10.1038/nature11247.
Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J: SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009, 25 (15): 1966-1967. 10.1093/bioinformatics/btp336.
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS: Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008, 9 (9): R137-10.1186/gb-2008-9-9-r137.
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5 (7): 621-628. 10.1038/nmeth.1226.
Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010, 26 (1): 139-140. 10.1093/bioinformatics/btp616.
Wang L, Feng Z, Wang X, Wang X, Zhang X: DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics. 2010, 26 (1): 136-138. 10.1093/bioinformatics/btp612.
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc 1995, 57(1):289–300..
Skinner ME, Uzilov AV, Stein LD, Mungall CJ, Holmes IH: JBrowse: a next-generation genome browser. Genome Res. 2009, 19 (9): 1630-1638. 10.1101/gr.094607.109.
Sainz J, Mata I, Barrera J, Perez-Iglesias R, Varela I, Arranz MJ, Rodriguez MC, Crespo-Facorro B: Inflammatory and immune response genes have significantly altered expression in schizophrenia. Mol Psychiatry. 2013, 18 (10): 1056-1057. 10.1038/mp.2012.165.
Schultz CC, Nenadic I, Riley B, Vladimirov VI, Wagner G, Koch K, Schachtzabel C, Muhleisen TW, Basmanav B, Nothen MM, Deufel T, Kiehntopf M, Rietschel M, Reichenbach JR, Cichon S, Schlosser RG, Sauer H: ZNF804A and cortical structure in schizophrenia: in vivo and postmortem studies. Schizophr Bull. 2014, 40 (3): 532-541. 10.1093/schbul/sbt123.
Hayashi-Takagi A, Vawter MP, Iwamoto K: Peripheral biomarkers revisited: integrative profiling of peripheral samples for psychiatric research. Biol Psychiatry. 2014, 75 (12): 920-928. 10.1016/j.biopsych.2013.09.035.
Sullivan PF, Daly MJ, O'Donovan M: Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nat Rev Genet. 2012, 13 (8): 537-551. 10.1038/nrg3240.
Sullivan PF, Fan C, Perou CM: Evaluating the comparability of gene expression in blood and brain. Am J Med Genet B Neuropsychiatr Genet. 2006, 141B (3): 261-268. 10.1002/ajmg.b.30272.
Rollins B, Martin MV, Morgan L, Vawter MP: Analysis of whole genome biomarker expression in blood and brain. Am J Med Genet B Neuropsychiatr Genet. 2010, 153B (4): 919-936.
Davies MN, Volta M, Pidsley R, Lunnon K, Dixit A, Lovestone S, Coarfa C, Harris RA, Milosavljevic A, Troakes C, Al-Sarraj S, Dobson R, Schalkwyk LC, Mill J: Functional annotation of the human brain methylome identifies tissue-specific epigenetic variation across brain and blood. Genome Biol. 2012, 13 (6): R43-10.1186/gb-2012-13-6-r43.
Bell JT, Tsai PC, Yang TP, Pidsley R, Nisbet J, Glass D, Mangino M, Zhai G, Zhang F, Valdes A, Shin SY, Dempster EL, Murray RM, Grundberg E, Hedman AK, Nica A, Small KS, Dermitzakis ET, McCarthy MI, Mill J, Spector TD, Deloukas P: Epigenome-wide scans identify differentially methylated regions for age and age-related phenotypes in a healthy ageing population. PLoS genetics. 2012, 8 (4): e1002629-10.1371/journal.pgen.1002629.
Davies MN, Krause L, Bell JT, Gao F, Ward KJ, Wu H, Lu H, Liu Y, Tsai PC, Collier DA, Murphy T, Dempster E, Mill J, Battle A, Mostafavi S, Zhu X, Henders A, Byrne E, Wray NR, Martin NG, Spector TD, Wang J: Hypermethylation in the ZBTB20 gene is associated with major depressive disorder. Genome Biol. 2014, 15 (4): R56-10.1186/gb-2014-15-4-r56.
This work was supported by the National High Technology Research and Development Program of China [863 Program, Grant No. 2014AA021102], the National Program on Key Basic Research Project [973 Program, Grant No. 2014CB910504], the National Natural Science Foundation of China [Grant Nos. 91129710, 61170154 and 61203264], the China Postdoctoral Science Foundation [Grant No. 2012M520764 and 2014T70364], WeihanYu Youth Science Fund Project of Harbin Medical University, and the Innovation Research Fund for Graduate Students of Harbin Medical University [Grant No. YJSCX2014-22HYD].
The authors thank the Department of Psychiatry and Center of Excellence-Neurosciences, Texas Tech University Health Science Center (TTUHSC) for blood samples, and Southwest Brain Bank, Department of Psychiatry, UTHSCSA, TX USA for post-mortem brains samples.
The authors declare that they have no competing interests.
XL, JX and CX conceived of the project. ZZ, YL, HC, JL, PT, JC and ZW participated in the collection and analysis of all data sources. ZZ designed and implemented the database. ZZ, JX, CX and XL wrote the manuscript. All authors have read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figure S1.: The overview of the PD_NGSAtlas. (a) The search page of the database shows. (b) The detail page of search gene expression, search methylation peaks and DMRs. (c) The gene expression of specific gene was shown in the search result page. The users can also view the distribution of the gene expression across samples by clicking the bar button. (d) The DNA methylation of specific gene across samples was shown. (e) The identified DMRs across the samples selected by users. (f) The visualization of DNA methylation and gene expression. (DOC 478 KB)
Additional file 3: Figure S2.: Median Phred score vs. base position. The quality scores of the reads were satisfactory, most of the called bases had a Phred score ≥ 30. (DOC 170 KB)
About this article
Cite this article
Zhao, Z., Li, Y., Chen, H. et al. PD_NGSAtlas: a reference database combining next-generation sequencing epigenomic and transcriptomic data for psychiatric disorders. BMC Med Genomics 7, 71 (2014). https://doi.org/10.1186/s12920-014-0071-z
- Bipolar disorder
- Next-generation sequencing
- Epigenomic and transcriptomic data