The current trend of genomics research for human diseases
© Zhang et al.; licensee BioMed Central Ltd. 2013
Published: 23 January 2013
This is an introduction to the supplement to BMC Medical Genomics that includes16 papers selected from the 2011 World Congress in Computer Science, Computer Engineering, Applied Computing as well as other sources with a focus on genomics studies with a focus on human diseases.
This supplement is dedicated to the World Congress in Computer Science, Computer Engineering, and Applied Computing on July 18-21 in 2011 in Las Vegas, Nevada. The congress drew a wide attention from computer scientists, applied mathematicians and statisticians, bioinformaticians, and researchers in many other fields. As one of the fastest developing fields, genomics became a prominent area that drew attentions from most attendants. Noting that a substantial part of genomics work was dedicated to human disease studies, we decided to launch a supplement to BMC Medical Genomics with manuscripts selected from this congress as well as invited submissions. More than 30 manuscripts were submitted to this supplement. Each manuscript was reviewed by at least two reviewers in addition to the review by the editorial board. We hope that the selected 15 manuscripts represent the current success and challenges of medical genomics research and indicate the future direction.
The genetic heterogeneity and the complicated molecular mechanism underlying tumor progression have made it highly difficult to develop sustainable approaches for cancer therapy. The explosion of knowledge about global gene alterations and genomic variations has changed the traditional approaches of cancer research and presented a promising foundation for diagnosis and treatment of cancer. Much effort of this supplement is devoted to cancer research with various types of genomics data in terms of early diagnosis, biomarker identification, cancer classification and therapy. Wang et al.  proposed a Chi-square statistic based Top Score Genes (Chi-TSG) method for tumor classification using global gene expression data. They tested multiple cancer datasets and found their algorithm had better performance in comparison to other widely-used classifiers, such as Top Scoring Pair, Support Vector Machines and Prediction Analysis of Microarrays. A manuscript by Zhang and Chen  also explored cancer classification by developing multiple distance metrics for proteomics data. Their data showed correct identification of breast cancer subtypes and molecular pathways. A predictive model, Support Vector Machine based on Recursive Feature Elimination and Cross Validation (SVM-RFE-CV), was proposed by Zhang, Deng and Drabier  for breast cancer early detection using microarray data collected from peripheral blood. Their method identified a small group of biomarker genes for early diagnosis, which is potentially more convenient and accurate than the traditional diagnosis using tissue biopsies and mammography. Cancer detection and prediction was developed in non-small cell lung carcinoma (NSCLC) by Tran . His method based on generalized Lorenz curves and Gini ratios identified a set of 9 genes from the NSCLC gene expression profiles and these biomarker genes can be used for cancer prediction. Epigenetic variation can be used for cancer early detection and classification as well. Chen et al.  proposed a nonparametric method for detecting differentially methylated loci and suggested its application in cancer diagnosis and classification.
Gene pathway and networking is another major focus of this special issue. Garrett et al.  investigated the molecular changes for human proximal tubule cells in the short and long term exposure of cadmium. The short and long term exposure in cells represented the acute and chronic toxic response of human kidney. Varying exposure time and cadmium concentration enabled them to conduct gene pathway analysis and construct gene networks model for toxic response by heavy metal. Pathway analysis was carried out by Liu et al.  to investigate potential common molecular mechanism underlying two complex diseases, Schizophrenia and type 2 diabetes mellitus. The protein-protein network revealed high connectivity hub proteins that were involved in a few important signal pathways for both diseases. Gene order is an emerging way for gene clustering and molecular pattern discovery. By combining transcriptome data with lipidomics data, Zhao et al.  identified activated pathways during type 2 diabetes development. Hu et al.  compared a few distance metrics using genetic algorithm and Ant Colony Optimization for computing gene order and found that the optimized method worked well for Alzheimer's disease microarray data.
The manuscripts of this supplement cover various data types and broad topics. ChIP-Seq of RNA Pol II was used to identify the bidirectional promoter by detecting bi-peak shapes using computational approach . Qureshi and Sacan  proposed a weighted normalization method for microRNA RT-PCR data. Juan et al.  investigated the interaction between microRNAs and long intergenic noncoding RNAs microRNAs. Such interactions were implied in breast cancer development. Wang et al.  developed a single gene sequencing technique for genotyping hepatitis B virus and detecting gene mutation for drug resistance. Zheng et al.  proposed a support vector machine-based machine learning method to predict DNA methylation status, which may have broad applications in disease studies. Teng, Yang and Wang  developed a machine learning method to predict the human tissue-specific genes from microarray data and UniProt database. Their method may be useful for understanding the molecular mechanisms underlying tissue-specific diseases. Lastly, we would like to introduce a database for disease associated mutations. The database, HDAM, is constructed by Jia et al.  using next generation sequencing data. HDAM currently assembled 1,114 mutations accounting for 669 genes and 125 human diseases.
In short, this special issue brings forth many exciting scientific discoveries and methodology advances that present the cutting edge development of genomics research in human diseases. We hope that it would provide guidance for the questions and attempts in current genomics research and stimulate ideas for future studies.
This article has been published as part of BMC Medical Genomics Volume 6 Supplement 1, 2013: Proceedings of the 2011 International Conference on Bioinformatics and Computational Biology (BIOCOMP'11). The full contents of the supplement are available online at http://www.biomedcentral.com/bmcmedgenomics/supplements/6/S1. Publication of this supplement has been supported by the International Society of Intelligent Biological Medicine.
- Wang H, Zhang H, Dai Z, Chen M-S, Yuan Z: TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection. BMC Medical Genomics. 2012Google Scholar
- Zhang F, Chen J: Analysis of subtype associated pathway changes from plasma proteome in breast cancer using distance measurement. BMC Medical Genomics. 2012Google Scholar
- Zhang F, Deng Y, Kaufman H, Drabier R: Recursive SVM biomarker selection for early detection of breast cancer in peripheral blood. BMC Medical Genomics. 2012Google Scholar
- Tran Q-N: A novel method for finding non-small call lung cancer diagnosis biomarkers. BMC Medical Genomics. 2012Google Scholar
- Chen Z, Huang H, Liu J, Ng HKT, Nadarajah S, et al: Detecting differentially methylated loci for Illumina array methylation data based on human ovarian cancer data. BMC Medical Genomics. 2012Google Scholar
- Garrett SH, Clarke K, Sens DA, Deng Y, Somji S, et al: Short and long term gene expression variation and networking in human proximal tubule cells when exposed to cadmium. BMC Medical Genomics. 2012Google Scholar
- Liu Y, Li Z, Zhang M, Deng Y, Yi Z, Shi T: Exploring the pathogenetic association between Schizophrenia and type 2 diabetes mellitus diseases based on pathway analysis. BMC Medical Genomics. 2012Google Scholar
- Zhao C, Mao J, Ai J, Ming S, Shi T, et al: Integrated lipidomics and transcriptomic analysis of peripheral blood reveals significantly enriched pathways in type 2 diabetes mellitus. BMC Medical Genomics. 2012Google Scholar
- Hu B, Jiang G, Pang C, Wang S, Liu Q, et al: Assessment of gene order computing methods for Alzheimer's disease. BMC Medical Genomics. 2012Google Scholar
- Wang G, Bai Q, Zhao Y, Juan L, Teng M, et al: RNA polymerase II binding patterns reveal the regulatory region of bidirectional promoter in cervical cancer cell. BMC Medical Genomics. 2012Google Scholar
- Quershi R, Sacan A: A novel method for the normalization of microRNA RT-PCR data. BMC Medical Genomics. 2012Google Scholar
- Juan L, Wang G, Wang Y, Liu Y: Potential roles of microRNAs in regulating long intergenic noncoding RNAs. BMC Medical Genomics. 2012Google Scholar
- Wang F, Lu L, Yu C, Lv Z, Luo X, et al: Development of a novel DNA sequencing method not only for hepatitis B virus genotyping but also for drug resistant mutation detection. BMC Medical Genomics. 2012Google Scholar
- Zheng H, Jiang S-W, Wu H, et al: CpGIMethPred: computational model for predicting methylation status of CpG islands in human genome. BMC Medical Genomics. 2012Google Scholar
- Teng S, Yang JY, Wang L: Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data. BMC Medical Genomics. 2012Google Scholar
- Jia M, Liu Y, Shen Z, Zhao C, Zhang M, et al: HDAM: a resource of human disease associated mutations from next generation sequencing studies. BMC Medical Genomics. 2012Google Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.