TY - JOUR AU - Osheroff, J. A. PY - 2007 DA - 2007// TI - A roadmap for national action on clinical decision support JO - J Am Med Inform Assoc VL - 14 UR - https://doi.org/10.1197/jamia.M2334 DO - 10.1197/jamia.M2334 ID - Osheroff2007 ER - TY - STD TI - Xu B, et al. Distributed gene clinical decision support system based on cloud computing. In: Bioinformatics and Biomedicine (BIBM), 2017 IEEE International Conference on: IEEE; 2017. ID - ref2 ER - TY - JOUR AU - Muir, P. PY - 2016 DA - 2016// TI - The real cost of sequencing: scaling computation to keep pace with data generation JO - Genome Biol VL - 17 UR - https://doi.org/10.1186/s13059-015-0866-z DO - 10.1186/s13059-015-0866-z ID - Muir2016 ER - TY - JOUR AU - Auwera, G. A. PY - 2013 DA - 2013// TI - From FastQ data to high-confidence variant calls: the genome analysis toolkit best practices pipeline JO - Curr Protoc Bioinformatics VL - 43 ID - Auwera2013 ER - TY - JOUR AU - Li, H. AU - Durbin, R. PY - 2009 DA - 2009// TI - Fast and accurate short read alignment with burrows–wheeler transform JO - Bioinformatics VL - 25 UR - https://doi.org/10.1093/bioinformatics/btp324 DO - 10.1093/bioinformatics/btp324 ID - Li2009 ER - TY - JOUR AU - McKenna, A. PY - 2010 DA - 2010// TI - The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data JO - Genome Res VL - 20 UR - https://doi.org/10.1101/gr.107524.110 DO - 10.1101/gr.107524.110 ID - McKenna2010 ER - TY - STD TI - Nothaft FA, et al. Rethinking data-intensive science using scalable analytics systems. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data: ACM; 2015. ID - ref7 ER - TY - JOUR AU - Wang, C. PY - 2015 DA - 2015// TI - Heterogeneous cloud framework for big data genome sequencing JO - IEEE/ACM Trans Comput Biol Bioinform (TCBB) VL - 12 UR - https://doi.org/10.1109/TCBB.2014.2351800 DO - 10.1109/TCBB.2014.2351800 ID - Wang2015 ER - TY - BOOK AU - Li, H. PY - 2013 DA - 2013// TI - Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint arXiv:1303.3997 ID - Li2013 ER - TY - BOOK AU - Nothaft, F. PY - 2015 DA - 2015// TI - Scalable genome resequencing with ADAM and avocado ID - Nothaft2015 ER - TY - BOOK AU - Massie, M. PY - 2013 DA - 2013// TI - Adam: Genomics formats and processing patterns for cloud scale computing. EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-207 ID - Massie2013 ER - TY - BOOK AU - Zaharia, M. PY - 2011 DA - 2011// TI - Faster and more accurate sequence alignment with SNAP. arXiv preprint arXiv:1111.5572 ID - Zaharia2011 ER - TY - BOOK AU - Chen, Y. -. T. PY - 2015 DA - 2015// TI - CS-BWAMEM: a fast and scalable read aligner at the cloud scale for whole genome sequencing. High Throughput Sequencing Algorithms and Applications (HITSEQ) ID - Chen2015 ER - TY - BOOK AU - Chen, Y. -. T. PY - 2016 DA - 2016// TI - Memory system optimizations for customized computing--from single-Chip to datacenter ID - Chen2016 ER - TY - STD TI - Spark meets Genomics: Helping Fight the Big C with the Big D. Available from: https://spark-summit.org/2014/david-patterson/. UR - https://spark-summit.org/2014/david-patterson/ ID - ref15 ER - TY - STD TI - Parsian M. Data Algorithms: Recipes for Scaling Up with Hadoop and Spark: O'Reilly Media, Inc.; 2015. ID - ref16 ER - TY - BOOK AU - Xu, B. PY - 2017 DA - 2017// TI - DSA: Scalable Distributed Sequence Alignment System Using SIMD Instructions. arXiv preprint arXiv:1701.01575 ID - Xu2017 ER - TY - BOOK AU - Xu, B. PY - 2017 DA - 2017// TI - Efficient Distributed Smith-Waterman Algorithm Based on Apache Spark. 2017 IEEE 10th International Conference on Cloud Computing ID - Xu2017 ER - TY - JOUR AU - Li, H. AU - Durbin, R. PY - 2010 DA - 2010// TI - Fast and accurate long-read alignment with burrows–wheeler transform JO - Bioinformatics VL - 26 UR - https://doi.org/10.1093/bioinformatics/btp698 DO - 10.1093/bioinformatics/btp698 ID - Li2010 ER - TY - JOUR AU - Abuín, J. M. PY - 2016 DA - 2016// TI - SparkBWA: speeding up the alignment of high-throughput DNA sequencing data JO - PLoS One VL - 11 UR - https://doi.org/10.1371/journal.pone.0155461 DO - 10.1371/journal.pone.0155461 ID - Abuín2016 ER - TY - STD TI - White T. Hadoop: The definitive guide: O'Reilly Media, Inc.; 2012. ID - ref21 ER - TY - STD TI - Shvachko K, et al. The hadoop distributed file system. In: 2010 IEEE 26th symposium on mass storage systems and technologies (MSST): IEEE; 2010. ID - ref22 ER - TY - JOUR AU - Zaharia, M. PY - 2010 DA - 2010// TI - Spark: cluster computing with working sets JO - HotCloud VL - 10 ID - Zaharia2010 ER - TY - STD TI - Li H, et al. Tachyon: reliable, memory speed storage for cluster computing frameworks. In: Proceedings of the ACM symposium on cloud computing: ACM; 2014. ID - ref24 ER - TY - STD TI - Wang C, et al. GenServ: genome sequencing services on scalable energy efficient accelerators. In: Web Services (ICWS), 2017 IEEE International Conference on: IEEE; 2017. ID - ref25 ER - TY - STD TI - Wang C, et al. Big data genome sequencing on zynq based clusters. In: Proceedings of the 2014 ACM/SIGDA international symposium on field-programmable gate arrays: ACM; 2014. ID - ref26 ER - TY - STD TI - Wang C, et al. Genome sequencing using mapreduce on FPGA with multiple hardware accelerators. In: Proceedings of the ACM/SIGDA international symposium on field programmable gate arrays: ACM; 2013. ID - ref27 ER - TY - STD TI - Zaharia M, et al. Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX conference on networked systems design and implementation: USENIX Association; 2012. ID - ref28 ER - TY - BOOK AU - Rodrigues Pereira, R. PY - 2016 DA - 2016// TI - Identifying potential cis-regulatory variants associated with allele-specific expression ID - Rodrigues Pereira2016 ER - TY - JOUR AU - Amberger, J. S. PY - 2015 DA - 2015// TI - OMIM. Org: online Mendelian inheritance in man (OMIM®), an online catalog of human genes and genetic disorders JO - Nucleic Acids Res VL - 43 UR - https://doi.org/10.1093/nar/gku1205 DO - 10.1093/nar/gku1205 ID - Amberger2015 ER - TY - STD TI - Parquet, A. Apache Parquet. http://parquet.incubator.apache.org; Available from: http://parquet.apache.org/. UR - http://parquet.apache.org/ ID - ref31 ER - TY - JOUR AU - Dean, J. AU - Ghemawat, S. PY - 2008 DA - 2008// TI - MapReduce: simplified data processing on large clusters JO - Commun ACM VL - 51 UR - https://doi.org/10.1145/1327452.1327492 DO - 10.1145/1327452.1327492 ID - Dean2008 ER - TY - JOUR AU - Li, H. PY - 2009 DA - 2009// TI - The sequence alignment/map format and SAMtools JO - Bioinformatics VL - 25 UR - https://doi.org/10.1093/bioinformatics/btp352 DO - 10.1093/bioinformatics/btp352 ID - Li2009 ER - TY - STD TI - Forer L, et al. Cloudflow-a framework for mapreduce pipeline development in biomedical research. In: Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2015 38th International Convention on: IEEE; 2015. ID - ref34 ER - TY - STD TI - Li H. wgsim-Read simulator for next generation sequencing. http://github.com/lh3/wgsim; 2013. UR - http://github.com/lh3/wgsim ID - ref35 ER -