TY - JOUR AU - Shi, Jingyi AU - Zheng, Mingna AU - Yao, Lixia AU - Ge, Yaorong PY - 2018 DA - 2018/11/20 TI - Developing a healthcare dataset information resource (DIR) based on Semantic Web JO - BMC Medical Genomics SP - 102 VL - 11 IS - 5 AB - The right dataset is essential to obtain the right insights in data science; therefore, it is important for data scientists to have a good understanding of the availability of relevant datasets as well as the content, structure, and existing analyses of these datasets. While a number of efforts are underway to integrate the large amount and variety of datasets, the lack of an information resource that focuses on specific needs of target users of datasets has existed as a problem for years. To address this gap, we have developed a Dataset Information Resource (DIR), using a user-oriented approach, which gathers relevant dataset knowledge for specific user types. In the present version, we specifically address the challenges of entry-level data scientists in learning to identify, understand, and analyze major datasets in healthcare. We emphasize that the DIR does not contain actual data from the datasets but aims to provide comprehensive knowledge about the datasets and their analyses. SN - 1755-8794 UR - https://doi.org/10.1186/s12920-018-0411-5 DO - 10.1186/s12920-018-0411-5 ID - Shi2018 ER -