- Open Access
CRlncRNA: a manually curated database of cancer-related long non-coding RNAs with experimental proof of functions on clinicopathological and molecular features
BMC Medical Genomicsvolume 11, Article number: 114 (2018)
Recent studies demonstrated that long non-coding RNAs (lncRNAs) could be intricately implicated in cancer-related molecular networks, and related to cancer occurrence, development and prognosis. However, clinicopathological and molecular features for these cancer-related lncRNAs, which are very important in bridging lncRNA basic research with clinical research, fail to well settle to integration.
After manually reviewing more than 2500 published literature, we collected the cancer-related lncRNAs with the experimental proof of functions. By integrating from literature and public databases, we constructed CRlncRNA, a database of cancer-related lncRNAs. The current version of CRlncRNA embodied 355 entries of cancer-related lncRNAs, covering 1072 cancer-lncRNA associations regarding to 76 types of cancer, and 1238 interactions with different RNAs and proteins. We further annotated clinicopathological features of these lncRNAs, such as the clinical stages and the cancer hallmarks. We also provided tools for data browsing, searching and download, as well as online BLAST, genome browser and gene network visualization service.
CRlncRNA is a manually curated database for retrieving clinicopathological and molecular features of cancer-related lncRNAs supported by highly reliable evidences. CRlncRNA aims to provide a bridge from lncRNA basic research to clinical research. The lncRNA dataset collected by CRlncRNA can be used as a golden standard dataset for the prospective experimental and in-silico studies of cancer-related lncRNAs. CRlncRNA is freely available for all users at http://crlnc.xtbg.ac.cn.
Cancer is a collection of diseases characterized by abnormal cell growth with the potential to invade adjacent tissues and spread to distant sites. Cancer formation is a complicated process that involves some common traits (cancer hallmarks), such as self-sufficiency in growth signals, insensitivity to anti-growth signals, evading apoptosis, limitless replicative potential, sustained angiogenesis, and tissue invasion and metastasis [1,2,3]. The discoveries of cancer driver protein-coding genes and their molecular mechanisms produced countless breakthroughs over the past years . For example, many different types of cancer show a high incidence of TP53 mutations, leading to the expression of mutant p53 proteins . While the genetic causes of cancer have been intensively studied, it is becoming clear that a large proportion of cancer susceptibility cannot be attributed to variation in protein-coding sequences . On the other hand, a kind of length more than 200 nt non-coding RNA (long non-coding RNA, lncRNA) is found to be intricately implicated in cancer occurrence and development, and might work as effective therapeutic target and diagnostic marker for cancers with diverse types and phases [7,8,9].
Growing evidence reveals that lncRNAs play vital roles in a variety of biological processes such as cell differentiation [10, 11], apoptosis , autophagy , metabolism  and neoplasia , with the extremely diverse and complex mechanisms. LncRNAs can epigenetically modulate gene expression via recruiting chromatin modification factors [16,17,18,19]. In addition, lncRNAs can regulate mRNA translation and/or stability at post-transcription level by base-pairing with them [20, 21]. Besides, some lncRNAs can also function as miRNA sponges by titrating miRNAs away from their mRNA targets [22, 23]. Recent studies have shown that lncRNA disorders play an important role in the development and progression of cancer. For instance, ANRIL that has been reported to be dysregulated in several human cancers, is believed to facilitate the proliferation of cancer cells and repress apoptosis [24,25,26]. Another lncRNA, lncRNA-activated by TGF-beta (lncRNA-ATB), is an oncogene, which can promote the invasion-metastasis cascade in hepatocellular carcinoma . LncRNA H19, an oncogene in diverse cancers, can promote cell proliferation by accelerating cell-cycle progression, and also function as miRNA sponge to antagonize the latter functions and lead to the de-repression of miRNA endogenous target [28,29,30,31,32,33].
Given a large number of cancer-related lncRNAs being discovered, some cancer-related lncRNA databases have been developed in recent years. For example, LncRNADisease  collects lncRNA-disease associations of both the experimentally reported and the computationally predicted. Lnc2Cancer provides more than 1000 manually curated associations between lncRNAs and human cancers . TANRIC presents a resource of lncRNAs with clinical and other molecular data, both within and across tumor types . Lnc2Catlas is an atlas of lncRNAs compiled with quantitative associations between lncRNAs and cancers using different computational methods . LnCaNet serves as a comprehensive co-expression data resource of the interactions between lncRNA and non-neighboring cancer genes . Although these databases are of immense help in the study of cancer-related lncRNAs, a database containing both clinicopathological and molecular features of cancer-related lncRNAs supported by highly reliable evidences, is of great importance to infuse lncRNA basic research into clinical research.
Here, we developed a new cancer-related lncRNA database, CRlncRNA, for two main objectives. Firstly, we anticipate providing a golden standard dataset for the follow-up experimental and in-silico studies of cancer-related lncRNAs. Other than the proceeding cancer-related lncRNA databases that collected most of data primarily from computational prediction or high-throughput experiments (e.g., differential expression data by sequencing or microarray), all entries in CRlncRNA were manually assembled from the published literature and supported by low-throughput functional experiments, which tend to provide more in-depth experimental evidence other than differential expression profiling. The second goal of this work is to mediate the transition from lncRNA basic research to clinical research. For each lncRNA in CRlncRNA, we gathered not only the information about its genomics location, epigenetic modification, expression profile and molecular interaction, but also the certificated clinicopathological features, such as correlative clinical stages and cancer hallmarks. Moreover, for better handy and effective use of CRlncRNA, we provided the tools for data browsing, searching and download, as well as the service of online BLAST, genome browser and gene network visualization. With this user-friendly interface as well as the highly valuable molecular and clinicopathological data, we believe CRlncRNA could serve as a productive tool for researchers in the field of lncRNA and cancer.
The data source for CRlncRNA
An overview of CRlncRNA framework was shown in Fig. 1. First, we used ‘lncRNA’, ‘lincRNA’, ‘long noncoding RNA’ and ‘cancer’ as keywords to search PubMed database . Then, the abstract and the full text of selected articles were manually screened to extract cancer-related lncRNAs and their detailed information of annotation, such as cancer type, binding factors, related cancer hallmarks. In addition, a summary description of each lncRNA was produced. In order to provide a comprehensive description, we also integrated information from lncRNAdb , lnc2Cancer  and lncRNADisease . Only the lncRNAs that satisfy some specific criteria were adopted so as to ensure the high reliability of our dataset. That is, the selected lncRNAs were either differentially expressed in cancer (as verified by qRT-PCR), co-occurred with a significant pertinence to clinicopathological parameters (e.g., tumor differentiation, clinical stage, survival time); or else, were proven by functional experiments (e.g., colony formation assay, matrigel invasiveness assay, xenograft mouse model, and metastasis nude mouse model) to participate in cancer development.
To present more informative data, we first manually searched lncRNA IDs collected from the published literature through a selection of databases, such as HGNC, UCSC, NCBI, ENSEMBL, GENCODE, to get a corrected ID correspondence table. Next, we integrated genome-wide information from different general databases. For example, the histone modification data for H3K4me3 and H3K27ac markers were downloaded from ENCODE . The phastCons conservation score, repeat elements and SNP information were downloaded from UCSC table browser . LncRNA gene expression profiles were derived from Illumina human body map (https://www.ebi.ac.uk/gxa/experiments/E-MTAB-513/Results) and TANRIC database . In addition, we manually collected the interactions between lncRNAs and proteins from literature, including direct and indirect, upstream and downstream interactions. In order to more effectively utilize CRlncRNA, we provided a variety of tools for data browsing, searching and download, as well as the services of online BLAST, genome browser and gene network visualization.
Database architecture and implementation details
We developed a series of python scripts to import data in previous step to SQLite database step-by-step. In addition, we built a user-friendly web interface with Bootstrap (version 3.3.7) and jQuery (version 2.1.0) for users to query and visualize the collected data, as well as online servers. The principal functions of CRlncRNA include search, browse and network analysis. The network visualization was conducted by modifying the code from http://www.regulatorynetworks.org. The BLAST server was constructed by django-blastplus (version 0.4.0) and NCBI BLAST+ (version 2.3.0) . We offered an interactive web-based platform for visualizing human genome datasets by use of Dalliance . The web server of CRlncRNA runs on a dedicated Linux machine with the Nginx (version 1.9.9) and uWSGI (version 2.0.14). The server itself is a 4 Intel(R) Xeon(R) CPU E5–2640 v3 @ 2.60GHz with 8 Gigabytes of RAM. The application architecture consisted of several python web application according to the Django framework (version 1.9). CRlncRNA is supported by main standard-compliant web browsers such as Firefox, Google Chrome, Internet Explorer and Safari.
Browsing and searching the database
Users can browse all cancer-related lncRNAs directly on the ‘Browse’ page. There are two ways for browsing in CRlncRNA: (1) the way of ‘By alphabet’ will present lncRNAs in alphabetical order (Fig. 2a); while (2) ‘By tissue type’ will arrange lncRNAs according to their expressed tissues (Fig. 2b). Correspondingly, in the ‘Home’ page, there also provides a quick entry for browsing through the hyperlinks in ‘By alphabet’ and ‘By tissue type’.
CRlncRNA provide two kinds of keyword search services. One is the ‘Quick Search’ (on the top menu, Fig. 2c), in point to keywords including lncRNA name, tissue type or cancer type. The ‘Advanced Search’ (in the ‘Search’ page, Fig. 2d) performs a combined search. For example, you can find lncRNAs which were expressed in brain and have a relationship with glioma by inputting ‘brain’ in ‘tissue’ input box and ‘glioma’ in ‘cancer’ input box.
In addition, CRlncRNA also has web-based BLAST server for sequence similarity search (Fig. 2f), to assist users in identifying and annotating novel lncRNA transcripts.
Network visualization service
Apart from the cancer-related lncRNAs, CRlncRNA also collected 1238 interactions between cancer-related lncRNAs and various factors (including RNAs and proteins), all of which are supported by experimental evidences. Users can visualize this complicated network using the network visualization service in ‘Network’ page (Fig. 2e). They can alternatively retrieve different types of interactions by the option of ‘Association type’ drop-down menu; for example, ‘all’ for all interactions, ‘binding’ for direct interactions, while, ‘association’ for indirect associations. The ‘cancer hallmarks’ button is used for selectively exhibiting the specific cancer-hallmark-related sub-network (the default value is proliferation sub-network). There are 7 cancer hallmarks to be chosen: proliferation, migration, metastasis, invasion, apoptosis, prognosis and EMT (epithelial-mesenchymal transition). The ‘genes’ button could assign one or more genes interested to display their sub-network. In addition, the built-in search function in the network visualization service enables the rapid position of a particular gene in the network.
The detail information page
For each lncRNA, CRlncRNA intended to collect the comprehensive information as much as possible. The detailed information page for a specific lncRNA can be accessed by clicking on the lncRNA name. As shown in Fig. 3, the collected information includes the following parts: (1) gene basic information, such as lncRNA symbol, location, the relevant tissues and cancer types (Fig. 3a); (2) the description of lncRNA, which contains a paragraph of detailed description on the lncRNA’s function and its clinicopathological and molecular features, as summarized from literature (Fig. 3b); (3) the cancer-related information, such as cancer type/pathway/hallmark, which is organized in tabular form and can be sorted and searched (Fig. 3c); (4) the lncRNA expression profiles in 16 normal tissues from Illumina human body map (Fig. 3d); (5) the lncRNA expression profiles in 14 cancer and paracancerous tissue pairs from TANRIC database (Fig. 3e); (6) the genome browser, which includes different annotation tracks (gene structure, epigenetics data, revolutionary conversion, etc.) (Fig. 3f); (7) the FASTA format sequence (Fig. 3g).
Statistics of the database content
In the current version of CRlncRNA, we collected 355 entries of cancer-related lncRNAs, covering 1072 cancer-lncRNA associations with regard to 76 types of cancer, and 1238 interactions relevant to lncRNAs and different factors (RNAs and proteins). Compared with other cancer-related lncRNA databases (Table 1), CRlncRNA worked on providing a golden standard dataset for the later experimental and in-silico studies of cancer-related lncRNAs, thereby, adopted the most rigid standards for data acquisition. Moreover, CRlncRNA provided more full-scale annotations compared with others, particularly in the aspect of clinicopathological information, which is valuable for better translation from the basic research of cancer-related lncRNA to clinical research.
Distribution of different lncRNA subtypes
According to the gene/transcript biotype classification system in GENCODE & Ensembl (https://www.gencodegenes.org/pages/biotypes.html), we counted the distribution of different lncRNA subtypes based on their genomic location (Fig. 4a). Similar to those collected in GENCODE, the vast majority of cancer-related lncRNAs could be classified as ‘lincRNA’ and ‘antisense’, while the lncRNAs of ‘3prime_overlapping’ are less than 1%. On the other hand, the percentage of ‘sense_overlapping’ cancer-related lncRNA is obviously higher than in GENCODE (18.3% vs. 1.19%), while the percentage of ‘sense_intronic’ cancer-related lncRNA is obviously lower than in GENCODE (1.37% vs. 5.77%).
LncRNAs related to different cancer hallmarks
We added up the number of cancer-related lncRNA associated with different cancer hallmarks (Fig. 4b). In total, there are 116, 127, 43, 42, 94, 196 and 162 lncRNAs related to ‘invasion’, ‘migration’, ‘metastasis’, ‘EMT’, ‘apoptosis’, ‘proliferation’ and ‘prognosis’, respectively. The most noteworthy is there are 11 lncRNAs (SPRY4-IT1, GAS5, LINC01133, HOTAIR, TUG1, ROR, MALAT1, H19, BANCR, NEAT1, HOTTIP) associated with all of the seven cancer hallmarks. In addition, the Venn diagram demonstrated that, despite the fact that there is an obvious overlapping between different hallmarks, a lot of unique lncRNAs emerged in hallmarks ‘prognosis’ (46 lncRNAs) and ‘proliferation’ (28 lncRNAs).
Cancer-related lncRNAs in the top ten enriched cancers
We also counted the number of the reported cancer-related lncRNAs in the top ten enriched cancers (Fig. 4c). Compared with cancer-related lncRNAs, the studies on cancer-related proteins were far more comprehensive. Correspondingly, for some cancer types, such as gastric cancer, breast cancer, and colorectal cancer, it is not unexpected that the number of the reported cancer-related proteins is greater than that of lncRNAs in the same cancer. But in the cancer types of acute myelocytic leukemia, T acute lymphoblastic leukemia, chronic lymphocytic leukemia, glioma and melanoma, we were surprised to see that the amount of the known cancer-related lncRNAs had much exceeded that of proteins, especially in three types of leukemia. For example, 80 lncRNAs are associated with acute myelocytic leukemia, while only 8 proteins relevant to it.
Hub lncRNAs in the cancer-related lncRNA network
It is apparent that there was a complicated interaction network between cancer-related lncRNAs and various factors (RNAs and proteins). Hence, we systematically summarized the top ten hub lncRNAs and hub factors. As shown in Fig. 4d, the list of top ten hub factors contained many vital proteins (e.g., cadherin, vimentin and EZH2) associated with cellular signaling, cell migration and invasion, and epigenetic regulation, and other star genes in the cancer studies (like p53, p21). While in the list of top ten hub lncRNAs, HOTAIR, MALAT1 and H19 are ranking the first three, wherein HOTAIR is the biggest hub node in the network and linked with 109 factors (including P53, P21 and P16). HOTAIR is a relatively well-studied oncogene, its expression level in cancer is an efficient predictor of metastasis and survival [9, 45, 46].
More and more studies on cancer-related lncRNAs have identified the crucial roles of lncRNAs in cancer processes [4, 47]. Thereby, it is very important to summarize and integrate the information of these cancer-related lncRNAs for conducting the subsequent clinical studies. However, it is worth to think how to achieve this goal. Here, we confined the cancer types strictly within those malignant tumors with the exclusion of benign tumors that are usually localized and do not spread to other parts of the body. For example, pituitary adenoma and neurofibromatosis type 1, which had been included in Lnc2Cancer database , are non-cancerous and benign tumors. The second question is how to define cancer-related lncRNA. Due to the excessive false positive, the data of cancer-related lncRNAs, if only with evidences from prediction or high-throughput experiments, are likely insufficient for the researches afterward; particularly when utilizing these data for developing machine learning approaches, it’s not instrumental. Considering that, we persist in two criteria for constructing CRlncRNA: (1) Only targeting cancer -- the malignant tumor; (2) Strict inclusion standard, only including lncRNAs that have low-throughput functional experiments evidence. We hope that our CRlncRNA database could provide a golden standard dataset for future cancer-related lncRNA studies.
In CRlncRNA, we collected many molecular and clinicopathological data about cancer-related lncRNAs, which may provide references and insights for investigating lncRNA’s roles in cancer. During the preliminary statistical analysis, we found that the percentage of ‘sense_overlapping’ cancer-related lncRNAs is remarkably above the average of all lncRNAs (nearly more than 15 times), which would potentially facilitate the studies of cancer-related lncRNAs on the identification of molecular mechanisms and the development of novel prediction algorithm. In addition, there are 11 lncRNAs related to all the seven cancer hallmarks when we tested the relevance between cancer-related lncRNAs and cancer hallmarks. These lncRNAs may play important roles in cancer occurrence and development, and may be the most valuable targets for cancer therapy. A much more interesting result came from the quantitative comparison between reported cancer-related lncRNAs and proteins in different cancer types. Despite just the beginning of studies of cancer-related lncRNAs, the amount of lncRNAs in some cancer types (like acute myelocytic leukemia, T acute lymphoblastic leukemia, chronic lymphocytic leukemia) is much more than that of proteins, implying lncRNAs can be potentially as novel biomarkers for blood cancer diagnosis and therapy.
In order to better promote the cancer-related lncRNA studies, CRlncRNA provided a variety of web-based tools. For example, gene network visualization service can help to survey lncRNA-involved interaction network, genome browser to search and visualize for lncRNA’s genomic neighborhood. In the future, we will add more online tools in CRlncRNA. For example, we are exploiting the CRlncRNA data to develop an algorithm for predicting cancer-related lncRNAs, which will provide the online service before long. In addition, other than periodically retrieving the freshly-published literature, we plan to integrate more clinicopathological information in CRlncRNA in the aspect of data collection, such as survival, radiation sensitivity and drug resistance. The study of cancer-related lncRNA had been a multi-discipline crossed hot field, which would be bound to affect our conception of cancer, from its causative origins to the design and prescription of treatments, profoundly. Meanwhile, this field is still in its infancy, and we are far from comprehending lncRNAs and incorporating it into the clinical application. We believe that a rigid standard for cancer-related lncRNA database, with multiple data sources and more online services, will offer an important platform for further study and more pronounced understanding of lncRNA functions and mechanisms both in physiological and pathological condition.
In this work, we presented CRlncRNA, a manually curated database for the cancer-related lncRNAs, which offers not only the experimental validated molecular mechanisms and the related clinicopathological features of collected items, but also many useful tools to perform data-mining. The aim of CRlncRNA is to provide a more orientated curated database that only targets to cancer -- the malignant tumor. In addition, for the sake of the identification of cancer-related lncRNAs in future, CRlncRNA could provide golden standard datasets which are highly credible and well-informed for the development of tools and methodology.
Long intergenic non-coding RNA
Long non-coding RNA
Hanahan D, Weinberg RA. The hallmarks of cancer. Cell. 2000;100(1):57–70.
Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74.
Fouad YA, Aanei C. Revisiting the hallmarks of cancer. Am J Cancer Res. 2017;7(5):1016–36.
Schmitt AM, Chang HY. Long noncoding RNAs in Cancer pathways. Cancer Cell. 2016;29(4):452–63.
Muller PA, Vousden KH. Mutant p53 in cancer: new functions and therapeutic opportunities. Cancer Cell. 2014;25(3):304–17.
Cheetham SW, Gruhl F, Mattick JS, Dinger ME. Long noncoding RNAs and the genetics of cancer. Br J Cancer. 2013;108(12):2419–25.
Cabili MN, Trapnell C, Goff L, Koziol M, Tazon-Vega B, Regev A, Rinn JL. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011;25(18):1915–27.
Akers JC, Gonda D, Kim R, Carter BS, Chen CC. Biogenesis of extracellular vesicles (EV): exosomes, microvesicles, retrovirus-like vesicles, and apoptotic bodies. J Neuro-Oncol. 2013;113(1):1–11.
Bolha L, Ravnik-Glavac M, Glavac D. Long noncoding RNAs as biomarkers in Cancer. Dis Markers. 2017;2017:7243968.
Mathieu EL, Belhocine M, Dao LT, Puthier D, Spicuglia S. Functions of lncRNA in development and diseases. Med Sci (Paris). 2014;30(8–9):790–6.
Fatica A, Bozzoni I. Long non-coding RNAs: new players in cell differentiation and development. Nat Rev Genet. 2014;15(1):7–21.
Rossi MN, Antonangeli F. LncRNAs: new players in apoptosis control. Int J Cell Biol. 2014;2014:473857.
Xiong H, Ni Z, He J, Jiang S, Li X, He J, Gong W, Zheng L, Chen S, Li B, et al. LncRNA HULC triggers autophagy via stabilizing Sirt1 and attenuates the chemosensitivity of HCC cells. Oncogene. 2017;36(25):3528–40.
Zhao XY, Lin JD. Long noncoding RNAs: a new regulatory code in metabolic control. Trends Biochem Sci. 2015;40(10):586–96.
Huarte M. The emerging role of lncRNAs in cancer. Nat Med. 2015;21(11):1253–61.
Bergmann JH, Spector DL. Long non-coding RNAs: modulators of nuclear structure and function. Curr Opin Cell Biol. 2014;26:10–8.
Spitale RC, Tsai MC, Chang HY. RNA templating the epigenome: long noncoding RNAs as molecular scaffolds. Epigenetics-Us. 2011;6(5):539–43.
Davidovich C, Cech TR. The recruitment of chromatin modifiers by long noncoding RNAs: lessons from PRC2. RNA. 2015;21(12):2007–22.
Zhao J, Sun BK, Erwin JA, Song JJ, Lee JT. Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science. 2008;322(5902):750–6.
Gong C, Maquat LE. lncRNAs transactivate STAU1-mediated mRNA decay by duplexing with 3’ UTRs via Alu elements. Nature. 2011;470(7333):284–8.
Abdelmohsen K, Panda AC, Kang MJ, Guo R, Kim J, Grammatikakis I, Yoon JH, Dudekula DB, Noh JH, Yang X, et al. 7SL RNA represses p53 translation by competing with HuR. Nucleic Acids Res. 2014;42(15):10099–111.
Salmena L, Poliseno L, Tay Y, Kats L, Pandolfi PP. A ceRNA hypothesis: the Rosetta stone of a hidden RNA language? Cell. 2011;146(3):353–8.
Tay Y, Rinn J, Pandolfi PP. The multilayered complexity of ceRNA crosstalk and competition. Nature. 2014;505(7483):344–52.
Zhang EB, Kong R, Yin DD, You LH, Sun M, Han L, Xu TP, Xia R, Yang JS, De W, et al. Long noncoding RNA ANRIL indicates a poor prognosis of gastric cancer and promotes tumor growth by epigenetically silencing of miR-99a/miR-449a. Oncotarget. 2014;5(8):2276–92.
Zhao JJ, Hao S, Wang LL, Hu CY, Zhang S, Guo LJ, Zhang G, Gao B, Jiang Y, Tian WG, et al. Long non-coding RNA ANRIL promotes the invasion and metastasis of thyroid cancer cells through TGF-beta/Smad signaling pathway. Oncotarget. 2016;7(36):57903–18.
Zhu HX, Li XC, Song YR, Zhang P, Xiao YJ, Xing YF. Long non-coding RNA ANRIL is up-regulated in bladder cancer and regulates bladder cancer cell proliferation and apoptosis through the intrinsic pathway. Biochem Bioph Res Co. 2015;467(2):223–8.
Yuan JH, Yang F, Wang F, Ma JZ, Guo YJ, Tao QF, Liu F, Pan W, Wang TT, Zhou CC, et al. A long noncoding RNA activated by TGF-beta promotes the invasion-metastasis cascade in hepatocellular carcinoma. Cancer Cell. 2014;25(5):666–81.
Tsang WP, Ng EK, Ng SS, Jin H, Yu J, Sung JJ, Kwok TT. Oncofetal H19-derived miR-675 regulates tumor suppressor RB in human colorectal cancer. Carcinogenesis. 2010;31(3):350–8.
Luo M, Li Z, Wang W, Zeng Y, Liu Z, Qiu J. Long non-coding RNA H19 increases bladder cancer metastasis by associating with EZH2 and inhibiting E-cadherin expression. Cancer Lett. 2013;333(2):213–21.
Vennin C, Spruyt N, Dahmani F, Julien S, Bertucci F, Finetti P, Chassat T, Bourette RP, Le Bourhis X, Adriaenssens E. H19 non coding RNA-derived miR-675 enhances tumorigenesis and metastasis of breast cancer cells by downregulating c-Cbl and Cbl-b. Oncotarget. 2015;6(30):29209–23.
Han D, Gao X, Wang M, Qiao Y, Xu Y, Yang J, Dong N, He J, Sun Q, Lv G, et al. Long noncoding RNA H19 indicates a poor prognosis of colorectal cancer and promotes tumor growth by recruiting and binding to eIF4A3. Oncotarget. 2016;7(16):22159–73.
Liu C, Chen Z, Fang J, Xu A, Zhang W, Wang Z. H19-derived miR-675 contributes to bladder cancer cell proliferation by regulating p53 activation. Tumour Biol. 2016;37(1):263–70.
Liang WC, Fu WM, Wong CW, Wang Y, Wang WM, Hu GX, Zhang L, Xiao LJ, Wan DCC, Zhang JF, et al. The lncRNA H19 promotes epithelial to mesenchymal transition by functioning as miRNA sponges in colorectal cancer. Oncotarget. 2015;6(26):22513–25.
Chen G, Wang Z, Wang D, Qiu C, Liu M, Chen X, Zhang Q, Yan G, Cui Q. LncRNADisease: a database for long-non-coding RNA-associated diseases. Nucleic Acids Res. 2013;41(Database issue):D983–6.
Ning S, Zhang J, Wang P, Zhi H, Wang J, Liu Y, Gao Y, Guo M, Yue M, Wang L, et al. Lnc2Cancer: a manually curated database of experimentally supported lncRNAs associated with various human cancers. Nucleic Acids Res. 2016;44(D1):D980–5.
Li J, Han L, Roebuck P, Diao L, Liu L, Yuan Y, Weinstein JN, Liang H. TANRIC: An interactive open platform to explore the function of lncRNAs in Cancer. Cancer Res. 2015;75(18):3728–37.
Ren C, An G, Zhao C, Ouyang Z, Bo X, Shu W. Lnc2Catlas: an atlas of long noncoding RNAs associated with risk of cancers. Sci Rep. 2018;8(1):1909.
Liu Y, Zhao M. lnCaNet: pan-cancer co-expression network for human lncRNA and cancer genes. Bioinformatics. 2016;32(10):1595–7.
Coordinators NR. Database resources of the National Center for biotechnology information. Nucleic Acids Res. 2016;44(D1):D7–19.
Quek XC, Thomson DW, Maag JL, Bartonicek N, Signal B, Clark MB, Gloss BS, Dinger ME. lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs. Nucleic Acids Res. 2015;43(Database issue):D168–73.
Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74.
Speir ML, Zweig AS, Rosenbloom KR, Raney BJ, Paten B, Nejad P, Lee BT, Learned K, Karolchik D, Hinrichs AS, et al. The UCSC genome browser database: 2016 update. Nucleic Acids Res. 2016;44(D1):D717–25.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421.
Down TA, Piipari M, Hubbard TJ. Dalliance: interactive genome viewing on the web. Bioinformatics. 2011;27(6):889–90.
Hajjari M, Salavaty A. HOTAIR: an oncogenic long non-coding RNA in different cancers. Cancer Biol Med. 2015;12(1):1–9.
Gupta RA, Shah N, Wang KC, Kim J, Horlings HM, Wong DJ, Tsai MC, Hung T, Argani P, Rinn JL, et al. Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 2010;464(7291):1071–6.
Prensner JR, Chinnaiyan AM. The emergence of lncRNAs in cancer biology. Cancer Discov. 2011;1(5):391–407.
Data analysis supported by HPC Platform, The Public Technology Service Center of Xishuangbanna Tropical Botanical Garden (XTBG), CAS, China.
This work was supported by the National Natural Science Foundation of China (No. 31471220, 91440113), Start-up Fund from Xishuangbanna Tropical Botanical Garden, ‘Top Talents Program in Science and Technology’ from Yunnan Province. Publication of this article was sponsored by the National Natural Science Foundation of China (No. 31371220).
Availability of data and materials
All data generated or analyzed during this study are included in this published article.
About this supplement
This article has been published as part of BMC Medical Genomics Volume 11 Supplement 6, 2018: Proceedings of the 29th International Conference on Genome Informatics (GIW 2018): medical genomics. The full contents of the supplement are available online at https://bmcmedgenomics.biomedcentral.com/articles/supplements/volume-11-supplement-6.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.