Skip to main content
Figure 1 | BMC Medical Genomics

Figure 1

From: Integrating human omics data to prioritize candidate genes

Figure 1

Scheme of BRIDGE. (A) A multiple linear regression model is proposed to explain the phenotypic similarity between two diseases using functional similarities between the two sets of genes associated with the diseases. The regression function is S d d ′ = α d + ∑ i = 1 5 β d i ∑ g ∈ G d ∑ g ′ ∈ G d ′ S g g ′ i , where S i dd’ is the phenotypic similarity between two diseases d and d’, S i gg’ the functional similarity between two genes g and g’ derived from the i-th data source, G(d) and G(d’) genes associated with diseases d and d’, respectively. We consider five genomic data sources (PPI, GS, GE, KEGG, and GO) in our model. (B) Given a query disease g and a candidate gene d, we assume the candidate gene is the only one associated with the disease, i.e. G(d) = {g}, and we calculate the coefficient of determination (R2) of the fitted model as a score to measure the strength of association between the disease and the gene. (C) Repeating (B) for every candidate gene, we obtain a score for each candidate. We then rank the candidate genes in non-increasing order according to their scores to obtain a ranking list.

Back to article page