Skip to main content


Fig. 7 | BMC Medical Genomics

Fig. 7

From: Classifying cancer genome aberrations by their mutually exclusive effects on transcription

Fig. 7

Random Forest predictions of gene-mutation status across 25 cancer types. We identified genes that had been mutated in at least 50 tumor samples and that contained no mutation in other genes that had been mutated in at least 50 samples. Then using cross validation, we evaluated how well the Random Forest classification algorithm could identify which gene was mutated in a given sample. The algorithm produced a probabilistic prediction for each gene (class), and we evaluated these predictions using the area under the receiver operating characteristic curve (AUROC). The x-axis labels indicate AUROC values for each gene. Relatively high AUROC values indicate that gene-expression levels are highly predictive of gene-mutation status and thus suggest that mutations in these genes exert a characteristic effect on gene-expression levels

Back to article page