Skip to main content

Table 2 Examples of gene pairs correlation changes after batch correction. We illustrate two correlation examples (i) between DYSF and SYNPO2L, where the correlation increases significantly and (ii) between DYSF and SYNPO, where the correlation decreases. Both Spearman and Pearson’s correlations are available to indicate that batch effects are prevalent in both parametric and non-parametric statistics. We see considerable changes on their combined correlation coefficients, which is due to the correction of the variation between studies having been done in different labs by different people. In the case of DYSF - SYNPO2L, originally there seems to be no correlation on the combined samples, despite that a strong positive correlation is observed in each individual series. This bias is removed after batch correction with ComBat, resulting in a positive correlation. The example of the DYSFSYNPO pair shows an initial strong positive correlation before batch correction, while the individual series have mixed positive and negative correlations. Following batch correction this value is reduced

From: MyoMiner: explore gene co-expression in normal and pathological muscle

 

DYSF - SYNPO2L

DYSF - SYNPO

 

Spearman ρ

Pearson r

Spearman ρ

Pearson r

Uncorrected

−0.05

0.02

0.62

0.53

Batch corrected

0.62

0.65

0.36

0.42

GSE47881

0.6

0.67

0.31

0.39

GSE48278

0.3

0.31

−0.4

−0.08

GSE28422

0.67

0.79

0.64

0.71

Average of the 3 series

0.54

0.62

0.21

0.38