Skip to main content

Table 4 Comparative Performance on continental-level Ancestry classification

From: Human ancestry indentification under resource constraints -- what can one chromosome tell us about human biogeographical ancestry?

Basic Method

Data Size

Datasets Used

Classification Rate (%)

STRUCTURE [36]

664

Mutiple datasets

96.1

SNPforID [4]

2689

1000 Genome, HGDP, NIST

98.8

STRUCTURE [37]

6410

Mutiple datasets

81.4

Random match probability [5]

451

Own collection

77.0 (+ 21.6 thresholded out)

Proposed

2504

1000 Genome Phase 3

99.19 (614 SNPs)

Proposed

2504

1000 Genome Phase 3

96.75 (206 SNPs)