Skip to main content

Table 4 Comparative Performance on continental-level Ancestry classification

From: Human ancestry indentification under resource constraints -- what can one chromosome tell us about human biogeographical ancestry?

Basic Method Data Size Datasets Used Classification Rate (%)
STRUCTURE [36] 664 Mutiple datasets 96.1
SNPforID [4] 2689 1000 Genome, HGDP, NIST 98.8
STRUCTURE [37] 6410 Mutiple datasets 81.4
Random match probability [5] 451 Own collection 77.0 (+ 21.6 thresholded out)
Proposed 2504 1000 Genome Phase 3 99.19 (614 SNPs)
Proposed 2504 1000 Genome Phase 3 96.75 (206 SNPs)