From: Predicting host tropism of influenza A virus proteins using random forest
Dataset | Training dataset | Testing dataset | ||||
---|---|---|---|---|---|---|
 | Positive samples | Negative samples | Total samples | Positive samples | Negative samples | Total samples |
HA | 5449 | 5261 | 10710 | 1344 | 1357 | 2701 |
M1 | 547 | 908 | 1455 | 135 | 219 | 354 |
M2 | 644 | 1038 | 1682 | 178 | 268 | 446 |
NA | 3945 | 4315 | 8260 | 963 | 1051 | 2014 |
NP | 1148 | 2140 | 3288 | 282 | 537 | 819 |
NS1 | 1706 | 2940 | 4646 | 418 | 748 | 1166 |
NS2 | 475 | 1157 | 1632 | 133 | 246 | 379 |
PA | 2135 | 4067 | 6202 | 573 | 997 | 1570 |
PB1 | 1995 | 3189 | 5184 | 504 | 797 | 1301 |
PB1-F2 | 722 | 2206 | 2928 | 167 | 588 | 755 |
PB2 | 2157 | 3327 | 5484 | 565 | 860 | 1425 |
Combined | 3272 | 3923 | 7195 | 799 | 989 | 1788 |