Skip to main content

Table 1 Total number of positive and negative samples for protein datasets and combined dataset.

From: Predicting host tropism of influenza A virus proteins using random forest

Dataset

Training dataset

Testing dataset

 

Positive samples

Negative samples

Total samples

Positive samples

Negative samples

Total samples

HA

5449

5261

10710

1344

1357

2701

M1

547

908

1455

135

219

354

M2

644

1038

1682

178

268

446

NA

3945

4315

8260

963

1051

2014

NP

1148

2140

3288

282

537

819

NS1

1706

2940

4646

418

748

1166

NS2

475

1157

1632

133

246

379

PA

2135

4067

6202

573

997

1570

PB1

1995

3189

5184

504

797

1301

PB1-F2

722

2206

2928

167

588

755

PB2

2157

3327

5484

565

860

1425

Combined

3272

3923

7195

799

989

1788