Molecular epidemiology of SARS-CoV-2 isolated from COVID-19 family clusters

Gunadi; Wibawa, Hendra; Hakim, Mohamad Saifudin; Marcellus; Trisnawati, Ika; Khair, Riat El; Triasih, Rina; Irene; Afiahayati; Iskandar, Kristy; Siswanto; Anggorowati, Nungki; Daniwijaya, Edwin Widyanto; Supriyati, Endah; Nugrahaningsih, Dwi Aris Agung; Budiono, Eko; Retnowulan, Heni; Puspadewi, Yunika; Puspitawati, Ira; Sianipar, Osman; Afandy, Dwiki; Simanjaya, Susan; Widitjiarso, William; Puspitarani, Dyah Ayu; Fahri, Fadil; Riawan, Untung; Fauzi, Aditya Rifqi; Kalim, Alvin Santoso; Ananda, Nur Rahmi; Setyati, Amalia; Setyowireni, Dwikisworo; Laksanawati, Ida Safitri; Arguni, Eggi; Nuryastuti, Titik; Wibawa, Tri

doi:10.1186/s12920-021-00990-3

Research
Open access
Published: 01 June 2021

Molecular epidemiology of SARS-CoV-2 isolated from COVID-19 family clusters

Gunadi¹^na1,
Hendra Wibawa²^na1,
Mohamad Saifudin Hakim³,
Marcellus⁴,
Ika Trisnawati⁵,
Riat El Khair⁶,
Rina Triasih⁷,
Irene⁸,
Afiahayati⁹,
Kristy Iskandar¹⁰,
Siswanto¹¹,
Nungki Anggorowati¹²,
Edwin Widyanto Daniwijaya¹³,
Endah Supriyati¹⁴,
Dwi Aris Agung Nugrahaningsih¹⁵,
Eko Budiono⁵,
Heni Retnowulan⁵,
Yunika Puspadewi⁶,
Ira Puspitawati⁶,
Osman Sianipar⁶,
Dwiki Afandy⁴,
Susan Simanjaya⁴,
William Widitjiarso⁴,
Dyah Ayu Puspitarani⁴,
Fadil Fahri⁴,
Untung Riawan⁴,
Aditya Rifqi Fauzi⁴,
Alvin Santoso Kalim⁴,
Nur Rahmi Ananda⁵,
Amalia Setyati⁷,
Dwikisworo Setyowireni⁷,
Ida Safitri Laksanawati⁷,
Eggi Arguni⁷,
Titik Nuryastuti³,
Tri Wibawa³ on behalf of
the Yogyakarta-Central Java COVID-19 study group

BMC Medical Genomics volume 14, Article number: 144 (2021) Cite this article

3980 Accesses
11 Citations
15 Altmetric
Metrics details

Abstract

Background

Transmission within families and multiple spike protein mutations have been associated with the rapid transmission of SARS-CoV-2. We aimed to: (1) describe full genome characterization of SARS-CoV-2 and correlate the sequences with epidemiological data within family clusters, and (2) conduct phylogenetic analysis of all samples from Yogyakarta and Central Java, Indonesia and other countries.

Methods

The study involved 17 patients with COVID-19, including two family clusters. We determined the full-genome sequences of SARS-CoV-2 using the Illumina MiSeq next-generation sequencer. Phylogenetic analysis was performed using a dataset of 142 full-genomes of SARS-CoV-2 from different regions.

Results

Ninety-four SNPs were detected throughout the open reading frame (ORF) of SARS-CoV-2 samples with 58% (54/94) of the nucleic acid changes resulting in amino acid mutations. About 94% (16/17) of the virus samples showed D614G on spike protein and 56% of these (9/16) showed other various amino acid mutations on this protein, including L5F, V83L, V213A, W258R, Q677H, and N811I. The virus samples from family cluster-1 (n = 3) belong to the same clade GH, in which two were collected from deceased patients, and the other from the survived patient. All samples from this family cluster revealed a combination of spike protein mutations of D614G and V213A. Virus samples from family cluster-2 (n = 3) also belonged to the clade GH and showed other spike protein mutations of L5F alongside the D614G mutation.

Conclusions

Our study is the first comprehensive report associating the full-genome sequences of SARS-CoV-2 with the epidemiological data within family clusters. Phylogenetic analysis revealed that the three viruses from family cluster-1 formed a monophyletic group, whereas viruses from family cluster-2 formed a polyphyletic group indicating there is the possibility of different sources of infection. This study highlights how the same spike protein mutations among members of the same family might show different disease outcomes.

Peer Review reports

Introduction

Many countries are still struggling to control the COVID-19 pandemic, including Indonesia [1, 2]. On April 15, 2021, Indonesia recorded 1,583,182 confirmed COVID-19 cases with 42,906 deaths and infection rate of approximately 6000 cases/day [3].

One of the most important factors affecting the rapid spreading of COVID-19 is transmission within families [4, 5]. Genomic epidemiology has been suggested to be important to fill the gaps in identifying the SARS-CoV-2 infection sources [6]. However, to our best knowledge, no reports have described the genomic epidemiology within family clusters [6,7,8]. Moreover, multiple spike protein mutations have been associated with a higher transmissibility of SARS-CoV-2 [9]. In this study, we aimed to: (1) perform full genome characterization of SARS-CoV-2 and correlate the sequences with the epidemiological data within family clusters in Indonesia, and (2) conduct phylogenetic analysis of all samples from Yogyakarta and Central Java, Indonesia, involving the family clusters, and virus data from other regions in Indonesia.

Methods

SARS-CoV-2 samples

We collected all virus samples of confirmed COVID-19 patients from Yogyakarta and Central Java provinces from June to November 2020. All nasopharyngeal samples were collected in viral transport media (DNA/RNA Shield™ Collection Tube with Swab, Zymo Research, CA, United States) and transported to four COVID-19 diagnostic laboratories in Yogyakarta province: (1) Molecular Diagnostic Laboratory, Integrated Laboratory Unit, Dr. Sardjito Hospital; (2) Department of Microbiology and Laboratorium Diagnostik Yayasan Tahija World Mosquito Program, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada; (3) Balai Besar Teknik Kesehatan Lingkungan dan Pengendalian Penyakit (BBTKLPP), Yogyakarta; and (4) Disease Investigation Center, Wates, Yogyakarta. SARS-CoV-2 was detected by Real-Q 2019-nCoV Detection Kit (BioSewoom, Seoul, South Korea) with LightCycler® 480 Instrument II (Roche Diagnostics, Mannheim, Germany).

Full-genome sequencing

First, we performed RNA extraction of 19 nasopharyngeal swab samples by a QiAMP Viral RNA mini kit (Qiagen, Hilden, Germany), synthesized the double-stranded cDNA by Maxima H Minus Double-Stranded cDNA Synthesis (Thermo Fisher Scientific, MA, United States), and purified the cDNA using a GeneJET PCR Purification Kit (Thermo Fisher Scientific, MA, United States). For library preparations, we utilized the Nextera DNA Flex for Enrichment using Respiratory Virus Oligos Panel, whereas for full-genome sequencing, we used next generation sequencing (NGS) applied in the Illumina MiSeq instrument (Illumina, San Diego, CA, United States) with Illumina MiSeq reagents v3 150 cycles (2 × 75 cycles). We excluded two samples for further bioinformatics analysis because of low coverages. Our sample genomes were assembled by mapping to the reference genome from Wuhan, China (hCoV-19/Wuhan/Hu-1/2019, GenBank accession number: NC_045512.2) using Burrow-Wheeler Aligner (BWA) algorithm embedded in UGENE v. 1.30 [10]. Identification of single nucleotide polymorphisms (SNPs) was performed using the number of high confidence base calls (consensus sequence variations of the assembly) that disagree with the reference bases for the genome position of interest, then all SNPs were exported to a vcf. file and visualized in MS Excel. The following accession IDs for the 17 samples are: EPI_ISL_516800, EPI_ISL_516806, EPI_ISL_516829, EPI_ISL_525492, EPI_ISL_576383, EPI_ISL_632936, EPI_ISL_610161, EPI_ISL_610162, EPI_ISL_576145, EPI_ISL_632937, EPI_ISL_575331, EPI_ISL_576113, EPI_ISL_576114, EPI_ISL_576115, EPI_ISL_576116, EPI_ISL_576128, and EPI_ISL_576130 [11]. The first four IDs have been reported in our previous study [12].

Phylogenetic analysis

We used the reference genome of hCoV-19/Wuhan/Hu-1/2019 (NC_045512.2) for annotation of our sequences. A dataset of 142 available SARS-CoV-2 genomes (89 sequences from Indonesia and 53 from other countries) was retrieved from GISAID to conduct a phylogenetic analysis (Acknowledgment Table is provided in Additional file 2: Table S2). We only used the full-genome sequences of several strains representing SARS-CoV-2 clades from some countries that had complete genome data and no long stretches of ‘NNNN’ for the phylogenetic analysis. The MAFFT program server was utilized for multiple nucleotide sequence alignment (https://mafft.cbrc.jp/alignment/server/). A phylogenetic tree was constructed from 29.409 nt length of the open reading frame (ORF) of 142 SARS-CoV-2 virus sequences using Neighbor Joining statistical method with 2000 bootstrap replications. The evolutionary distances were computed using the Kimura 2-parameter method and the rate variation among sites was modelled using a gamma distribution with estimated shape parameter (α) for the dataset. The estimation of α gamma distribution was calculated in DAMBE version 7 [13], whereas all the other analyses were performed in MEGA version 10 (MEGA X) [14].

COVID-19 severity classifications

COVID-19 severity was determined based on the WHO classifications: (1) mild, without evidence of hypoxia or pneumonia; (2) moderate, pneumonia but not severe; (3) severe, pneumonia plus one of the following signs: respiratory rate > 30 breaths/minute (or based on age for children), severe respiratory distress, or SpO₂ < 90% in room air; and (4) critical, Acute Respiratory Distress Syndrome (ARDS), sepsis, or septic shock, or other complications [12, 15].

Our study was approved by the Medical and Health Research Ethics Committee of the Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/Dr. Sardjito Hospital (KE/FK/0563/EC/2020). All participants or guardians signed a written informed consent for participating in this study.

Results

Phylogenetic analysis

Phylogenetic analysis revealed that thirteen virus samples were situated within clade GH (GISAID classification), while two viruses were grouped with other viruses which belonged to clade GR, and one virus each that belonged to clade O and clade L (Fig. 1). Three viruses from family cluster case-1 (YO-UGM-10001|EPI_ISL_576113, YO-UGM-10002|EPI_ISL_576114, and YO-UGM-10003|EPI_ISL_576115) formed a single group within clade GH, whereas viruses from family cluster-2 (YO-UGM-1004|EPI_ISL_576116, YO-UGM-1005|EPI_ISL_576128, and YO-UGM-1006|EPI_ISL_576130,) were separated into two different nodes within clade GH (Fig. 1, top-right).

Molecular analysis

Ninety-four SNPs were detected throughout the ORP of the SARS-CoV-2 virus samples with 60% (54/94) of the nucleic acid changes resulting in amino acid substitutions (missense mutations) (Table 1, detailed in Additional file 1: Table S1). The types of nucleic acid base changes were more often detected as transitions (70%) compared to transversions (30%). Higher entropy values were observed more from nucleic acids that carried more frequent base changes; however, nucleic acid changes that caused missense mutation could have lower entropy values than those that resulted in synonymous mutation.

Table 1 Nucleic acid and amino acid mutations observed in seventeen SARS-CoV-2 virus genomes collected from Yogyakarta and Central Java provinces between June and September 2020

Full size table

The majority of the virus samples (16/17) possessed D614G substitution on spike protein and 56% of these (9/16) showed other amino acid substitutions on this protein, including L5F, V83L, V213A, W258R, Q677H, and N811I. Second amino acid mutations that were frequently detected were P232L substitution on NSP12 (RdRp) protein (15x), followed by Q57H substitution on NS3 (14x) and P822L substitution on NSP3 protein (13x). Furthermore, various amino acid mutations were also found in the other proteins of virus samples, including on NSP2 (A205V, V247A, T256I, Q321K), NSP3 (P679S, T1022I, A1179V, T1198K, F1354C, P1665L), NSP4 (A231V), NSP5 (K12R, M49I, P184S), NSP6 (L37F), NSP8 (A21T), NSP9 (L42F), NSP12/RdRp (A97V, P227L, T248I, A656S, H892Y, M906V), NSP13 (T127I, T153I, V169F, M576I, P203L), NSP15 (H337Y), NSP16 (Y222C), NS3 (A54V, A99S, T151I, D222Y), NS7a (H73Y), and N (P13L, A119S, Q160R, S193I, R195S, P199S, R203K, G204R, M234I).

COVID-19’s severity and spike protein mutations of COVID-19 samples

Based on the case definition of COVID-19 severity developed for this study, 3 of 17 virus samples (17.6%) were collected each from asymptomatic cases (people) and critical cases, 5 virus samples (29.4%) from mild cases, and 6 virus samples (35.3%) from moderate cases (Table 2). Two of the patients with critical stages eventually died. A range of Ct values was found amongst different stages of severity, nevertheless all the virus samples with D614G mutations, except one (YO-UGM-10004/2020|EPI_ISL_576116), showed lower Ct values (clade GH, GR, and O, Ct range 16.9–24.7) than those with no mutation in this position (clade L, Ct 27.9). Dual mutations of V213A and D614G on spike protein were detected in four patients, and two of these eventually died after a period of hospitalization.

Table 2 Severity and genetic data associated with SARS-CoV-2 viruses collected from seventeen COVID-19 patients in Yogyakarta and Central Java provinces, Indonesia from June–September 2020

Full size table

Disease outcomes of COVID-19’s family clusters

The epidemiological and clinical data of COVID-19’s family clusters, including clinical symptoms, date of first symptoms appeared, diagnostic results, abnormal findings, comorbidity background are provided by timeline and tabulation in Fig. 2 and Table 3, respectively.

Table 3 Characteristics of patients with COVID-19 from family cluster cases in Yogyakarta and Central Java

Full size table

In family cluster-1, all three patients showed critical COVID-19 and eventually, two died (YO-UGM-10001|EPI_ISL_576113 and YO-UGM-10003|EPI_ISL_576115) and one survived (YO-UGM-10002|EPI_ISL_576114). The disease began from patient-1.1, a 28-year-old male, who had a history of traveling from the local COVID-19 transmission area. He complained of fever, sore throat, cough and malaise on August 8th, 2020, and was tested for PCR three days afterward with the result of COVID-19 positive. His father (patient-1.2, 58-yo), who was living in the same house, showed fever, dyspnea and diarrhea on August 13th, then was followed by his grandfather (patient-1.3, 88-yo) who showed fever and dyspnea on August 18th. The PCR tests for both patients showed positive for COVID-19. All patients developed severe disease outcomes including bilateral pneumonia, cardiomegaly and ARDS. Several comorbidities were recorded from patient-1.1 (obesity), patient-1.2 (diabetes mellitus, and obesity), and patient-1.3 (type 2 diabetes mellitus, geriatric syndrome, history of infarction stroke). Patient-1.1 was uneventfully discharged from the hospital on day 29 of hospitalization, but sadly, patient-1.2 and patient-1.3 passed away in the hospital after 7 and 12 days of hospitalization, respectively.

Family cluster-2 involved three patients which were comprised of a son, 8-yo (patient-2.1), father, 35-yo (patient-2.2) and mother, 33-yo, with the following virus samples: YO-UGM-10006 (EPI_ISL_576130), YO-UGM-10004/2020 (EPI_ISL_576114), and YO-UGM-10005/2020 (EPI_ISL_576128), respectively. Prior to the index case, this family travelled from West Nusa Tenggara to Yogyakarta on August 2nd, 2020, in order to obtain medical treatment for patient-2.1 who had an autoimmune disorder in a hospital in Yogyakarta. Patient-2.1 had firstly exhibited symptoms of fever, cough, and runny nose on August 11th, 2020 and he was diagnosed COVID-19 positive on August 18th. His parents showed clinical signs of cold, headache and dry cough (patient-2.2) and cough, sore throat, and runny nose (patient-2.3) at the same day on August 20th and the PCR results of both patients were positive on August 21st. Patient-2.1 developed moderate severity with bilateral paracardial infiltrate, whereas patient-2.2 and patient-2.3 developed mild disease without any abnormalities in their chest X-rays and other laboratory findings, except eosinophilia (6.4%), increased levels in pH of the blood (7.45), PaO₂ (83.2), PaCO₂ (39.2) with SaO₂ 96.8% and PaO₂/FiO₂ value was 416 from the arterial blood glass analysis of patient-2.2. All three patients uneventfully recovered and were discharged from the hospital on September 2nd, 2020.

Molecular characterizations of virus samples collected from family clusters

Phylogenetic analysis revealed that the three viruses from family cluster-1 were grouped together from a single node. A matrix of nucleic acid difference showed that YO-UGM-10001|EPI_ISL_576113 and YO-UGM-10003|EPI_ISL_576115 were identical on their ORF (nucleic acid and protein levels) and both virus strains had differences of 2 nucleic acids and 1 amino acid in the NSP2 protein which correspond with V247A substitution in YO-UGM-10001|EPI_ISL_576113 and YO-UGM-10003|EPI_ISL_576115 and T256I substitution in YO-UGM-10002|EPI_ISL_576114, respectively (Table 4). Other unique mutations in the other viral proteins were detected in these three virus strains which were not shown in the other study viruses, including V213A (Spike), K12R (NSP5), T248I (NSP12/RdRp), A119S and S193I (N). The virus samples from family cluster-2 were separated in different nodes in the phylogenetic tree (Fig. 3). The tree and the matrix sequence showed that YO-UGM-10005/2020|EPI_ISL_576128 and YO-UGM-10006|EPI_ISL_576130 were genetically identical. Both virus strains had 15 nucleic acid differences compared to YO-UGM-10004|EPI_ISL_576114 which resulted in amino acid variations detected in several viral proteins (Table 4).

Table 4 Amino acid mutations detected in SARS-CoV-2 viruses collected from two family cluster cases in Yogyakarta and Central Java provinces

Full size table

Discussion

Our study provides evidence of SARS-CoV-2 transmission within families, in which the same mutation of the spike protein in each family cluster was identified. It is important to understand the transmission routes of SARS-CoV-2 to prevent and control its spreading [4]. Families have been reported as the most dominant infection cluster of COVID-19 [16]. Family clusters have a higher risk of cross-infection because of frequent and close contact among each family member [4]. Our study also documented that although all family members showed the same multiple S protein mutations, however, they revealed different outcomes. While multiple S protein mutations, i.e. B.1.1.7 variant, have been associated with the severity of COVID-19 [17, 18], this is not the case for our patients. Our samples did not consist of B.1.1.7 variant. In addition, several prognostic factors have been associated with increased risk of severity and mortality of COVID-19, including increasing age, obesity, and comorbidities such as hypertension, diabetes and cerebrovascular disease [15]. Our patients who eventually died (YO-UGM-10001|EPI_ISL_576113 and YO-UGM-10003|EPI_ISL_576115) have more prognostic factors than the patient who survived (YO-UGM-10002|EPI_ISL_576114) (Table 3). Besides the SARS-CoV-2 variants and prognostic factors, a recent GWAS identified rs11385942 at locus 3p21.31 and rs657152 at locus 9q34.2 as a genetic risk factor for severe COVID-19 [19]. Further study is necessary to confirm whether these polymorphisms might be as susceptible factors in our patients.

Double mutations of V213A and D614G on spike protein were detected in four patients, but three of them (75%) developed severe diseases causing critical conditions and two (50%) with fatal outcome. Another interesting finding was documented from family cluster-1, in which all the virus samples in this family cluster belong to the same clade GH, but two patients died and one survived (Table 2). All samples from family cluster-1 revealed another spike protein mutation, V213A, besides D614G. However, virus samples isolated from fatal disease outcomes carried V247A mutation in the NSP2 protein, while those from the recovered patient did not. In conjunction with D614G mutation, substitution of valine (V) to alanine (A) in position 247 and 213 of NSP2 and spike protein, respectively, were detected in the patients with fatal disease outcomes. While both V and A, as well as G are in the non-polar hydrophobic amino acid group and no evidence shows that the double mutations of V213A and D614G affect the severity and lethality of COVID-19 patients, further investigations are necessary to determine whether these dual mutations (V213A and D614G in spike protein) or even triple mutations (V213A and D614G in spike protein and V47A in NS2) associated with increased risk of mortality in COVID-19 patients. Moreover, due to limited number of sample size in this study, it is very difficult to associate between the number of mutations on the spike protein or other proteins or SNPs and severity of COVID-19.

Phylogenetic analysis revealed that three viruses from family cluster-1 formed a monophyletic group. The epidemiological and genetic data indicated that local transmission occurred in family cluster-1 in which patient-1.1 (YO-UGM-10002|EPI_ISL_576114) was initially infected and then transmitted the virus to patient-1.2 (YO-UGM-10003|EPI_ISL_576115) and patient-1.3 (YO-UGM-10001|EPI_ISL_576113). Interestingly, the virus that infected patient 2.2 in family cluster-2 was genetically different from that which infected both two counterparts: patient 2.1 (YO-UGM-10006/2020 (EPI_ISL_576130) and patient 2.3 (YO-UGM-10005/2020 (EPI_ISL_576128). These viruses formed a polyphyletic group indicating there is the possibility of different sources of infection (two convergent descendants, but not their common ancestors).

Recently, more than 50% of the viral genome sequences in the UK were reported to have a new single phylogenetic cluster, i.e. B.1.1.7 variant (multiple spike protein mutations: deletion 69–70, deletion 144, N501Y, A570D, D614G, P681H, T716I, S982A, D1118H) [9]. These new variants have been associated with a higher transmissibility of SARS-CoV-2 up to 70% [9]. Until the submission date of April 2021 in GISAID, these variants were also detected in Asia, including Indonesia [11]. Interestingly, we detected other spike protein mutations in our collected virus strains, including those from the family clusters, i.e. L5F, V213A, W258R, Q677H, and K811I. Noteworthy, the V213A variant was identified in all patients from family cluster-1. V213A was detected in 4/17 (23.5%) of our samples. This variant is only found in only 0.01% of samples in four countries, including Indonesia [11]. Whether this variant is due to a founder effect needs further study.

Currently, besides the D614G variant, several mutations within the receptor binding domains (RBD) of the S protein have attracted most scientists’ attention due to their increased frequency in certain countries, including S477N (Australia and some Central European), N439K (UK and European), and N501Y (part of the new UK variant B.1.1.7, the new South Africa variant 501.V2 and the new Brazil variant P.1) [11]. These variants might be associated with some potential advantages for these viruses. While the B.1.1.7 variant has been associated with COVID-19 clinical severity [17, 18], the 501.V2 and P.1 variants have not [20].

In addition, among eight clades in the GISAID classification, we only detected five clades, i.e. L, G, GH, GR, and O, in the SARS-CoV-2 samples from Indonesia and most of them (~ 60%) contained D614G. Globally, D614G has been detected in ~ 97% samples in 182 countries [11]. While a recent study showed that D614G mutation is significantly associated with the increase of SARS-CoV-2 infectivity, competitive fitness, and transmission in primary human airway epithelial cells and hamsters [21], it does not associate with the clinical severity of COVID-19 patients [22]. Moreover, it is difficult to assess the convergent evolution of D614G mutation in our samples since all samples were from Yogyakarta and Central Java and D614G has been already found in most samples (97%) from all over the world [11]. These findings were compatible with previous studies [22, 23]. The hypothesis of convergent evolution for D614G mutation is not supported by the sequence data since almost all 614G variants derived from the same ancestor [23]. Volz et al. [22] proposed a more complex selective landscape in the spike protein for the co-occurring variants between D614G and the neighbouring sites (615 and 613).

Phylogenetic analysis showed that the full-genome sequences of SARS-CoV-2 identified within these family clusters are identical, which strongly indicates a direct transmission within these families. Moreover, our study is also able to determine the virus clades of COVID-19 cases with unknown contact history with a confirmed COVID-19 case. Our findings support a previous suggestion regarding the importance of genomic epidemiology in filling the gaps of identifying SARS-CoV-2 infection sources [6]. Therefore, a full-genome surveillance of SARS-CoV-2 in Indonesia is essential to prevent further transmission of SARS-CoV-2 and to identify any established or new variant that might affect the SARS-CoV-2 transmission and severity.

Notably, our study only included a limited number of family clusters from Yogyakarta and Central Java, Indonesia. These limitations should be considered for interpretations of our findings.

Conclusions

This is the first molecular epidemiology study associating the full-genome sequences of SARS-CoV-2 with the epidemiological and clinical data within family clusters. Phylogenetic analysis revealed that the three viruses from family cluster-1 formed a monophyletic group, whereas viruses from family cluster-2 formed a polyphyletic group indicating there is the possibility of different sources of infection. This study highlights how the same spike protein mutations among members of the same family might show different disease outcomes. Moreover, we also detected multiple spike protein mutations in our samples. Further studies are necessary to clarify the impact of these multiple spike protein mutations in the transmission and severity of SARS-CoV-2 infection, especially in Indonesia.

Availability of data and materials

All data generated or analyzed during this study are included in the submission. The sequence and metadata are shared through GISAID (www.gisaid.org).

Abbreviations

SNPs:: Single nucleotide polymorphisms

References

World Health Organization. https://www.who.int/news-room/detail/27-04-2020-who-timeline---covid-19. Accessed 9 Feb 2021.
Phelan AL, Katz R, Gostin LO. The novel coronavirus originating in Wuhan, China: challenges for global health governance. JAMA. 2020;323:709–10. https://doi.org/10.1001/jama.2020.1097.
Article CAS PubMed Google Scholar
World Health Organization. https://covid19.who.int/table. Accessed 9 Feb 2021.
Liu T, Gong D, Xiao J, Hu J, He G, Rong Z, Ma W. Cluster infections play important roles in the rapid evolution of COVID-19 transmission: a systematic review. Int J Infect Dis. 2020;99:374–80.
Article CAS PubMed PubMed Central Google Scholar
Zhang H, Hong C, Zheng Q, Zhou P, Zhu Y, Zhang Z, Bi Q, Ma T. A multi-family cluster of COVID-19 associated with asymptomatic and pre-symptomatic transmission in Jixi City, Heilongjiang, China, 2020. Emerg Microbes Infect. 2020;9:2509–14.
Article CAS PubMed PubMed Central Google Scholar
Pattabiraman C, Habib F, Harsha PK, Rasheed R, Prasad P, Reddy V, Dinesh P, Damodar T, Hosallimath K, George AK, Kiran Reddy NV, John B, Pattanaik A, Kumar N, Mani RS, Venkataswamy MM, Shahul Hameed SK, Kumar BGP, Desai A, Vasanthapuram R. Genomic epidemiology reveals multiple introductions and spread of SARS-CoV-2 in the Indian state of Karnataka. PLoS ONE. 2020;15:e0243412.
Article CAS PubMed PubMed Central Google Scholar
Lu J, du Plessis L, Liu Z, Hill V, Kang M, Lin H, Sun J, François S, Kraemer MUG, Faria NR, McCrone JT, Peng J, Xiong Q, Yuan R, Zeng L, Zhou P, Liang C, Yi L, Liu J, Xiao J, Hu J, Liu T, Ma W, Li W, Su J, Zheng H, Peng B, Fang S, Su W, Li K, Sun R, Bai R, Tang X, Liang M, Quick J, Song T, Rambaut A, Loman N, Raghwani J, Pybus OG, Ke C. Genomic epidemiology of SARS-CoV-2 in Guangdong Province. China Cell. 2020;181:997-1003.e9.
Article CAS PubMed Google Scholar
MacLean OA, Orton RJ, Singer JB, Robertson DL. No evidence for distinct types in the evolution of SARS-CoV-2. Virus Evol. 2020;6:veaa034.
Article PubMed PubMed Central Google Scholar
European Centre for Disease Prevention and Control. Rapid increase of a SARS-CoV-2 variant with multiple spike protein mutations observed in the United Kingdom—20 December 2020. Stockholm: ECDC; 2020.
Google Scholar
About UGENE - Unipro UGENE Online User Manual v. 1.30 - WIKI [Internet]. Ugene.net. 2020 [cited 22 December 2020]. https://ugene.net/wiki/display/UUOUM30/About+UGENE.
GISAID. 2020. Pandemic coronavirus causing COVID-19. https://platform.gisaid.org/epi3/cfrontend#8dc5e. Accessed 21 Dec 2020.
Gunadi WH, Marcellus HMS, Daniwijaya EW, Rizki LP, Supriyati E, Nugrahaningsih DAA, Afiahayati S, Iskandar K, Anggorowati N, Kalim AS, Puspitarani DA, Athollah K, Arguni E, Nuryastuti T, Wibawa T. Fulllength genome characterization and phylogenetic analysis of SARS-CoV-2 virus strains from Yogyakarta and Central Java. Indonesia PeerJ. 2020;8:e10575. https://doi.org/10.7717/peerj.10575.
Article CAS PubMed Google Scholar
Xia X. DAMBE7: New and improved tools for data analysis in molecular biology and evolution. Mol Biol Evol. 2018;35:1550–2.
Article CAS PubMed PubMed Central Google Scholar
Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol Biol Evol. 2018;35:1547–9.
Article CAS PubMed PubMed Central Google Scholar
Beeching NJ, Fletcher TE, Fowler R. BMJ best practice. Coronavirus Disease 2019 (COVID-19). https://bestpractice.bmj.com/topics/en-us/3000168/prognosis. Accessed 10 Oct 2020.
Mission J. Report of the WHO-China joint mission on Coronavirus Disease 2019 (COVID-19). WHO; 2020.
Davies NG, Jarvis CI, CMMID COVID-19 Working Group, Edmunds WJ, Jewell NP, Diaz-Ordaz K, Keogh RH. Increased mortality in community-tested cases of SARS-CoV-2 lineage B.1.1.7. Nature. 2021. https://doi.org/10.1038/s41586-021-03426-1.
Article PubMed PubMed Central Google Scholar
Horby P, Huntley C, Davies N, et al. NERVTAG note on B.1.1.7 severity. SAGE meeting report. January 21, 2021.
Severe Covid-19 GWAS Group, Ellinghaus D, Degenhardt F, Bujanda L, Buti M, Albillos A, et al. Genomewide association study of severe Covid-19 with respiratory failure. N Engl J Med. 2020;383(16):1522–34.
Article Google Scholar
CDC. Science Brief: Emerging SARS-CoV-2 Variants https://www.cdc.gov/coronavirus/2019-ncov/science/science-briefs/scientific-brief-emerging-variants.html#ref2. Accessed 14 Apr 2021.
Hou YJ, Chiba S, Halfmann P, Ehre C, Kuroda M, Dinnon KH 3rd, et al. SARS-CoV-2 D614G variant exhibits efficient replication ex vivo and transmission in vivo. Science. 2020;370:1464–8.
CAS PubMed PubMed Central Google Scholar
Volz E, Hill V, McCrone JT, Price A, Jorgensen D, O’Toole Á, et al. Evaluating the effects of SARS-CoV-2 spike mutation D614G on transmissibility and pathogenicity. Cell. 2021;184(1):64-75.e11.
Article CAS PubMed PubMed Central Google Scholar
van Dorp L, Richard D, Tan CCS, Shaw LP, Acman M, Balloux F. No evidence for increased transmissibility from recurrent mutations in SARS-CoV-2. Nat Commun. 2020;11:5986.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the technical support from Sri Fatmawati (Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada), Safitriani and Muhammad Taufiq Soekarno (PT. Pandu Biosains). We also gratefully acknowledge the authors, the originating and submitting Laboratories for their sequence and metadata shared through GISAID. All submitters of data may be contacted directly via www.gisaid.org. The Acknowledgments Table for GISAID is reported as Additional file 2: Table S2. Yogyakarta-Central Java COVID-19 Study Group members: Elisabeth S. Herini⁷, Titis Widowati⁷, Cahya Dewi Satria⁷, Sumardi⁵, Bambang Sigit Riyanto⁵, Munawar Gani⁵, Satria Maulana⁵, Ludhang Pradipta Rizki³, Umi Solekhah Intansari⁶,‬ Elizabeth Henny Herningtiyas⁶, Nur Imma Fatimah Harahap⁶, Bagoes Poermadjaja², Sintong HMT Hutasoit², Indaryati⁸, Havid Setyawan⁸, Kemala Athollah⁴, Maria Patricia Inggriani⁴.

Funding

Our study was funded by Indonesian Ministry of Research and Technology/National Agency for Research and Innovation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Gunadi and Hendra Wibawa contributed equally to this work

Authors and Affiliations

Pediatric Surgery Division, Department of Surgery/Genetics Working Group, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/Dr, Sardjito Hospital, Jl. Kesehatan No. 1, Yogyakarta, 55281, Indonesia
Gunadi
Disease Investigation Center Wates, Directorate General of Livestock and Animal Health Services, Ministry of Agriculture, Yogyakarta, Indonesia
Hendra Wibawa, Bagoes Poermadjaja & Sintong H. M. T. Hutasoit
Department of Microbiology, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, Indonesia
Mohamad Saifudin Hakim, Titik Nuryastuti, Tri Wibawa & Ludhang Pradipta Rizki
Genetics Working Group, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, Indonesia
Marcellus, Dwiki Afandy, Susan Simanjaya, William Widitjiarso, Dyah Ayu Puspitarani, Fadil Fahri, Untung Riawan, Aditya Rifqi Fauzi, Alvin Santoso Kalim, Kemala Athollah & Maria Patricia Inggriani
Pulmonology Division, Department of Internal Medicine, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/Dr, Sardjito Hospital, Yogyakarta, Indonesia
Ika Trisnawati, Eko Budiono, Heni Retnowulan, Nur Rahmi Ananda, Sumardi, Bambang Sigit Riyanto, Munawar Gani & Satria Maulana
Department of Clinical Pathology and Laboratory Medicine, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/Dr, Sardjito Hospital, Yogyakarta, 55281, Indonesia
Riat El Khair, Yunika Puspadewi, Ira Puspitawati, Osman Sianipar, Umi Solekhah Intansari, ‬Elizabeth Henny Herningtiyas & Nur Imma Fatimah Harahap
Department of Child Health, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/Dr, Sardjito Hospital, Yogyakarta, Indonesia
Rina Triasih, Amalia Setyati, Dwikisworo Setyowireni, Ida Safitri Laksanawati, Eggi Arguni, Elisabeth S. Herini, Titis Widowati & Cahya Dewi Satria
Balai Besar Teknik Kesehatan Lingkungan Dan Pengendalian Penyakit, Yogyakarta, Yogyakarta, Indonesia
Irene, Indaryati & Havid Setyawan
Department of Computer Science and Electronics Faculty of Mathematics and Natural Sciences, Universitas Gadjah Mada, Yogyakarta, Indonesia
Afiahayati
Department of Child Health/Genetics Working Group, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/UGM Academic Hospital, Yogyakarta, Indonesia
Kristy Iskandar
Department of Physiology, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/UGM Academic Hospital, Yogyakarta, Indonesia
Siswanto
Department of Anatomical Pathology/Genetics Working Group, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, Indonesia
Nungki Anggorowati
Department of Microbiology, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, UGM Academic Hospital, Yogyakarta, Indonesia
Edwin Widyanto Daniwijaya
Centre of Tropical Medicine, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, Indonesia
Endah Supriyati
Department of Pharmacology and Therapy/Genetics Working Group, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada, Yogyakarta, Indonesia
Dwi Aris Agung Nugrahaningsih

Authors

Gunadi
View author publications
You can also search for this author in PubMed Google Scholar
Hendra Wibawa
View author publications
You can also search for this author in PubMed Google Scholar
Mohamad Saifudin Hakim
View author publications
You can also search for this author in PubMed Google Scholar
Marcellus
View author publications
You can also search for this author in PubMed Google Scholar
Ika Trisnawati
View author publications
You can also search for this author in PubMed Google Scholar
Riat El Khair
View author publications
You can also search for this author in PubMed Google Scholar
Rina Triasih
View author publications
You can also search for this author in PubMed Google Scholar
Irene
View author publications
You can also search for this author in PubMed Google Scholar
Afiahayati
View author publications
You can also search for this author in PubMed Google Scholar
Kristy Iskandar
View author publications
You can also search for this author in PubMed Google Scholar
Siswanto
View author publications
You can also search for this author in PubMed Google Scholar
Nungki Anggorowati
View author publications
You can also search for this author in PubMed Google Scholar
Edwin Widyanto Daniwijaya
View author publications
You can also search for this author in PubMed Google Scholar
Endah Supriyati
View author publications
You can also search for this author in PubMed Google Scholar
Dwi Aris Agung Nugrahaningsih
View author publications
You can also search for this author in PubMed Google Scholar
Eko Budiono
View author publications
You can also search for this author in PubMed Google Scholar
Heni Retnowulan
View author publications
You can also search for this author in PubMed Google Scholar
Yunika Puspadewi
View author publications
You can also search for this author in PubMed Google Scholar
Ira Puspitawati
View author publications
You can also search for this author in PubMed Google Scholar
Osman Sianipar
View author publications
You can also search for this author in PubMed Google Scholar
Dwiki Afandy
View author publications
You can also search for this author in PubMed Google Scholar
Susan Simanjaya
View author publications
You can also search for this author in PubMed Google Scholar
William Widitjiarso
View author publications
You can also search for this author in PubMed Google Scholar
Dyah Ayu Puspitarani
View author publications
You can also search for this author in PubMed Google Scholar
Fadil Fahri
View author publications
You can also search for this author in PubMed Google Scholar
Untung Riawan
View author publications
You can also search for this author in PubMed Google Scholar
Aditya Rifqi Fauzi
View author publications
You can also search for this author in PubMed Google Scholar
Alvin Santoso Kalim
View author publications
You can also search for this author in PubMed Google Scholar
Nur Rahmi Ananda
View author publications
You can also search for this author in PubMed Google Scholar
Amalia Setyati
View author publications
You can also search for this author in PubMed Google Scholar
Dwikisworo Setyowireni
View author publications
You can also search for this author in PubMed Google Scholar
Ida Safitri Laksanawati
View author publications
You can also search for this author in PubMed Google Scholar
Eggi Arguni
View author publications
You can also search for this author in PubMed Google Scholar
Titik Nuryastuti
View author publications
You can also search for this author in PubMed Google Scholar
Tri Wibawa
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

the Yogyakarta-Central Java COVID-19 study group

Elisabeth S. Herini
, Titis Widowati
, Cahya Dewi Satria
, Sumardi
, Bambang Sigit Riyanto
, Munawar Gani
, Satria Maulana
, Ludhang Pradipta Rizki
, Umi Solekhah Intansari
, ‬Elizabeth Henny Herningtiyas
, Nur Imma Fatimah Harahap
, Bagoes Poermadjaja
, Sintong H. M. T. Hutasoit
, Indaryati
, Havid Setyawan
, Kemala Athollah
& Maria Patricia Inggriani

Contributions

G, HW, MSH, KI, and NA conceived the study. G drafted the manuscript, and HW, MSH, RT, A, KI, S, EA, and TW critically revised the manuscript for important intellectual content. G, MSH, M, IT, REK, RT, I, S, EWD, ES, DAAD, EB, HR, YP, IP, OS, DA, SS, WW, DAP, FF, UW, ARF, ASK, NRA, AS, DS, ISL, and TA collected the data; and G, M, and HW analyzed the data. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Gunadi.

Ethics declarations

Consent to publish

All participants or guardians signed a written informed consent for participating in this study.

Competing interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Ninety-four SNPs were detected throughout the ORP of the SARS-CoV-2 virus samples with 60% (54/94) of the nucleic acid changes resulting in amino acid substitutions (missense mutations).

Additional file 2: Table S2

The Acknowledgments Table for GISAID is report.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Gunadi, Wibawa, H., Hakim, M.S. et al. Molecular epidemiology of SARS-CoV-2 isolated from COVID-19 family clusters. BMC Med Genomics 14, 144 (2021). https://doi.org/10.1186/s12920-021-00990-3

Download citation

Received: 09 February 2021
Accepted: 18 May 2021
Published: 01 June 2021
DOI: https://doi.org/10.1186/s12920-021-00990-3

Molecular epidemiology of SARS-CoV-2 isolated from COVID-19 family clusters

Abstract

Background

Methods

Results

Conclusions

Introduction

Methods

SARS-CoV-2 samples

Full-genome sequencing

Phylogenetic analysis

COVID-19 severity classifications

Results

Phylogenetic analysis

Molecular analysis

COVID-19’s severity and spike protein mutations of COVID-19 samples

Disease outcomes of COVID-19’s family clusters

Molecular characterizations of virus samples collected from family clusters

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

the Yogyakarta-Central Java COVID-19 study group

Contributions

Corresponding author

Ethics declarations

Consent to publish

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Table S1.

Additional file 2: Table S2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Genomics

Contact us