Molecular diagnostics for congenital hearing loss including 15 deafness genes using a next generation sequencing platform
- Sarah De Keulenaer1,
- Jan Hellemans1, 2,
- Steve Lefever2,
- Jean-Pierre Renard3,
- Joachim De Schrijver3,
- Hendrik Van de Voorde2,
- Mohammad Amin Tabatabaiefar5, 6,
- Filip Van Nieuwerburgh1, 4,
- Daisy Flamez1, 3,
- Filip Pattyn2,
- Bieke Scharlaken1,
- Dieter Deforce1, 4,
- Sofie Bekaert1,
- Wim Van Criekinge1, 3,
- Jo Vandesompele1, 2,
- Guy Van Camp5 and
- Paul Coucke1, 2Email author
© De Keulenaer et al.; licensee BioMed Central Ltd. 2012
Received: 12 December 2011
Accepted: 7 May 2012
Published: 18 May 2012
Hereditary hearing loss (HL) can originate from mutations in one of many genes involved in the complex process of hearing. Identification of the genetic defects in patients is currently labor intensive and expensive. While screening with Sanger sequencing for GJB2 mutations is common, this is not the case for the other known deafness genes (> 60). Next generation sequencing technology (NGS) has the potential to be much more cost efficient. Published methods mainly use hybridization based target enrichment procedures that are time saving and efficient, but lead to loss in sensitivity. In this study we used a semi-automated PCR amplification and NGS in order to combine high sensitivity, speed and cost efficiency.
In this proof of concept study, we screened 15 autosomal recessive deafness genes in 5 patients with congenital genetic deafness. 646 specific primer pairs for all exons and most of the UTR of the 15 selected genes were designed using primerXL. Using patient specific identifiers, all amplicons were pooled and analyzed using the Roche 454 NGS technology. Three of these patients are members of families in which a region of interest has previously been characterized by linkage studies. In these, we were able to identify two new mutations in CDH23 and OTOF. For another patient, the etiology of deafness was unclear, and no causal mutation was found. In a fifth patient, included as a positive control, we could confirm a known mutation in TMC1.
We have developed an assay that holds great promise as a tool for screening patients with familial autosomal recessive nonsyndromal hearing loss (ARNSHL). For the first time, an efficient, reliable and cost effective genetic test, based on PCR enrichment, for newborns with undiagnosed deafness is available.
KeywordsDeafness Next generation sequencing PCR based enrichment Genetic diagnostics
Hearing loss (HL) is the most common birth defect in industrialized countries and the most prevalent sensorineural disorder. One out of every 500 newborns has bilateral permanent sensorineural HL of more than 40 dB HL . It is estimated that in developed countries, genetic causes are responsible in at least two-thirds of prelingual cases. In most of the cases there are no clinical abnormalities other than the hearing loss (i.e. nonsyndromic hearing loss, NSHL). Inherited NSHL is monogenic, with over 100 mapped loci and 46 causally implicated genes .
GJB2 mutations are the most frequent cause of autosomal recessive non-syndromic hearing loss (ARNSHL) and account for about 20 % of the cases . Therefore, newborns that are diagnosed with severe-to-profound HL in the absence of other abnormal findings on physical examination are analyzed for mutations in the GJB2 gene. In some cases, when imaging of the inner ear shows abnormalities such as an enlarged vestibular aquaduct, the SLC26A4 gene is analyzed. Besides these genes there is hardly any other gene that is routinely analyzed in DNA diagnostics. For this reason, a positive result is only obtained in less than 20 % of deaf children for which DNA diagnostics is requested . While substantial progress has been made in recent years in identifying the responsible deafness genes, the key challenge lies in determining which gene is responsible in a patient. Sequencing of all genes by traditional DNA sequencing technology is labor intensive and not cost effective .
Recently the next generation DNA sequencing technology has come of age . Companies such as Roche (454 Genome Sequencer FLX), Applied Biosystems (Solid System) and Illumina (Genome Analyzer) have brought high throughput DNA sequencers to the market that can sequence several hundred million to a few billions of basepairs in a few days. Until now, this technology has mainly been used for research purposes. Several genes for hearing loss have been identified using this technology . Most likely, the identification of other as yet unidentified deafness genes will follow. On the other hand, applications in DNA diagnostics in general are still rare. A first genetic test encompassing the NSHL genes using massively parallel sequencing technology has been described . In these studies, an array-based enrichment approach was used on a limited number of patients. Although this new technology holds the promise to significantly reduce the cost and workload per sample, the enrichment used leads to a significant loss in sensitivity, which is deemed unacceptable according to current standards using PCR amplification. Array based enrichment procedures inherently suffer from incomplete selection for two reasons. Firstly, due to the presence of repetitive sequences, not all fragments can be included in the selection set. Secondly, hybridization based enrichment suffers from selection bias and uneven capture efficiency. Much deeper sequencing will be needed to ensure complete or sufficient coverage of the selected fragments, in comparison to unbiased selection techniques such as PCR. In combination, these factors lead to a reduced sensitivity, while sensitivity requirements for DNA diagnostics are generally required to be very high. Here, we report the evaluation of a PCR based enrichment strategy followed by a 454 NGS approach for 5 patients using NGS implicated in ARNSHL.
Five patients with familial congenital deafness were screened for 15 deafness genes on the 454 Genome Sequencer in order to evaluate if next generation sequencing enables the detection of mutations. Three of these patients are members of families in which a region of interest has previously been characterized by linkage studies. Therefore, mutations in respectively CDH23 (for patients 1 and 2) and OTOF (for patient 4) were expected. In patient 5, a known mutation (c.236 + 1 G > A) in TMC1 had to be confirmed. For patient 3, no mutation was known.
Number of exons
Number of mutations worldwide**
Number of homopolymer repeats* in CDS
Function in hearing process
hair bundle, motor protein
exocytose at auditory ribbon synapse
hair bundle, adhesion protein
extracellular matrix protein
hair bundle, cytoskeletal formation
signaling of hair cells and neurons
hair bundle, cytoskeletal formation
hair bundle, adhesion protein
hair bundle, motor proteins
Number of NGS runs
Standard 1/3 run
Standard 1/3 run
Standard 1/3 run
Standard 1/5 Titanium 1/12
Standard 1/5 Titanium 1/12
24940* (Std) 24192* (Tit)
16767* (Std) 40169* (Tit)
Total average coverage
% sequenced amplicons with coverage > 38
% sequenced amplicons with coverage > 30
% sequenced amplicons with coverage > 5
New mutation found in CDH23
New mutation found in CDH23
No mutation could be clearly identified
New mutation found in OTOF
Known mutation in TMC1 confirmed
For patients 4 and 5, 1/5 of the capacity of a Standard run with 2-lane gasket was used for each patient. The average coverage was lower than expected and therefore an additional Titanium run (1/12) was performed to have extra reads (Table 2), resulting in an average total coverage of 73 and 88 for respectively patient 4 and patient 5 (Table 2).
New variants observed in patients 1, 2 and 4
Grantham score ([0–215])
Patient 1–2 CDH23 NM_022124.4:c.5527 G > T Chr10(NCBI 36):g.73214678 G > T p.Asp1843Tyr
Most likely interfere with function (Class C65)
OTOF NM_194248.2:c.3263 T > C Chr2(NCBI 36):g.26550910 T > C p.Leu1088Pro
Less likely interfere with function (Class C0)
Variants observed in patient 3
NM_022124.5:c.8167 G > C Chr10(GRCh37):g.73566027 G > C p.Val2723Leu
NM_194248.2:c.3636_3637del Chr2(NCBI36):g.26549600_26549601del p.Phe1212fs
Frame shift (The new reading frame ends in a STOP codon 78 positions downstream.)
NM_194248.2:c.2317 C > TChr2(NCBI36):g.26553877 C > T p.Arg773Cys rs80356569
NM_194248.2:c.4936 C > T Chr2(NCBI36):g.26541265 C > T p.Pro1646Ser rs17005371
NM_001142763.1:c.1319A > C Chr10(NCBI36):g.55625450A > C p.Asp440Ala rs4935502
The variant analysis of patient 4 revealed a new homozygous missense mutation in the OTOF gene (exon 26: c.3263 T > C (p.Leu1088Pro)). This mutation is in agreement with linkage analysis, previously performed, suggesting a disease causing mutation in the OTOF gene. Prediction programs revealed the mutation as possibly damaging (PolyPhen) and not tolerated (SIFT) and therefore can most likely be considered as disease causing. The mutation was investigated in all available family members and showed full co-segregation with the deafness. The prediction results of the novel mutations, with the different software approaches, are listed in Table 3.
In patient 5, included as a control, we could confirm the heterozygous mutation in the TMC1 gene previously found with Sanger sequencing, with a relative variant frequency of 0.5. The mutation NM_138691.2:c.236 + 1 G > A was found in 13 reads out of 26. This substitution is located in the donor splice site of intron 7.
We have developed an assay to improve the molecular diagnosis of autosomal recessive nonsyndromic hearing loss (ARNSHL) by simultaneous sequencing of the exons, UTRs and alternative transcripts of 15 deafness genes. The selected list of genes includes the most frequently mutated recessive genes in patients with hearing loss (Table 1). In contrast to hybridization based capture approaches , we performed a PCR based enrichment strategy for all target regions of the 15 genes followed by sequencing with a 454 Genome Sequencer FLX. The optimization of the PCR amplification conditions made an efficient semi-automated high throughput processing of the many PCR reactions feasible and straightforward. The PCR reactions of all difficult amplicons can easily be repeated and sequenced with the conventional Sanger method. We are able to complete the screening in a relatively short period of time. The adapter ligation method (Shotgun protocol) was preferred over the fusion primer approach for amplicon sequencing, in order to reduce the sequencing cost and to obtain a more efficient workflow. When screening a limited number of genes, it is more advantageous to use gene specific primers with the forward and reverse adapters already incorporated in the oligo’s. For the 15 genes (646 amplicons) in our setting, the fusion primer approach is cost prohibitive, since the adapter sequence needs to be incorporated in every single primer set. Therefore, the adapters were ligated in one reaction to the pool of amplicons.
Two new mutations were discovered and a known heterozygous mutation could be confirmed. The depth of sequencing coverage was high (greater than 38) for over 90 % of the amplicons, indicating that the majority of amplicons are covered sufficiently. Sanger sequencing remains indispensable to confirm results, to analyze homopolymer regions and to sequence drop-out amplicons. Since this was a pilot study, we restricted the screening to 5 patients. However, by this approach we are able to screen 15 patients within a single Titanium run, making this workflow cost-effective. While further validation with a larger panel of positive controls is needed, our approach holds great promise. Although it was an advantage that the region of interest had already been localized by linkage analysis for patients 1–2 and 4; we believe that we would have found the mutation without the knowledge of the linked region. The main reason for this is that there were no other relevant mutations found in the remaining 14 analyzed genes that could be causal. The new variants that we found in CDH23 and OTOF were homozygous with a relative frequency of 1. These mutations in CDH23 and OTOF were only found in respectively patients 1–2 and 4 and not in the other analyzed deafness samples (e.g. we often did find the same SNP’s in different patients). At the same time, there was no doubt about the causality of both mutations as predicted by the different software tools.
Genetic counseling prior to analysis of genes potentially revealing Usher syndrome (CDH23, PCDH15 and MYO7A) is important and this should be explained to parents who agree to have this type of diagnostics performed for their child.
With patient 3, in whom we were not able to determine the disease causing mutation, we illustrate that the major difficulty with this kind of analysis is the interpretation of the variants. In some cases, the disease causing nature of the variant will not be convincing and additional investigations as segregation in the family or functional analysis will be essential.
Our data demonstrate that the use of NGS technology holds promises as a tool for screening congenital deafness genes. The availability of a more profound diagnostic test compared to the actual “gene by gene” analysis approach, will provide better opportunities to identify the disease causing mutation in patients permitting prompt management and accurate genetic counseling. Once a substantial number of patients originating from a specific population are analyzed, we will be able to determine the relative contribution of this set of 15 deafness genes to recessive hearing loss in this specific population, data which is currently unavailable. From a technical point of view, the screening can be performed even on a sporadic case. However less positive results will be obtained since there are a significant amount of deafness cases caused by non-genetic factors. A continued refinement of the NGS technology will further improve the sequencing accuracy and reduce the cost. At the same time bio-informatics tools will improve. This will be critical for the interpretation of NGS-data and essential for the diagnostic laboratories using NGS technology.
Genomic DNA was obtained from five patients with congenital HL. Patient 1 and 2 are the probands of a consanguineous recessive Iranian family with at least 9 affected members for which linkage to the CDH23 locus has been found previously. Patient 3, a member of a Turkish family, has congenital deafness. This patient has parents with normal hearing and GJB2 mutations have been excluded. Patient 4 is the proband of a consanguineous Iranian family with recessive deafness, for which linkage to the OTOF locus has been proven. The family had 7 affected members. Both Iranian families were seen by a geneticist and filled in a general clinical questionnaire. However, no detailed audiometric or ophthalmological examinations were performed, because the families were collected in remote locations. Both families suffered from profound early childhood hearing loss and no obvious signs of syndromal hearing loss were noted. In patient 5, a known mutation in the TMC1 gene was identified previously, and this patient served as a positive control. As patient 3 was part of routine deafness screening, specific ethical approval was not required. Both Iranian families are part of a research project at the University of Antwerp, for which ethical approval was obtained.
Primerdesign and amplicon PCR
Fifteen autosomal recessive deafness genes were selected, based on their reported mutation frequency (Table 1) . The genes with the highest frequency of mutations reported in the literature were chosen. Primer design using the in house developed primerXL pipeline (Lefever et al., in preparation), resulted in 646 oligonucleotide pairs covering all the coding sequences (CDS) and most of the UTRs of the 15 genes responsible for ARNSHL. After the first round of primer design, with the most stringent conditions (no SNPs in primer annealing region, amplicon length between 250–350 bp, GC content between 30 and 80 %), 97.1 % of all the regions of the 15 genes could be amplified successfully. The missing regions were caused by systematic drop-out of some amplicons during PCR. The primer design was optimized in different steps by accepting less stringent conditions; first by tolerating an increase in the number of generated primers (for example to allow the presence of a single SNP in the 5’ region of the primer), then by lowering the permitted amplicon length and finally by slightly varying the melting temperature (Tm) of the primers. All PCR reactions are performed using the same reaction conditions, enabling an automated workflow. The average length across the 646 amplicons is 319 bp, resulting in an aggregate target size of approximately 200 Kb. The primers used in this step are modified at their 5' end with a universal M13 linker sequence. The primer sequences were synthesized by Integrated DNA technologies and delivered in 384-well plates as a 100 μM stock solution (forward and reverse primer separately).
All 646 amplification reactions of the 15 genes were carried out as singleplex PCR in two 384-well plates per patient. A master mix for each sample was prepared and consisted of 1x Kapa Taq buffer (Sopachem), 1 mM MgCl2 (Roche Diagnostics), 0.12 mM dNTP’s (Invitrogen), 0.02 U/μl Kapa Taq polymerase (Sopachem), 0.32x LC Green Plus (Bioké) and 25 ng gDNA per reaction. Then, 1.25 μl of the forward/reverse primer mix (1 μM stock solution) was added to the 8.75 μl master mix using a Freedom EVO Tecan liquid handling robotic workstation, resulting in a final volume of 10 μl per reaction. The PCRs were first tested on Human Genomic DNA (Roche diagnostics). The amplification reactions were carried out on a C1000 real-time thermal cycler (Bio-Rad) with following cycling conditions: 95°C-5’, 94°C-30”, 58°C-30”, 72°C-50”; 40 cycles (+ plate read after each one), followed by a melt curve (65°C > 95°C for 5” increment 0.5°C). The Cq value was determined and the size of the amplified products was verified on a MultiNA (Shimadzu Biotech). To obtain an equal representation of every amplicon in the pool, PCR products were pooled in an equimolar manner for each patient, based on the end-point fluorescent values. Subsequently, 100 μl of these pools were purified with the High Pure PCR Cleanup Micro Kit (Roche Diagnostics). The quality of the PCR pools was verified on an Agilent 2100 Bioanalyzer with the DNA 1000 K chip (Agilent technologies) and the concentration was measured with the Quant-it Picogreen DNA assay (Invitrogen).
Next generation sequencing
Sanger sequencing homopolymers
We identified the homopolymer regions, defined as 6 or more repeats of the same base, located in the exonic regions of the 15 deafness genes (Table 1). These regions were analyzed with the conventional Sanger sequencing method for patient 3, since we couldn’t confirm a causal mutation in this patient.
Mapping of the sequenced reads was performed by BLAT , software that has been integrated in V.I.P. (Variant Identification Pipeline) . Variants were identified using the variant identification module included into V.I.P. with filter settings: homopolymers < 7, Quality score ≥ 30, relative variant frequency ≥ 0.35 and total coverage ≥ 20. The novel identified variants were analyzed by Alamut version 1.53 (Interactive Biosoftware).
Autosomal recessive non-syndromic hearing loss
- GS FLX:
Genome Sequencer FLX
Low molecular weight
Next generation sequencing
Polymerase chain reaction
Single nucleotide polymorphism
Variant identification pipeline.
This research has been made possible by funding from Hercules Foundation [AUGE/039] and Ghent University IOF StepStone funding. This study was supported by a research grant from ‘Action on Hearing Loss UK’ to GVC and PC. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. We thank Fatemeh Alasti for providing samples 1 and 2.
- Morton CC, Nance WE: Current concepts: Newborn hearing screening - A silent revolution. New England Journal of Medicine. 2006, 354: 2151-2164. 10.1056/NEJMra050700.View ArticlePubMedGoogle Scholar
- The Hereditary Hearing Loss Homepage. [http://hereditaryhearingloss.org]
- Putcha GV, Bejjani BA, Bleoo S, Booker JK, Carey JC, Carson N, et al: A multicenter study of the frequency and distribution of GJB2 and GJB6 mutations in a large North American cohort. Genetics in Medicine. 2007, 9: 413-426. 10.1097/GIM.0b013e3180a03276.View ArticlePubMedGoogle Scholar
- Hilgert N, Smith RJH, Van Camp G: Forty-six genes causing nonsyndromic hearing impairment: Which ones should be analyzed in DNA diagnostics?. Mutation Research-Reviews in Mutation Research. 2009, 681: 189-196.View ArticlePubMedGoogle Scholar
- Shearer AE, Deluca AP, Hildebrand MS, Taylor KR, Gurrola J, Scherer S, et al: Comprehensive genetic testing for hereditary hearing loss using massively parallel sequencing. Proceedings of the National Academy of Sciences of the United States of America. 2010, 107: 21104-21109. 10.1073/pnas.1012989107.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang J, Chiodini R, Badr A, Zhang GF: The impact of next-generation sequencing on genomics. Journal of Genetics and Genomics. 2011, 38: 95-109. 10.1016/j.jgg.2011.02.003.View ArticlePubMedPubMed CentralGoogle Scholar
- Schraders M, Haas SA, Weegerink NJD, Oostrik J, Hu H, Hoefsloot LH, et al: Next-Generation Sequencing Identifies Mutations of SMPX, which Encodes the Small Muscle Protein, X-Linked, as a Cause of Progressive Hearing Impairment. American Journal of Human Genetics. 2011, 88: 628-634. 10.1016/j.ajhg.2011.04.012.View ArticlePubMedPubMed CentralGoogle Scholar
- Rehman AU, Morell RJ, Belyantseva IA, Khan SY, Boger ET, Shahzad M, et al: Targeted Capture and Next-Generation Sequencing Identifies C9orf75, Encoding Taperin, as the Mutated Gene in Nonsyndromic Deafness DFNB79. American Journal of Human Genetics. 2010, 86: 378-388. 10.1016/j.ajhg.2010.01.030.View ArticlePubMedPubMed CentralGoogle Scholar
- Brownstein Z, Friedman LM, Shahin H, Oron-Karni V, Kol N, Rayyan AA, et al: Targeted genomic capture and massively parallel sequencing to identify genes for hereditary hearing loss in middle eastern families. Genome Biology. 2011, 12: 9.View ArticleGoogle Scholar
- De LK, De SJ, Clement L, Baetens M, Lefever S, De KS, et al: Practical tools to implement massive parallel pyrosequencing of PCR products in next generation molecular diagnostics. PLoS One. 2011, 6: e25531-10.1371/journal.pone.0025531.View ArticleGoogle Scholar
- Smith RJH, Gurrola JG: Kelley. PM. 1993Google Scholar
- Migliosi V, Modamio-Hoybjor S, Moreno-Pelayo MA, Rodriguez-Ballesteros M, Villamar M, Telleria D, et al: Q829X, a novel mutation in the gene encoding otoferlin (OTOF), is frequently found in Spanish patients with prelingual non-syndromic hearing loss. J Med Genet. 2002, 39: 502-506. 10.1136/jmg.39.7.502.View ArticlePubMedPubMed CentralGoogle Scholar
- Kent WJ: BLAT - The BLAST-like alignment tool. Genome Research. 2002, 12: 656-664.View ArticlePubMedPubMed CentralGoogle Scholar
- De Schrijver JM, De LK, Lefever S, Sabbe N, Pattyn F, Van NF, et al: Analysing 454 amplicon resequencing experiments using the modular and database oriented Variant Identification Pipeline. BMC Bioinformatics. 2010, 11: 269-10.1186/1471-2105-11-269.View ArticlePubMedPubMed CentralGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/5/17/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.