- Research article
- Open Access
The identification of novel gene mutations for degenerative lumbar spinal stenosis using whole-exome sequencing in a Chinese cohort
BMC Medical Genomics volume 14, Article number: 134 (2021)
Degenerative lumbar spinal stenosis (DLSS) is a common lumbar disease that requires surgery. Previous studies have indicated that genetic mutations are implicated in DLSS. However, studies on specific gene mutations are scarce. Whole-exome sequencing (WES) is a valuable research tool that identifies disease-causing genes and could become an effective strategy to investigate DLSS pathogenesis.
From January 2016 to December 2017, we recruited 50 unrelated patients with symptoms consistent with DLSS and 25 unrelated healthy controls. We conducted WES and exome data analysis to identify susceptible genes. Allele mutations firstly identified potential DLSS variants in controls to the patients’ group. We conducted a site-based association analysis to identify pathogenic variants using PolyPhen2, SIFT, Mutation Taster, Combined Annotation Dependent Depletion, and Phenolyzer algorithms. Potential variants were further confirmed using manual curation and validated using Sanger sequencing.
In this cohort, the major classification variant was missense_mutation, the major variant type was single nucleotide polymorphism (SNP), and the major single nucleotide variation was C > T. Multiple SNPs in 34 genes were identified when filtered allele mutations in controls to retain only patient mutations. Pathway enrichment analyses revealed that mutated genes were mainly enriched for immune response-related signaling pathways. Using the Novegene database, site-based associations revealed several novel variants, including HLA-DRB1, PARK2, ACTR8, AOAH, BCORL1, MKRN2, NRG4, NUP205 genes, etc., were DLSS related.
Our study revealed that deleterious mutations in several genes might contribute to DLSS etiology. By screening and confirming susceptibility genes using WES, we provided more information on disease pathogenesis. Further WES studies incorporating larger DLSS patient cohorts are required to comprehend the genetic landscape of DLSS pathophysiology fully.
Lumbar spinal stenosis (LSS) is characterized by narrowing the lumbar spinal canal, lateral recesses, or intervertebral foramina [1, 2]. The condition may be congenital, acquired, or degenerative [3,4,5]. Congenital stenosis is rare and typically caused by achondroplasia and hypochondroplasia. Degenerative lumbar spinal stenosis (DLSS) is usually attributed to degenerative changes in intervertebral discs, facet joints, and ligamentum flavum, leading to spondylolisthesis. DLSS typically affects individuals over 50 years old and globally is one of the most common spinal conditions requiring surgery. Neurogenic claudication is the classical clinical manifestation associated with LSS. It causes diminished function and impairs quality of life [6,7,8,9].
DLSS etiology and patho-mechanisms remain unclear, but previous studies implicate genetic factors in DLSS. A Finnish study identified a splice site variant of COL9A2 associated with the condition . Seung-Jae Hyun reported a COL9A2 haplotype (HAP2) was significantly associated with DLSS in the Korean population, whereas another haplotype (HAP4) potentially exerted a protective role against DLSS development . Currently, there is limited information on susceptibility gene mutations for DLSS. Therefore, the identification of pathogenic genes will provide novel therapeutic strategies for DLSS treatment.
Whole-exome sequencing (WES) is a cost-effective, reproducible, and robust approach for the sensitive and specific identification of variants causing protein-coding changes in the human genome. Previous genetic studies were limited to identifying one or more candidate genes, whereas WES approaches can potentially identify and screen hundreds of thousands of single nucleotide polymorphisms (SNPs) . For populations with pathogenic stenosis gene mutations, preventative measures, such as lifestyle modifications and better stenosis monitoring, may be applicable. In this study, we analyzed WES data from 50 DLSS patients and 25 healthy controls to identify pathogenic genes, used Sanger DNA sequencing to validate variants identified by WES, and analyzed SNPs using bioinformatics analysis. We identified several pathogenic gene variants associated with DLSS in our cohort.
We recruited 50 consecutive, unrelated Chinese patients with symptoms consistent with DLSS and 25 unrelated healthy controls between January 2016 and December 2017 from the Orthopedic Department at the China-Japan Friendship Hospital. The ethics committee approved this study of the Hospital Institutional Review Board. All participants provided written informed consent to use samples and clinical information. All patients underwent clinical and radiological examinations to confirm the degenerative nature of their stenosis. DLSS patients met the following three criteria: (1) DLSS diagnosis by magnetic resonance imaging (MRI); (2) conservative treatment and monitoring for > three months by spinal surgeons, and (3) a history of typical LSS symptoms, consisting of self-reported intermittent neurogenic claudication, stenotic symptoms extending to the lower extremities upon extension of the lumbar spine, or numbness or weakness of the lower extremities. The study excluded patients with neurological disorders, spinal fractures, spondylolisthesis, spinal tumors, trauma, and infectious diseases. We enrolled 25 biologically unrelated healthy individuals with similar ethnic backgrounds as the control group.
Genome DNA was extracted from whole blood using the TIANamp Blood DNA kit (TIANGEN BIOTECH, Beijing, China). We performed exome capture using the Agilent SureSelect Human All Exon V6 kit (Agilent Technologies, Santa Clara, CA, USA) as per the manufacturer’s instructions. WES was performed using the Illumina HiSeq Xten platform. We processed sequencing-derived raw image files using Illumina base-calling software with default parameters and generated sequence data for each individual as paired-end reads or raw data. We conducted studies according to the manufacturer’s protocol. One sample was loaded per flow cell lane to generate a minimum 10 × read depth across ~ 96% target regions.
Exome data analysis
The bioinformatics analysis used raw sequencing data from the Illumina pipeline (Fig. 1). We processed and filtered the raw data and discarded low-quality reads based on the following criteria: (1) reads containing an abnormal sequencing adapter, (2) reads with a low-quality base ratio (base quality less than or equal to 5 that was more than 50%, and (3) reads with an unknown base (“N” base) ratio > 10%. Variants in exon or alternative splicing regions were retained. We compared the variant frequency using the 1000G SNP database (http://www.1000genomes.org/). Exome Aggregation Consortium (ExAC) (http://exac.broadinstitute.org) analysis for potential deleterious mutations was performed using different algorithms; PolyPhen2 (http://genetics.bwh.harvard.edu/pph2/) , SIFT (http://sif.jcvi.org/) , Mutation Taster (http://www.mutationtaster.org/) , and Combined Annotation Dependent Depletion (CADD). We used the identification of rare combined with at least two of these four algorithms to prioritize potential pathogenic variants. We used R software (http://www.bioconductor.org/packages/release/bioc/html/maftools.html) for cluster analyses to determine statistical significance in genotype and allele frequencies between patients and controls (false discovery rate (FDR) < 0.05). Phenolyzer (Phenotype Based Gene Analyzer, http://phenolyzer.wglab.org/) was used to identify genes based on user-specific disease/phenotype terms.
Read mapping to reference sequences
Single nucleotide variants (SNVs) in the 75 samples were generated by comparing valid sequencing data with the human reference genome (UCSC Genome Browser hg19) using Burrows–Wheeler Aligner software to derive primary mapping results. Then, we sorted the aligned data using SAM tools to select the best mapping positions. Duplicated reads were marked by Picard (http://sourceforge.net/ projects/picard/) for use in the next analysis.
Also, we conducted functional annotations to determine genetic variations associated with DLSS. Firstly, we annotated all variants using ANNOVAR software. Then, normal population variant databases, including 1000g2015aug_all, 1000g2015aug_Chinese, esp6500siv2_all, NovoDb_WES_SNP, and ExAC databases, were screened to exclude common variations occurring with no more than 1% minor allele frequency.
Candidate variants and gene selection
SIFT, PolyPhen-2, and MutationTaster analyses were performed to predict whether amino acid substitutions affected protein function. A mutation was selected as a candidate variant if one of the three software programs showed it was pathological.
Gene function enrichment analysis
The Toppgene suite was used for gene enrichment analysis, and candidate gene prioritization was based on functional annotation and a protein interaction network (https://toppgene.cchmc.org/). We used a web-based portal, Metascape (http://metascape.org/), to perform Gene Ontology (GO) enrichment analysis. Gene Set Enrichment Analysis (GSEA) software (version 3.0) with a “c2.cp.kegg.v6.1symbols.gmt” gene set database was used to perform GSEA (http://www.broadinstitute.org/gsea/index.jsp) to identify significantly enriched or depleted genes that showed statistically significant and concordant differences between two given clusters.
We performed a Fisher’s exact test to evaluate associations between rare variants and disease phenotype (case/control data) using the Benjamini and-Hochberg method, a type of FDR test used for multiple hypothesis testing to correct for multiple comparisons.
The study included 21 male and 29 female patients. Patients had an average age of 52.4 years, ranging from 46 to 70 years. Also, 10 male and 15 female controls were included, with an average age of 52.4 years, ranging from 29 to 70 years. After Principal Component Analysis, three patients were deemed as outliers. Without these outliers, the remaining 72 participants were in a random distribution. The three outliers (bijinfang, bijinfen, bijinhua) had a close genetic relationship (Additional file 1: Fig. S1, Additional file 2: Fig. S2).
General variant status of participants using WES
We performed WES on 75 DNA samples using the Illumina Hiseq Xen platform. To filter potential pathogenic variants, we focused on the identification of rare 1000G_EAS ≤ 0.005, based on the BGI database) and damaging variants predicted by at least two of four algorithms (i.e.,., SIFT, Polyphen2, Mutation Assessor, Mutation Taster, and CADD). Detrimental variants were classified as missense_mutation, frame_shift_del, nonsense_mutation, frame_shift_ins, splice_site, in_frame_ins, in_frame_del, nonstop_mutation. The basic variant status in the 75 samples is shown (Additional file 3: Fig. S3, Additional file 4: Fig. S4).
DLSS-related variant identification from 72 participants
Genes related to DLSS were processed by cluster analysis in R software. We filtered allele mutations in controls to retain only mutations in patients and identified multiple SNPs in 34 genes (FDR < 0.05) (Fig. 2, Table 1).
DLSS-related variant identification using public databases
The study used 50 disease and 2245 control cases (Novegene database) to perform site-based associations (Fig. 4). Forty-three (43) genes were filtered up to P value ≤ 0.001 and FDR_BH_Allele ≤ 0.01. After Toppgene analyses, the top two genes were HLA-DRB1 and PRKN. Table 2 shows the top four SNPs related to lumbar disease. We also performed pathway and Phenolyzer analysis to discover genes related to lumbar disease. Genes related to DLSS (Phenolyzer score ≥ 0.01) are shown (Additional file 5: Fig. S5).
After filtering data using 1000G_EAS ≤ 0.005 and damaging variants predicted by at least two of four algorithms (i.e., SIFT, Polyphen2, Mutation Assessor, Mutation Taster, and CADD), we identified 322 SNPs in 60 candidate genes, with P values ≤ 0.001. We also performed pathway and Phenolyzer analyses to discover genes related to lumbar disease. As “lumbar degeneration” was not a disease item, we inputted “lumbar disc degeneration” into Phenolyzer. Genes related to DLSS (Phenolyzer scores ≥ 0.01) are shown (Table 3 and Additional file 6: Fig. S6).
As there was a genetic relationship between bijingfang, bijinfen, and bijinhua, not all patients had a sporadic disease. We sought to determine SNPs not only in these three cases but also in the remaining 47. Six potential genes were identified associated with DLSS (Tables 4, 5).
Previous evidence has indicated that genetic factors may be implicated in DLSS; therefore, we performed WES on 50 patients and 25 controls to investigate disease contributing genes. We identified several novel candidate genes previously unconnected with DLSS.
Several studies have identified candidate genes associated with lumbar disc degeneration. Jason et al. performed a genome-wide association study and identified multiple SNPs suggesting a multifactorial basis for DLSS . In other work, the HLA-DRB1 genotype increased the risk of developing pain after surgery or lumbar disc herniation . A PARK2 gene variant was associated with lumbar disc degeneration by influencing overall PARK2 methylation . In this study, HLA-DRB1 and PARK2 were identified as susceptibility genes associated with a predisposition to DLSS. Lumbar disc degeneration causes intervertebral collapse, which may accelerate DLSS development, suggesting overlapping mechanisms exist between the two degenerative processes. Our genetic data partially agreed with previous studies; however, specific gene-related DLSS mechanisms require further investigation.
This study also identified ACTR8, AOAH, BCORL1, MKRN2, NRG4, and NUP205 genes associated with DLSS. ACTR8 has 14 exons, is located on chromosome 3p21.1, with mutations associated with lineage-specific expression in primates . AOAH is found on chromosome 7p14.2 and encodes both light and heavy acyloxyacyl hydrolaseregion subunits. AOAH polymorphisms are reportedly associated with asthma, chronic rhinosinusitis, and bronchial hyperreactivity [20, 21]. Located on chromosome Xq26.1, BCORL1 encodes a transcriptional corepressor that tethers promoter regions via DNA-binding proteins. Pathogenic BCORL1 variants reportedly underlie a newly identified X-linked epigenetic syndrome . MKRN2 is located on chromosome 3p25.2 and encodes a putative E3 ubiquitin ligase containing several zinc finger domains. The gene is involved in inflammatory response regulation and is implicated in non-small-cell lung cancer [23, 24]. NRG4 is located on chromosome 15q24.2 and is a member of the epidermal growth factor family of extracellular ligands, is highly expressed in adipose tissue, enriched in brown fat, and markedly increased during brown adipocyte differentiation . NUP205 on chromosome7q33 encodes a nucleoporin which is a subunit of the nuclear pore complex which functions in protein, RNA, and ribonucleoprotein particle active transport between the nucleus and cytoplasm. Mutations in NUP205 are associated with steroid-resistant nephrotic syndrome .
Several other DLSS candidate genes were also identified in this study, including GPRIN2, MYOT, and PDE4DIP, etc. Several pathways were enriched using differentially expressed gene analysis between patients and controls. Until now, these genes were not associated with DLSS.
Our study had several limitations. Firstly, patient numbers (50) were largely inadequate for a genetic study. However, we had enrolled three family members. In terms of future work, the candidate genes identified here warrant further investigation. Functional studies should be conducted to determine how candidate gene molecular mechanisms and pathways impact DLSS development. However, our immediate remit is to increase cohort size to increase statistical power and identify more susceptible genes.
We identified several candidate gene mutations potentially associated with DLSS in Chinese patients using WES for the first time. Further verification of our data may help develop molecular-based approaches to aid DLSS diagnosis and treatment.
Availability of data and materials
The datasets generated and/or analysed during the current study are available in the [SRA] repository, [https://www.ncbi.nlm.nih.gov/bioproject/PRJNA728520/].
Degenerative lumbar spinal stenosis
Whole exome sequencing
Magnetic resonance imaging
Single nucleotide variants
Minor allele frequency
Arabmotlagh M, Sellei RM, Vinas-Rios JM, Rauschmann M. Classification and diagnosis of lumbar spinal stenosis. Orthopade. 2019;48(10):816–23. https://doi.org/10.1007/s00132-019-03746-1.
Bagley C, MacAllister M, Dosselman L, Moreno J, Aoun SG, El Ahmadieh TY. Current concepts and recent advances in understanding and managing lumbar spine stenosis. F1000Res. 2019;8:137. https://doi.org/10.12688/f1000research.16082.1.
Sheehan JM, Shaffrey CI, Jane JA Sr. Degenerative lumbar stenosis: the neurosurgical perspective. Clin Orthop Relat Res. 2001;384:61–74.
Spivak JM. Degenerative lumbar spinal stenosis. J Bone Joint Surg Am. 1998;80(7):1053–66. https://doi.org/10.2106/00004623-199807000-00015.
Raja A, Hoang S, Patel P, Mesfin FB. Spinal stenosis. StatPearls; 2020.
Issack PS, Cunningham ME, Pumberger M, Hughes AP, Cammisa FP Jr. Degenerative lumbar spinal stenosis: evaluation and management. J Am Acad Orthop Surg. 2012;20(8):527–35. https://doi.org/10.5435/jaaos-20-08-527.
Kalff R, Ewald C, Waschke A, Gobisch L, Hopf C. Degenerative lumbar spinal stenosis in older people: current treatment options. Dtsch Arztebl Int. 2013;110(37):613–23. https://doi.org/10.3238/arztebl.2013.0613 (quiz 624).
Munakomi S, Foris LA, Varacallo M. Spinal stenosis and neurogenic claudication. StatPearls; 2020.
Deer T, Sayed D, Michels J, Josephson Y, Li S, Calodney AK. A review of lumbar spinal stenosis with intermittent neurogenic claudication: disease and diagnosis. Pain Med. 2019;20(Suppl 2):S32–44. https://doi.org/10.1093/pm/pnz161.
Noponen-Hietala N, Kyllönen E, Männikkö M, Ilkko E, Karppinen J, Ott J, Ala-Kokko L. Sequence variations in the collagen IX and XI genes are associated with degenerative lumbar spinal stenosis. Ann Rheum Dis. 2003;62(12):1208–14. https://doi.org/10.1136/ard.2003.008334.
Hyun SJ, Park BG, Rhim SC, Bae CW, Lee JK, Roh SW, Jeon SR. A haplotype at the COL9A2 gene locus contributes to the genetic risk for lumbar spinal stenosis in the Korean population. Spine. 2011;36(16):1273–8. https://doi.org/10.1097/BRS.0b013e31820e6282.
Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, Lee C, Shaffer T, Wong M, Bhattacharjee A, Eichler EE, Bamshad M, Nickerson DA, Shendure J. Targeted capture and massively parallel sequencing of 12 human exomes. Nature. 2009;461(7261):272–6. https://doi.org/10.1038/nature08250.
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–9. https://doi.org/10.1038/nmeth0410-248.
Sim NL, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012;40:W452–7. https://doi.org/10.1093/nar/gks539.
Schwarz JM, Cooper DN, Schuelke M, Seelow D. MutationTaster2: mutation prediction for the deep-sequencing age. Nat Methods. 2014;11(4):361–2. https://doi.org/10.1038/nmeth.2890.
Cheung JPY, Kao PYP, Sham P, Cheah KSE, Chan D, Cheung KMC, Samartzis D. Etiology of developmental spinal stenosis: a genome-wide association study. J Orthop Res. 2018;36(4):1262–8. https://doi.org/10.1002/jor.23746.
Dominguez CA, Kalliomäki M, Gunnarsson U, Moen A, Sandblom G, Kockum I, Lavant E, Olsson T, Nyberg F, Rygh LJ, Røe C, Gjerstad J, Gordh T, Piehl F. The DQB1 *03:02 HLA haplotype is associated with increased risk of chronic pain after inguinal hernia surgery and lumbar disc herniation. Pain. 2013;154(3):427–33. https://doi.org/10.1016/j.pain.2012.12.003.
Williams FM, Bansal AT, van Meurs JB, Bell JT, Meulenbelt I, Suri P, Rivadeneira F, Sambrook PN, Hofman A, Bierma-Zeinstra S, Menni C, Kloppenburg M, Slagboom PE, Hunter DJ, MacGregor AJ, Uitterlinden AG, Spector TD. Novel genetic variants associated with lumbar disc degeneration in northern Europeans: a meta-analysis of 4600 subjects. Ann Rheum Dis. 2013;72(7):1141–8. https://doi.org/10.1136/annrheumdis-2012-201551.
Choe SH, Park SJ, Cho HM, Park HR, Lee JR, Kim YH, Huh JW. A single mutation in the ACTR8 gene associated with lineage-specific expression in primates. BMC Evol Biol. 2020;20(1):66. https://doi.org/10.1186/s12862-020-01620-9.
Barnes KC, Grant A, Gao P, Baltadjieva D, Berg T, Chi P, Zhang S, Zambelli-Weiner A, Ehrlich E, Zardkoohi O, Brummet ME, Stockton M, Watkins T, Gao L, Gittens M, Wills-Karp M, Cheadle C, Beck LA, Beaty TH, Becker KG, Garcia JG, Mathias RA. Polymorphisms in the novel gene acyloxyacyl hydroxylase (AOAH) are associated with asthma and associated phenotypes. J Allergy Clin Immunol. 2006;118(1):70–7. https://doi.org/10.1016/j.jaci.2006.03.036.
Zhang Y, Endam LM, Filali-Mouhim A, Zhao L, Desrosiers M, Han D, Zhang L. Polymorphisms in RYBP and AOAH genes are associated with chronic rhinosinusitis in a Chinese population: a replication study. PLoS ONE. 2012;7(6):e39247. https://doi.org/10.1371/journal.pone.0039247.
Damm F, Chesnais V, Nagata Y, Yoshida K, Scourzic L, Okuno Y, Itzykson R, Sanada M, Shiraishi Y, Gelsi-Boyer V, Renneville A, Miyano S, Mori H, Shih LY, Park S, Dreyfus F, Guerci-Bresler A, Solary E, Rose C, Cheze S, Prébet T, Vey N, Legentil M, Duffourd Y, de Botton S, Preudhomme C, Birnbaum D, Bernard OA, Ogawa S, Fontenay M, Kosmider O. BCOR and BCORL1 mutations in myelodysplastic syndromes and related disorders. Blood. 2013;122(18):3169–77. https://doi.org/10.1182/blood-2012-11-469619.
Shin C, Ito Y, Ichikawa S, Tokunaga M, Sakata-Sogawa K, Tanaka T. MKRN2 is a novel ubiquitin E3 ligase for the p65 subunit of NF-κB and negatively regulates inflammatory responses. Sci Rep. 2017;7:46097. https://doi.org/10.1038/srep46097.
Jiang J, Xu Y, Ren H, Wudu M, Wang Q, Song X, Su H, Jiang X, Jiang L, Qiu X. MKRN2 inhibits migration and invasion of non-small-cell lung cancer by negatively regulating the PI3K/Akt pathway. J Exp Clin Cancer Res. 2018;37(1):189. https://doi.org/10.1186/s13046-018-0855-7.
Wang GX, Zhao XY, Meng ZX, Kern M, Dietrich A, Chen Z, Cozacov Z, Zhou D, Okunade AL, Su X, Li S, Blüher M, Lin JD. The brown fat-enriched secreted factor Nrg4 preserves metabolic homeostasis through attenuation of hepatic lipogenesis. Nat Med. 2014;20(12):1436–43. https://doi.org/10.1038/nm.3713.
Braun DA, Sadowski CE, Kohl S, Lovric S, Astrinidis SA, Pabst WL, Gee HY, Ashraf S, Lawson JA, Shril S, Airik M, Tan W, Schapiro D, Rao J, Choi WI, Hermle T, Kemper MJ, Pohl M, Ozaltin F, Konrad M, Bogdanovic R, Büscher R, Helmchen U, Serdaroglu E, Lifton RP, Antonin W, Hildebrandt F. Mutations in nuclear pore genes NUP93, NUP205 and XPO5 cause steroid-resistant nephrotic syndrome. Nat Genet. 2016;48(4):457–65. https://doi.org/10.1038/ng.3512.
Ethics approval and consent to participate
The clinical study was approved by the ethics committee of China-Japan Friendship Hospital and was conducted in accordance with the provisions of the Declaration of Helsinki. Written informed consent was obtained from all participants before enrolment.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Principal Component Analysis of 75 samples.
Principal Component Analysis of 72 samples.
Basic variant status in the 75 cases.
Basic variant status in the 75 cases.
Forty-three genes identified by Phenolyzer analysis (Phenolyzer score ≥ 0.01).
Sixty genes identified by Phenolyzer analysis (Phenolyzer score ≥ 0.01).
Pathways enriched possibly related to DLSS.
About this article
Cite this article
Jiang, X., Chen, D. The identification of novel gene mutations for degenerative lumbar spinal stenosis using whole-exome sequencing in a Chinese cohort. BMC Med Genomics 14, 134 (2021). https://doi.org/10.1186/s12920-021-00981-4
- Degenerative lumbar spinal stenosis
- Whole-exome sequencing
- Susceptible genes
- Single nucleotide polymorphisms
- Gene mutations