Skip to main content

Exploring genes for immunoglobulin A nephropathy: a summary data-based mendelian randomization and FUMA analysis



Immunoglobulin A nephropathy (IgAN) is a complex autoimmune disease, and the exact pathogenesis remains to be elucidated. This study aimed to explore genes underlying the pathogenesis of IgAN.


We conducted the summary data-based Mendelian randomization (SMR) analysis and performed functional mapping and annotation using FUMA to explore genetic loci that are potentially involved in the pathogenies of IgAN. Both analyses used summarized data of a recent genome-wide association study (GWAS) on IgANs, which included 477,784 Europeans (15,587 cases and 462,197 controls) and 175,359 East Asians (71 cases and 175,288 controls). We performed SMR analysis using Consortium for the Architecture of Gene Expression (CAGE) expression quantitative trait loci (eQTL) data and replicated the analysis using Genotype-Tissue Expression (GTEx) eQTL data.


Using the CAGE eQTL data, our SMR analysis identified 32 probes tagging 25 unique genes whose expression were pleiotropically associated with IgAN, with the top three probes being ILMN_2150787 (tagging HLA-C, PSMR= 2.10 × 10–18), ILMN_1682717 (tagging IER3, PSMR= 1.07 × 10–16) and ILMN_1661439 (tagging FLOT1, PSMR=1.16 × 10–14). Using GTEx eQTL data, our SMR analysis identified 24 probes tagging 24 unique genes whose expressions were pleiotropically associated with IgAN, with the top three probes being ENSG00000271581.1 (tagging XXbac-BPG248L24.12, PSMR= 1.44 × 10–10), ENSG00000186470.9 (tagging BTN3A2, PSMR= 2.28 × 10–10), and ENSG00000224389.4 (tagging C4B, PSMR= 1.23 × 10 –9). FUMA analysis identified 3 independent, significant and lead SNPs, 2 genomic risk loci and 39 genes that are potentially involved in the pathogenesis of IgAN.


We identified many genetic variants/loci that are potentially involved in the pathogenesis of IgAN. More studies are needed to elucidate the exact mechanisms of the identified genetic variants/loci in the etiology of IgAN.

Peer Review reports


Immunoglobulin A nephropathy (IgAN), first described by Jean Berger in 1968 [1], is one of the most common forms of glomerulonephritis (GN) in the world [2]. It is characterized by the deposition of IgA immune complexes (specifically the IgA1 subclass) in the glomerular mesangium, leading to frequent episodes of hematuria and/or proteinuria [3]. Approximately 20–40% of IgAN patients will progress to end-stage renal disease (ESRD) within 10–20 years of diagnosis [4, 5], causing a critical public health burden.

IgAN is a complex autoimmune disease with contributions from multiple factors, such as preference of salty food [6] and a family history of chronic glomerulonephritis [7]. Previous studies also indicated the important role of genetics in the etiology of IgAN. The prevalence of IgAN varies considerably across ethnicities, being the highest in Asians, moderate in Caucasians and the lowest in the African population [8], implying that both environmental and genetic factors are likely to be involved in the pathogenesis of IgAN. Genome-wide association studies (GWAS) identified several independent risk alleles for IgAN in East Asians and Europeans, such as genetic loci in CFH, TNFSF13, ST6GAL1 and ACCS [9,10,11,12,13]. Recent research also discovered two distinct genome-wide significant loci in C1GALT1 and C1GALT1C1 in association with defective O-glycosylation of serum immunoglobulin A1 (IgA1), the key pathogenic defect in IgAN [14]. However, the exact pathogenic mechanisms underlying the observed associations in general and the genetic associations in particular remains to be elucidated.

An important goal of public health is to identify modifiable risk factors of a disease or disorder to develop effective interventions and therapeutics. However, because of confounding, reverse causality, and selection bias, risk factors discovered by traditional observational epidemiology research were frequently found to be deceptive [15, 16]. Randomized controlled trials (RCTs) are often regarded as the gold standard for drawing causal conclusions because all the parameters, except for the exposure of interest, are comparable between the groups [17]. However, it is often time-consuming and expensive to conduct RCTs, and in some cases, allocation of exposure is either immoral or unfeasible.

Mendelian randomization (MR) refers to bioinformatical methods that assess pleiotropic effect of modifiable risk factors on a disease/disorder by utilizing proxy measures of these risk factors, thereby avoiding the necessity of conducting a conventional RCT [18]. In MR, instrumental variables (IVs) are used as a proxy for randomizing individuals to ensure comparable results regardless of known/unknown confounding factors. Due to the random allocation of alleles during gamete formation, which occurs well before the exposure or outcome, genetic variants are often used as the IVs. Pleiotropic associations can be estimated from MR because inherited genetic variants are independent of potential confounding factors. By using MR, confounding and reverse causation, which are commonly encountered in traditional association studies, can be greatly minimized. MR has been successful in identifying gene expression or DNA methylation sites showing pleiotropic association with various phenotypes, such as systemic lupus erythematosus (SLE), educational attainment, and severity of COVID-19 [19,20,21]. In this study, we adopted the summary data-based MR (SMR) approach integrating summarized cis-expression quantitative trait loci (cis- eQTL) data and GWAS data for IgAN to prioritize genes whose expressions are pleiotropically associated with IgAN, with gene expression being the exposure and IgAN being the outcome.

Although previous genetic studies have identified independent risk alleles for IgAN, more studies are needed to better understand the genetic mechanisms underlying IgAN, such as the roles of the non-coding regulatory regions. FUMA is an Internet-based program that utilizes multiple biological databases to provide an easy-to-use tool for functional mapping and annotation of genetic variants identified in GWAS [22]. FUMA can provide multiple post-GWAS analysis simultaneously, including functional annotation of candidate SNPs, gene mapping, tissue-expression analysis of the prioritized genes, gene set enrichment analysis (GSEA), and interactive visualization. Previously research indicated that FUMA validated known candidate genes and identified additional putative genes through eQTL mapping and chromatin interaction mapping [22], providing valuable clues for understanding the complex genetic mechanisms underlying a disease/disorder. Therefore, we also conducted FUMA analysis to further explore genetic variants and genomic loci in the pathogenesis of IgAN.


GWAS data for IgAN

The GWAS summarized data for IgAN were provided by a recent genome-wide association meta-analysis of IgAN [23]. The results were based on meta-analyses of IgAN using data from three population-based projects: The BioBank Japan (BBJ) [24], the UK Biobank [25], and GWAS summary results from FinnGen (, with the sample size being 175,359 (71 cases and 175,288 controls), 344,365 (15,418 cases and 328,947 controls), and 133,419 (169 cases and 133,250 controls) for the three projects, respectively. As a result, the meta-analysis included 477,784 Europeans (15,587 cases and 462,197 controls) and 175,359 East Asians (71 cases and 175,288 controls). IgAN was diagnosed based on International Classification of Diseases, 10th revision (ICD-10). The three projects used different genotyping flatforms and reference panels for imputation. GWAS analysis was done using generalized mixed-effects models, with adjustment of different covariates and principal components (PCs) (Additional file 1: Table S1). The GWAS summarized data can be downloaded at

eQTL data for SMR analysis

In the SMR analysis, cis-eQTL genetic variants were used as the IVs for gene expression. cis-eQTLs were defined as the eQTLs that are not more than 5 Mb away from the probes. A default cis window of 2000 kb was used. We performed separate SMR analysis using eQTL data from two sources. Specifically, we used the CAGE eQTL summarized data for whole blood, which included 2765 participants of European ancestry [26]. To see whether the significant findings can be replicated, we also performed separate SMR analysis using the V7 release of the GTEx eQTL summarized data for whole blood, which included 338 participants of European ancestry [27]. The eQTL data can be downloaded at We did not use GTEx eQTL data from kidney due to the extremely limited sample size (e.g., n = 73 for kidney cortex and n = 4 for kidney medulla).

SMR analysis

We conducted the SMR analysis as implemented in the software SMR, with cis-eQTL as the IV, gene expression as the exposure, and IgAN as the outcome. Detailed information regarding the SMR method can be found in a previous publication [28]. We conducted the heterogeneity in dependent instruments (HEIDI) test to evaluate the existence of linkage in the observed association. HEIDI uses multiple SNPs in a cis-eQTL region to distinguish pleiotropy from linkage. The null hypothesis is that there is a single causal variant underlying the observed association between gene expression and a trait. Testing against the null hypothesis is equivalent to testing whether there is heterogeneity in the estimated SMR effect for the SNPs in the cis-eQTL region. We adopted the default PHEIDI< 0.05 to indicate the existence of pleiotropy (i.e., the observed association could be due to two distinct genetic variants in high linkage disequilibrium with each other), which is a conservative approach as it retains fewer genes than when correcting for multiple testing. We adopted the default settings in SMR (Additional file 1: Table S2) and used false discovery rate (FDR) to adjust for multiple testing. The SMR analytic process is illustrated in Fig. 1.

Fig. 1
figure 1

Flow chart for the SMR analysis. A SMR analysis using eQTL data from CAGE; and B SMR analysis using eQTL data from GTEx. eQTL, expression quantitative trait loci; GWAS, genome-wide association studies; LD, linkage disequilibrium; SMR, summary data-based Mendelian randomization; SNP, single nucleotide polymorphisms

FUMA analysis

We conducted a FUMA analysis to functionally map and annotate the genetic associations to better understand the genetic mechanisms underlying IgAN. FUMA uses GWAS association results as the input and integrates information from multiple resources. It provides a friendly on-line platform for easy implementation of post-GWAS analysis, such as functional annotation and gene prioritization [22]. FUMA provides two major functions: SNP2GENE for annotating SNPs regarding their biological functions and SNP-to-genes mapping; and GENE2FUNC for annotating the mapped genes in biological contexts. In SNP2GENE, we performed both positional mapping (maximum distance 10 kb) and eQTL mapping (cis-eQTL, i.e., up to 1 Mb) using GTEx v8 of whole blood and kidney. We adopted the default settings otherwise for both SNP2GENE (e.g., maximum P value of lead SNPs being 5 × 10− 8 and r2 threshold for independent significant SNPs being 0.6) and GENE2FUNC (e.g., using FDR to correct for multiple testing in the gene-set enrichment analysis).

It is noteworthy that although both FUMA and SMR utilized eQTL information, the two methods attempted to explore the pathogenesis of IgAN through different perspectives: The SMR analysis tried to reveal gene expressions showing pleiotropic association with IgAN while FUMA attempted to pinpoint most likely relevant genetic variants in association with IgAN while taking into account the regional linkage disequilibrium (LD) patterns based on positional and eQTL information of the SNPs. Moreover, FUMA not only provided enrichment analysis of the prioritized genes in biological pathways and functional categories, it also revealed potential risk loci along the chromosomes.

Data cleaning and statistical/bioinformatical analysis was performed using R version 4.1.2 (, PLINK 1.9 (, SMR (, and FUMA (


Basic information of the summarized data

The GWAS summarized data included a total of 22,665,652 SNPs. A total of 6437 SNPs were significantly associated with IgAN using the conventional P = 5 × 10− 8 as the cut-off. The GWAS meta-analysis of IgAN identified eight significant genetic loci (Additional file 1: Table S3). After checking allele frequencies among the datasets and linkage disequilibrium (LD) pruning, we found that there were more than 6 million eligible SNPs in each SMR analysis. The CAGE eQTL has a much larger number of participants than that of the GTEx eQTL data (2765 vs. 70), so is the number of eligible probes (8534 vs. 4536). In the FUMA analysis, about 8.6 million SNPs were used as input. The detailed information is shown in Table 1.

Table 1 Basic information of the eQTL and GWAS data

Pleiotropic association with IgAN

Using the CAGE eQTL data, our SMR analysis identified 32 probes tagging 25 unique genes whose expressions were pleiotropically associated with IgAN, with the top three probes being ILMN_2150787 (tagging HLA-C, PSMR= 2.10 × 10–18), ILMN_1682717 (tagging IER3, PSMR= 1.07 × 10–16) and ILMN_1661439 (tagging FLOT1, PSMR=1.16 × 10–14; Fig. 2; Table 2, Additional file 1: Table S4). Using GTEx eQTL data, our SMR analysis identified 24 probes tagging 24 unique genes whose expressions were pleiotropically associated with IgAN, with the top three probes being ENSG00000271581.1 (tagging Xxbac-BPG248L24.12, PSMR=1.44 × 10–10), ENSG00000186470.9 (tagging BTN3A2, PSMR=2.28 × 10–10), and ENSG00000224389.4 (tagging C4B, PSMR = 1.23 × 10− 9; Fig. 3; Table 2, Additional file 1: Table S4). Note that of the 25 unique genes identified in the SMR analysis using CAGE eQTL data, four genes (FLOT1, BTN3A2, HLA-DRB6, and HLA-DRB1) were also replicated in the SMR analysis using GTEx eQTL data (Additional file 1: Table S4).

Fig. 2
figure 2

Pleiotropic association of HLA-C with IgAN. Top plot, grey dots represent the −log10(P values) for SNPs from the GWAS of IOP, with solid rhombuses indicating that the probes pass HEIDI test. Middle plot, eQTL results. Bottom plot, location of genes tagged by the probes; GWAS, genome-wide association studies; SMR, summary data-based Mendelian randomization; HEIDI, heterogeneity in dependent instruments; eQTL, expression quan-titative trait loci; IgAN, immunoglobulin A nephropathy

Table 2 The top hit probes identified in SMR analyses
Fig. 3
figure 3

Pleiotropic association of XXbac-BPG248L24.12, C4A and C4B with IgAN. Top plot, grey dots represent the −log10(P values) for SNPs from the GWAS of IOP, with solid rhombuses indicating that the probes pass HEIDI test. Middle plot, eQTL results. Bottom plot, location of genes tagged by the probes. GWAS, genome-wide association studies; SMR, summary data-based Mendelian randomization; HEIDI, heterogeneity in dependent instruments; eQTL, expression quan-titative trait loci; IgAN, immunoglobulin A nephropathy

Functional mapping and annotation

FUMA analysis identified 3 independent, significant and lead SNPs (rs2076030, rs469228, and rs1884937; Additional file 1: Tables S5–S7), and 2 genomic risk loci (Fig. 4; Additional file 1: Table S8). All the three SNPs are located on chromosome 6. In addition, FUMA identified 39 genes that are potentially involved in the pathogenesis of IgAN (Additional file 1: Table S9), which are clustered in one genomic risk locus, with the other genomic locus containing no identified genes (Fig. 4 & Additional file 1: Table S9). Of the 39 identified genes, four were also identified by SMR analysis using CAGE eQTL data, including HIST1H2BK, ZSCAN16, ZKSCAN4, and ZKSCAN3; and one (TRIM27) was identified by SMR analysis using GTEx eQTL data. Expression of the prioritized genes in 54 tissues can be found in Additional file 1: Table S10 and Figure S1.

Fig. 4
figure 4

Genetic risk loci identified by FUMA analysis using GWAS data on IgAN. Genomic risk loci are displayed in the format of ‘chromosome:start position-end position’ on the Y axis. For each genomic locus, histograms from left to right depict the size, the number of candidate SNPs, the number of mapped genes (using positional mapping and eQTL mapping), and the number of genes known to be located within the genomic locus, respectively. eQTL, expression quantitative trait loci; GWAS, genome-wide association studies; SNP, single nucleotide polymorphism; IgAN, immunoglobulin A nephropathy

GSEA was undertaken to test the possible biological mechanisms of the 39 candidate genes implicated in IgAN (Additional file 1: Table S11). A total of 310 gene sets with an adjusted P < 0.05 were identified. We found enrichment signals in intestinal immune network such as SLE (adjusted P = 9.00 × 10–23) as revealed by a recent GWAS study [13], and chromatin-related pathways such as chromatin assembly (adjusted P = 5.58 × 10–11) which are crucial regulators in cellular immunity[29].


In this study, we conducted SMR and FUMA analysis to prioritize genetic loci associated with IgAN. We identified multiple genetic variants, genes, gene sets and two genomic loci that may be involved in the pathogenesis of IgAN. These findings provided helpful leads to a better understanding of the etiology of IgAN.

We found that multiple genes in the human leukocyte antigen (HLA) complex whose expressions showed significantly pleiotropic association with IgAN using CAGE and/or GTEx eQTL data, such as HLA-A/C/E/H/J/L, HLA-DQA1/A2, and HLA-DRB1/B6. The HLA complex, known as major histocompatibility complex (MHC), plays important roles in enabling the immune system to recognize “self” versus “non-self” antigens. The first association between HLA and renal disease was reported more than 50 years ago. Since then, mounting evidence has demonstrated the importance of the HLA complex in IgAN [30]. Many genetic variants in the HLA complex have been found to be associated with the risk of IgAN [31,32,33]. Previous GWAS studies also identified a few genetic variants in MHC in association with IgAN in individuals of European and East-Asian ancestry [9, 11,12,13]. A recent study found that the expression of HLA-DQB1 and HLA-DRB1 decreased on the peripheral blood lymphocytes (PBLs) in IgAN patients, compared with the controls, and that abnormal HLA-DQB1 and HLA-DRB1 expression may aggravate the progression of IgAN [34], suggesting the possible involvement of the abnormal expression of both genes in the pathogenesis of IgAN. Abnormal mRNA expression of some HLA genes has been observed in many autoimmune diseases such as lupus and was found to be related to DNA methylation [35, 36]. It should be noted that HEIDI test was significant for some of the HLA genes except HLA-DRB6, HLA-DQA1, HLA-DRB1, HLA-DQA2 (Additional file 1: Tabel S4), indicating the existence of pleiotropy. These findings indicated that inflammation and DNA methylation might be two possible mechanisms underlying the HLA’s involvement in IgAN. More studies are needed to elucidate the exact functions of HLA genes in the pathogenesis of IgAN.

In SMR analysis, we also found that two genes in the complement component C4 family, including C4A and C4B, whose expressions showed significantly pleiotropic association with IgAN using GTEx eQTL data (Table 2). Both genes are mapped in III region of the MHC on chromosome 6p21.3 [37]. The two genes, together with three other neighboring genes including RP (serine–threonine kinase), CYP21 (steroid 21-hydroxylase) and TNX (tenascin-X), form a genetic unit called RCCX module (RP-C4A-CYP21-TNX or RP-C4B-CYP21-TNX) which determines gene copy number (GCN) variation [38]. GCN of the two genes was found to be associated with many autoimmune diseases. For example, a previous meta-analysis found that low C4A GCNs were associated with increased risk of SLE in Caucasian populations [39]. The expression of C4A and C4B was significantly upregulated in glomeruli of patients with IgAN [40]. The whole complement system can be activated by three pathways, including the classical pathway, the lectin binding pathway, and the alternative pathway, and C4A and C4B are likely involved in the classical pathway [41]. However, the exact roles of the two genes and the mechanisms underlying the pathogenesis of IgAN remain to be explored.

Our SMR analyses were based on three core assumptions: (1) The genetic variants used as IVs were associated with gene expression (i.e., concern of weak IV); (2) The genetic variants were not associated with confounders that bias the association of gene expression with IgAN; and (3) The genetic variants are related with IgAN only through their association with gene expression. Concerns about these assumptions are minimal, moderate, or cannot be verified directly. For Assumption 1, the SMR analysis assumed a P value threshold of 5 × 10− 8 to select the top associated eQTL. Therefore, the selected genetic variants indeed showed strong association with gene expression, and we believe that the concern about weak IV is minimal. The basis for Assumption 2 is that the genetic variants are not associated with socioeconomic and behavioural traits that commonly confound the effect of exposure (i.e., gene expression) on outcome (i.e., IgAN risk). Our SMR analyses used summarized data; therefore, we did not have data to directly test this assumption. Assumption 3 is regarding horizontal pleiotropy which can skew the MR results. Recent research indicated that horizontal pleiotropy was detectable in more than 48% of significant MR causalities, yielding an average bias of −131% to 201% in MR estimates. The existence of horizontal pleiotropy can induce false-positive causal findings in up to 10% of relationships [42]. For some of the identified genes, we did notice that the HEIDI test was significant, indicating that we should interpret these results with caution.

Our study has some limitations. The number of eligible probes and the sample size of the eQTL for the SMR analysis was limited, especially the GTEx eQTL. Moreover, we used FDR to correct for multiple testing. Together, we may miss some important genes that were not tagged in the eQTL data or filtered out by FDR. The HEIDI test indicated the existence of horizontal pleiotropy for some of the observed associations (Additional file 1: Table S4). Our SMR analysis used eQTL data from the blood as the sample size for eQTL data in kidney is rather limited in GTEx V7. It would be interesting to explore whether the findings still hold using kidney eQTL data which are based on larger sample sizes. Moreover, the GWAS data are based on mixed ancestries including Europeans and East Asians. We did not perform SMR analysis in subjects of European ancestry because the GWAS summarized data for European ancestry are not publicly available. In our SMR analyses, the exposure is gene expression which may be influenced by genetic variants. The SMR analyses, however, cannot distinguish between pleiotropy and causality. We did not perform genetic colocalization analysis [43], which aims to assess whether two traits are affected by the same or distinct causal variants and therefore serves as a good complement to the SMR analysis. We used the default setting in the SMR analyses; therefore, we only examined pleiotropic association in cis regions but not in trans regions. Future studies are needed to explore genes in trans regions in pleiotropic association with IgAN. The SMR approach adopted in this paper used a single instrument. Traditional methods, such as SMR, MR-Egger and median based regression, only make use of a single SNP instrument or multiple independent SNP instruments. Some new MR-based methods have been recently developed, such as the probabilistic Mendelian randomization method named PMR-Egger which can accommodate multiple correlated instruments and can control horizontal pleiotropic effects [44]. PMR-Egger was found to yield calibrated values across a wide range of scenarios and improve the power of MR analysis over existing approaches, potentially leading to better replication and experimental validation on the top identified genes. Future studies that use these novel methods are needed to validate our findings.


In summary, we performed SMR and FUMA analysis and identified many genetic variants/loci that are potentially involved in the pathogenesis of IgAN. More studies are needed to elucidate the exact mechanism of the identified genetic variants/loci in the etiology of IgAN.

Availability of data and materials

The datasets supporting the conclusions of this article are available at, and



BioBank Japan


Consortium for the architecture of gene expression


Expression quantitative trait loci


End-stage renal disease


False discovery rate




Genome-wide association study


Gene-set enrichment analysis


Genotype-tissue expression


Heterogeneity in dependent instruments


Human leukocyte antigen


Immunoglobulin A nephropathy


Linkage disequilibrium


Major histocompatibility complex


Mendelian randomization


Randomized controlled trial


Systemic lupus erythematosus


Summary data-based Mendelian randomization


  1. Berger J, Hinglais N. Intercapillary deposits of IgA–IgG. J Urol Nephrol (Paris). 1968;74(9):694–5.

    CAS  Google Scholar 

  2. D’Amico G. The commonest glomerulonephritis in the world: IgA nephropathy. Q J Med. 1987;64(245):709–27.

    Google Scholar 

  3. Roberts IS. Pathology of IgA nephropathy. Nat Rev Nephrol. 2014;10(8):445–54.

    Article  CAS  Google Scholar 

  4. Li PK, Ho KK, Szeto CC, Yu L, Lai FM. Prognostic indicators of IgA nephropathy in the Chinese–clinical and pathological perspectives. Nephrol Dial Transplant. 2002;17(1):64–9.

    Article  CAS  Google Scholar 

  5. Le W, Liang S, Hu Y, Deng K, Bao H, Zeng C, et al. Long-term renal survival and related risk factors in patients with IgA nephropathy: results from a cohort of 1155 cases in a chinese adult population. Nephrol Dial Transplant. 2012;27(4):1479–85.

    Article  CAS  Google Scholar 

  6. Wakai K, Kawamura T, Matsuo S, Hotta N, Ohno Y. Risk factors for IgA nephropathy: a case-control study in Japan. Am J Kidney Dis. 1999;33(4):738–45.

    Article  CAS  Google Scholar 

  7. Wakai K, Nakai S, Matsuo S, Kawamura T, Hotta N, Maeda K, et al. Risk factors for IgA nephropathy: a case-control study with incident cases in Japan. Nephron. 2002;90(1):16–23.

    Article  Google Scholar 

  8. Schena FP, Nistor I. Epidemiology of IgA nephropathy: a global perspective. Semin Nephrol. 2018;38(5):435–42.

    Article  Google Scholar 

  9. Feehally J, Farrall M, Boland A, Gale DP, Gut I, Heath S, et al. HLA has strongest association with IgA nephropathy in genome-wide analysis. J Am Soc Nephrol. 2010;21(10):1791–7.

    Article  CAS  Google Scholar 

  10. Gharavi AG, Kiryluk K, Choi M, Li Y, Hou P, Xie J, et al. Genome-wide association study identifies susceptibility loci for IgA nephropathy. Nat Genet. 2011;43(4):321–7.

    Article  CAS  Google Scholar 

  11. Yu XQ, Li M, Zhang H, Low HQ, Wei X, Wang JQ, et al. A genome-wide association study in Han Chinese identifies multiple susceptibility loci for IgA nephropathy. Nat Genet. 2011;44(2):178–82.

    Article  Google Scholar 

  12. Li M, Foo JN, Wang JQ, Low HQ, Tang XQ, Toh KY, et al. Identification of new susceptibility loci for IgA nephropathy in Han Chinese. Nat Commun. 2015;6:7270.

    Article  CAS  Google Scholar 

  13. Kiryluk K, Li Y, Scolari F, Sanna-Cherchi S, Choi M, Verbitsky M, et al. Discovery of new risk loci for IgA nephropathy implicates genes involved in immunity against intestinal pathogens. Nat Genet. 2014;46(11):1187–96.

    Article  CAS  Google Scholar 

  14. Kiryluk K, Li Y, Moldoveanu Z, Suzuki H, Reily C, Hou P, et al. GWAS for serum galactose-deficient IgA1 implicates critical genes of the O-glycosylation pathway. PLoS Genet. 2017;13(2):e1006609.

    Article  Google Scholar 

  15. Lawlor DA, Davey Smith G, Kundu D, Bruckdorfer KR, Ebrahim S. Those confounded vitamins: what can we learn from the differences between observational versus randomised trial evidence? Lancet. 2004;363(9422):1724–7.

    Article  Google Scholar 

  16. Lawlor DA, Smith GD. Cardiovascular risk and hormone replacement therapy. Curr Opin Obstet Gynecol. 2006;18(6):658–65.

    Article  Google Scholar 

  17. Bhide A, Shah PS, Acharya G. A simplified guide to randomized controlled trials. Acta Obstet Gynecol Scand. 2018;97(4):380–7.

    Article  Google Scholar 

  18. Davey Smith G, Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet. 2014;23(R1):R89–98.

    Article  CAS  Google Scholar 

  19. Mo X, Guo Y, Qian Q, Fu M, Lei S, Zhang Y, et al. Mendelian randomization analysis revealed potential causal factors for systemic lupus erythematosus. Immunology. 2020;159(3):279–88.

    Article  CAS  Google Scholar 

  20. Porcu E, Rueger S, Lepik K, e QC, Consortium B, Santoni FA, et al. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nat Commun. 2019;10(1):3300.

    Article  Google Scholar 

  21. Liu D, Yang J, Feng B, Lu W, Zhao C, Li L. Mendelian randomization analysis identified genes pleiotropically associated with the risk and prognosis of COVID-19. J Infect. 2021;82(1):126–32.

    Article  CAS  Google Scholar 

  22. Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8(1):1826.

    Article  Google Scholar 

  23. Sakaue S, Kanai M, Tanigawa Y, Karjalainen J, Kurki M, Koshiba S, et al. A cross-population atlas of genetic associations for 220 human phenotypes. Nat Genet. 2021;53(10):1415–24.

    Article  CAS  Google Scholar 

  24. Nagai A, Hirata M, Kamatani Y, Muto K, Matsuda K, Kiyohara Y, et al. Overview of the BioBank Japan Project: study design and profile. J Epidemiol. 2017;27(3S):2–S8.

    Article  Google Scholar 

  25. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203–9.

    Article  CAS  Google Scholar 

  26. Lloyd-Jones LR, Holloway A, McRae A, Yang J, Small K, Zhao J, et al. The genetic architecture of gene expression in peripheral blood. Am J Hum Genet. 2017;100(2):371.

    Article  CAS  Google Scholar 

  27. Consortium G. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369(6509):1318–30.

    Article  Google Scholar 

  28. Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48(5):481–7.

    Article  CAS  Google Scholar 

  29. Mazzone R, Zwergel C, Artico M, Taurone S, Ralli M, Greco A, et al. The emerging role of epigenetics in human autoimmune disorders. Clin Epigenetics. 2019;11(1):34.

    Article  Google Scholar 

  30. Patel R, Mickey MR, Terasaki PI. Leucocyte antigens and disease. I. Association of HL-A2 and chronic glomerulonephritis. Br Med J. 1969;2(5654):424–6.

    Article  CAS  Google Scholar 

  31. Jiyun Y, Guisen L, Li Z, Yi S, Jicheng L, Fang L, et al. The genetic variants at the HLA-DRB1 gene are associated with primary IgA nephropathy in Han Chinese. BMC Med Genet. 2012;13(1):33.

    Article  Google Scholar 

  32. Akiyama F, Tanaka T, Yamada R, Ohnishi Y, Tsunoda T, Maeda S, et al. Single-nucleotide polymorphisms in the class II region of the major histocompatibility complex in Japanese patients with immunoglobulin A nephropathy. J Hum Genet. 2002;47(10):532–8.

    Article  CAS  Google Scholar 

  33. Cao HX, Li M, Nie J, Wang W, Zhou SF, Yu XQ. Human leukocyte antigen DRB1 alleles predict risk and disease progression of immunoglobulin A nephropathy in Han Chinese. Am J Nephrol. 2008;28(4):684–91.

    Article  CAS  Google Scholar 

  34. Zhan X, Deng F, Wang AY, Chen Q, Du Y, Wang Q, et al. HLA-DQB1 and HLA-DRB1 expression is associated with disease severity in IgAN. Ann Palliat Med. 2021;10(9):9453–66.

    Article  Google Scholar 

  35. Miller S, Tsou PS, Coit P, Gensterblum-Miller E, Renauer P, Rohraff DM, et al. Hypomethylation of STAT1 and HLA-DRB1 is associated with type-I interferon-dependent HLA-DRB1 expression in lupus CD8 + T cells. Ann Rheum Dis. 2019;78(4):519–28.

    Article  CAS  Google Scholar 

  36. Kular L, Liu Y, Ruhrmann S, Zheleznyakova G, Marabita F, Gomez-Cabrero D, et al. DNA methylation as a mediator of HLA-DRB1*15:01 and a protective variant in multiple sclerosis. Nat Commun. 2018;9(1):2397.

    Article  Google Scholar 

  37. Martinez OP, Longman-Jacobsen N, Davies R, Chung EK, Yang Y, Gaudieri S, et al. Genetics of human complement component C4 and evolution the central MHC. Front Biosci. 2001;6:D904–13.

    Article  CAS  Google Scholar 

  38. Yang Z, Mendoza AR, Welch TR, Zipf WB, Yu CY. Modular variations of the human major histocompatibility complex class III genes for serine/threonine kinase RP, complement component C4, steroid 21-hydroxylase CYP21, and tenascin TNX (the RCCX module). A mechanism for gene deletions and disease associations. J Biol Chem. 1999;274(17):12147–56.

    Article  CAS  Google Scholar 

  39. Li N, Zhang J, Liao D, Yang L, Wang Y, Hou S. Association between C4, C4A, and C4B copy number variations and susceptibility to autoimmune diseases: a meta-analysis. Sci Rep. 2017;7:42628.

    Article  CAS  Google Scholar 

  40. Park S, Yang SH, Jeong CW, Moon KC, Kim DK, Joo KW, et al. RNA-Seq profiling of microdissected glomeruli identifies potential biomarkers for human IgA nephropathy. Am J Physiol Renal Physiol. 2020;319(5):F809–F21.

    Article  CAS  Google Scholar 

  41. Selvaskandan H, Barratt J, Cheung CK. Immunological drivers of IgA nephropathy: exploring the mucosa-kidney link. Int J Immunogenet. 2022;49(1):8–21.

    Article  CAS  Google Scholar 

  42. Verbanck M, Chen CY, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8.

    Article  CAS  Google Scholar 

  43. Zuber V, Grinberg NF, Gill D, Manipur I, Slob EAW, Patel A, et al. Combining evidence from mendelian randomization and colocalization: review and comparison of approaches. Am J Hum Genet. 2022;109(5):767–82.

    Article  CAS  Google Scholar 

  44. Yuan Z, Zhu H, Zeng P, Yang S, Sun S, Yang C, et al. Testing and controlling for horizontal pleiotropy with probabilistic mendelian randomization in transcriptome-wide association studies. Nat Commun. 2020;11(1):3861.

    Article  Google Scholar 

Download references


Not applicable.


This research was supported by National Natural Science Foundation of China (81870501) and Shanghai Science and Technology Committee (21Y11904200). Dr. Jingyun Yang’s research was supported by NIH/NIA Grants P30AG10161, R01AG15819, R01AG17917, R01AG033678, R01AG36042, U01AG61356, and 1RF1AG064312– 01.

Author information

Authors and Affiliations



MZ and JY conceptualized the study; YZ, GY and JY collected and analyzed data; QZ, KZ, JY and MZ wrote the original version of the paper. All authors contributed to subsequent revision and editing. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Jingyun Yang or Minmin Zhang.

Ethics declarations

Ethics approval and consent to participate

This study used only publicly available data. As a result, ethics approval and consent to participate is not needed.

Consent for publication

Not applicable.

Competing interests

No potential conflicts of interest were disclosed by the authors. All authors reviewed the study findings and read and approved the final version before submission.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Basic information of the GWAS summary data, the default parameters of SMR analysis, and supplementary results.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Q., Zhang, K., Zhu, Y. et al. Exploring genes for immunoglobulin A nephropathy: a summary data-based mendelian randomization and FUMA analysis. BMC Med Genomics 16, 16 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: