- Research article
- Open Access
A meta-analysis of public microarray data identifies gene regulatory pathways deregulated in peripheral blood mononuclear cells from individuals with Systemic Lupus Erythematosus compared to those without
BMC Medical Genomics volume 9, Article number: 66 (2016)
Systemic Lupus Erythematosus (SLE) is a complex, multi-systemic, autoimmune disease for which the underlying aetiological mechanisms are poorly understood. The genetic and molecular processes underlying lupus have been extensively investigated using a variety of -omics approaches, including genome-wide association studies, candidate gene studies and microarray experiments of differential gene expression in lupus samples compared to controls.
This study analyses a combination of existing microarray data sets to identify differentially regulated genetic pathways that are dysregulated in human peripheral blood mononuclear cells from SLE patients compared to unaffected controls. Two statistical approaches, quantile discretisation and scaling, are used to combine publicly available expression microarray datasets and perform a meta-analysis of differentially expressed genes.
Differentially expressed genes implicated in interferon signaling were identified by the meta-analysis, in agreement with the findings of the individual studies that generated the datasets used. In contrast to the individual studies, however, the meta-analysis and subsequent pathway analysis additionally highlighted TLR signaling, oxidative phosphorylation and diapedesis and adhesion regulatory networks as being differentially regulated in peripheral blood mononuclear cells (PBMCs) from SLE patients compared to controls.
Our analysis demonstrates that it is possible to derive additional information from publicly available expression data using meta-analysis techniques, which is particularly relevant to research into rare diseases where sample numbers can be limiting.
Systemic Lupus Erythematosus (SLE) is a multi-systemic autoimmune disease associated with high morbidity and mortality. At least nine out of ten lupus patients are female, with an onset age of between late teens and early forties. There is a significant difference in the prevalence of SLE occurring in different population and ethnic groups, and an occurrence of approximately 40 versus over 200 cases out of 100 000 persons among Northern European or black people respectively, has been reported [1, 2].
There is no predictable course for SLE, and patients can experience alternating periods of disease flare of varying severity; and remission, during which they have no obvious signs or symptoms. Lupus is also commonly misdiagnosed as it displays a broad range of clinical presentation resulting from inflammation and damage caused by the deposition of autoantibody-complexes in various tissues . There is a wide variety of contributing factors attributed to the active disease phenotype and the precise pathological mechanisms of SLE have not yet been fully elucidated. Extensive work, however, has shown its aetiology to be multifactorial, with strong evidence indicating a substantial genetic component to SLE [4–9]. Lupus is now believed to be the result of a complex model in which multiple genes influence the likelihood of establishing the disease state in response to different environmental triggers . The molecular basis underpinning this disease remains unclear, however, and more research is required to understand the mechanisms contributing to the lupus phenotype.
The genetic and molecular processes underlying lupus have been extensively investigated using a variety of -omics approaches, including genome-wide association studies, candidate gene studies and microarray experiments of differential gene expression in lupus samples compared to controls. Many of these data sets are in the public domain, and can be accessed freely by researchers [11–31]. These studies produce extensive and information-rich data sets that represent a snapshot of all genetic and/or molecular events occurring in a diseased cell at one particular point in time, and can be used to generate hypotheses on the molecular mechanisms underlying lupus.
The comparison and meta-analysis of such clinical datasets can be complicated by a variety of factors. Firstly, and especially with rare diseases such as SLE, sample size can be limiting both in the number of samples included in datasets and the number datasets within the public domain. Secondly, because participants in the study are usually in clinical care, it is often not possible to stratify cohorts by treatment regimes, particular clinical symptoms or participant demographics, and matched controls are often not available. Thirdly, the study design, participant inclusion criteria and sample type analysed are generally not standardised across studies. The wealth of information contained within these data sets is frequently underestimated, and often under-utilised once initial analyses have been completed.
In this study, we have aimed to address some of these factors by using an approach that focuses on identifying mechanistic pathways that may underlie SLE aetiology, rather than focusing on the identification of key individual genes. The benefits of addressing differential changes at a pathway level as opposed to a gene level have previously been described [32, 33]. We have analysed existing microarray data sets to identify regulatory networks and pathways that are dysregulated in SLE, including pathways that were not identified in the original individual studies. This is achieved using statistical approaches for meta-analysis of the combined results; using data from a variety of microarray expression experiments performed using different microarray platforms. We have conducted the analysis on the premise that not all genes in an aetiological pathway will necessarily show high levels of differential regulation; but that many genes of that pathway will be differentially regulated to some significant and detectable level. Our approach, therefore, uses less stringent criteria to select the differentially expressed gene lists followed by further more stringent pathway analysis. The first statistical approach to select differentially expressed genes is a binning method using quantile discretisation, and can analyse microarray datasets from different platforms; and the second independent method used to corroborate these results uses a classical scaled approach.
Our analysis demonstrates that it is possible to derive additional aetiological pathway information from publicly available expression data, using these meta-analysis techniques. This is particularly relevant to research into rare diseases where datasets tend to be fairly small, potentially limiting the statistical significance of findings from individual studies. Furthermore, this study highlights the value of sharing datasets in the public domain once primary analyses are completed.
Inclusion-exclusion criteria for datasets
Microarray studies investigating human peripheral blood mononuclear cells (PBMCs) or any subpopulation thereof (lymphocytes, monocytes or granulocytes) in at least four patients with SLE over the age of 16 were considered. SLE diagnosis satisfied the criteria set by the American College of Rheumatology  or similar. As we were particularly interested in genes differentially expressed during disease flare, lupus patients with disease activity scores of SLE Disease Activity Index (SLEDAI; [35, 36]) less than 6, or British Isles Lupus Assessment Group (BILAG) C, D or E [37, 38] were excluded. Patients on maintenance immunosuppressive treatment but still exhibiting an active disease phenotype (SLEDAI ≥ 6 and/or BILAG A or B) were included in the study. Samples that were cultured in any way after collection were excluded. Where only subsets of participants satisfying these criteria were identified, the data for these individuals were included in the analysis.
Data collection and pre-processing
The ArrayExpress database  was used to identify microarray studies investigating participants with SLE compared to healthy controls. The search term used was “systemic lupus” and results were filtered by organism (Homo sapiens) and experiment type (RNA array assay), in April 2013.
Raw data for Affymetrix data sets [GEO:GSE11909] , [GEO:GSE13887]  and [GEO:GSE38351]  were obtained from Gene Expression Omnibus (GEO; ) using the R package GEOquery . Samples fulfilling the specified criteria were Robust-Multi array Average normalised (Irizarry et al., 2003) using the simpleaffy R package [42, 43]. Another study using a custom spotted oligonucleotide array, [ArrayExpress:E-MTAB-145] , was also included. These raw data were imported into R using the ArrayExpress package , and the Limma package  was used for both print-tip loess within-array and quantile between-array normalisations of samples fulfilling the above criteria from E-MTAB-145.
Microarray probe identifiers (IDs) were converted to Ensemble IDs with a Python script accessing the Ensembl MySQL database (Ensembl 74: December 2013; ). Probe redundancy within each data set was resolved by averaging expression values for probe sets mapping to common Ensembl IDs and technical replicates were averaged, using a Python script.
Meta- and differential expression analysis of microarray studies
Common Ensembl IDs between data sets were normalised across data sets using two different methods. As the aim of this study was to identify mechanistic pathways involved in lupus aetiology as opposed to individual genes, we used a less stringent approach in the selection of genes so as to provide a larger gene set for the pathway analysis in which we applied more stringent selection criteria.
The first recently developed method for the meta-analysis of micro-array data, a binning approach, was quantile discretization . This method directly integrates disparate microarray data at the gene expression level and was first described to assess the benefit of performing supervised classification across disparate sources of microarray data . In this study, we optimised this method to perform statistical differential expression analysis with an optimum quantile discretisation range of 128.
In the second method a scaling approach was employed. We centered and scaled the expression values corresponding to each sample. The result of this operation was to get a mean equal to 0 and a sample standard deviation equal to 1 for each sample (across all genes), which tends to cancel out the inter-sample variations that are non significant for the purpose of our study on the differential expression of genes between cases and controls.
Significantly differentially expressed genes were identified using the Wilcoxon Rank Sum Test  with Benjamini-Hochberg correction for multiple testing . While we were able to apply a more stringent filter of p-value < 0.05 to the scaling dataset, we needed to relax this filter for the binning method to < 0.1 as all of the adjusted p-values obtained from this method were greater than 0.05.
Pathway and regulatory network analysis
Differentially expressed genes between SLE patients in disease flare and normal controls identified through the binning and scaling methods were investigated using Ingenuity Pathway Analysis (IPA; ). The IPA methodology compares proportional representation of genes from a defined test set in a canonical pathway (a known, well-characterised pathway), compared to the proportional representation of the pathway genes in the entire set of known genes. The p-value is calculated using a right-tailed Fisher Exact test and indicates the likelihood of the pathway association under the random model. The adjusted Benjamini-Hochberg p-value for a 5% FDR was calculated as 1.51 × 10−3 for the binning method and 2.3 × 10−3 for the scaling method, and controls for errors in selecting canonical pathways from a large set of options. The most over-represented canonical pathways enriched with differentially regulated genes were identified and an analysis of regulatory networks was undertaken. This was used to identify key common upstream transcription factors that may be driving cascades of differential gene regulation, as well as to identify any key node genes that might be crucial to the differential regulation of gene regulatory pathways and networks in the disease state.
Data collection and normalisation across studies
Four studies fulfilling the inclusion-exclusion criteria were selected for this meta-analysis study (Fig. 1). GSE11909 contributed 15 cases (no controls) from PBMCs , GSE13887  contributed 4 cases (no controls) from CD3-positive T cells , GSE38351 contributed 12 cases and 8 controls from monocytes, and E-MTAB-145 contributed 13 cases and 25 controls from PBMCs (Fig. 1, ). Normalised data sets GSE11909, GSE13887, GSE38351, E-MTAB-145 each contained 22,283, 54,675, 22,283 and 26,495 probes respectively. These probe sets were reduced to 14,806, 24,772, 14,806 and 17,407 genes (respectively), of which 11,933 genes (Ensembl ID) were common across all three data sets (Fig. 1).
Meta- and differential expression analysis of microarray studies
Two approaches were used to normalise the expression values across data sets. The binning and scaling methods resulted in lists of 749 (p-value < 0.1) and 597 (p-value < 0.05) differentially expressed genes respectively [see Additional file 1], with 458 of these genes common between the lists. We took the decision to use relatively relaxed p-value cutoffs for differential expression analysis in light of the fact that we were interested in identifying mechanistic pathways underlying lupus, rather than identifying individual key genes, and that we were using more stringent criteria in the pathway analysis component of the study.
Pathway and regulatory network analysis
The top overlapping canonical pathways identified by IPA using both binning and scaling gene lists were Agranulocyte Adhesion and Diapedesis and the Role of Pattern Recognition Receptors in Recognition of Bacteria and Viruses; both with p-values < 2 × 10−04 (Table 1). Other pathways identified included Interferon Signaling, Oxidative Phosphorylation and Toll-like Receptor Signaling from the binning method (p-value < 7 × 10−04), and the Role of Cytokines in Mediating Communication between Immune Cells, the Role of Hypercytokinemia/hyperchemokinemia in the Pathogenesis of Influenza and Role of Macrophages and Fibroblasts and Endothelial Cells in Rheumatoid Arthritis for the scaling method (p-value < 2 × 10−04). Genes assigned to the pathways are shown in Table 2. When using Benjamini-Hochberg adjusted p-values, the top five pathways from either method are significant within an FDR of 5%.
From these pathways, IPA calculated the top overlapping upstream regulators to be IFNL1, IFNA2, TNF and IRF7; all with p-values < 2 × 10−13 (Table 3). Tretinoin and IRF3 were additionally identified from the binning and scaling methods, respectively.
In this study, we have used publicly available microarray datasets to detect genes that are differentially regulated in blood cells from people with SLE when compared to controls without SLE. By combining the data from four separate gene expression studies in a meta-analysis, we hoped to be able to derive additional information from the data that possibly could not be established from the individual studies in isolation. In general, we found this to be the case, with confirmation of the previously reported findings of the separate studies; as well as some additional insights that were not reported in the individual papers. Despite the lack of control samples from two of the studies, the results of the meta-analysis do not seem to be biased towards the studies which included controls, as evidenced by identification of Toll-like Receptor Signaling and Oxidative Phosphorylation which were not identified in those original individual studies that contained controls. Although the ideal scenario would be an equal number of cases and controls where possible, using the combined meta-analysis approach and increasing the number of datasets for cases can still lead to detecting smaller effect sizes with accuracy even in the absence of a concomitant increase in number of control samples. A potential source of bias in this meta-analysis is the lack of controls for two of the studies, and because the study analyses previously published data this lack of controls for two studies cannot be addressed retrospectively. Whilst the meta-analysis including all studies increases sample size and statistical power, and the analysis is designed to control for false positives, we acknowledge that it is not possible to completely rule out the possibility of such bias. In particular, we focused our analysis on identifying aetiological pathways rather than specific aetiological genes, with inclusive criteria when selecting differentially regulated genes. The rationale behind this decision was two-fold: firstly, we aimed to select pathways for which a large proportion of genes are differentially regulated in preference to pathways where fewer genes are consistently subject to large fold changes in expression; and secondly, we aimed to accommodate the different characteristics and small sample sizes of the datasets used.
Activation of the interferon (IFN) pathway in lupus patients is well established , and is the common underlying theme found in all of the original individual studies used in this meta-analysis [16, 17, 26, 39], as well as in other similar meta-analysis studies of lupus data [53, 54]. In this meta-analysis, we similarly found that when using both the binning and scaling methods, IFN signaling was a top canonical pathway activated in lupus patients compared to controls (Table 1), providing a good positive control for our meta-analysis, and corroborating the findings of the individual analyses. Differentially regulated genes that are either integral to IFN signaling or lie directly downstream to the IFN pathways are shown in Fig. 2.
The IFN family of signaling proteins is a subset of cytokines with a protective function elicited in response to pathogenic species, such as bacteria or viruses. In lupus patients, type 1 IFN signaling stimulates persistent dendritic cell activation and has direct effects on B and T cell activation. Dendritic cells are able to selectively activate autoreactive T cells, while activated B cells seem to play a role in elevated autoantibody production and immune complex manifestation . The presence of pathogens also stimulates elevation of other cytokines that signal immune cells to migrate to places of infection. These activated immune cells are in turn stimulated to produce more cytokines thereby creating a positive feedback loop. In normal healthy individuals, this process is tightly controlled. However deregulation of this control, sometimes present in patients with rheumatic diseases, can lead to cytokine storm, or hypercytokinemia: a potentially fatal complication that can lead to severe tissue and organ damage. Interestingly, while hypercytokinemia is quite rarely found in adult lupus patients , it has been more commonly reported in juveniles with this disease . While our selection criteria only included individuals of over 16 years of age, the study by Chaussabel et al., was in fact on a group of pediatric lupus patients, of which we included 15 individuals over 16 years in our meta-analysis . This may have enriched the set of differentially regulated genes that we identified here for IFN signaling events that are implicated in hypercytokinemia.
In addition, however, our study of the combined data identified a number of other differentially regulated pathways in lupus patients compared to controls.
Of particular interest, was the detection of activated toll-like receptor (TLR) signaling (Table 1) and the role of pattern recognition receptors (PRRs) in recognition of bacteria and viruses. Importantly, the latter pathway was a similar result to Makashir et al.’s findings of enrichment of genes within the module of immune defense against extracellular organisms . This highlights the benefits of using multiple methodologies to explore existing datasets, as different analytical approaches may bring a variety of evidences together towards the same conclusions. The TLR class of proteins, a subset of PRRs, has an essential role in the mammalian innate immune response . Along with other PRRs, these proteins are able to recognise structurally conserved microbial molecular patterns, also known as pathogen-associated molecular patterns (PAMPs). Activation of TLRs leads to a complex response which involves activation of IFN signaling and can have downstream roles in both apoptosis and autophagy . While none of the original studies highlighted a change in this pathway for any of the naive patient samples, Smiljanovic et al. (2012) report TLR2 to be up-regulated in their tumor necrosis factor (TNF)-α in vitro-treated lupus monocytes compared to controls . Furthermore, polymorphisms in TLR3, TLR7 and TLR9 genes have been associated with lupus patients in some populations [60–62] and a number of in vivo and in vitro studies have strongly implicated a number of TLRs to play a role in the pathogenesis of lupus . By combining the datasets from the individual microarray studies here, we have similarly identified TLR signaling as a key regulatory mechanism that is differentially regulated in the PBMCs from lupus patients when compared to unaffected controls; and have demonstrated consistent differential regulation of TLR3.
Oxidative phosphorylation was another key pathway highlighted in our meta-analysis. This is a metabolic process through which ATP is produced by the release of energy from a series of redox reactions. These reactions involve the transfer of electrons between donor and acceptor pairs, performed by a group of protein complexes situated within the mitochondrial membrane. Simultaneously, this electron transport chain is coupled with the transport of protons across the membrane, setting up an electrochemical gradient within the inter-membrane space. ATP synthase is then able to exploit this gradient through chemiosmosis, allowing the phosphorylation of ADP, to produce ATP. Gene expression analysis by Fernandez et al., (2009) found increases in mitochondrial mass and membrane potential, as well as enhanced levels of NO production and intracellular calcium in lupus samples compared to controls. They also showed that the mammalian target of rapamycin (mTOR) activity is increased in this disease. mTOR is situated on the external mitochondrial membrane, and plays an essential role in the oxidative capacity of mitochondria, and pharmacological inhibition of mTOR by rapamycin has been shown to reduce the ATP generating capacity of mitochondria [63, 64]. Moreover, oxidative stress is known to be increased in lupus patients compared to healthy controls , and a number of oxidative stresses have been shown to influence oxidative phosphorylation [66, 67]. Taken together, it is likely that there will be an intimate interplay of changes in lupus patients between these processes and oxidative phosphorylation. Our meta-analysis has highlighted components of this pathway as being differentially regulated in PBMCs from lupus patients, suggesting that further research to explore a role for oxidative phosphorylation in SLE pathogenesis is warranted.
Diapedesis, or leukocyte extravasation, is the movement of white blood cells out of the vascular system via intact vessel walls into adjacent inflammation-affected tissue. As inflammation is a highly common disease manifestation found in lupus patients, it is reasonable to assume that diapedesis signaling will be markedly deregulated in lupus patients. This is further supported by the findings of Makashir et al., 2015 reporting an enrichment of genes involved in wounding and leukocyte cell migration modules within lupus patients, and importantly, both studies similarly identified FCGR3B as a key player within the associated pathway . Together with ITGB2, ITGAM forms the Macrophage-1 antigen (Mac-1) integrin or complement receptor 3 (CR3), known to be involved in leukocyte extravasation and phagocytosis. Notably, a number of polymorphisms within or near the gene encoding Integrin alpha M, or ITGAM, have been highly associated with genetic lupus risk factors, with the rs1143679 variant resulting in an R77H substitution in its gene product [68–71]. Interestingly, cells transfected with this mutant showed reduced ligand (ICAM-1 and ICAM-2) and complement (iC3b) binding abilities well as impaired iC3b-mediated phagocytosis, compared to those transfected with wild type ITGAM . Rhodes and co-workers confirmed most of these results in ex vivo monocytes (adhesion) and macrophages (phagocytosis) from wild type or R77H homozygous volunteers, and also reported reduced adhesion to fibrinogen and DC-SIGN, the dendritic cell-specific intercellular adhesion molecule-3Grabbing non-integrin receptor, in R77H homozygous monocytes . More recently, Fossati-Jimack and co-workers provided strong evidence suggesting that this variant’s role in lupus susceptibility is most likely due to its effects on clearance of cellular debris. They were only able to corroborate findings of reduced iC3b-directed phagocytosis in monocytes, macrophages, neutrophils and dendritic cells heterozygous for the R77H/R allele; the genotype most commonly associated with lupus disease . The dysregulation of these key functions of diapedesis and adhesion, detected in our study through differential expression of key molecules in PBMCs from SLE patients, may therefore be contributing to pathogenic mechanisms in SLE.
From the gene lists in Table 2, it can be seen that there is frequent overlap in gene membership between the different pathways identified. This reflects the strong interplay that exists between the pathway functions and regulatory mechanisms that drive dysregulated cellular mechanisms; and is consistent with the complexity of SLE aetiology whereby multiple downstream effects of gene dysregulation can result in the complex disease phenotype. Identification of common upstream regulators of these pathways, however, may provide insights into key molecules that are fundamental to the dysregulated cellular processes underlying SLE aetiology.
IFN-lambda 1 (IFNL1, also known as IL29), a type III IFN, has been proposed previously as a key molecule in renal disorder and arthritic progression in SLE . The role of IFN alpha in SLE has been extensively explored (reviewed in ), and IFN-alpha 2 (IFNA2) has recently been implicated in perpetuation of SLE disease activity . Furthermore, a number of members highlighted within the IFN pathway in this study have previously been reported in other meta-analyses, including IFIT1, IFIT3, MX1, IRF7 and OAS1 [53, 54]. TNF is already targeted by biologic therapeutic approaches , and IFN regulatory factors 3 and 7 (IRF3 and IRF7) have been implicated in the IFN signature in lupus previously [54, 79, 80]. Whilst pathway analysis highlighted tretinoin as a common upstream regulator of identified pathways, there is little information about a role for tretinoin, belonging to the family of retinoids, in SLE. Whilst some studies describe the use of retinoids for topical treatment of cutaneous lupus, for example ; and a case study describes the use of retinoids in treating patients with lupus nephritis ; any relationship that may exist between this molecule and SLE progression is, as yet, unclear.
It is important to remember that the cellular composition of PBMCs can change in the lupus disease state, with increases in myeloid and decreases in lymphoid lineages observed in lupus PBMCs compared to control samples. These changes can also bring about differences observed in gene regulation , depending on the cell population that is assayed. With this in mind, we are aware that the inclusion of specific monocyte and T cell subsets within our PBMC meta-analysis may bias the end result because of exclusion of the other cell types in generating these datasets. We think it is likely, however, that this bias would be most likely to dilute the observed differential expression between disease state and controls; rather than enriching a signal from the selected cell types. If this were the case, this would tend to result in concealment of a gene signature of interest, rather than indicating false positive differential regulation that does not occur in all datasets.
This study demonstrates the value that can be gained from using appropriate statistical methods to combine expression datasets from multiple studies that compare gene expression between PBMCs from SLE patients compared to controls. The methods used here are robust even across different microarray platforms, and the pathways that are enriched for differentially regulated gene expression are consistent with the primary findings of the individual studies. Furthermore, the meta-analysis of multiple datasets has detected gene expression signatures additional to those described by the individual studies; and these regulatory pathways have been implicated in SLE through independent research approaches, confirming the validity of this meta-analysis. It is likely that the knowledgebase of regulatory pathways has been expanded since the individual studies were undertaken, and this may contribute in part to the increased range of regulatory mechanisms that were identified by the current meta-analysis. This further indicates the value in revisiting earlier datasets to extract more information about molecular processes underlying the disease state.
Such approaches may have particular value in the study of rare diseases, where study size can be limiting; and also where available datasets are clinically heterogeneous and cannot be stratified by stringent clinical or sampling criteria. Whilst bioinformatics analyses of existing datasets are currently unlikely to provide unequivocal clinical answers to disease aetiology, we believe that expanding the range of methodologies to explore such datasets can identify new mechanistic pathways to explore in ongoing and future research. Such studies highlight the utility of controlled public access to existing datasets, where ethical requirements for data re-use can be met, in order to increase our understanding of molecular mechanisms contributing to the aetiology of rare diseases.
Gene Expression Omnibus
Ingenuity pathway analysis
Interferon regulatory factors
Mammalian target of rapamycin
Peripheral blood mononuclear cells
Pattern recognition receptors
Systemic Lupus Erythematosus
Tumor necrosis factor
Danchenko N, Satia J, Anthony M. Epidemiology of Systemic Lupus Erythematosus: a comparison of worldwide disease burden. Lupus. 2006;15:308–18.
Johnson AE, Gordon C, Palmer RG, Bacon PA. The prevalence and incidence of Systemic Lupus Erythematosus in Birmingham, England. Relationship to ethnicity and country of birth. Arthritis Rheum. 1995;38:551–8.
Rahman A, Isenberg DA. Systemic Lupus Erythematosus. N Engl J Med. 2008;358:929–39.
Alarcón-Segovia D, Alarcón-Riquelme ME, Cardiel MH, Caeiro F, Massardo L, Villa AR, et al. Familial aggregation of Systemic Lupus Erythematosus, rheumatoid arthritis, and other autoimmune diseases in 1,177 lupus patients from the GLADEL cohort. Arthritis Rheum. 2005;52:1138–47.
Block SR. A brief history of twins. Lupus. 2006;15:61–4.
Block SR, Winfield JB, Lockshin MD, D'Angelo WA, Christian CL. Studies of twins with Systemic Lupus Erythematosus. A review of the literature and presentation of 12 additional sets. Am J Med. 1975;59:533–52.
Deapen D, Escalante A, Weinrib L, Horwitz D, Bachman B, Roy-Burman P, et al. A revised estimate of twin concordance in Systemic Lupus Erythematosus. Arthritis Rheum. 1992;35:311–8.
Hochberg MC. The application of genetic epidemiology to systemic lupus erythematosus. J Rheumatol. 1987;14:867–9.
Lawrence JS, Martins CL, Drake GL. A family survey of lupus erythematosus. 1. Heritability J Rheumatol. 1987;14:913–21.
Glazier AM, Nadeau JH, Aitman TJ. Finding genes that underlie complex traits. Science. 2002;298:2345–9.
Abbas AR, Wolslegel K, Seshasayee D, Modrusan Z, Clark HF. Deconvolution of blood microarray data identifies cellular activation patterns in Systemic Lupus Erythematosus. PLoS ONE. 2009. doi:10.1371/journal.pone.0006098.
Allantaz F, Chaussabel D, Stichweh D, Bennett L, Allman W, Mejias A, et al. Blood leukocyte microarrays to diagnose systemic onset juvenile idiopathic arthritis and follow the response to IL-1 blockade. J Exp Med. 2007;204:2131–44.
Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, Kim IF, et al. NCBI GEO: archive for functional genomics data sets--10 years on. Nucleic Acids Res. 2010;39:D1005–10.
Bernales I, Fullaondo A, Marín-Vidalled MJ, Ucar E, Martínez-Taboada V, López-Hoyos M, et al. Innate immune response gene expression profiles characterize primary antiphospholipid syndrome. Genes Immun. 2007;9:38–46.
Berry MPR, Graham CM, Mcnab FW, Xu Z, Bloch SAA, Oni T, et al. An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis. Nature. 2010;466:973–7.
Chaussabel D, Quinn C, Shen J, Patel P, Glaser C, Baldwin N, et al. A modular analysis framework for blood genomics studies: application to systemic lupus erythematosus. Immunity. 2008;29:150–64.
Fernandez DR, Telarico T, Bonilla E, Li Q, Banerjee S, Middleton FA, et al. Activation of mammalian target of rapamycin controls the loss of TCRζ in lupus T cells through HRES-1/Rab4-regulated lysosomal degradation. J Immunol. 2009;182:2063–73.
Garaud JC, Schickel JN, Blaison G, Knapp AM, Dembele D, Ruer-Laventie J, et al. B Cell Signature during Inactive Systemic Lupus Is Heterogeneous: Toward a Biological Dissection of Lupus. PLoS ONE. 2011. doi:10.1371/journal.pone.0023900.
Garcia-Romo GS, Caielli S, Vega B, Connolly J, Allantaz F, Xu Z, et al. Netting neutrophils are major inducers of type I IFN production in pediatric Systemic Lupus Erythematosus. Sci Transl Med. 2011. doi:10.1126/scitranslmed.3001201.
Hutcheson J, Scatizzi JC, Siddiqui AM, Haines GK, Wu T, Li QZ, et al. Combined deficiency of proapoptotic regulators bim and fas results in the early onset of systemic autoimmunity. Immunity. 2008;28:206–17.
Javierre BM, Fernandez AF, Richter J, Al-Shahrour F, Martin-Subero JI, Rodriguez-Ubreva J, et al. Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus. Genome Res. 2010;20:170–9.
Jeffries MA, Dozmorov M, Tang Y, Merrill JT, Wren JD, Sawalha AH. Genome-wide DNA methylation patterns in CD4+ T cells from patients with systemic lupus erythematosus. Epigenetics. 2011;6:593–601.
Kahlenberg JM, Thacker SG, Berthier CC, Cohen CD, Kretzler M, Kaplan MJ. Inflammasome activation of IL-18 results in endothelial progenitor cell dysfunction in systemic lupus erythematosus. J Immunol Baltim Md 1950. 2011;187:6143–56.
Lee HM, Mima T, Sugino H, Aoki C, Adachi Y, Yoshio-Hoshino N, et al. Interactions among type I and type II interferon, tumor necrosis factor, and β-estradiol in the regulation of immune response-related gene expressions in systemic lupus erythematosus. Arthritis Res Ther. 2009. doi:10.1186/ar2584.
Lee HM, Sugino H, Aoki C, Nishimoto N. Underexpression of mitochondrial-DNA encoded ATP synthesis-related genes and DNA repair genes in systemic lupus erythematosus. Arthritis Res Ther. 2011. doi:10.1186/ar3317.
Lyons PA, Mckinney EF, Rayner TF, Hatton A, Woffendin HB, Koukoulaki M, et al. Novel expression signatures identified by transcriptional analysis of separated leucocyte subsets in systemic lupus erythematosus and vasculitis. Ann Rheum Dis. 2010;69:1208–13.
Mckinney EF, Lyons PA, Carr EJ, Hollis JL, Jayne DRW, Willcocks LC, et al. A CD8+ T cell transcription signature predicts prognosis in autoimmune disease. Nat Med. 2010;16:586–91.
O'Hanlon TP, Rider LG, Gan L, Fannin R, Paules RS, Umbach DM, et al. Gene expression profiles from discordant monozygotic twins suggest that molecular pathways are shared among multiple systemic autoimmune diseases. Arthritis Res Ther. 2011. doi:10.1186/ar3330.
Parkinson H, Sarkans U, Kolesnikov N, Abeygunawardena N, Burdett T, Dylag M, et al. ArrayExpress update--an archive of microarray and high-throughput sequencing-based functional genomics experiments. Nucleic Acids Res. 2011. doi:10.1093/nar/gkq1040.
Thacker SG, Berthier CC, Mattinzoli D, Rastaldi MP, Kretzler M, Kaplan MJ. The detrimental effects of IFN-α on vasculogenesis in lupus are mediated by repression of IL-1 pathways: potential role in atherogenesis and renal vascular rarefaction. J Immunol. 2010;185:4457–69.
Villanueva E, Yalavarthi S, Berthier CC, Hodgin JB, Khandpur R, Lin AM, et al. Netting neutrophils induce endothelial damage, infiltrate tissues, and expose immunostimulatory molecules in Systemic Lupus Erythematosus. J Immunol. 2011;187:538–52.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545–50.
Azuaje F, Devaux Y, Wagner DR. Integrative pathway-centric modeling of ventricular dysfunction after myocardial infarction. PLoS ONE. 2010. doi:10.1371/journal.pone.0026963.
Tan EM, Cohen AS, Fries JF, Masi AT, Mcshane DJ, Rothfield NF, et al. The 1982 revised criteria for the classification of systemic lupus erythematosus. Arthritis Rheum. 1982;25:1271–7.
Bombardier C, Gladman DD, Urowitz MB, Caron D, Chang CH. Derivation of the SLEDAI. A disease activity index for lupus patients. The committee on prognosis studies in SLE. Arthritis Rheum. 1992;35:630–40.
Gladman DD, Ibañez D, Urowitz MB. Systemic lupus erythematosus disease activity index 2000. J Rheumatol. 2002;29:288–91.
Hay EM, Bacon PA, Gordon C, Isenberg DA, Maddison P, Snaith ML, et al. The BILAG index: a reliable and valid instrument for measuring clinical disease activity in systemic lupus erythematosus. Q J Med. 1993;86:447–58.
Yee CS, Isenberg DA, Prabu A, Sokoll K, Teh LS, Rahman A, et al. BILAG-2004 index captures systemic lupus erythematosus disease activity better than SLEDAI-2000. Ann Rheum Dis. 2008;67:873–6.
Smiljanovic B, Grün JR, Biesen R, Schulte-Wrede U, Baumgrass R, Stuhlmüller B, et al. The multifaceted balance of TNF-α and type I/II interferon responses in SLE and RA: how monocytes manage the impact of cytokines. J Mol Med Berl Ger. 2012;90:1295–309.
Edgar R, Domrachev M, Lash AE. Gene expression omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002;30:207–10.
Davis S, Meltzer PS. GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics. 2007;23:1846–7.
Irizarry RA, Hobbs B, Collin F, Beazer‐Barclay YD, Antonellis KJ, Scherf U, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4:249–64.
Wilson CL, Miller CJ. Simpleaffy: a BioConductor package for affymetrix quality control and data analysis. Bioinformatics. 2005;21:3683–5.
Kauffmann A, Rayner TF, Parkinson H, Kapushesky M, Lukk M, Brazma A, et al. Importing Arrayexpress datasets into R/Bioconductor. Bioinformatics. 2009;25:2092–4.
Smyth GK. limma: Linear Models for Microarray Data. In: Gentleman R, Carey VJ, Huber W, Irizarry RA, Dudoit S, editors. Bioinformatics and computational biology solutions using R and bioconductor. New York: Springer; 2005. p. 397–420.
Flicek P, Ahmed I, Amode MR, Barrell D, Beal K, Brent S, et al. Ensembl 2013. Nucleic Acids Res. 2013. doi:10.1093/nar/gks1236.
Mapiye DS, Christoffels AG, Gamieldien J. Identification of phenotype-relevant differentially expressed genes in breast cancer demonstrates enhanced quantile discretization protocol’s utility in multi-platform microarray data integration. J Bioinform Comput Biol. 2016. doi:10.1142/S0219720016500220.
Warnat P, Eils R, Brors B. Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC Bioinformatics. 2005;6:265.
Wilcoxon F. Individual comparisons by ranking methods. Biom Bull. 1945;1:80.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57:289–300.
The data were analysed and networks generated through the use of QIAGEN’s ingenuity pathway analysis (IPA®. QIAGEN: Redwood City. www.qiagen.com/ingenuity. Accessed March 2015).
Elkon KB, Stone VV. Type I interferon and systemic lupus erythematosus. J Interferon Cytokine Res. 2011;31:803–12.
Arasappan D, Tong W, Mummaneni P, Fang H, Amur S. Meta-analysis of microarray data using a pathway-based approach identifies a 37-gene expression signature for systemic lupus erythematosus in human peripheral blood mononuclear cells. BMC Med. 2011;9:65.
Makashir SB, Kottyan LC, Weirauch MT. Meta-analysis of differntial gene co-expression: application to lupus. Pac Symp Biocomput. 2015;443–54.
Ardoin SP, Pisetsky DS. Developments in the scientific understanding of lupus. Arthritis Res Ther. 2008;10:218.
Granata G, Didona D, Stifano G, Feola A, Granata M. Macrophage activation syndrome as onset of Systemic Lupus Erythematosus: a case report and a review of the literature. Case Rep Med. 2015. doi:10.1155/2015/294041.
Parodi A, Davì S, Pringe AB, Pistorio A, Ruperto N, Magni-Manzoni S, et al. Macrophage activation syndrome in juvenile systemic lupus erythematosus: a multinational multicenter study of thirty-eight patients. Arthritis Rheum. 2009;60:3388–99.
Conti F, Spinelli FR, Alessandri C, Valesini G. Toll-like receptors and lupus nephritis. Clin Rev Allergy Immunol. 2011;40:192–8.
Salaun B, Romero P, Lebecque S. Toll-like receptors’ two-edged sword: when immunity meets apoptosis. Eur J Immunol. 2007;37:3311–8.
Enevold C, Kjær L, Nielsen CH, Voss A, Jacobsen RS, Hermansen MLF, et al. Genetic polymorphisms of dsRNA ligating pattern recognition receptors TLR3, MDA5, and RIG-I. Association with systemic lupus erythematosus and clinical phenotypes. Rheumatol Int. 2014;34:1401–8.
dos Santos B, Valverde JV, Rohr P, Monticielo OA, Brenol JCT, Xavier RM, et al. TLR7/8/9 polymorphisms and their associations in systemic lupus erythematosus patients from Southern Brazil. Lupus. 2012;21:302–9.
Shen N, Fu Q, Deng Y, Qian X, Zhao J, Kaufman KM, et al. Sex-specific association of X-linked toll-like receptor 7 (TLR7) with male systemic lupus erythematosus. Proc Natl Acad Sci U S A. 2010;107:15838–43.
Cunningham JT, Rodgers JT, Arlow DH, Vazquez F, Mootha VK, Puigserver P. mTOR controls mitochondrial oxidative function through a YY1-PGC-1alpha transcriptional complex. Nature. 2007;450:736–40.
Schieke SM, Phillips D, McCoy JP, Aponte AM, Shen RF, Balaban RS, et al. The mammalian target of rapamycin (mTOR) pathway regulates mitochondrial oxygen consumption and oxidative capacity. J Biol Chem. 2006;281:27643–52.
Perl A. Oxidative stress in the pathology and treatment of systemic lupus erythematosus. Nat Rev Rheumatol. 2013;9:674–86.
Siegel MP, Kruse SE, Knowels G, Salmon A, Beyer R, Xie H, et al. Reduced coupling of oxidative phosphorylation in vivo precedes electron transport chain defects due to mild oxidative stress in mice. PLoS One. 2011. doi:10.1371/journal.pone.0026963.
Vatassery GT, Santacruz KS, Demaster EG, Quach HT, Smith WE. Oxidative stress and inhibition of oxidative phosphorylation induced by peroxynitrite and nitrite in rat brain subcellular fractions. Neurochem Int. 2004;45:963–70.
Han S, Kim-Howard X, Deshmukh H, Kamatani Y, Viswanathan P, Guthridge JM, et al. Evaluation of imputation-based association in and around the integrin-alpha-M (ITGAM) gene and replication of robust association between a non-synonymous functional variant within ITGAM and systemic lupus erythematosus (SLE). Hum Mol Genet. 2009;18:1171–80.
Hom G, Graham RR, Modrek B, Taylor KE, Ortmann W, Garnier S, et al. Association of systemic lupus erythematosus with C8orf13-BLK and ITGAM-ITGAX. N Engl J Med. 2008;358:900–9.
International Consortium for Systemic Lupus Erythematosus Genetics (SLEGEN), Harley JB, Alarcón-Riquelme ME, Criswell LA, Jacob CO, Kimberly RP, et al. Genome-wide association scan in women with systemic lupus erythematosus identifies susceptibility variants in ITGAM, PXK, KIAA1542 and other loci. Nat Genet. 2008;40:204–10.
Nath SK, Han S, Kim-Howard X, Kelly JA, Viswanathan P, Gilkeson GS, et al. A nonsynonymous functional variant in integrin-alpha(M) (encoded by ITGAM) is associated with systemic lupus erythematosus. Nat Genet. 2008;40:152–4.
Macpherson M, Lek HS, Prescott A, Fagerholm SC. A Systemic Lupus Erythematosus-associated R77H substitution in the CD11b chain of the Mac-1 integrin compromises leukocyte adhesion and phagocytosis. J Biol Chem. 2011;286:17303–10.
Rhodes B, Fürnrohr BG, Roberts AL, Tzircotis G, Schett G, Spector TD, et al. The rs1143679 (R77H) lupus associated variant of ITGAM (CD11b) impairs complement receptor 3 mediated functions in human monocytes. Ann Rheum Dis. 2012;71:2028–34.
Fossati-Jimack L, Ling GS, Cortini A, Szajna M, Malik TH, McDonald JU, et al. Phagocytosis is the main CR3-mediated function affected by the lupus-associated variant of CD11b in human myeloid cells. PLoS One. 2013. doi:10.1371/journal.pone.0057082.
Wu Q, Yang Q, Lourenco E, Sun H, Zhang Y. Interferon-lambda1 induces peripheral blood mononuclear cell-derived chemokines secretion in patients with systemic lupus erythematosus: its correlation with disease activity. Arthritis Res Ther. 2011. doi:10.1186/ar3363.
Crow MK. Type I interferon in the pathogenesis of lupus. J Immunol Baltim Md 1950. 2014;192:5459–68.
Becker-Merok A, Østli-Eilersten G, Lester S, Nossent J. Circulating interferon-α2 levels are increased in the majority of patients with systemic lupus erythematosus and are associated with disease activity and multiple cytokine activation. Lupus. 2013;22:155–63.
Stohl W. Future prospects in biologic therapy for systemic lupus erythematosus. Nat Rev Rheumatol. 2013;9:705–20.
Smith S, Gabhann JN, Higgs R, Stacey K, Wahren-Herlenius M, Espinosa A, et al. Enhanced interferon regulatory factor 3 binding to the interleukin-23p19 promoter correlates with enhanced interleukin-23 expression in systemic lupus erythematosus. Arthritis Rheum. 2012;64:1601–9.
Sweeney SE. Hematopoietic stem cell transplant for systemic lupus erythematosus: Interferon regulatory factor 7 activation correlates with the IFN signature and recurrent disease. Lupus. 2011;20:975–80.
Seiger E, Roland S, Goldman S. Cutaneous lupus treated with topical tretinoin: a case report. Cutis. 1991;47:351–5.
Kinoshita K, Kishimoto K, Shimazu H, Nozaki Y, Sugiyama M, Ikoma S, et al. Successful treatment with retinoids in patients with lupus nephritis. Am J Kidney Dis. 2010;55:344–7.
We acknowledge Prof Lyon, from study EMTAB145, for helpful advice and the Centre for Proteomic and Genomic Research for access to Ingenuity Pathway Analysis software.
WK conducted this study with funding from the Claude Leon Foundation. DM and NT are funded by the H3Africa Kidney Disease Research Network under a cooperative agreement from the National Human Genome Research Institute (grant number 1U54HG006939-01). NT and JBDE are funded by the H3Africa Bioinformatics Network (NIH Common Fund, grant number U41HG006941). NT is funded by the National Research Foundation of South Africa, and the Medical Research Council of South Africa.
Availability of data and materials
The datasets used in this study have been previously described and are available in the Gene Expression Omnibus (GEO) database , with accession numbers GSE11909 , GSE13887  and GSE38351 , as well as the ArrayExpress database with accession number E-MTAB-145 .
WK participated in the study design, performed the data collection and pre-processing, participated in the meta-analysis and differential expression analysis and drafted the manuscript. DM participated in the meta-analysis and differential expression analysis and provided statistical analysis support. JBDE participated in the meta-analysis and differential expression analysis and provided statistical analysis support. NT conceived of the study and participated in its design and coordination, performed the pathway analysis and helped to draft the manuscript. All authors contributed to, and approved the final manuscript.
The authors declare that they have no competing interests.
About this article
Cite this article
Kröger, W., Mapiye, D., Entfellner, J.D. et al. A meta-analysis of public microarray data identifies gene regulatory pathways deregulated in peripheral blood mononuclear cells from individuals with Systemic Lupus Erythematosus compared to those without. BMC Med Genomics 9, 66 (2016). https://doi.org/10.1186/s12920-016-0227-0
- Systemic Lupus Erythematosus (SLE)
- Gene expression