Skip to main content

Information encoded in a network of inflammation proteins predicts clinical outcome after myocardial infarction

Abstract

Background

Inflammation plays an important role in cardiac repair after myocardial infarction (MI). Nevertheless, the systems-level characterization of inflammation proteins in MI remains incomplete. There is a need to demonstrate the potential value of molecular network-based approaches to translational research. We investigated the interplay of inflammation proteins and assessed network-derived knowledge to support clinical decisions after MI. The main focus is the prediction of clinical outcome after MI.

Methods

We assembled My-Inflamome, a network of protein interactions related to inflammation and prognosis in MI. We established associations between network properties, disease biology and capacity to distinguish between prognostic categories. The latter was tested with classification models built on blood-derived microarray data from post-MI patients with different outcomes. This was followed by experimental verification of significant associations.

Results

My-Inflamome is organized into modules highly specialized in different biological processes relevant to heart repair. Highly connected proteins also tend to be high-traffic components. Such bottlenecks together with genes extracted from the modules provided the basis for novel prognostic models, which could not have been uncovered by standard analyses. Modules with significant involvement in transcriptional regulation are targeted by a small set of microRNAs. We suggest a new panel of gene expression biomarkers (TRAF2, SHKBP1 and UBC) with high discriminatory capability. Follow-up validations reported promising outcomes and motivate future research.

Conclusion

This study enhances understanding of the interaction network that executes inflammatory responses in human MI. Network-encoded information can be translated into knowledge with potential prognostic application. Independent evaluations are required to further estimate the clinical relevance of the new prognostic genes.

Peer Review reports

Background

Cardiovascular disease is a major cause of death worldwide. Myocardial infarction (MI) often leads to heart failure (HF), which makes MI a leading source of morbidity and hospitalizations [1]. Different inflammation biomarkers with diagnostic and prognostic applications have been deployed in the clinical setting [2, 3]. Post-MI cell death triggers the activation of several complex inflammatory processes to clear dead cells and initiate heart tissue regeneration [47]. It has been suggested that such a coordinated regulation is fundamental to enable cardiac repair and recovery after MI [8]. C-reactive protein (CRP) and other pro-inflammatory cytokines are widely-applied biomarkers in the routine clinical setting [2, 6, 9]. For example, CRP's capacity to identify patients with high risk of developing HF has been demonstrated in different studies, including the Framingham Heart Study [10], though this can be achieved at the expense of low specificity [2].

Notwithstanding the widely-accepted application of different prognostic biomarkers after MI, including indicators of inflammation, there is a need to provide new insights into the complex mechanisms leading to HF. In addition, currently there is no single biomarker capable to accurately classify patients on the basis of their post-MI outcome [2]. Standard research approaches have been based on the analysis of differential expression of putative biomarkers, using gene or protein expression values [3, 4, 11]. The need to provide mechanistic understanding at a systems level, while improving the predictive ability of candidate biomarkers, makes network-based approaches a pertinent new research direction in this clinical area. This strategy not only entails the representation and analysis of networks of putative biomarkers, but also benefits from the integration of multiple sources of omic information.

The application of systems-driven approaches to discovering prognostic biomarkers and understanding disease biology has received relatively more attention in the areas of cancer and neurodegenerative diseases [1214]. We have previously demonstrated its potential to guide new knowledge discovery in translational research of MI [1519]. For instance, we have established quantitative connections between network topological features, modularity and prognostic outcome in post-MI [16]. Moreover, we have explored the prognostic value of new biomarkers relevant to angiogenesis and cardiac repair [15, 18]. Based on these outcomes and the lack of systems biology research of inflammation in the specific context of MI, we set out to investigate a comprehensive set of inflammation-implicated protein interactions and its potential prognostic value post-MI. The main focus of this investigation was to predict clinical outcome after MI, in particular left ventricular dysfunction.

Objective

The objective of our investigation is three-fold:

  1. 1.

    To characterize a comprehensive compendium of protein interactions relevant to inflammation and (post-MI) HF, through the implementation of an integrative biological network approach.

  2. 2.

    To establish qualitative and quantitative connections between network properties, disease biology and prognostic outcomes.

  3. 3.

    To test the potential of new network-derived prognostic biomarkers of post-MI ventricular dysfunction.

Methods

Research framework

Our investigation required the integration and analysis of multiples types of omic and clinical data, including external public information repositories and data generated at our laboratory. Figure 1 synthesizes the main phases and outputs of our research. First, using expert knowledge, we listed a set of well-known biomarkers in the context of inflammation and prognosis after MI. To reduce bias towards well-characterized biomarkers and to enable the discovery of new biological knowledge, this list of traditional biomarkers was expanded into a set of network seeds. This was accomplished by retrieving other functionally- and phenotypically-related genes from different public databases. These seeds were the product of a gene prioritization, which was established on the basis of their quantitative functional similarity to the traditional biomarkers. We then interrogated multiple protein-protein interaction (PPI) databases using the seeds as query inputs. This resulted in the extraction of their experimentally-validated PPIs in human, and the construction of a global network of PPIs in the MI and inflammation setting (My-Inflamome). This network was analyzed on the basis of different topological features to identify potential critical components and sub-networks.

Figure 1
figure 1

Research framework and analytical pipeline implemented.

The next phase aimed to answer the question of whether (and how) network-based information could be applied to aid in post-MI prognostic applications. More specifically, we investigated whether the genes encoding critical network components or involved in significant molecular modules could be applied to distinguish between post-MI clinical categories. To further demonstrate this translational research application, we built different classification systems based on microarray data from a small set of clinically meaningful genes. Models were evaluated using a standard cross-validation procedure. Furthermore, an experimental follow-up, using multiplex PCR, was conducted to assess opportunities for future research. These methods are described in more detail as follows.

My-Inflamome seeds

The initial set of well-investigated inflammation biomarkers included those commonly applied in clinical practice, as well as others discussed in review articles in the post-MI context (Results) [2, 3]. Genes functionally related to these biomarkers were searched in different public databases with the Endeavour system [20], which has previously been applied to different disease network investigations [16, 21]. This is a gene prioritization task in which the training set is represented by the set of well-known biomarkers, and candidate genes originate from whole-genome searches. Functional similarity between candidate and training set genes is established using different types of omic information, such as annotation databases, gene sequences and public gene expression datasets. Thus, functional relatedness is estimated based on data source-specific similarity criteria. A global, multi-source similarity score is computed for each candidate gene across these resources, which is then used to rank candidate genes. The reader is referred to [21] for details about this gene search and prioritization procedure. We concentrated on candidate genes with global prioritization score, PS, equal to or lower than 0.01. The union of the sets of retrieved candidate genes and the well-known biomarkers represented the seeds of My-Inflamome.

My-Inflamome construction

We obtained PPIs for the seeds using different public PPI databases: DIP [22], IntAct [23] and MINT [24] databases. These datasets have been shown to be complementary with high interactome quality and coverage [25]. We only retrieved PPIs validated in humans and focused on immediate (direct) seed-protein interactions. This aids in the reduction of the likelihood of retrieving PPIs not significantly related to the post-MI domain (false positives). The union of all retrieved PPIs defined My-Inflamome. We visualized and analyzed it with Cytoscape [26] and PolarMapper [27].

My-Inflamome analysis

In My-Inflamome, proteins and their pair-wise interactions are graphically represented as network nodes and edges respectively. We analyzed fundamental network topological features at the node level: node degree and traffic. The former refers to the number of edges associated with a node. The latter, also referred to as "betweeness centrality", is the number of shortest paths that go through the node, and which link any two other nodes in the network.

A group of highly-interconnected nodes can be defined as a "module". A module may be identified through network clustering, which meets specific statistical analysis criteria as explained below. There are various statistical or topological features that can be used to characterize the networks, its modules and individual components. High traffic nodes indicate their "bottleneck" property, which has been correlated with different biologically-meaningful features, including putative disease biomarkers and drug targets [2830]. In another study, we demonstrated the predictive potential of high traffic nodes in a global PPI network to detect patients with ventricular dysfunction [16].

Network modularity is another key topological property previously linked to disease biology and biomarker discovery [16, 18, 19, 31]. A network module can be defined as a highly-interconnected sub-network, whose number of connections is larger than that expected from randomly pairing its proteins. We applied a "greedy" network clustering algorithm that maximizes a modularity score, Q, defined as: Q = (numIME/numE) - (numIME/numE) random , with numIME and numE representing the number of edges in a module and the total number of network edges respectively. We implemented network module identification and other topological analyzes with PolarMapper [27].

We characterized the resulting network modules on the basis of their association with Gene Ontology (GO) Biological Process (BP) and Cellular Component (CC) annotations. We also examined statistically detectable associations between microRNAs (miRNAs) and RNA targets in each module, and which are annotated in the miRBase database [32]. These associations were computed through (two-tailed) Fisher's exact tests and corrected to account for multiple-hypothesis testing using the Benjamini & Hochberg procedure. We concentrated on associations with (corrected) P < 0.01. We completed this functional characterization of modules with the Fatigo tool, under the Babelomics (v4.1) platform [33]. The Statistica software was used for standard statistical analysis [34].

Prognostic biomarker discovery

To investigate the prognostic potential of critical nodes and modules, we examined the ability of their corresponding genes to distinguish between poor and good outcomes after MI (specified below). This was done on (blood-derived) gene expression data generated by microarray experiments at our laboratory (Gene Expression Omnibus, GEO, accession number: GSE11947) [35]. These data were obtained from circulating white blood cells, which play prominent roles in inflammation. Thus, we hypothesized that these cells better reflect inflammatory responses than secreted proteins. We first sought high traffic nodes and network modules showing strong differential expression capability, at the gene set level, across good and poor clinical outcome. The latter was estimated using the kipuMarkers approach [17, 36]. Genes defining highest traffic nodes and predictive network modules were also used as inputs to classification models based on logistic regression (Ridge estimator value: 1.0E-8). Classification performance was summarized with the area under the receiving operating curve (AUC), and estimated with a leave-one-out cross-validation procedure (LOOCV). Classification models were implemented with the Weka system [37]. Inputs to classifiers were re-scaled so that their means and variances were equal to 0 and 1 respectively. Also note that we used the standardised outputs of the logistic regression classifiers to calculate the AUC values.

Microarray gene expression data generation

We analyzed microarray gene expression profiles from 32 patients with good and poor post-MI clinical outcome (4 months after MI, 16 patients/category). Good outcome was defined as the preservation of left ventricular (LV) systolic function and high ejection fraction (EF) after MI (EF > 40%, median 63%, range 45-73%). A patient exhibited poor outcome when presenting impaired LV function and low EF (EF ≤ 40%, median 35%, range 20-40%). Clinical characteristics of patients selected, microarray data generation and pre-processing procedures are described in [18].

Experimental verification

We also performed a follow-up experimental replication using multiplex TaqMan PCR. Out of the 32 RNA samples extracted from the patients used in the microarray experiments, 11 samples were available for quantitative PCR determination. Thus, we measured the gene expression of key putative biomarkers using this sub-set of patients (11 patients, 7 with poor clinical outcome) as a first validation step.

Multiplex TaqMan assay

Total RNA was extracted from 2.5 mL of whole blood collected the day of MI by the PAXgeneTM technology and 1 μg of total RNA was reverse transcribed using SuperScriptTM II Reverse Transcriptase (Invitrogen, Merelbeke, Belgium) according to the manufacturer's protocol. Expression levels of TRAF2 (TNF receptor-associated factor 2), UBC (ubiquitin C) and SHKBP1 (SH3KBP1 binding protein 1) were assessed by quantitative PCR, in a multiplex TaqMan assay. SF3A1 was chosen as housekeeping gene for normalization. PCR was performed in a BioRad CFX96™ apparatus using the following primers and probes: TRAF2 forward primer GGCTTCTCCAAGACCCTCCTG, TRAF2 reverse primer TTCGTGGCAGCTCTCGTATTCT, TRAF2 probe TexasRed-ACTGTGCTGCCTGTGTTCACGAG-BHQ2, UBC forward primer AGCGAGCGTCCTGATCCTTC, UBC reverse primer CACCCGGCGCGTCCTTATAT, UBC probe FAM-AGTAGTCCCTTCTCGGCGATTCTG-BHQ1, SHKBP1 forward primer GACGGAGCAAGAGCTGATGGA, SHKBP1 reverse primer GCTGCGTTGTTCAAAAGGAAGTTT, SHKBP1 probe HEX-CCAGGAACTGGTGCGGAGTGG-BHQ2, SF3A1 forward primer AAGGGTCCAGTGTCCATCAAAGT, SF3A1 reverse primer GCCATGTTGTAGTAAGCCAGTGAG, SF3A1 probe Cy5-ACCAGGTCTCTGTCATTAAGGTGAAG-BHQ3. The multiplex reaction was performed on 4 μL of cDNA (previously diluted 10×) in a 20 μL reaction mix containing 1× AmpliTaq Gold buffer (Applied Biosystems, Halle, Belgium), 5 mM of MgCl2 (Applied Biosystems), 500 μM of each dNTPs (Invitrogen), 4 U of AmpliTaq Gold (Applied Biosystems), 400 nM of each TRAF2 primers and probe, 300 nM of each SHKBP1 primers and probe, 1 μM of each UBC primers, 500 nM of UBC probe and 200 nM of each SF3A1 primers and probe. The PCR reaction was composed of 1 cycle at 95°C for 8 min, followed by 60 cycles at 95°C for 30 s, 63.5°C for 30 s and 72°C for 30 s. Primer concentrations and annealing temperature were optimized to allow for multiplex detection. Regression was used as Ct determination mode for data analysis. Expression of TRAF2, UBC and SHKBP1 genes was calculated as: gene expression = 2(Ct SF3A1 - Ct gene).

In order to select the housekeeping gene for this experiment, we measured the expression of 12 different known housekeeping genes: ACTB, GAPDH, UBC, B2M, YWHAZ, SF3A1, 18S rRNA, CYC1, EIF4A2, SDHA, TOP1 and ATP5B, using the geNorm™ Housekeeping Gene Selection Kit (PrimerDesign). Results indicated that the expression of SF3A1 was the most stable between samples. We determined the amplification efficiency of each primer pair during the set up of this multiplex TaqMan qPCR essay: efficiencies varied from 93.1% to 102.8%, depending on the primer pair studied. This indicated high amplification efficiency for each gene. Moreover, we compared the expression of each gene in the multiplex reaction to its expression in a singleplex reaction. Ct values obtained for each gene were identical between multiplex and singleplex PCR, indicating that the multiplex reaction was not rate limiting.

Results

My-Inflamome: A novel resource for inflammation and post-MI research

My-Inflamome seed set consisted of 415 proteins, including 28 known biomarkers. Examples of seed proteins are: TNF, CRP, FAS and TNFRSF10A. My-Inflamome was composed of 2595 proteins and 6181 interactions in total. The single largest, interconnected network component (island) included 2532 nodes and 6131 interactions (mean number of interactions per node: 4.25). We focused on this network for subsequent analyses (Additional file 1). A global view of My-Inflamome is displayed in Figure 2. Figure 3 depicts the correlation between node degree and traffic in My-Inflamome. Examples of known and candidate biomarkers are highlighted on the plot. Because of the observed relatively strong correlation (Spearman correlation, ρ = 0.93, P = 1E-4) and the previously demonstrated functional importance of high traffic nodes, we concentrated on top high-traffic proteins in subsequent analyses (Table 1). We note that none of the top-10 high-traffic nodes were included in the list of known inflammation biomarkers in post-MI. On the basis of their traffic values, standard inflammation biomarkers CRP, IL6 and TNF ranked 227th, 478th and 607th respectively. This further indicates that our approach is not biased toward traditional biomarkers.

Figure 2
figure 2

My-Inflamome network. A. Global view. B. Modular view.

Figure 3
figure 3

Correlation between node degree and traffic in My-Inflamome.

Table 1 Top-10 high-traffic proteins in My-Inflamome.

My-Inflamome is biologically meaningful

We detected 21 highly-interconnected modules with diverse sizes and numbers of interactions in My-Inflamome (Table 2). The largest modules are strongly implicated in post-translational protein modifications and signal transduction processes (e.g., Modules 5, 10 and 16). Different modules are directly relevant to cell death and development processes (e.g., Modules 4, 8 and 14). Others appear specialized in immune responses (e.g., Module 9). Four modules were not statistically associated with GO BP terms (Modules 11, 12, 17 and 21). Module 1 is specialized in blood coagulation and wound healing processes. My-Inflamome modules operate across different cellular regions, including the nucleus, cytoplasm and the extra-cellular space. However, 7 modules were not significantly associated with specific cellular compartments.

Table 2 Topological description of My-Inflamome modules.

Because of the therapeutic and prognostic potential of these modules, we searched for major interactions between miRNAs and targets found in each module. We found statistically detectable evidence that Modules 6, 7 and 16 may be regulated by different miRNAs. For example, hsa-miR-335*, recently implicated in breast tumor suppression [38], is substantially linked to targets in Module 7, which in turn are significantly enriched in genes with transcriptional regulatory roles. Figure 4 summarizes this functional quantitative characterization of My-Inflamome modules on the basis of the most significant associations observed. Details of module composition and functional descriptions are available in Additional file 2.

Figure 4
figure 4

Functional characterization of My-Inflamome modules. Most statistically detectable associations with GO BP, CC and microRNAs. In heat map: P is the probability associated with the functional category observed in each module, colors reflect log-transformed values of P. BP: biological process, MF: molecular function, CC: cellular component. NA: No statistically detectable association. Modules are numbered as in Figure 2. The darker the color, the more significant the statistical association.

My-Inflamome guides the discovery of candidate prognostic biomarkers

To assess the predictive potential of these results, first we established the capacity of genes encoding high-traffic proteins and of modules to differentiate prognostic categories (good vs. poor prognosis). Among the top high-traffic proteins, IKBKE (inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase epsilon), TRAF2 (TNF receptor-associated factor 2) and UBC (ubiquitin C) were found statistically differentially expressed at the (nominal) P = 0.02 level (Mann-Whitney U Test). The modules showing the highest differential (mean) expressions were Module 17 (corrected P = 0.02) and Module 11 (P = 0.06). Note that these modules are not statistically linked to specific biological processes, and only Module 11 appears circumscribed to a specific cellular region (Collagen type VI). Within Module 11, SHKBP1 (SH3KBP1 binding protein 1) is differentially expressed (Mann-Whitney U Test, nominal P = 0.006).

Motivated by these findings, we evaluated these genes and modules as inputs to (supervised) computational prognostic models. Different combinations of genes and modules were used as inputs to logistic regression models (Table 3). The highest classification performance (AUC = 0.84) was obtained when using the individual expression values measured in M11 and M17 (6 genes with available microarray data: CD5, PIK3C3, FBXL2, MIA3, SHKBP1 and ZC3H7A). This was followed by a model in which the mean expression values of M11 and M17 were applied as inputs (AUC = 0.8). The combination of module-derived and high-traffic genes also offered promising results (TRAF2, SHKBP1 and UBC, AUC = 0.83).

Table 3 Summary of classification performance of models defined by My-Inflamome information.

Replication analyses

To verify the validity of these candidate biomarkers, we replicated microarray experiments using an independent platform, quantitative PCR (Methods). In this first follow-up evaluation, we decided to focus on the prognostic signature defined by TRAF2, SHKBP1 and UBC. As a first classification model verification strategy, we tested the model derived from the microarray data on the multiplex PCR data (11 samples). The classification performance on this dataset was comparable to that obtained above (AUC = 0.79). In a second verification strategy, we re-built the classification model on the 11 PCR data samples and estimated its performance using LOOCV. This evaluation also reported classification capability comparable to previous performance using this signature (AUC = 0.77). Although this does not qualify as an independent evaluation of these potential novel biomarkers, these results at least suggest that it is viable to obtain adequate predictive concordance between prognostic models when tested on different data generated by independent expression measurement platforms.

Discussion

Significance

Although inflammation biomarkers are useful to support medical decision-making in the MI setting, the complexity and roles of the global interactions of these proteins have not been adequately elucidated. This is partly explained by the lack of systems-based approaches to bridge clinical (i.e., disease phenotypes) and molecular information (i.e., different types of omic information). The rationale for applying integrative, network-driven approaches is reinforced by the knowledge that the interactive activity of inflammation proteins may be functionally critical to multiple biological processes, which cannot be pinpointed by traditional hypothesis-driven research. Moreover, such an inflammation interactome is spatially distributed in the cell and subject to different regulatory control mechanisms acting in a combinatorial fashion. With an approach solely relying on traditional hypothesis-driven research or single-source molecular data, it would be difficult to accurately establish relevant connections between biomarkers and their prognostic value, while at the same time providing adequate mechanistic visualizations. In this paper, we laid out a foundation to address these challenges and discover potentially novel translational research applications.

The inflammation-related proteins TRAF2, SHKBP1 and UBCs were here associated with clinical outcome after MI based on a multi-source, functional characterization of network-derived properties. To accomplish this, we assembled a new compendium of interacting proteins that are interrelated, and not limited to known inflammation biomarkers in post-MI. My-Inflamome represents by itself a contribution to enable future independent investigations.

Despite their established use, standard inflammation markers are poor predictors of clinical outcome after MI. Prior to this investigation we did not have evidence to suggest that known biomarkers would appear as top high-traffic nodes. Our research indicates that indeed standard biomarkers do not necessarily act as network bottlenecks in the inflammation network. This confirms the known little predictive power of widely-applied inflammation biomarkers as shown by others and our previous research. Thus, our results offer a possible explanation for the lack of predictive capability of traditional markers: They have limited roles as central mediators or coordinators of inflammation responses in MI.

Among these putative biomarkers, SHKBP1 (SH3KBP1 binding protein 1) has not been widely characterized, though SH3KBP1 (SH3-domain kinase binding protein 1) is known to be implicated in apoptosis. Moreover, SHKBP1 has not been specifically linked to cardiovascular disease prior to our study. We found this new relationship by extracting SHKBP1 from a My-Inflamome module. Although this module was not statistically associated with specific biological processes or even specific cellular localizations, the combined expression values of the genes encoding this module's members showed relatively powerful prognostic capacity. SHKBP1 is over-expressed in patients with poor prognosis, and can actually offer relatively good classification performance when used as a single biomarker (AUC = 0.75, microarray data). Although not a necessary condition for effective integrated (multi-gene) classification, relatively strong differential expression of individual inflammation biomarkers may add to the potential clinical value of the biomarkers. Inflammation is a natural reaction to myocardial injury and the duration of these events largely determine heart remodelling. Thus, future research could offer more detailed quantitative relationships between individual levels of inflammation genes and their influence on cardiac damage or remodelling.

Inflammation can cause plaque rupture in the coronary arteries, which leads to MI. However, this type of inflammation is highly localized in the heart. Therefore, we cannot confer putative causative roles to our proposed circulating biomarkers.

Good classification performance was observed when replicating results with PCR data obtained from the same cohort of patients used in the model derivation phase. These results encourage additional investigations on independent, larger cohorts.

It is also crucial to indicate that standard clinical biomarkers, N-terminal pro-brain natriuretic peptide (NT-pro-BNP) and troponin T, have already been analyzed on the same (model derivation) cohort here investigated [18]. Prognostic models based on these markers reported lower performance (LOOCV, AUC < 0.7) than those obtained here with TRAF2, SHKBP1 and UBC.

An examination of the literature and annotated disease databases indicates that TRAF2, SHKBP1 and UBC have not been widely investigated in the context of cardiovascular disease, and that no quantitative links have been made to clinical response after MI. Only UBC has been previously linked to coronary artery disease severity [39]. Also we detected promising associations between miRNAs and My-Inflamome that could guide the design of new prognostic or therapeutic interventions based on "driving" regulatory modulation strategies. Considering that we did not incorporate expression information of miRNAs, the establishment of such statistically detectable associations underscores the predictive potential of our approach.

Overall, these observations underline the value of our approach to uncover potentially clinically relevant associations, which can be both novel and unbiased.

Limitations and future research

The level of certainty of our quantitative associations is constrained by the size of the datasets used in the model derivation and verification analyses of prognostic models. Nevertheless, we expect to expand our performance estimations and comparisons with alternative models as more data become available. Moreover, in this study we showed the feasibility of producing concordant results when using different data generation platforms.

It may be argued that My-Inflamome does not include all components with relevant involvement in inflammation-mediated responses to cardiac injury and remodeling. We acknowledge that the rate of false negatives may be reduced by implementing a more inclusive network inference process, e.g., by expanding the seed set. However, at this stage we emphasized quality, rather than coverage, to control the level of potentially spurious or false positive associations.

The clinical acceptance of new biomarkers firstly depends on the successful development of larger, independent evaluations. Moreover, future assessments should involve additional comparisons with standard post-MI biomarkers, including those not directly connected to inflammatory responses (e.g., NT-pro-BNP) [3]. Future research should incorporate measurements of standard clinical biomarkers, such as CRP or TNF-α [2]. To enhance the scientific exploitation of My-Inflamome and add value to our findings, we also plan to facilitate their user-friendly access and analysis via a Web-based interface. Researchers can currently have full access to My-Inflamome in a text-formatted file format for non-commercial application (Additional file 1).

Conclusion

In this research, we demonstrated how an integrative, network-driven strategy can improve systems-level mechanistic understanding of inflammation in MI. We also showed the translation of network-derived information into new knowledge with potential applications to assess prognostic outcome after MI. In particular, we reported new prognostic associations involving three genes: TRAF2, SHKBP1 and UBC. Potential clinical implications of our results, including the proposed biomarkers, will require independent studies.

Abbreviations

AUC:

Area under the receiver operating characteristic curve

BP:

GO biology process hierarchy

GO:

Gene Ontology

HF:

Heart failure

LOOCV:

Leave-one-out cross-validation

MI:

Myocardial infarction (MI).

References

  1. Lloyd-Jones D, Adams R, Carnethon M, et al: Heart disease and stroke statistics--2009 update: a report from the American Heart Association Statistics Committee and Stroke Statistics Subcommittee. Circulation. 2009, 119: e21-181.

    Article  PubMed  Google Scholar 

  2. Braunwald E: Biomarkers in heart failure. N Engl J Med. 2008, 358: 2148-59. 10.1056/NEJMra0800239.

    Article  CAS  PubMed  Google Scholar 

  3. Gerszten RE, Wang TJ: The search for new cardiovascular biomarkers. Nature. 2008, 451: 949-52. 10.1038/nature06802.

    Article  CAS  PubMed  Google Scholar 

  4. Chan D, Ng LL: Biomarkers in acute myocardial infarction. BMC Med. 2010, 8: 34-10.1186/1741-7015-8-34.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Hansson GK: Inflammation, atherosclerosis, and coronary artery disease. N Engl J Med. 2005, 352: 1685-95. 10.1056/NEJMra043430.

    Article  CAS  PubMed  Google Scholar 

  6. Holme I, Aastveit AH, Hammar N, Jungner I, Walldius G: Inflammatory markers, lipoprotein components and risk of major cardiovascular events in 65,005 men and women in the Apolipoprotein MOrtality RISk study (AMORIS). Atherosclerosis. 2010, 213: 299-305. 10.1016/j.atherosclerosis.2010.08.049.

    Article  CAS  PubMed  Google Scholar 

  7. Kalogeropoulos A, Georgiopoulou V, Psaty BM, et al: Inflammatory markers and incident heart failure risk in older adults: the Health ABC (Health, Aging, and Body Composition) study. J Am Coll Cardiol. 2010, 55: 2129-37. 10.1016/j.jacc.2009.12.045.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Jiang B, Liao R: The paradoxical role of inflammation in cardiac repair and regeneration. J Cardiovasc Transl Res. 2010, 3: 410-6. 10.1007/s12265-010-9193-7.

    Article  PubMed  Google Scholar 

  9. Lavie CJ, Milani RV, Verma A, O'Keefe JH: C-reactive protein and cardiovascular diseases--is it ready for primetime?. Am J Med Sci. 2009, 338: 486-92. 10.1097/MAJ.0b013e3181c61b66.

    Article  PubMed  Google Scholar 

  10. Vasan RS, Sullivan LM, Roubenoff R, et al: Inflammatory markers and risk of heart failure in elderly subjects without prior myocardial infarction: the Framingham Heart Study. Circulation. 2003, 107: 1486-91. 10.1161/01.CIR.0000057810.48709.F6.

    Article  CAS  PubMed  Google Scholar 

  11. Azuaje F, Devaux Y, Wagner D: Computational biology for cardiovascular biomarker discovery. Brief Bioinform. 2009, 10: 367-77. 10.1093/bib/bbp008.

    Article  CAS  PubMed  Google Scholar 

  12. Goni J, Esteban FJ, de Mendizabal NV, et al: A computational analysis of protein-protein interaction networks in neurodegenerative diseases. BMC Syst Biol. 2008, 2: 52-10.1186/1752-0509-2-52.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Hwang D, Lee IY, Yoo H, et al: A systems approach to prion disease. Mol Syst Biol. 2009, 5: 252.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Ragusa M, Avola G, Angelica R, et al: Expression profile and specific network features of the apoptotic machinery explain relapse of acute myeloid leukemia after chemotherapy. BMC Cancer. 2010, 10: 377-10.1186/1471-2407-10-377.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Azuaje F, Devaux Y, Vausort M, Yvorra C, Wagner DR: Transcriptional networks characterize ventricular dysfunction after myocardial infarction: a proof-of-concept investigation. J Biomed Inform. 2010, 43: 812-9. 10.1016/j.jbi.2010.05.012.

    Article  PubMed  Google Scholar 

  16. Azuaje F, Devaux Y, Wagner DR: Coordinated modular functionality and prognostic potential of a heart failure biomarker-driven interaction network. BMC Syst Biol. 2010, 4: 60-10.1186/1752-0509-4-60.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Azuaje F, Devaux Y, Wagner DR: Integrative pathway-centric modeling of ventricular dysfunction after myocardial infarction. PLoS One. 2010, 5: e9661-10.1371/journal.pone.0009661.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Devaux Y, Azuaje F, Vausort M, Yvorra C, Wagner DR: Integrated protein network and microarray analysis to identify potential biomarkers after myocardial infarction. Funct Integr Genomics. 2010, 10: 329-37. 10.1007/s10142-010-0169-0.

    Article  CAS  PubMed  Google Scholar 

  19. Nepomuceno-Chamorro I, Azuaje F, Devaux Y, et al: Prognostic transcriptional association networks: a new supervised approach based on regression trees. Bioinformatics. 2011, 27: 252-8. 10.1093/bioinformatics/btq645.

    Article  CAS  PubMed  Google Scholar 

  20. Aerts S, Lambrechts D, Maity S, et al: Gene prioritization through genomic data fusion. Nat Biotechnol. 2006, 24: 537-44. 10.1038/nbt1203.

    Article  CAS  PubMed  Google Scholar 

  21. Tranchevent LC, Barriot R, Yu S, et al: ENDEAVOUR update: a web resource for gene prioritization in multiple species. Nucleic Acids Res. 2008, W377-84. 36 Web Server

  22. Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, Eisenberg D: The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2004, D449-51. 32 Database

  23. Aranda B, Achuthan P, Alam-Faruque Y, et al: The IntAct molecular interaction database in 2010. Nucleic Acids Res. 2010, D525-31. 38 Database

  24. Ceol A, Chatr Aryamontri A, Licata L, et al: MINT, the molecular interaction database: 2009 update. Nucleic Acids Res. 2010, D532-9. 38 Database

  25. Salwinski L, Licata L, Winter A, et al: Recurated protein interaction datasets. Nat Methods. 2009, 6: 860-1. 10.1038/nmeth1209-860.

    Article  CAS  PubMed  Google Scholar 

  26. Cline MS, Smoot M, Cerami E, et al: Integration of biological networks and gene expression data using Cytoscape. Nat Protoc. 2007, 2: 2366-82. 10.1038/nprot.2007.324.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Goncalves JP, Graos M, Valente AX: POLAR MAPPER: a computational tool for integrated visualization of protein interaction networks and mRNA expression data. J R Soc Interface. 2009, 6: 881-96.

    Article  CAS  PubMed  Google Scholar 

  28. Florez AF, Park D, Bhak J, et al: Protein network prediction and topological analysis in Leishmania major as a tool for drug target selection. BMC Bioinformatics. 2010, 11: 484-10.1186/1471-2105-11-484.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Ozgur A, Vu T, Erkan G, Radev DR: Identifying gene-disease associations using centrality on a literature mined gene-interaction network. Bioinformatics. 2008, 24: i277-85. 10.1093/bioinformatics/btn182.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Yu H, Kim PM, Sprecher E, Trifonov V, Gerstein M: The importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics. PLoS Comput Biol. 2007, 3: e59-10.1371/journal.pcbi.0030059.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Shi Z, Derow CK, Zhang B: Co-expression module analysis reveals biological processes, genomic gain, and regulatory mechanisms associated with breast cancer progression. BMC Syst Biol. 2010, 4: 74-10.1186/1752-0509-4-74.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006, D140-4. 34 Database

  33. Al-Shahrour F, Minguez P, Tarraga J, et al: BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments. Nucleic Acids Res. 2006, W472-6. 34 Web Server

  34. Satatistica. Accessed: 21 April 2011, [http://www.statsoft.com/]

  35. GEO: Gene Expression Omnibus database. [Accessed: 21 April 2011], [http://www.ncbi.nlm.nih.gov/geo/]

  36. Azuaje F, Zheng H, Camargo A, Wang H: Systems-based biological concordance and predictive reproducibility of gene set discovery methods in cardiovascular disease. J Biomed Inform.

  37. Frank E, Hall M, Trigg L, Holmes G, Witten IH: Data mining in bioinformatics using Weka. Bioinformatics. 2004, 20: 2479-81. 10.1093/bioinformatics/bth261.

    Article  CAS  PubMed  Google Scholar 

  38. Png KJ, Yoshida M, Zhang XH, et al: MicroRNA-335 inhibits tumor reinitiation and is silenced through genetic and epigenetic mechanisms in human breast cancer. Genes Dev. 2011, 25: 226-31. 10.1101/gad.1974211.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Chen SM, Zhang HX, Li YG, Wang DM, Zhang GH, Tan CJ: Expression of ubiquitin in peripheral inflammatory cells from patients with coronary artery disease. J Int Med Res. 2008, 36: 1227-34.

    Article  CAS  PubMed  Google Scholar 

Pre-publication history

Download references

Acknowledgements and funding

We thank Malou Gloesener, Loredana Jacobs, Céline Jeanty and Mélanie Vausort for technical assistance. This work was supported by grants from the National Funds of Research, the Society for Research on Cardiovascular Diseases and the Ministry of Culture, Higher Education and Research of Luxembourg.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francisco J Azuaje.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

FA and DW conceived the study. FA designed the discovery strategy and implemented computational analyses. LZ implemented computational analyses. SR performed expression measurement experiments with multiplex PCR. DW and YD provided biomedical feedback. FA wrote the manuscript assisted by the other authors. All the authors read and approved the manuscript.

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Azuaje, F.J., Rodius, S., Zhang, L. et al. Information encoded in a network of inflammation proteins predicts clinical outcome after myocardial infarction. BMC Med Genomics 4, 59 (2011). https://doi.org/10.1186/1755-8794-4-59

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1755-8794-4-59

Keywords