Predicting phenotypes of asthma and eczema with machine learning
- Mattia CF Prosperi1, 2Email author,
- Susana Marinho2,
- Angela Simpson2,
- Adnan Custovic†2 and
- Iain E Buchan†1
© Prosperi et al.; licensee BioMed Central Ltd. 2014
Published: 8 May 2014
There is increasing recognition that asthma and eczema are heterogeneous diseases. We investigated the predictive ability of a spectrum of machine learning methods to disambiguate clinical sub-groups of asthma, wheeze and eczema, using a large heterogeneous set of attributes in an unselected population. The aim was to identify to what extent such heterogeneous information can be combined to reveal specific clinical manifestations.
The study population comprised a cross-sectional sample of adults, and included representatives of the general population enriched by subjects with asthma. Linear and non-linear machine learning methods, from logistic regression to random forests, were fit on a large attribute set including demographic, clinical and laboratory features, genetic profiles and environmental exposures. Outcome of interest were asthma, wheeze and eczema encoded by different operational definitions. Model validation was performed via bootstrapping.
The study population included 554 adults, 42% male, 38% previous or current smokers. Proportion of asthma, wheeze, and eczema diagnoses was 16.7%, 12.3%, and 21.7%, respectively. Models were fit on 223 non-genetic variables plus 215 single nucleotide polymorphisms. In general, non-linear models achieved higher sensitivity and specificity than other methods, especially for asthma and wheeze, less for eczema, with areas under receiver operating characteristic curve of 84%, 76% and 64%, respectively. Our findings confirm that allergen sensitisation and lung function characterise asthma better in combination than separately. The predictive ability of genetic markers alone is limited. For eczema, new predictors such as bio-impedance were discovered.
More usefully-complex modelling is the key to a better understanding of disease mechanisms and personalised healthcare: further advances are likely with the incorporation of more factors/attributes and longitudinal measures.
Asthma is the most common chronic disease in developed countries, however, the drug armamentarium available to manage the condition is modest. There is increasing recognition that asthma is a heterogeneous disease with multiple endotypes, which may have similar clinical manifestations, or phenotypes, but different underlying pathophysiological causes[2, 3]. Appropriate identification of such endotypes is critical for the understanding of the disease mechanism and the development of personalised approaches to its management. Sensitisation to allergens from several sources (such as pets, dust mites, cockroaches, and pollens) has been independently associated with asthma and asthma-related symptoms[5–8], and among asthmatic patients with the severity of the disease[9–12]. Also, sensitisation to inhalant allergens has been found to be associated with diminished lung function and increased airway responsiveness.
It remains unclear to what extent allergen sensitisation and lung function markers (e.g., airway reactivity, airway inflammation), in conjunction with a broader set of other potentially relevant information (e.g. environmental exposures or genetic characteristics), contribute towards specific clinical manifestations of different atopic diseases (e.g. asthma vs. eczema). In the past decades, several approaches to predict such current or subsequent clinical manifestations, both in children and adults, have been introduced[14–22]. The performance of prediction models varies in relation to different population strata, and obviously in relation to the clinical outcome or end-point definitions. For instance, one of the earliest works, by Castro-Rodríguez et al., devised a rule-based asthma predictive index to predict subsequent asthma amongst young children with a history of wheezing, attaining sensitivity ~0.4 at ~0.8 specificity on various time points. The recent work by Chatzimichail et al. reported ~0.95 of both sensitivity and specificity in predicting current asthma in symptomatic preschool children, using a machine learning approach based on previous symptoms, medications, allergen sensitisation and lung function.
In this work, using a rich data set from an unselected cross-sectional population study, different operational definitions of current asthma, wheeze and eczema are carefully derived, and we analyse their prognostic factors from a large set of markers, which includes demographic, clinical, laboratory features, genetic profiles and environmental exposures. Of note, previous diagnoses (along with anti-asthma medication usage) are removed on purpose from the input set, as many clinical outcome definitions are recursively based on them. The aim is to identify to which extent such heterogeneous information contributes and combines towards the prediction of a specific clinical presentation - comparing linear and non-linear machine learning models fitted with different feature combinations - and eventually prepare the grounds for the deployment of a personalized diagnostic tool.
The study population comprised a cross-sectional sample of adult individuals, age ≥18 years, including representatives of the general population enriched by subjects with asthma[13, 24]. For the sample from the general population, we approached parents of children who have been under active follow-up in the Manchester Asthma and Allergy Study (population-based birth cohort study). The population of subjects with asthma included well-phenotyped adults who were identified from a clinical trials database, and had both a history of physician-diagnosed asthma and asthma symptoms within the previous 12 months[26, 27]. The study was approved by the Local Research Ethics Committee (05/Q1406/70) and is registered as N0226171141. Written informed consent was obtained from all subjects.
A total of 1,102 attributes of the study participants were collected across a large heterogeneous information spectrum, including interviewer-administered questionnaires, laboratory measurements, doctors' diagnoses and environmental exposures. The collected data included:
demographic information (e.g. gender, ethnicity, age, place of residence);
questionnaire data related to symptom presence and severity (e.g. wheeze, shortness of breath, chronic cough), previous/current diagnoses of asthma, hay fever, eczema, food allergies or other illnesses;
use of anti-asthma medications (e.g. short-acting beta agonists [SABA], long-acting beta agonists [LABA], inhaled corticosteroids [ICS]);
questionnaire data on smoking and alcohol drinking habits, current pet ownership, indoor environmental conditions (e.g. rugs, beds, type of house heating, latex usage), occupation and occupation-related accidents;
objective measures on environmental exposure to house dust mite (Der p 1), cat (Fel d 1) and dog (Can f 1) allergen determined in dust samples collected from homes using enzyme-linked immunosorbent assays (ELISAs);
objective measures on environmental exposure to endotoxin (marker of exposure to gram-negative bacteria) and beta glucan (marker of exposure to moulds) determined in dust samples collected from homes;
body measurements (e.g. height, weight, body mass index [BMI], fat percentage, whole body impedance);
lung function measurements (e.g. forced expiratory volume in 1 second [FEV1], forced vital capacity [FVC], peak expiratory flow [PEF], functional reserve capacity [FRC] and residual volume [RV], total lung capacity [TLC], forced expiratory flow 25-75% [FEF25-75] and specific airway resistance [sRaw]);
measurement of airway inflammation (exhaled nitric oxide [eNO])
measurement of airway hyper-responsiveness using methacholine challenge, expressed as a provocative concentration of methacholine needed to produce a 20% fall in FEV1 (PC20), and methacholine dose-response slope (MDRS);
assessment of atopic status using (i) skin prick tests (SPT), (ii) measurement of serum allergen-specific Immunoglobulin E values (IgE), and (iii) component resolved diagnostics using an immuno-dot blot as previously described.
In addition, as part of a candidate gene association study, subjects were genotyped for 215 single nucleotide polymorphisms (SNPs) in genes found to be associated with asthma in previous studies (including polymorphisms in chromosomal regions 20p13-p12 and 17q12-21).
We used the following (partly overlapping) definitions of asthma, wheeze and eczema.
current asthma (CA), based on De Marco et al., defined as asthma ever confirmed by a doctor and at least one symptom of wheeze, nocturnal chest tightness, asthma attack within the past 12 months, attacks of breathlessness following activity, at rest or at night-time, having taken anti-asthma medication;
level-2 ECRHS II definition (A2), as two positive answers to the questions "have you been woken by an attack of shortness of breath at any time in the last 12 months", "have you had an attack of asthma in the last 12 months", "are you currently taking any medicines including inhalers, aerosol or tablets for asthma";
level-3 ECRHS II definition (A3), as three positive answers out of the set described at the previous point.
Current wheeze (CW) was defined, according to Pekkanen et al., as the presence of wheeze/breathlessness in the previous 12 months outside colds.
Eczema was defined as self-diagnosed (SDE) or doctor-confirmed (DDE) eczema.
Out of the 1,102 original non-genetic attributes, 223 were selected by clinical researchers, excluding factors considered as irrelevant or completely redundant, and those that were defining features of diagnoses. Attributes were grouped into: demographic/environmental variables (n = 74, including age, gender, BMI, whole body impedance, housing conditions, pet ownership, plus n = 56 variables measuring environmental exposures to endotoxin, beta glucan and indoor allergens); lung function, airway inflammation and airway hyper-responsiveness markers (n = 12, including eNO, % predicted FEV1, FVC, FEV1/FVC, FEF25-75, sRaw, PEF, TLC, RV, methacholine challenge MDRS and PC20); allergen sensitization assessed either by skin prick testing, specific serum IgE measurement or component resolved diagnostics (n = 8, n = 7, n = 66, respectively), recording mean wheal diameters (MWD) and IgE levels, which were either log-transformed or discretized into ordered quartile categories (where a negative or below limit of detection result was the zero-order category). All 215 SNPs were retained and merged to the data set. Before data merge, raw SNP data were processed through linkage-disequilibrium filtering/imputation using Haploview and the method of Gabriel et al. (as described in the previous work by Marinho et al.). Other missing values were replaced by column-wise median and modes depending on the data types.
For descriptive statistics and comparison with other prediction methods, information about previous diagnoses and medication usage (ICS, SABA, LABA) was retained but not used as input for the main models.
Main-effects logistic regression (LR) models were fitted selecting features by means of the LogitBoost algorithm. For comparison purposes, a LR model made by the best single predictor according to the Akaike information criterion (named one rule, OR) was considered. A decision tree model (DT) and a decision tree ensemble, the random forest (RF, 250 trees) were also evaluated, along with the AdaBoost (AB) classifier. Goodness-of-fit functions examined were: accuracy, i.e. percentage of correctly classified cases; area under the receiver operating characteristic curve (AUROC), which is equal to the probability that a classifier ranks a randomly chosen positive instance (e.g. condition present/diagnosed) higher than a randomly chosen negative one (e.g. condition absent); sensitivity, i.e. the probability that the classification is positive when the condition is present (true positive rate); specificity, i.e. the probability that the classification is negative when the condition is not present (true negative rate). Model performance was estimated and compared as extra-sample via bootstrapping (100 replicates), considering out-of-bag distributions, and assessing significance via t-tests adjusted for sample overlap and multiple comparisons[39–41]. Attribute importance was assessed by means of RF, calculating the average re-scaled (i.e. divided by its standard error) decrease in accuracy by variable randomization (repeated for 1000 times), and comparing it against a null distribution obtained by shuffling outcome labels, calculating p-values according to the method of Altmann et al. and previous works[43, 44]. All analyses were carried out using R software (http://www.r-project.org/).
Characteristics of the study population
year of birth
body mass index (BMI)
whole body impedance
exhaled nitric oxide (eNO), ppb (loge scale)
specific airway resistance (sRaw), kPa/s (loge scale)
peak expiratory flow (PEF) % predicted
forced vital capacity (FVC) % predicted
forced expiratory volume in 1 second (FEV^ % predicted
forced expiratory flow (FEF25-75) % predicted
total lung capacity (TLC)
residual volume (RV)
provocative concentration of methacholine needed to produce a 20% fall in FEVj (PC20), of those completing the test
methacholine dose-response slope (MDRS), transformed as 100/(MdRS+10)
allergen sensitisation by skin prick test (SPT)
dust mite (mean wheal diameter >3 mm)
cat (mean wheal diameter >3 mm)
dog (mean wheal diameter >3 mm)
tree (mean wheal diameter >3 mm)
grass (mean wheal diameter >3 mm)
mould (mean wheal diameter >3 mm)
peanut (mean wheal diameter >3 mm)
medications in the past three months
short-acting beta agonists (SABA)
inhaled corticosteroids (ICS) or ICS/long-acting beta agonists (LABA)
illness or problem caused by eating a particular food or foods, ever
accident at home, work or elsewhere exposing to high levels of vapours, gas or dust
carpets in the house
gas stove in the house
electric stove in the house
job causing wheezing problems
proportion of subjects not completing PC20
proportion of subjects with current asthma (CA)
proportion of subjects with level-2 asthma (A2)
proportion of subjects with level-3 asthma (A3)
proportion of subjects with current wheeze (CW)
proportion of subjects with self-diagnosed eczema (SDE)
proportion of subjects with doctor's diagnosed eczema (DDE)
cross-tabulation of clinical outcomes (% of agreement)
Comparison of machine learning methods.
sensitivity (at 90% specificity)
sensitivity (at 80% specificity)
Doctor's Diagnosed Eczema
Comparison of random forest performance using selected input domains.
sensitivity (at 90% specificity)
sensitivity (at 80% specificity)
Doctor's Diagnosed Eczema
To compare more thoroughly RF with LR, we analysed the variable sets selected by the LogitBoost algorithm. Specifically, for CW, five predictors were selected: IgE of house dust mite (OR = 1.207 per loge higher, p = 0.005); IgE of dog (OR = 1.465 per loge higher, p < 0.0001); number of cigarettes smoked (OR = 1.032 per packages/year, p = 0.003); moving house (OR = 3.078 for moving twice or more as compared to not moving, p = 0.001); MDRS (OR = 0.794 per transformed unit p = 0.0004). For CA, nine predictors were selected: IgE of house dust mite (OR = 1.308 per loge higher, p < 0.0001); IgE of dog (OR = 1.519 per loge higher, p < 0.0001); job causing wheezing problems (OR = 13.923 for presence of condition; p < 0.0001); rs8079416 (OR = 0.502 as additive model; p = 0.002); rs11540720 (OR = 0.182 as additive model; p = 0.008); rs5743704 (OR = 0.265 as additive model; p = 0.011); rs11536889 (OR = 0.265 as additive model; p = 0.011); sRaw (OR = 6.509 per loge higher; p = 0.0009); MDRS (OR = 0.839 per transformed unit p = 0.013). For DDE, one predictor was selected, the IgE of cat (OR = 1.378 per loge higher, p < 0.0001). All features selected by LogitBoost were listed as top-ranked variables by the RF, except for SNPs in the CA outcome. Note that these LR models were obtained from one data set using a single LogitBoost selection, and - given also the degree of correlation among variables - alternative models with equal performance may be selected by varying selection heuristics.
We investigated the ability of linear and non-linear machine learning models to predict asthma, wheezing, and eczema outcomes, according to different operational definitions, with a heterogeneous set of attributes in an adult population. Models were compared in terms of performance, complexity and interpretability. Different feature groups were evaluated and combined in order to understand determinants (and combinations thereof) of asthma symptoms or the presence of eczema. The use of random forests in model building yielded better AUROC, sensitivity and specificity than other methods. This might be due to the ability of random forests to model non-linear functions and account for variable interactions, although the corrected t-test on AUROC did not show a statistically significant difference. Furthermore, the difference between random forests and logistic regression was minimal in predicting asthma phenotypes. In terms of statistical power, however, a larger sample of subjects may reveal undetected differences in the AUROC comparisons of the linear and non-linear methods.
There was a higher prevalence of eczema compared to asthma and wheeze in the population; AUROCs were higher when considering current asthma and current wheeze outcomes (0.84 and 0.76), lower for doctor's diagnosed eczema (0.64). Results show clearly that there is a benefit of merging information from different sources, e.g. lung functions, allergen sensitization tests, genetic markers, demographics and environment. However, in general all models were characterized by a relatively low sensitivity with any feature set combination. A lower sensitivity was obtained compared to that of Chatzimichail et al., who used a similar outcome definition: this is because we explicitly excluded any previous personal and familiar diagnosis of asthma, wheeze and eczema from the input set, given the fact that outcomes are often defined recursively on previous episodes. In fact, when utilising previous diagnoses (plus anti-asthma medication usage), sensitivity increased to ≥0.8 (≥0.9 when including anti-asthma medication usage variables) at a minimum specificity of 0.9. However, direct comparison with other methods is only qualitative given the different study designs and populations.
Regarding the importance of features, our findings confirm the important contribution of allergen sensitization (dust mite, dog, cat), along with lung function markers, in predicting asthma diagnoses or symptom patterns. The predictive ability of genetic markers alone is limited, although for the current asthma outcome the LogitBoost algorithm selected a few over the whole set of variables. Our AUROCs for SNPs are in line with the previous estimates of Spycher et al., who analysed the genome-wide prediction of childhood asthma and related phenotypes in a longitudinal birth cohort (reporting AUROC of 0.59 for wheeze and of 0.54 for asthma). However, our analysis was not focused on genetic markers: a limited population sample, in terms of the set candidate SNPs as well as of environmental markers, can decrease the power to look for SNP-environment interactions effectively; therefore a more accurate study design is warranted for this objective.
We observed interesting novel and biologically plausible association between bio-impedance and eczema. Previous studies have found that whole body impedance is associated with steroid treatments and several types of cutaneous reactions, including an indirect association to Filaggrin-related eczema (via stratum corneum hydration). Further investigation of this association is warranted.
Limitation of our study include the use of an in-house, rather than externally validated assay for component resolved diagnostics (however, this metric was coupled with validated skin prick testing and blood Immunoglobulin E testing), and the facts that genetic analysis was restricted to candidate genes. Another potential limitation was the naïve policy for missing value imputation; however the extent of missing information was negligible.
Being a cross-sectional study, with no longitudinal separation of predictors and outcomes, this study is not intended to assess different approaches to causal inference. However, our data demonstrate that even with cross-sectional data, there is considerable scope to build more usefully complex models to better understand asthma and other complex diseases (such as eczema). Future studies might incorporate more factors/attributes and harness longitudinal data in the prediction of later clinical outcomes.
Publication for this article has been funded by grants from J P Moulton Charitable Foundation (sponsoring the MAAS cohort), Medical Research Council (MRC) grants G0601361, MR/K002449/1, University of Manchester's Library via the Research Councils UK (for open-access publications), and by the MRC Health eResearch Centre (HeRC) grant MR/K006665/1.
This article has been published as part of BMC Medical Genomics Volume 7 Supplement 1, 2014: Selected articles from the 3rd Translational Bioinformatics Conference (TBC/ISCB-Asia 2013). The full contents of the supplement are available online at http://www.biomedcentral.com/bmcmedgenomics/supplements/7/S1.
- Papierniak ES, Lowenthal DT, Harman E: Novel therapies in asthma: leukotriene antagonists, biologic agents, and beyond. Am J Ther. 2013, 20 (1): 79-103. 10.1097/MJT.0b013e31826915c2.PubMedView ArticleGoogle Scholar
- Bacharier LB, Guilbert TW: Diagnosis and management of early asthma in preschool-aged children. J Allergy Clin Immunol. 2012, 130 (2): 287-296. 10.1016/j.jaci.2012.04.025. quiz 297-288PubMedView ArticleGoogle Scholar
- Lotvall J, Akdis CA, Bacharier LB, Bjermer L, Casale TB, Custovic A, Lemanske RF, Wardlaw AJ, Wenzel SE, Greenberger PA: Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. J Allergy Clin Immunol. 2011, 127 (2): 355-360. 10.1016/j.jaci.2010.11.037.PubMedView ArticleGoogle Scholar
- Sittka A, Vera J, Lai X, Schmeck B: Asthma phenotyping, therapy, and prevention: what can we learn from systems biology?. Pediatr Res. 2013Google Scholar
- Taylor PE, Jacobson KW, House JM, Glovsky MM: Links between pollen, atopy and the asthma epidemic. International archives of allergy and immunology. 2007, 144 (2): 162-170. 10.1159/000103230.PubMedView ArticleGoogle Scholar
- Gent JF, Belanger K, Triche EW, Bracken MB, Beckett WS, Leaderer BP: Association of pediatric asthma severity with exposure to common household dust allergens. Environmental research. 2009, 109 (6): 768-774. 10.1016/j.envres.2009.04.010.PubMedPubMed CentralView ArticleGoogle Scholar
- Wang J, Calatroni A, Visness CM, Sampson HA: Correlation of specific IgE to shrimp with cockroach and dust mite exposure and sensitization in an inner-city population. J Allergy Clin Immunol. 2011, 128 (4): 834-837. 10.1016/j.jaci.2011.07.045.PubMedPubMed CentralView ArticleGoogle Scholar
- Sordillo JE, Webb T, Kwan D, Kamel J, Hoffman E, Milton DK, Gold DR: Allergen exposure modifies the relation of sensitization to fraction of exhaled nitric oxide levels in children at risk for allergy and asthma. J Allergy Clin Immunol. 2011, 127 (5): 1165-1172 e1165. 10.1016/j.jaci.2011.01.066.PubMedPubMed CentralView ArticleGoogle Scholar
- Burrows B, Martinez FD, Halonen M, Barbee RA, Cline MG: Association of asthma with serum IgE levels and skin-test reactivity to allergens. N Engl J Med. 1989, 320 (5): 271-277. 10.1056/NEJM198902023200502.PubMedView ArticleGoogle Scholar
- Beeh KM, Ksoll M, Buhl R: Elevation of total serum immunoglobulin E is associated with asthma in nonallergic individuals. The European respiratory journal : official journal of the European Society for Clinical Respiratory Physiology. 2000, 16 (4): 609-614. 10.1034/j.1399-3003.2000.16d07.x.View ArticleGoogle Scholar
- Simpson BM, Custovic A, Simpson A, Hallam CL, Walsh D, Marolia H, Campbell J, Woodcock A: NAC Manchester Asthma and Allergy Study (NACMAAS): risk factors for asthma and allergic disorders in adults. Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology. 2001, 31 (3): 391-399. 10.1046/j.1365-2222.2001.01050.x.View ArticleGoogle Scholar
- Marinho S, Simpson A, Soderstrom L, Woodcock A, Ahlstedt S, Custovic A: Quantification of atopy and the probability of rhinitis in preschool children: a population-based birth cohort study. Allergy. 2007, 62 (12): 1379-1386. 10.1111/j.1398-9995.2007.01502.x.PubMedView ArticleGoogle Scholar
- Marinho S, Simpson A, Marsden P, Smith JA, Custovic A: Quantification of atopy, lung function and airway hypersensitivity in adults. Clinical and translational allergy. 2011, 1 (1): 16-10.1186/2045-7022-1-16.PubMedPubMed CentralView ArticleGoogle Scholar
- Castro-Rodriguez JA, Holberg CJ, Wright AL, Martinez FD: A clinical index to define risk of asthma in young children with recurrent wheezing. Am J Respir Crit Care Med. 2000, 162 (4 Pt 1): 1403-1406.PubMedView ArticleGoogle Scholar
- Singer F, Luchsinger I, Inci D, Knauer N, Latzin P, Wildhaber JH, Moeller A: Exhaled nitric oxide in symptomatic children at preschool age predicts later asthma. Allergy. 2013, 68 (4): 531-538. 10.1111/all.12127.PubMedView ArticleGoogle Scholar
- Greenberg S: Asthma exacerbations: predisposing factors and prediction rules. Current opinion in allergy and clinical immunology. 2013Google Scholar
- Wadsworth SJ, Sandford AJ: Personalised medicine and asthma diagnostics/management. Current allergy and asthma reports. 2013, 13 (1): 118-129. 10.1007/s11882-012-0325-9.PubMedView ArticleGoogle Scholar
- Pralong JA, Seed MJ, Yasri R, Agius RM, Cartier A, Labrecque M: A computer based asthma hazard prediction model and new molecular weight agents in occupational asthma. Occupational and environmental medicine. 2013, 70 (1): 70-10.1136/oemed-2012-101189.PubMedView ArticleGoogle Scholar
- Soyiri IN, Reidpath DD: Semistructured black-box prediction: proposed approach for asthma admissions in London. International journal of general medicine. 2012, 5: 693-705.PubMedPubMed CentralView ArticleGoogle Scholar
- Spycher BD, Henderson J, Granell R, Evans DM, Smith GD, Timpson NJ, Sterne JA: Genome-wide prediction of childhood asthma and related phenotypes in a longitudinal birth cohort. J Allergy Clin Immunol. 2012, 130 (2): 503-509 e507. 10.1016/j.jaci.2012.06.002.PubMedView ArticleGoogle Scholar
- Savenije OE, Kerkhof M, Koppelman GH, Postma DS: Predicting who will have asthma at school age among preschool children. J Allergy Clin Immunol. 2012, 130 (2): 325-331. 10.1016/j.jaci.2012.05.007.PubMedView ArticleGoogle Scholar
- Vial Dupuy A, Amat F, Pereira B, Labbe A, Just J: A simple tool to identify infants at high risk of mild to severe childhood asthma: the persistent asthma predictive score. The Journal of asthma : official journal of the Association for the Care of Asthma. 2011, 48 (10): 1015-1021. 10.3109/02770903.2011.626481.View ArticleGoogle Scholar
- Chatzimichail E, Paraskakis E, Sitzimi M, Rigas A: An intelligent system approach for asthma prediction in symptomatic preschool children. Computational and mathematical methods in medicine. 2013, 2013: 240182-PubMedPubMed CentralView ArticleGoogle Scholar
- Marinho S, Custovic A, Marsden P, Smith JA, Simpson A: 17q12-21 variants are associated with asthma and interact with active smoking in an adult population from the United Kingdom. Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology. 2012, 108 (6): 402-411 e409. 10.1016/j.anai.2012.03.002.View ArticleGoogle Scholar
- Custovic A, Simpson BM, Murray CS, Lowe L, Woodcock A, Asthma NACM, Allergy Study G: The National Asthma Campaign Manchester Asthma and Allergy Study. Pediatric allergy and immunology : official publication of the European Society of Pediatric Allergy and Immunology. 2002, 13 (Suppl 15): 32-37.View ArticleGoogle Scholar
- Langley SJ, Goldthorpe S, Craven M, Morris J, Woodcock A, Custovic A: Exposure and sensitization to indoor allergens: association with lung function, bronchial reactivity, and exhaled nitric oxide measures in asthma. J Allergy Clin Immunol. 2003, 112 (2): 362-368. 10.1067/mai.2003.1654.PubMedView ArticleGoogle Scholar
- Langley SJ, Goldthorpe S, Custovic A, Woodcock A: Relationship among pulmonary function, bronchial reactivity, and exhaled nitric oxide in a large group of asthmatic patients. Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology. 2003, 91 (4): 398-404. 10.1016/S1081-1206(10)61688-2.View ArticleGoogle Scholar
- Kidon MI, Chiang WC, Liew WK, Ong TC, Tiong YS, Wong KN, Angus AC, Ong ST, Gao YF, Reginald K, et al: Mite component-specific IgE repertoire and phenotypes of allergic disease in childhood: the tropical perspective. Pediatric allergy and immunology : official publication of the European Society of Pediatric Allergy and Immunology. 2011, 22 (2): 202-210. 10.1111/j.1399-3038.2010.01094.x.View ArticleGoogle Scholar
- de Marco R, Marcon A, Jarvis D, Accordini S, Almar E, Bugiani M, Carolei A, Cazzoletti L, Corsico A, Gislason D, et al: Prognostic factors of asthma severity: a 9-year international prospective cohort study. J Allergy Clin Immunol. 2006, 117 (6): 1249-1256. 10.1016/j.jaci.2006.03.019.PubMedView ArticleGoogle Scholar
- Siroux V, Boudier A, Anto JM, Cazzoletti L, Accordini S, Alonso J, Cerveri I, Corsico A, Gulsvik A, Jarvis D, et al: Quality-of-life and asthma-severity in general population asthmatics: results of the ECRHS II study. Allergy. 2008, 63 (5): 547-554. 10.1111/j.1398-9995.2008.01638.x.PubMedView ArticleGoogle Scholar
- Pekkanen J, Sunyer J, Anto JM, Burney P, European Community Respiratory Health S: Operational definitions of asthma in studies on its aetiology. The European respiratory journal : official journal of the European Society for Clinical Respiratory Physiology. 2005, 26 (1): 28-35. 10.1183/09031936.05.00120104.View ArticleGoogle Scholar
- Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21 (2): 263-265. 10.1093/bioinformatics/bth457.PubMedView ArticleGoogle Scholar
- Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, et al: The structure of haplotype blocks in the human genome. Science. 2002, 296 (5576): 2225-2229. 10.1126/science.1069424.PubMedView ArticleGoogle Scholar
- Landwehr N, Hall M, Frank E: Logistic model trees. Mach Learn. 2005, 59 (1-2): 161-205. 10.1007/s10994-005-0466-3.View ArticleGoogle Scholar
- Venables WN, Ripley BD: Modern Applied Statistics with S. 2002, SpringerView ArticleGoogle Scholar
- Breiman L, Friedman J, Stone C, Olshen RA: Classification and Regression Trees. 1984, Chapman and Hall/CRCGoogle Scholar
- Breiman L: Random forests. Mach Learn. 2001, 45 (1): 5-32. 10.1023/A:1010933404324.View ArticleGoogle Scholar
- Freund Y, Schapire RE: A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci. 1997, 55 (1): 119-139. 10.1006/jcss.1997.1504.View ArticleGoogle Scholar
- Hastie T, Tibshirani R, Friedman JH: The elements of statistical learning : data mining, inference, and prediction. 2009, New York, NY: Springer, 2View ArticleGoogle Scholar
- Nadeau C, Bengio Y: Inference for the Generalization Error. Mach Learn. 2003, 52 (3): 239-281. 10.1023/A:1024068626366.View ArticleGoogle Scholar
- Garcia S, Herrera F: An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons. J Mach Learn Res. 2008, 9: 2677-2694.Google Scholar
- Altmann A, Tolosi L, Sander O, Lengauer T: Permutation importance: a corrected feature importance measure. Bioinformatics. 2010, 26 (10): 1340-1347. 10.1093/bioinformatics/btq134.PubMedView ArticleGoogle Scholar
- Nicodemus KK, Malley JD, Strobl C, Ziegler A: The behaviour of random forest permutation-based variable importance measures under predictor correlation. BMC Bioinformatics. 2010, 11: 110-10.1186/1471-2105-11-110.PubMedPubMed CentralView ArticleGoogle Scholar
- Strobl C, Boulesteix AL, Zeileis A, Hothorn T: Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinformatics. 2007, 8: 25-10.1186/1471-2105-8-25.PubMedPubMed CentralView ArticleGoogle Scholar
- Heitmann BL, Anhoj J, Bisgaard AM, Ward L, Bisgaard H: Changes in body water distribution during treatment with inhaled steroid in pre-school children. Annals of human biology. 2004, 31 (3): 333-341. 10.1080/0301446042000208286.PubMedView ArticleGoogle Scholar
- Nyren M, Hagstromer L, Emtestam L: On assessment of skin reactivity using electrical impedance. Ann Ny Acad Sci. 1999, 873: 214-220. 10.1111/j.1749-6632.1999.tb09469.x.PubMedView ArticleGoogle Scholar
- Nemoto-Hasebe I, Akiyama M, Nomura T, Sandilands A, McLean WHI, Shimizu H: Clinical Severity Correlates with Impaired Barrier in Filaggrin-Related Eczema. J Invest Dermatol. 2009, 129 (3): 682-689. 10.1038/jid.2008.280.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.