Tobacco use induces anti-apoptotic, proliferative patterns of gene expression in circulating leukocytes of Caucasian males
- Peter C Charles†1, 2,
- Brian D Alder†3,
- Eleanor G Hilliard1,
- Jonathan C Schisler1,
- Robert E Lineberger1,
- Joel S Parker4,
- Sabeen Mapara1,
- Samuel S Wu2,
- Andrea Portbury1,
- Cam Patterson†1, 2Email author and
- George A Stouffer†1, 2
© Charles et al; licensee BioMed Central Ltd. 2008
Received: 14 January 2008
Accepted: 18 August 2008
Published: 18 August 2008
Strong epidemiologic evidence correlates tobacco use with a variety of serious adverse health effects, but the biological mechanisms that produce these effects remain elusive.
We analyzed gene transcription data to identify expression spectra related to tobacco use in circulating leukocytes of 67 Caucasian male subjects. Levels of cotinine, a nicotine metabolite, were used as a surrogate marker for tobacco exposure. Significance Analysis of Microarray and Gene Set Analysis identified 109 genes in 16 gene sets whose transcription levels were differentially regulated by nicotine exposure. We subsequently analyzed this gene set by hyperclustering, a technique that allows the data to be clustered by both expression ratio and gene annotation (e.g. Gene Ontologies).
Our results demonstrate that tobacco use affects transcription of groups of genes that are involved in proliferation and apoptosis in circulating leukocytes. These transcriptional effects include a repertoire of transcriptional changes likely to increase the incidence of neoplasia through an altered expression of genes associated with transcription and signaling, interferon responses and repression of apoptotic pathways.
Gene expression profiling has become a powerful approach to the study of molecular pathophysiology and is a potentially useful diagnostic tool in multiple fields . Oncologists have applied gene expression profiling to predict breast cancer aggressiveness , and microarray-driven approaches have been used to analyze cardiovascular diseases such as hypertension, heart failure, cardiac rejection, and atherosclerosis [3–5]. Ideally, gene expression profiling is performed on the specific cell type and tissue of interest, i.e. the tumor, myocardium, or atheroma. However, sampling target tissues from humans is often problematic, and data derived from tissues not routinely available to clinicians limits the diagnostic utility of this approach.
For the study of biological processes that involve an inflammatory response, gene expression profiles can be obtained from circulating leukocytes . Due to the ease of sampling, gene expression profiling of circulating leukocytes has been applied to the study of cancer , atherosclerosis [8, 9], and systemic lupus erythematosus . These studies demonstrate the utility of transcriptional analysis of peripheral blood in the study of disease states having a systemic, inflammatory component.
Tobacco use, whether by smoking or chewing, is associated with the development of many diseases. People who smoke more than 20 cigarettes per day have a 3–6 fold increased incidence of myocardial infarction  and increased overall rates of cardiovascular mortality compared to those who have never smoked . The risk of developing lung cancer is 20-fold increased in cigarette smokers , and smokers are at increased risk of developing chronic obstructive pulmonary disease, multiple cancers (e.g. esophageal, bladder, and leukemia), pneumonia, osteoporosis, and periodontal disease . Despite these major adverse health effects, more than 20% of American adults identify themselves as active smokers .
The mechanistic link between tobacco smoking and related diseases remain incompletely understood. To date, there have been numerous reports analyzing the effect that exposure to cigarette smoke has on the gene expression profiles of various cell types [15–22]. However, despite this detailed analysis, very little consensus amongst findings has been reported, even when the same cell type has been studied . This lack of significant overlap in conclusions may be the result of the considerable heterogeneity in methodology as well as the relatively small (on average 5–10 test subjects) sample populations in each study. Furthermore, many of these reports rely on the in vitro exposure of cells to cigarette smoke condensate, raising the obvious issue of physiological relevance amongst these various studies.
Here we report a novel method for analyzing the in vivo effects of tobacco use on gene expression in circulating leukocytes. The purpose of this study is not to identify biomarkers associated with tobacco use; rather, our approach is aimed at identifying changes in genes and gene sets that result from tobacco use and applying this information to identify potential cellular pathways associated with tobacco-dependent pathology. Our results indicate that tobacco use affects pathways that control cell death, response to stress, macromolecular metabolism and the inflammatory cascade, providing new insights into the systemic effects of smoking that may underlie tobacco-related diseases.
Subjects between the ages of 18 and 50 years (inclusive) referred to UNC Hospitals were considered for enrollment in this University of North Carolina Institutional Review Board-approved study (IRB 04-MED-471). Exclusion criteria included current cancer treatment, pregnancy, lymphoma, leukemia, chronic immunosuppressive therapy, infection with HIV or HCV, history of solid organ transplant, and anemia (i.e. conditions which might alter peripheral blood counts or patterns of gene expression). After obtaining informed consent for a one-time blood donation, subjects were interviewed for pertinent medical information, including a detailed history of tobacco use, family history of heart disease and diabetes. Blood cell counts including a white blood cell differential analysis was performed to ensure consistency in cell subtype number between study populations.
Blood Withdrawal and Processing
Blood (30 ml) was drawn early in the day from subjects fasted for at least 8 hours to minimize the signals associated with nutritional and diurnal cycles from the microarray data. Processing was begun within 15 minutes of the time of blood draw. Eight ml were collected into a tube containing EDTA and proteinase inhibitors (Becton, Dickinson and Co., Cockeysville, MD) to provide a sample of plasma for cotinine assays. The balance of blood was collected into Na-EDTA Vacutainer tubes (Becton, Dickinson and Co., Cockeysville, MD). Whole blood was treated with 10 volumes of carbonate-buffered 150 mM NH4Cl to lyse red blood cells. The remaining leukocytes were washed and concentrated by centrifugation [23, 24]. RNA and DNA were recovered from leukocytes using a modified one-step acid guanidinium isothiocyanate-phenol-chloroform extraction (RNA-STAT60, Tel-Test, TX). RNA was subsequently post-purified using the RNeasy Mini-kit (Qiagen, Valencia, CA). RNA quantity, purity, and integrity were assessed by spectrophotometry and microcapillary electrophoresis on an Agilent BioAnalyzer 2100. Complete processing of samples occurred within 2 hours, exceeding the standards set by the Consortium for Expression Profiles in Sepsis . Plasma cotinine levels were determined by competitive ELISA using the Serum Cotinine Assay Kit (BioQuant; San Diego, CA) essentially as described by the manufacturer.
Gene Expression Profiling
We utilized a "sample × reference" experimental design strategy in which RNA from each subject was hybridized to the microarray slide in the presence of labeled human reference RNA (UHRR, Stratagene, La Jolla, CA) [26, 27]. Briefly, total RNA (500 ng) was used for gene expression profiling following reverse transcription and T-7 polymerase-mediated amplification/labeling with Cyanine-5 CTP. Labeled subject cRNA was co-hybridized to Agilent G4112A Whole Human Genome 44 K oligonucleotide arrays with equimolar amounts of Cyanine-3 labeled UHRR. Slides were hybridized and washed, then scanned on an Axon 4000b microarray scanner. The data were processed using GenePix Pro 6 software and entered into the UNC Microarray Database .
Quantitative Real Time Polymerase Chain Reaction (qRT-PCR) analysis
Three hundred nanograms of total RNA were reverse transcribed using the iScript Synthesis cDNA Kit (Biorad, Hercules, CA). Real-time PCR reactions were performed using either the Roche Universal Probe Library (Roche Diagnostics, Mannheim, Germany) or pre-validated Taqman® assays (Applied Biosystems, Framingham, MA). Primers and probes for the indicated human transcripts were designed using Probe Finder (version 2.41, Roche Diagnostics, Mannheim, Germany): CDKN1C (left primer GAGCGAGCTAGCCAGCAG, right primer GCGACAAGACGCTCCATC, probe #77); CX3CR1 (left primer CTCTGGCTTCTGGGTGGAG, right primer AGACCACGATGTCCCCAATA, probe #30); SASH1 (left primer CAGATCCGGGTGAAGCAG, right primer GAGTCCACCACTTGGAATCG, probe #38); RPS29 (left primer CCAAGAACTGCAAAGCCATC, right primer GGCATTGGTGACTCTGATGA, probe #26); and 18S (left primer GGAGAGGGAGCCTGAGAAAC, right primer TCGGGAGTGGGTAATTTGC, probe #40). PTGDR and HRASLS3 were measured using Taqman® assays Hs00235003_m1 and Hs00272992_m1, respectively. Real-time PCR reactions were performed using the ABI PRISM® 7900 sequence detection system, software, and reagents. Relative changes in gene expression were calculated using the delta Ct method using ribosomal 18S to normalize RNA input. Statistical significance was determined using the Student's t test. P values less than 0.05 were considered significant.
Microarray data were normalized via the loess local intensity normalization [7, 29], and probes were filtered for features having a normalized intensity of < 30 aFU in either channel. Probes were removed if < 70% of the data were present across all samples. Missing data points were imputed using the k nearest-neighbors algorithm (k = 10). 18,375 probes passed these filters, and were subsequently used for analysis. Scripts written in the R Statistical Language and Environment ("R"; Version 2.2.1, build r36812, release date 2005-12-20.) and Perl (ActiveState Perl 5.8.1, build 807, release date 2003-11-6) were used to standardize (μ = 0, σ = 1) each sample in the data set.
Statistical Analysis of Microarrays (SAM)
Lists of differentially expressed genes were identified using the statistical analysis of microarray algorithm [30–32] (SAM, Version 2.21, release date 2005-8-24; typical false discovery rate of approximately 10%). Unsupervised, semi-supervised, and supervised clustering analysis was performed on gene lists essentially as described  using Cluster, version 2.11. Heat maps of cluster analyses were visualized with JavaTreeView, version 1.0.12 [35, 36].
Gene Set Analysis (GSA)
GSA [37, 38] was performed using the Molecular Signatures Database (MSigDB)  to identify gene set activity associated with cotinine levels. Mapping to gene ontology categories (GO)  and identification of putative transcription factor binding sites was performed on gene lists using the GATHER web-based analysis environment [41–43] using the TRANSFAC V7.0 (public) database [44–47].
A median-centered gene list was used for cluster analysis to identify relationships between subject samples (arrays). The clustering file was then used as the basis for a new pre-clustering file to incorporate gene annotation data. Genes were assigned to GO and TRANSFAC categories using the GATHER web interface . Categories showing statistical enrichment (p value < 0.01) were identified, and each gene in the pre-clustering file was annotated as to its membership in the appropriate category. The TRANSFAC predictions of transcription factor binding sites were designated in the pre-clustering file by the value representing the median-centered mean fold change expressed as the Log2 of the ratio of each sample to the reference for each gene. This method of indicating membership was chosen to reflect a relationship between expression level (as measured by microarray) and presence or absence of transcription factor binding sites. Gene membership in GO categories was indicated by a binary value of either 1.00 or 0.00 as a placeholder for the expression ratio. Blue color was added after the fact to heat maps indicating Gene Ontology membership to avoid confusion with expression values. The annotated pre-clustering file was then clustered on only the Y axis (genes) to preserve relationships among arrays. This technique, which we have designated "Hyperclustering," allows both the gene expression data and various other forms of annotation to be represented as a single heat map, effectively illustrating functional relationships among genes.
Results and discussion
Subject Selection for Gene Expression Analysis
Selected demographics of study subjects.
Number of subjects
Mean Age ± SD
47 ± 9
46 ± 5
Diagnosis of Diabetes (Number (% of total))
CAD Family History
Automated Differential Blood Count
White Blood Cells (× 109 /L ± SD)
8.42 ± 2.67
9.00 ± 2.41
Neutrophils (× 109 /L ± SD)
5.67 ± 2.18
5.76 ± 1.94
Lymphocytes (× 109 /L ± SD)
1.90 ± 0.68
2.31 ± 0.74
Monocytes (× 109 /L ± SD)
0.42 ± 0.18
0.46 ± 0.21
Basophils (× 109 /L ± SD)
0.06 ± 0.04
0.06 ± 0.04
Eosinophils (× 109 /L ± SD)
0.22 ± 0.18
0.26 ± 0.14
Platelets (× 109/L ± SD)
252.42 ± 73.97
250.67 ± 56.06
Tobacco Use Determination
Using these criteria, 24 subjects were classified as tobacco users and 38 as non-tobacco users, with 5 subjects having cotinine levels that fell between 50 and 100 ng/mL. These 5 intermediate subjects were removed from further consideration. Comparing each subject's plasma cotinine values with their self-reported tobacco use status revealed overall consistent results (i.e. a high cotinine value for subjects who self-reported that they were active tobacco users). Nevertheless, there were notable exceptions. Seven subjects reported that they were non-tobacco users, yet had plasma cotinine levels > 100 ng/mL. Errors in this dimension could be explained by subject misrepresentation or failure of the subjects to disclose nicotine replacement therapy as part of a smoking cessation plan (use of nicotine patches or gum). Interestingly, 3 subjects identified themselves as active smokers, yet had very low plasma cotinine levels. Rapid metabolism of nicotine, smoking of a small number of cigarettes daily, or the use of extremely low-nicotine smoking products could all account for this discrepancy. This discrepancy in self-reported tobacco use and plasma cotinine levels did not appreciably alter the results of our studies (data not shown). All subjects were categorized based only on plasma cotinine levels only. The 2 subject groups will henceforth be referred to as "high cotinine" (i.e. tobacco users) and "low cotinine" (i.e. non-tobacco users). Using this criterion, those subjects reporting to be "smokers" but who had low plasma cotinine levels were included in the low cotinine group while subjects with high cotinine levels who denied smoking were included in the high cotinine group. To ensure that patient co-morbidities did not influence the gene expression profile, we performed principal components analysis (PCA) on the expression values of genes identified in this paper using the combined significant gene list and visualized in the context of COPD, diabetes, CAD class, and smoking status (Additional File 1). As expected, the top component of variation appears to be associated only with smoking status.
Transcriptional Signals of Tobacco Use
Differentially expressed genes identified by SAM analysis.
Down-regulated in High Cotinine Subjects
Agilent Probe ID
HRAS-like suppressor 3
Chemokine (C-X3-C motif) receptor 1
G protein-coupled receptor 56
Prostaglandin D2 synthase 21kDa (brain)
Bromodomain containing 1
Benzodiazapine receptor (peripheral) associated protein 1
Nuclear DNA-binding protein
CCCTC-binding factor (zinc finger protein)
DnaJ (Hsp40) homolog, subfamily B, member 6
Guanine nucleotide binding protein (G protein), gamma
Heparan sulfate 6-O-sulfotransferase 1
IKK interacting protein
Killer cell lectin-like receptor subfamily K, member 1
V-maf musculoaponeurotic fibrosarcoma oncogene homolog (avian)
Membrane-type 1 matrix metalloproteinase cytoplasmic tail binding protein-1
Oxysterol binding protein-like 5
Protein phosphatase 1, catalytic subunit, beta isoform
Protein phosphatase 1, regulatory (inhibitor) subunit 12B
Protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), beta isoform
Solute carrier family 25 (carnitine/acylcarnitine translocase), member 20
Solute carrier family 9 (sodium/hydrogen exchanger), isoform 3 regulator 1
Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, theta polypeptide
Prostaglandin D2 receptor (DP)
Up-regulated in High Cotinine Subjects
Agilent Probe ID
SAM and SH3 domain containing 1
DNA polymerase-transactivated protein 6
Core 1 UDP-galactose:N-acetylgalactosamine-alpha-R beta 1,3-galactosyltransferase
Ral guanine nucleotide dissociation stimulator-like 1
C-terminal modulator protein
Hypothetical protein LOC283174
Visual inspection of the SAM-identified genes revealed that a number of differentially expressed genes are involved in the cell cycle control Gene Ontologies. CTCF was down regulated in the high cotinine group. Mutations in this gene have been associated with a variety of cancers . Furthermore, CTCF plays an important role in the regulation and differentiation of human myeloid leukemia cells, adding another possible underlying mechanism of leukemiagenesis in tobacco users . Conversely, we found that SASH1 (which is implicated in tumorogenesis of colorectal and breast cancer) was up regulated . Interestingly, CX3CR1 was significantly down regulated in the high cotinine group. As CX3CR1 is up-regulated in atherosclerotic lesions , we expected it to be up-regulated in circulating leukocytes of tobacco users due to the increased incidence and severity of CAD in this population (reviewed by Njolstad ). However, Barlic, et al., showed that macrophage up-regulation of CX3CR1 leads to retention of those cells in vessel walls . As the kinetics of the up-regulation of this gene are ill-defined, and it is not yet clear whether circulating monocytes differentially express CX3CR1 prior to tissue macrophage transformation, considerably more study will be necessary to elucidate what role it may play in the pathogenesis of smoking-related atherosclerotic disease.
Further analysis identified genes involved in apoptotic pathways. The pro-apoptotic genes C1D, MTCBP-1, CTCF, IKIP, MAF, and YWHAQ were all significantly down regulated in the high cotinine group. C1D (also known as SUNCOR) is representative of this group. C1D is a multi-functional nuclear protein with DNA-binding properties. When C1D is experimentally over-expressed it activates DNA-PK, inducing apoptosis . On the other hand, the c-terminal modulator protein (CTMP, also known as THEM4) was significantly over-expressed in the high cotinine population. CTMP protein stimulates the phosphorylation of AKT/PKB, increasing glucose uptake and blocking apoptosis . The relative mean fold change was modest for most of these genes (Table 2); nevertheless, in subjects with high plasma cotinine the overall expression pattern of these genes is anti-apoptotic compared to low cotinine subjects. The combination of increased cell cycle activity, resistance to apoptotic triggers, increased expression of oncogenes, and decreased expression of tumor suppressor genes in circulating leukocytes suggests a mechanism responsible for the low-level, systemic, increased risk of oncogenesis in patients who use tobacco products.
Summary of GSA.
Gene Set Pathway
Genes involved in signaling via Fas and DR3, 4, and 5.
Genes involved in metastasis of solid tumors.
Interferon responsive genes upregulated by DAC treatment.
Pattern Identification viathe Hyperclustering Technique
Hyperclustered TRANSFAC and GO Category Annotations
V$ISRE_01: interferon-stimulated response element
V$MAZR_01: MAZ related factor
V$E2F1DP1_01: E2F-1:DP-1 heterodimer
Common GO Parent Node
signal transduction  GO:0007165
small GTPase mediated signal transduction
intracellular signaling cascade
programmed cell death  GO:0012501
induction of apoptosis
induction of programmed cell death
positive regulation of programmed cell death
positive regulation of apoptosis
regulation of cellular process
programmed cell death
regulation of programmed cell death
regulation of apoptosis
response to stress  GO:0006950
response to stress
macromolecule metabolic process  GO:0043170
cellular macromolecule metabolism
regulation of metabolism
transcription  GO:0006350
regulation of transcription
regulation of nucleo-base, -side, -tide and nucleic acid metabolism
regulation of transcription, DNA-dependent
cell cycle process  GO:0022402
G1/S transition of mitotic cell cycle
mitotic spindle orientation
mevalonate transport  GO:0015728
The utility of the hyperclustering technique is readily apparent: a single image indicates the relationships among the genes, lending physiological relevance to a data set. A case in point is the 'Interferon' cluster, comprised of genes that are strongly up regulated in approximately half of the subjects with the highest cotinine levels. The genes in this cluster (IFI44, IFIT1, USP18, and HERC5. Figure 2) are interferon responsive genes, and are members of the gene class forming the early response to type-I interferons, indicative of a cellular response to viral agents or very specific forms of genotoxicity. Our findings are consistent with those of Grumelli, et al. who demonstrated that lymphocytes isolated from lung samples of patients with smoking-related lung damage showed an increase in expression of multiple interferon-inducible proteins . These results indicate that induction of interferon-dependent transcription pathways appear systemically in some tobacco users. Only half of the tobacco users have this expression pattern; the mechanisms of which are unknown, but worthy of future investigation. It is tempting to speculate that these patterns of systemic interferon-responsive induction identify a group of tobacco users who may develop early and severe disease. Longitudinal studies designed to track the patterns of gene expression over time in cohorts of tobacco users and non-users will be necessary to unambiguously determine the meaning of these observations.
Real time PCR verification of differentially expressed genes
In this study we demonstrated that groups of genes in circulating human leukocytes are affected by tobacco use in vivo. We identified genes and their relationships using a combination of testing individual genes (SAM), testing gene sets (GSA), and high throughput annotation (GATHER). Hyperclustering using Gene Ontologies and transcription factor binding sites associated with these genes illuminated the functional significance of the differentially regulated genes. The resulting gene expression spectra revealed novel and under-recognized molecular pathways in the pathophysiology of diseases commonly associated with tobacco use. Genomic signals in circulating leukocytes characteristic of cellular metabolism, transcription and signaling, apoptosis, response to stress, and the interferon response were all correlated with nicotine exposure. These results strongly suggest that tobacco use promotes a pro-carcinogenic environment, predisposing individuals to develop cancers in a variety of organ systems.
Interestingly, some genes that have previously been linked to smoking were not differentially expressed in our 2 subject groups [61–63]. For example, neither CYP1B1 (a cytochrome P450 enzyme playing an important role in chemical carcinogenesis) nor SOD2 (which destroys toxic radicals normally produced within cells) had an expression profile that differed significantly between high and low cotinine groups. Although several previous reports identified these genes as being affected by smoking, design and subject pool differences used in the present study could explain the absence of these genes from our profile. CYP1B1 is expressed to a greater degree in the females than in males and our data set is all male . SOD2 gene expression declines with age . The mean age of one of the studies reporting differential regulation of SOD2 was 27 years while the mean age of our study subjects is 46.5 years, which may explain why the SOD2 gene expression ratios between the groups in our study did not vary significantly.
A significant link has been established between smoking and the development of blood-borne cancers such as acute myelogenous leukemia (AML) and acute lymphocytic leukemia (ALL) [66, 67]. Exposure to compounds derived from tobacco use is typically highest in the oral and nasal cavities, the laryngotracheobronchial tree, and the urinary system, putting these tissues at the greatest risk of developing tumors . Nevertheless, given chronic exposure to carcinogens, blood tissues are likewise at an increased risk of carcinogenesis . Sandler, et al., observed a clear dose response to smoking, with heavy smokers at the highest risk of developing leukemia . The causative mechanism for this observed increase in leukemia among smokers is unknown. Our results identify highly relevant, differentially expressed genes that may serve as the basis for future experiments aimed at addressing the molecular etiology of AML and ALL in smokers. Moreover, these gene signals were detected in an easily obtainable sample of peripheral blood.
We found a correlation between tobacco use and increased expression of interferon-inducible genes in circulating leukocyte populations. Strong induction of interferon-responsive gene expression was seen in only a subset of tobacco-using subjects, arguing that interferon induction is not a direct effect of tobacco use. The mechanism of induction of these genes is not clear from our data. Previous studies have found a strong correlation between the parenchymal destruction associated with end-stage emphysema and the presence of interferon and interferon-inducible genes in the lung . Intriguingly, 5 of the 6 subjects (83%) with a diagnosis of COPD in this study demonstrated the high-interferon response phenotype. Our observation of elevated peripheral interferon response gene expression may reflect a systemic manifestation of a destructive pulmonary inflammatory response. These observations may provide evidence of a systemic immune basis for smoking-related lung parenchymal destruction. Alternatively, the expression of interferon-responsive genes in the periphery may be secondary to the upper and lower respiratory tract infections to which smokers are prone.
Hyperclustering revealed 5 distinct, physiologically relevant gene groups in peripheral leukocytes affected by tobacco use in vivo. Furthermore, these gene groups belong to pathways and regulatory systems important to the etiology of smoking-related diseases. These novel results enhance our understanding of how tobacco use affects patterns of gene expression in leukocytes, and provide a starting point for elucidating the molecular mechanisms of tobacco-related neoplasia, atherosclerosis, and immune dysfunction. The hyperclustering visualization facilitated interpretation of microarray data by fusing the expression data with functional annotation derived through robust statistical methodology (GSA and GATHER) prior to cluster analysis. This technique is a visual representation that combines gene expression data and any form of additional annotation. Gene expression profiling of readily obtainable peripheral blood samples identified genes that regulate response to stress, macromolecular metabolism, transcription and signaling, interferon response, and cell death and resistance to apoptosis. This profile may identify some novel targets for therapeutic intervention for both smoking-related diseases and, potentially, for smoking cessation.
This study was supported in part by an American Heart Association Scientist Development Grant (0635100N) to PCC, grants from the NIH (HL072347), CDC (H75/CCH424675 and H75/CCH424677), and UNC School of Medicine ("Investments in the Future" program) to CP, and a Doris Duke Charitable Foundation Fellowship to BDA. C.P. is an established investigator of the American Heart Association, and a Burroughs Wellcome Fund Clinician Scientist in Translational Research.
- Goldsmith ZG, Dhanasekaran N: The microrevolution: applications and impacts of microarray technology on molecular biology and medicine (review). International journal of molecular medicine. 2004, 13 (4): 483-495.PubMedGoogle Scholar
- Fan C, Oh DS, Wessels L, Weigelt B, Nuyten DS, Nobel AB, van't Veer LJ, Perou CM: Concordance among gene-expression-based predictors for breast cancer. The New England journal of medicine. 2006, 355 (6): 560-569. 10.1056/NEJMoa052933.View ArticlePubMedGoogle Scholar
- Seo D, Ginsburg GS, Goldschmidt-Clermont PJ: Gene expression analysis of cardiovascular diseases: novel insights into biology and clinical applications. Journal of the American College of Cardiology. 2006, 48 (2): 227-235. 10.1016/j.jacc.2006.02.070.View ArticlePubMedGoogle Scholar
- Napoli C, Lerman LO, Sica V, Lerman A, Tajana G, de Nigris F: Microarray analysis: a novel research tool for cardiovascular scientists and physicians. Heart (British Cardiac Society). 2003, 89 (6): 597-604.View ArticleGoogle Scholar
- Tuomisto TT, Binder BR, Yla-Herttuala S: Genetics, genomics and proteomics in atherosclerosis research. Annals of medicine. 2005, 37 (5): 323-332. 10.1080/07853890510011949.View ArticlePubMedGoogle Scholar
- Liew CC, Ma J, Tang HC, Zheng R, Dempsey AA: The peripheral blood transcriptome dynamically reflects system wide biology: a potential diagnostic tool. The Journal of laboratory and clinical medicine. 2006, 147 (3): 126-132. 10.1016/j.lab.2005.10.005.View ArticlePubMedGoogle Scholar
- Burczynski ME, Twine NC, Dukart G, Marshall B, Hidalgo M, Stadler WM, Logan T, Dutcher J, Hudes G, Trepicchio WL, Strahs A, Immermann F, Slonim DK, Dorner AJ: Transcriptional profiles in peripheral blood mononuclear cells prognostic of clinical outcomes in patients with advanced renal cell carcinoma. Clin Cancer Res. 2005, 11 (3): 1181-1189.PubMedGoogle Scholar
- Alberg AJ, Samet JM: Epidemiology of lung cancer. Chest. 2003, 123 (1 Suppl): 21S-49S. 10.1378/chest.123.1_suppl.21S.View ArticlePubMedGoogle Scholar
- Patino WD, Mian OY, Kang JG, Matoba S, Bartlett LD, Holbrook B, Trout HH, Kozloff L, Hwang PM: Circulating transcriptome reveals markers of atherosclerosis. Proceedings of the National Academy of Sciences of the United States of America. 2005, 102 (9): 3423-3428. 10.1073/pnas.0408032102.View ArticlePubMedPubMed CentralGoogle Scholar
- Bennett L, Palucka AK, Arce E, Cantrell V, Borvak J, Banchereau J, Pascual V: Interferon and granulopoiesis signatures in systemic lupus erythematosus blood. The Journal of experimental medicine. 2003, 197 (6): 711-723. 10.1084/jem.20021553.View ArticlePubMedPubMed CentralGoogle Scholar
- Njolstad I, Arnesen E, Lund-Larsen PG: Smoking, serum lipids, blood pressure, and sex differences in myocardial infarction. A 12-year follow-up of the Finnmark Study. Circulation. 1996, 93 (3): 450-456.View ArticlePubMedGoogle Scholar
- Qiao Q, Tervahauta M, Nissinen A, Tuomilehto J: Mortality from all causes and from coronary heart disease related to smoking and changes in smoking during a 35-year follow-up of middle-aged Finnish men. European heart journal. 2000, 21 (19): 1621-1626. 10.1053/euhj.2000.2151.View ArticlePubMedGoogle Scholar
- Edwards R: The problem of tobacco smoking. BMJ (Clinical research ed). 2004, 328 (7433): 217-219. 10.1136/bmj.328.7433.217.View ArticleGoogle Scholar
- Tobacco use among adults – United States, 2005. Mmwr. 2006, 55 (42): 1145-1148.
- Buttner P, Mosig S, Funke H: Gene expression profiles of T lymphocytes are sensitive to the influence of heavy smoking: A pilot study. Immunogenetics. 2007, 59 (1): 37-43. 10.1007/s00251-006-0177-3.View ArticlePubMedGoogle Scholar
- van Leeuwen DM, Gottschalk RW, van Herwijnen MH, Moonen EJ, Kleinjans JC, van Delft JH: Differential gene expression in human peripheral blood mononuclear cells induced by cigarette smoke and its constituents. Toxicol Sci. 2005, 86 (1): 200-210. 10.1093/toxsci/kfi168.View ArticlePubMedGoogle Scholar
- Ryder MI, Hyun W, Loomer P, Haqq C: Alteration of gene expression profiles of peripheral mononuclear blood cells by tobacco smoke: implications for periodontal diseases. Oral microbiology and immunology. 2004, 19 (1): 39-49. 10.1046/j.0902-0055.2003.00110.x.View ArticlePubMedGoogle Scholar
- Lampe JW, Stepaniants SB, Mao M, Radich JP, Dai H, Linsley PS, Friend SH, Potter JD: Signatures of environmental exposures using peripheral leukocyte gene expression: tobacco smoke. Cancer Epidemiol Biomarkers Prev. 2004, 13 (3): 445-453.PubMedGoogle Scholar
- Harvey BG, Heguy A, Leopold PL, Carolan BJ, Ferris B, Crystal RG: Modification of gene expression of the small airway epithelium in response to cigarette smoking. Journal of molecular medicine (Berlin, Germany). 2007, 85 (1): 39-53.View ArticleGoogle Scholar
- Heguy A, O'Connor TP, Luettich K, Worgall S, Cieciuch A, Harvey BG, Hackett NR, Crystal RG: Gene expression profiling of human alveolar macrophages of phenotypically normal smokers and nonsmokers reveals a previously unrecognized subset of genes modulated by cigarette smoking. Journal of molecular medicine (Berlin, Germany). 2006, 84 (4): 318-328.View ArticleGoogle Scholar
- Lodovici M, Luceri C, De Filippo C, Romualdi C, Bambi F, Dolara P: Smokers and passive smokers gene expression profiles: correlation with the DNA oxidation damage. Free radical biology & medicine. 2007, 43 (3): 415-422. 10.1016/j.freeradbiomed.2007.04.018.View ArticleGoogle Scholar
- Maunders H, Patwardhan S, Phillips J, Clack A, Richter A: Human bronchial epithelial cell transcriptome: gene expression changes following acute exposure to whole cigarette smoke in vitro. American journal of physiology. 2007, 292 (5): L1248-1256.PubMedGoogle Scholar
- Alcorta D, Preston G, Munger W, Sullivan P, Yang JJ, Waga I, Jennette JC, Falk R: Microarray studies of gene expression in circulating leukocytes in kidney diseases. Exp Nephrol. 2002, 10 (2): 139-149. 10.1159/000049909.View ArticlePubMedGoogle Scholar
- Feezor RJ, Baker HV, Mindrinos M, Hayden D, Tannahill CL, Brownstein BH, Fay A, MacMillan S, Laramie J, Xiao W, Moldawer LL, Cobb JP, Laudanski K, Miller-Graziano CL, Maier RV, Schoenfeld D, Davis RW, Tompkins RG, Inflammation and Host Response to Injury, Large-Scale Collaborative Research Program: Whole blood and leukocyte RNA isolation for gene expression analyses. Physiological genomics. 2004, 19 (3): 247-254. 10.1152/physiolgenomics.00020.2004.View ArticlePubMedGoogle Scholar
- Feezor RJ, Cheng A, Paddock HN, Baker HV, Moldawer LL: Functional genomics and gene expression profiling in sepsis: beyond class prediction. Clin Infect Dis. 2005, 41 (Suppl 7): S427-435. 10.1086/431993.View ArticlePubMedGoogle Scholar
- Novoradovskaya N, Whitfield ML, Basehore LS, Novoradovsky A, Pesich R, Usary J, Karaca M, Wong WK, Aprelikova O, Fero M, Perou CM, Botstein D, Braman J: Universal Reference RNA as a standard for microarray experiments. BMC Genomics. 2004, 5 (1): 20-10.1186/1471-2164-5-20.View ArticlePubMedPubMed CentralGoogle Scholar
- Cronin M, Ghosh K, Sistare F, Quackenbush J, Vilker V, O'Connell C: Universal RNA reference materials for gene expression. Clin Chem. 2004, 50 (8): 1464-1471. 10.1373/clinchem.2004.035675.View ArticlePubMedGoogle Scholar
- UMD: [http://genome.unc.edu]
- Smyth GK, Speed T: Normalization of cDNA microarray data. Methods. 2003, 31 (4): 265-273. 10.1016/S1046-2023(03)00155-5.View ArticlePubMedGoogle Scholar
- Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proceedings of the National Academy of Sciences of the United States of America. 2001, 98 (9): 5116-5121. 10.1073/pnas.091062498.View ArticlePubMedPubMed CentralGoogle Scholar
- Storey JD, Tibshirani R: Statistical methods for identifying differentially expressed genes in DNA microarrays. Methods Mol Biol. 2003, 224: 149-157.PubMedGoogle Scholar
- Yu H, Gao L, Tu K, Guo Z: Broadly predicting specific gene functions with expression similarity and taxonomy similarity. Gene. 2005, 352: 75-81. 10.1016/j.gene.2005.03.033.View ArticlePubMedGoogle Scholar
- Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863.View ArticlePubMedPubMed CentralGoogle Scholar
- Cluster. [http://rana.lbl.gov/EisenSoftware.htm]
- Saldanha AJ: Java Treeview – extensible visualization of microarray data. Bioinformatics (Oxford, England). 2004, 20 (17): 3246-3248. 10.1093/bioinformatics/bth349.View ArticleGoogle Scholar
- JavaTreeview. [http://sourceforge.net/projects/jtreeview/]
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences of the United States of America. 2005, 102 (43): 15545-15550. 10.1073/pnas.0506580102.View ArticlePubMedPubMed CentralGoogle Scholar
- Efron B, Tibshirani R: On Testing the Significance of Gene Sets. 2006, Stanford Biostatistics DepartmentGoogle Scholar
- Subramanian A, Kuehn H, Gould J, Tamayo P, Mesirov JP: GSEA-P: a desktop application for Gene Set Enrichment Analysis. Bioinformatics (Oxford, England). 2007, 23 (23): 3251-3253. 10.1093/bioinformatics/btm369.View ArticleGoogle Scholar
- The Gene Ontology. [http://www.geneontology.org/]
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25 (1): 25-29. 10.1038/75556.View ArticlePubMedPubMed CentralGoogle Scholar
- Chang JT, Nevins JR: GATHER: a systems approach to interpreting genomic signatures. Bioinformatics (Oxford, England). 2006, 22 (23): 2926-2933. 10.1093/bioinformatics/btl483.View ArticleGoogle Scholar
- GATHER- Gene Annotation To Help Explain Relationships.
- Matys V, Fricke E, Geffers R, Gössling E, Haubrock M, Hehl R, Hornischer K, Karas D, Kel AE, Kel-Margoulis OV, Kloos DU, Land S, Lewicki-Potapov B, Michael H, Münch R, Reuter I, Rotert S, Saxel H, Scheer M, Thiele S, Wingender E: TRANSFAC: transcriptional regulation, from patterns to profiles. Nucleic acids research. 2003, 31 (1): 374-378. 10.1093/nar/gkg108.View ArticlePubMedPubMed CentralGoogle Scholar
- Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, Voss N, Stegmaier P, Lewicki-Potapov B, Saxel H, Kel AE, Wingender E: TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic acids research. 2006, D108-110. 10.1093/nar/gkj143. 34 Database
- Wingender E, Chen X, Fricke E, Geffers R, Hehl R, Liebich I, Krull M, Matys V, Michael H, Ohnhäuser R, Prüss M, Schacherer F, Thiele S, Urbach S: The TRANSFAC system on gene expression regulation. Nucleic acids research. 2001, 29 (1): 281-283. 10.1093/nar/29.1.281.View ArticlePubMedPubMed CentralGoogle Scholar
- TransFac. [http://www.biobase-international.com/pages/index.php?id=transfac]
- Lee KM, Kim JH, Kang D: Design issues in toxicogenomics using DNA microarray experiment. Toxicology and applied pharmacology. 2005, 207 (2S): 200-208. 10.1016/j.taap.2005.01.045.View ArticlePubMedGoogle Scholar
- Cowling DW, Johnson TP, Holbrook BC, Warnecke RB, Tang H: Improving the self reporting of tobacco use: results of a factorial experiment. Tobacco control. 2003, 12 (2): 178-183. 10.1136/tc.12.2.178.View ArticlePubMedPubMed CentralGoogle Scholar
- Baron-Epel O, Haviv-Messika A, Green MS, Kalutzki DN: Ethnic differences in reported smoking behaviors in face-to-face and telephone interviews. European journal of epidemiology. 2004, 19 (7): 679-686. 10.1023/B:EJEP.0000036792.58923.75.View ArticlePubMedGoogle Scholar
- Payne CE, Southern SJ: Urinary point-of-care test for smoking in the pre-operative assessment of patients undergoing elective plastic surgery. J Plast Reconstr Aesthet Surg. 2006, 59 (11): 1156-1161. 10.1016/j.bjps.2005.12.053.View ArticlePubMedGoogle Scholar
- Bramer SL, Kallungal BA: Clinical considerations in study designs that use cotinine as a biomarker. Biomarkers. 2003, 8 (3–4): 187-203. 10.1080/13547500310012545.View ArticlePubMedGoogle Scholar
- Wells AJ, English PB, Posner SF, Wagenknecht LE, Perez-Stable EJ: Misclassification rates for current smokers misclassified as nonsmokers. American journal of public health. 1998, 88 (10): 1503-1509.View ArticlePubMedPubMed CentralGoogle Scholar
- Recillas-Targa F, De La Rosa-Velazquez IA, Soto-Reyes E, Benitez-Bribiesca L: Epigenetic boundaries of tumour suppressor gene promoters: the CTCF connection and its role in carcinogenesis. Journal of cellular and molecular medicine. 2006, 10 (3): 554-568. 10.1111/j.1582-4934.2006.tb00420.x.View ArticlePubMedGoogle Scholar
- Torrano V, Chernukhin I, Docquier F, D'Arcy V, Leon J, Klenova E, Delgado MD: CTCF regulates growth and erythroid differentiation of human myeloid leukemia cells. The Journal of biological chemistry. 2005, 280 (30): 28152-28161. 10.1074/jbc.M501481200.View ArticlePubMedGoogle Scholar
- Rimkus C, Martini M, Friederichs J, Rosenberg R, Doll D, Siewert JR, Holzmann B, Janssen KP: Prognostic significance of downregulated expression of the candidate tumour suppressor gene SASH1 in colon cancer. British journal of cancer. 2006, 95 (10): 1419-1423. 10.1038/sj.bjc.6603452.View ArticlePubMedPubMed CentralGoogle Scholar
- Barlic J, Zhang Y, Foley JF, Murphy PM: Oxidized lipid-driven chemokine receptor switch, CCR2 to CX3CR1, mediates adhesion of human macrophages to coronary artery smooth muscle cells through a peroxisome proliferator-activated receptor gamma-dependent pathway. Circulation. 2006, 114 (8): 807-819. 10.1161/CIRCULATIONAHA.105.602359.View ArticlePubMedGoogle Scholar
- Rothbarth K, Spiess E, Juodka B, Yavuzer U, Nehls P, Stammer H, Werner D: Induction of apoptosis by overexpression of the DNA-binding and DNA-PK-activating protein C1D. Journal of cell science. 1999, 112 (Pt 13): 2223-2232.PubMedGoogle Scholar
- Barry WT, Nobel AB, Wright FA: Significance analysis of functional categories in gene expression studies: a structured permutation approach. Bioinformatics (Oxford, England). 2005, 21 (9): 1943-1949. 10.1093/bioinformatics/bti260.View ArticleGoogle Scholar
- Grumelli S, Corry DB, Song LZ, Song L, Green L, Huh J, Hacken J, Espada R, Bag R, Lewis DE, Kheradmand F: An immune basis for lung parenchymal destruction in chronic obstructive pulmonary disease and emphysema. PLoS medicine. 2004, 1 (1): e8-10.1371/journal.pmed.0010008.View ArticlePubMedPubMed CentralGoogle Scholar
- Chang TK, Chen J, Pillay V, Ho JY, Bandiera SM: Real-time polymerase chain reaction analysis of CYP1B1 gene expression in human liver. Toxicol Sci. 2003, 71 (1): 11-19. 10.1093/toxsci/71.1.11.View ArticlePubMedGoogle Scholar
- Port JL, Yamaguchi K, Du B, De Lorenzo M, Chang M, Heerdt PM, Kopelovich L, Marcus CB, Altorki NK, Subbaramaiah K, Dannenberg AJ: Tobacco smoke induces CYP1B1 in the aerodigestive tract. Carcinogenesis. 2004, 25 (11): 2275-2281. 10.1093/carcin/bgh243.View ArticlePubMedGoogle Scholar
- van Leeuwen DM, van Agen E, Gottschalk RW, Vlietinck R, Gielen M, van Herwijnen MH, Maas LM, Kleinjans JC, van Delft JH: Cigarette smoke-induced differential gene expression in blood cells from monozygotic twin pairs. Carcinogenesis. 2007, 28 (3): 691-697. 10.1093/carcin/bgl199.View ArticlePubMedGoogle Scholar
- Finnstrom N, Ask B, Dahl ML, Gadd M, Rane A: Intra-individual variation and sex differences in gene expression of cytochromes P450 in circulating leukocytes. The pharmacogenomics journal. 2002, 2 (2): 111-116. 10.1038/sj.tpj.6500086.View ArticlePubMedGoogle Scholar
- Tatone C, Carbone MC, Falone S, Aimola P, Giardinelli A, Caserta D, Marci R, Pandolfi A, Ragnelli AM, Amicarelli F: Age-dependent changes in the expression of superoxide dismutases and catalase are associated with ultrastructural modifications in human granulosa cells. Molecular human reproduction. 2006, 12 (11): 655-660. 10.1093/molehr/gal080.View ArticlePubMedGoogle Scholar
- Sandler DP, Shore DL, Anderson JR, Davey FR, Arthur D, Mayer RJ, Silver RT, Weiss RB, Moore JO, Schiffer CA, Wurster-Hill DH, McIntyre OR, Bloomfield CD: Cigarette smoking and risk of acute leukemia: associations with morphology and cytogenetic abnormalities in bone marrow. Journal of the National Cancer Institute. 1993, 85 (24): 1994-2003. 10.1093/jnci/85.24.1994.View ArticlePubMedGoogle Scholar
- Lichtman MA: Cigarette smoking, cytogenetic abnormalities, and acute myelogenous leukemia. Leukemia. 2007, 21 (6): 1137-1140. 10.1038/sj.leu.2404698.View ArticlePubMedGoogle Scholar
- Newcomb PA, Carbone PP: The health consequences of smoking. Cancer. The Medical clinics of North America. 1992, 76 (2): 305-331.View ArticlePubMedGoogle Scholar
- Lynge E, Anttila A, Hemminki K: Organic solvents and cancer. Cancer Causes Control. 1997, 8 (3): 406-419. 10.1023/A:1018461406120.View ArticlePubMedGoogle Scholar
- Gene Set Enrichment Analysis. [http://www.broad.mit.edu/gsea/]
- MSigDB Death_Pathway. [http://www.broad.mit.edu/gsea/msigdb/cards/DEATHPATHWAY.html]
- MSigDB METASTASIS_ADENOCARC_DN. [http://www.broad.mit.edu/gsea/msigdb/cards/METASTASIS_ADENOCARC_DN.html]
- MSigDB DAC_IFN_BLADDER_UP. [http://www.broad.mit.edu/gsea/msigdb/cards/DAC_IFN_BLADDER_UP.html]
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/1/38/prepub