This article has Open Peer Review reports available.
Examining smoking-induced differential gene expression changes in buccal mucosa
© Kupfer et al; licensee BioMed Central Ltd. 2010
Received: 10 December 2009
Accepted: 24 June 2010
Published: 24 June 2010
Gene expression changes resulting from conditions such as disease, environmental stimuli, and drug use, can be monitored in the blood. However, a less invasive method of sample collection is of interest because of the discomfort and specialized personnel necessary for blood sampling especially if multiple samples are being collected. Buccal mucosa cells are easily collected and may be an alternative sample material for biomarker testing. A limited number of studies, primarily in the smoker/oral cancer literature, address this tissue's efficacy as an RNA source for expression analysis. The current study was undertaken to determine if total RNA isolated from buccal mucosa could be used as an alternative tissue source to assay relative gene expression.
Total RNA was isolated from swabs, reverse transcribed and amplified. The amplified cDNA was used in RT-qPCR and microarray analyses to evaluate gene expression in buccal cells. Initially, RT-qPCR was used to assess relative transcript levels of four genes from whole blood and buccal cells collected from the same seven individuals, concurrently. Second, buccal cell RNA was used for microarray-based differential gene expression studies by comparing gene expression between a group of female smokers and nonsmokers.
An amplification protocol allowed use of less buccal cell total RNA (50 ng) than had been reported previously with human microarrays. Total RNA isolated from buccal cells was degraded but was of sufficient quality to be used with RT-qPCR to detect expression of specific genes. We report here the finding of a small number of statistically significant differentially expressed genes between smokers and nonsmokers, using buccal cells as starting material. Gene Set Enrichment Analysis confirmed that these genes had a similar expression pattern to results from another study.
Our results suggest that despite a high degree of degradation, RNA from buccal cells from cheek mucosa could be used to detect differential gene expression between smokers and nonsmokers. However the RNA degradation, increase in sample variability and microarray failure rate show that buccal samples should be used with caution as source material in expression studies.
Blood has been shown to be a responsive tissue that is useful for monitoring gene expression changes due to disease, environmental, biological or drug effects. However, for studies performed in human subjects, a less invasive tissue source for biomarker monitoring is of interest due to the discomfort, required skill level, and cost of blood collection, especially for repeated-measures studies. Buccal mucosa (from cheek swabs) is an easily accessed tissue and has been used successfully to obtain DNA for genotyping studies . However, the literature is limited as to the usefulness of RNA from buccal cells as a substrate for gene expression testing, presumably due to concern regarding a high concentration of RNases in saliva which are known to rapidly degrade RNA in these cells . qPCR has been used to detect expression changes in genes from the P450 family using snap frozen surgical buccal plug samples  and from brushed exfoliated buccal cells [4, 5]. These studies suggested that buccal cells might serve as an alternative to blood in qPCR assays examining gene expression profiles after exposure to environmental toxins, tobacco smoke, drugs, nutrients, or the presence of certain cancers. With RNA purified from brushed exfoliated buccal cells, Sridhar et al.  used microarrays from smoker samples they collected and nonsmoker arrays from the Gene Expression Omnibus (GEO) collection to compare expression levels between smokers and nonsmokers, and to compare expression patterns between buccal cells and bronchial epithelium in smokers and nonsmokers from an earlier microarray-based study  by Gene Set Enrichment Analysis (GSEA) . To our knowledge, buccal cells have not been used in a whole transcriptome approach to investigate differential gene expression between smokers and nonsmokers using concurrently harvested samples in a manner which directly compares expression differences. A successful study of this type would more clearly suggest that buccal cells have efficacy as source material for biomarker discovery or in a gene expression monitoring system than earlier studies.
We describe here, both qPCR and microarray approaches. The RT-qPCR study used matched blood and brushed buccal samples from the same subjects. Relative expression levels of four genes allowed comparison of tissue sources and subject differences. RNA from buccal cells was highly degraded; nonetheless, expression could be detected by qPCR for all four transcripts tested. This was sufficient evidence of the potential of buccal cells to follow up on the work of Sridhar et al.  and use microarrays for differential gene expression analysis on the transcriptome level in smokers and nonsmokers. An important consideration was the availability of the Smoking Induced Epithelial Gene Expression Database, (SEIGE)  and smoker buccal mucosa-specific gene lists  against which results from this study could be compared for confirmation of our method.
Our data was first analyzed for differences between smokers and nonsmokers using Significance Analysis of Microarray (SAM)  and Rank Product (RP)  for detection of significant gene expression differences between smokers and nonsmokers in our study. These analyses resulted in a list of candidate marker genes from each method. Ingenuity Pathway Analysis  was used to find functional networks containing the differentially expressed genes. The gene lists were also examined for transcriptional coregulation by searching the promoters of differentially expressed genes for transcription factor binding sites (TFBS) using PAINT  to access the TRANSFAC  database of known TFBS. Specifically, we identified 103 genes with RP analysis that had increased expression in smokers. Pathway analysis showed five function networks involving 91 of the 103 target genes. Network functions included cell cycle; cell growth, proliferation and movement; gene expression; and immunological disease. Upstream sequence analysis showed 38 target genes containing binding sites for at least one of three widely expressed transcription factors. Twenty-five genes were identified using SAM analysis. Similar to the RP results, 13 of these genes fell into one of two functional networks which had in common roles in tumor morphology, metabolic disease, lipid and carbohydrate metabolism and which contained binding sites for at least one of two widely expressed transcription factors. These results suggest that many of these genes are co-regulated and that the transcriptional response affects numerous cellular functions. Both gene lists were further analyzed using GSEA, to compare the buccal dataset against the Sridhar gene sets. The comparisons showed that the genes in our buccal array data changed expression in the same direction as in the published sets.
The results of the study suggest that buccal mucosa may indeed be useful for factors selected carefully for optimum expression change in buccal tissue. However, the random degradation which may vary between subjects that we encountered suggests a loss of sensitivity, and possibly the need for multiple sampling which is costly. It also suggests that due to the extensive degradation found it seems unlikely to be a reliable source for biomarker discovery.
Smoker vs. nonsmoker demographics and nicotine/cotinine levels
Buccal samples were collected using sterile Cytobrush Plus® (Medscand Medical; Guttenberg, NJ). Subjects were asked not to eat for the 30 minutes prior to sampling and rinsed their mouths with a minimum of 20 ml of water before sample collection. Two buccal samples were collected from each subject and processed separately as either "a" or "b" samples. Cheeks were brushed for 30 seconds, and the brushes were immediately plunged into 2 ml tubes containing 1.0 ml of room temperature RNAlater (Invitrogen, Carlsbad, CA) to prevent post-sampling degradation of the RNA. The brush ends were cut off with sterile surgical scissors such that the 2 ml tubes could be capped. RNA was purified from buccal cell swabs immediately after collection.
RNA isolation from blood samples was performed according to the protocol in the PAXgene™ Blood RNA Purification Kit  with the optional on-column DNase treatment. A blood total RNA control sample was created by pooling purified RNA samples from three individuals not participating in either study.
Buccal-cell RNA was purified using the RNeasy Micro Kit (Qiagen, Valencia, CA) with the modifications found in Spivack et al.  and here. Cells were pelleted by centrifugation at 4,000 × g. The brush was removed from the tube by scraping the bristles against the lip of the tube to remove any adhered cells and the pellet reformed by centrifugation as above. RNAlater was pipetted off the pellet and the pellet washed with ice-cold PBS and the PBS removed after centrifugation, as above. Two microliters of polyC (Sigma Chemical; St. Louis MO) and 350 μl Buffer RLT (RNeasy Micro Kit) containing 10 μl/ml beta-mercaptoethanol was added and the pellet passed through a 25 ga needle to lyse the cells. The lysate was centrifuged at 20,000 × g for 3 minutes and the supernatant transferred to a fresh microfuge tube. Then 350 μl 70% ethanol was added, mixed well by pipetting and the sample applied to a MinElute column (RNeasy Micro Kit), and centrifuged at 8000 × g for 30 seconds. The column was washed twice with 350 μl of RW1 buffer (RNeasy Kit) followed by centrifugation at 8000 × g for 15 seconds. The column was placed in a fresh 2 ml collection tube and 500 ul RPE buffer (RNeasy Micro Kit) was added. The column was centrifuged at 8000 × g for 30 seconds. 500 μl of freshly prepared 80% ethanol was added the column and centrifuged for 2 minutes at 8000 × g. The column was transferred to a fresh 2 ml collection tube, with the cap open and centrifuged at 16,000 × g for 5 minutes. The RNA was eluted by adding 30 μl pre-warmed (50-55 degC) RNase-free water to the membrane. After 2 minutes incubation the column was centrifuged at 16,000 × g for 2 minutes. Spectrophotometric analysis showed a large 230 nm component, potentially salt carryover. To reduce this, the Qiagen RNeasy Micro Handbook RNA Cleanup and Concentration protocol (December 2007) was used as written by the manufacturer for sample volumes less than 100 μl.
RNA quality was assessed from Agilent Bioanalyzer 2100 (Agilent, Santa Clara, CA) traces using the Agilent RNA 6000 Nano Series II kit following manufacturer's directions with 1 μl of sample to generate a RNA Integrity Number (RIN). Yield was determined on a Nanodrop 1000 spectrophotometer (Thermo Scientific, Waltham, MA) (Additional file 1). RNA was aliquoted and stored at -80 degC.
Primers for qPCR in the matched blood and buccal portion of the study were designed using Beacon Designer 7.0 (PREMIER Biosoft International, Palo Alto, CA). Primers were synthesized and HPLC purified (Integrated DNA Technologies, Coralville IA). For three genes, integrin alpha-5/beta-1 (ITGA5), ankyrin repeat domain 28 (ANKRD28), and transmembrane protein 8 (TMEM8), multiple sets of primers were designed to span the mRNA. For ribosomal protein S3A, (RPS3A) only a single primer set was designed due to the small size of the transcript. See Additional file 2 for the primer sequences, positions of the primer sets on the respective transcript, concentrations and annealing temperatures. Template material for qPCR was prepared from 50 ng aliquots of total RNA that were reverse transcribed and amplified using either the WT-Ovation™ Pico System or the Ovation™ RNA Amplification System V2, #3300, 3100, respectively (Nugen Technologies, Inc., San Carlos, CA). All qPCR reactions were 25 μl and performed in triplicate with a SYBR® green based based assay, PerfeCta SYBR Green FastMix, Low ROX, #95074-05k (Quanta Biosciences, Gaithersburg, MD) with no additional magnesium using 1 ng of amplified template material per reaction except in the amplification comparison series where 5 ng/reaction was used. Cycling was performed on a Stratagene MX3005p (Agilent Technologies, La Jolla, CA) in a 96-well polypropylene plate using optical strip caps (#410098 and 401425 Agilent Technologies). Cycling parameters were one cycle of 2 minutes 95 degC, 40 cycles of 15 seconds 95 degC, 30 seconds optimum annealing temperature, 15 seconds 72 degC extension, followed by a dissociation curve with 1 minute 95 degC, 30 seconds at optimum annealing temperature, and dissociation ramp rate at 0.01 degree/second to 95 degC with all points data collection on. qPCR data was analyzed using qBase version 1.3.5 . qPCR product size was assessed with Agilent DNA 1000 Series II (Agilent Technologies) microfluidics chips.
A no reverse transcription control was performed in duplicate using total RNA in the amount to simulate what was used after reverse transcription and amplification from each sample. The TMEM8 3'-most primers were used. All reactions failed. The positive control gave Cts of 22.08 and 22.8.
Microarray target preparation
For microarray target material used in the smoker vs. nonsmoker portion of the study, 50 ng total RNA was reverse transcribed and amplified per the manufacturer's protocols using the Ovation™ RNA Amplification System V2 (Nugen Technologies, Inc.), fragmented and biotin labeled using the FL-Ovation™ cDNA Biotin Module V2, #4200 (Nugen Technologies, Inc.). Gene expression was determined by hybridization of the labelled template to hgU133 Plus 2.0 human microarrays (Affymetrix, Inc., Santa Clara, CA). Hybridization cocktail synthesis and post-hybridization processing was performed according to the "Affymetrix GeneChip Eukaryotic Array Analysis" protocol found in the appendix of the Nugen protocol for the fragmentation kit. Arrays were hybridized for 18 hours and washed using fluidics protocol FS450_0004 on a GeneChip Fluidic Station 450 (Affymetrix, Inc.)
Quality assessment of the arrays was performed with the tools available in the Gene Chip Operating Software, version 1.4 (Affymetrix, Inc.) and the Bioconductor packages AffyQCReport  and AffyPLM , R version 2.8, Bioconductor version 2.3 . The microarray data has been assigned series number GSE16149 in the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/).
Microarray data analysis
Array data were processed with Robust Multiarray Average (RMA)  and quantile normalized using the package available at the Automated Microarray Pipeline (AMP) . Differential expression analysis comparing smokers to nonsmokers was performed with both Significance Analysis of Microarrays (SAM)  and Rank Product Analysis (RP) , selected for their different statistical approaches. For RP analysis, samples matching the two poor quality arrays were removed as this analysis utilizes the ranked expression values from replicate samples. This left 12 arrays, six in each replicate group, a and b, for this analysis. Unsupervised hierarchical clustering, T-tests with multiple testing correction, SAM and RP were performed using the packages available in the MultiExperiment Viewer, version 4.3.01 (MeV) [21, 22] with default settings. Gene Set Enrichment Analysis, GSEA version 2.04 [8, 23] was used to test the array data for enrichment of differentially expressed genes. The default settings were used except the minimum size for gene sets was decreased to ten to allow analysis against the RP_downSm list which GSEA reduced from 17. The same microarray differential expression analysis pipeline was used on the data from series GSE8987 from the GEO database , which were designated either "mouth", "never smokers" or "current smokers".
The output gene lists of differentially expressed genes from RP and SAM were evaluated for biological significance using Ingenuity Pathway Analysis, IPA (Ingenuity Systems, Inc., Redwood City CA), for a core analysis. Promoter Analysis and Interaction Network, PAINT version 3.6  analysis using the TRANSFAC public database  was used with the same gene lists examining both strands to 2000 bases upstream looking for transcription factor binding sites and summing in TREs any potentially co-regulated genes.
Evaluation by qPCR
BioGPS expression values for blood and salivary gland for genes tested via qPCR
qPCR results comparing blood and buccal RNA across four genes with WT amplified template
When specificity of degradation was investigated, no clear pattern was evident. ITGA5 showed a 32-fold difference from 5' to 3' in buccal mucosa compared to an approximately three-fold difference in blood, but most reactions with ITGA5 primers with buccal RNA failed. ANKRD28 and TMEM8 showed no change in 5'/3' ratio in either RNA source. Due to the short transcript length of RPS3A, no 5' primer set was designed. This initial analysis of the quality of buccal RNA shows that, in general, there were lower but detectable levels of target mRNA in buccal mucosa when compared to blood (Table 3). These results do not differentiate between tissue-specific expression differences or degradation; however, when the expression data from BioGPS and the RINs are factored into our analysis, the differences in Cts are greater than expected from expression data and likely due to degradation. The variability of results from buccal cells suggests that the degradation seen in the buccal samples is not occurring in a predictable directional fashion but randomly such that transcript size has no clear effect on the level of degradation seen.
qPCR results comparing methods of template amplification
Quality assessment of the arrays
Microarray quality metrics
Microarray data analysis for differential expression
A study using Affymetrix hgU133A arrays to compare gene expression in "current smokers" and "neversmokers" using RNA from buccal mucosa and nasal swabs was published by Sridhar et al. . This group performed an extensive microarray analysis of gene expression in bronchial lavage samples from current smokers, former-smokers and never-smokers and developed a list of 314 genes differentially expressed in smokers in this tissue [7, 26]. Using Gene Set Enrichment Analysis (GSEA), Sridhar examined the smoker buccal and nasal microarray data asking whether the genes on the bronchial-314 gene list showed the same direction of change and identified three leading-edge subsets of genes from the bronchial-314 list which were changing expression in the buccal or nasal data in the same direction as in the bronchial data. These were a 74 gene subset up-regulated in buccal mucosa of smokers, a 120 gene subset up-regulated in the nasal mucosa of smokers and a 50 gene subset down-regulated in nasal mucosa of smokers. The buccal microarray cel files were downloaded from GEO and analyzed in parallel with the data from the current study (Materials and Methods). Initially, unsupervised hierarchical clustering was performed with the summarized data from the current study, termed SmvsNS, and BuccalCompare for the Sridhar study. Neither dataset showed any pattern of clustering by replicate sample (a vs. b) in the case of the SmvNS data, nor by smokers and non-smokers in either dataset.
T-tests comparing the a samples to the b samples in the SmvsNS data were done to evaluate the within-subject variability. There were 871 significant probesets out of 53,800 or 1.62%. Comparing smokers to nonsmokers using the same test gave 178 probesets or 0.33%. Applying a t-test to the BuccalCompare data gave 65 differentially expressed probesets comparing never smokers to current smokers and 66 probesets when a random grouping of odd numbered arrays against even was compared. Taken together, these results suggest that there is at least as much or greater variability among subjects than smoking introduces between the two subject types.
SAM  and RP  were used to develop lists of differentially expressed genes between smokers and nonsmokers. With the SmvNS data, SAM returned 30 significant probesets with a Q value of 0 at a 10% FDR. All 30 probesets were up-regulated in smokers. For the BuccalCompare dataset there were no significant results from the SAM analysis. With RP analysis of the SmvNS data seventeen genes were found to be down-regulated and 118 genes up-regulated in smokers (Additional file 3). RP analysis could not be performed on the BuccalCompare dataset since there were no replicates.
Using a similar analysis approach to Sridhar, both the SmvsNS and the BuccalCompare datasets were compared against six gene lists in a GSEA enrichment analysis. The gene lists were the 74 genes in Buccal_upSm, the 120 genes in Nasal_upSm and the 49 genes in Nasal_downSm defined as leading edge subsets by Sridhar , the 25 genes in SAM_upSm, the 107 RP_upSm genes, and the 17 genes in RP_downSm the three lists from the current study (Additional file 3).
When GSEA analysis of the SmvsNS dataset was performed against all six gene lists, the four lists up-regulated in smokers showed the same expression patterns in the SmvsNS dataset, and the two down-regulated gene lists likewise were down-regulated in the SmvsNS dataset. The same analysis was performed using the BuccalCompare data against the same six gene lists with the same results. This showed correlation between the SmvsNS and BuccalCompare datasets in terms of the direction of gene expression change for genes in the six sets. In the SmvsNS comparison the SAM_upSm list genes were significantly enriched in the smoker phenotype with an FDR q-value 0.029 and p-value 0.025 but not the RP_upSm genes which showed an FDR q-value 0.3. This was unexpected since the RP_upSm gene list was derived from the SmvsNS dataset. The BuccalCompare data behaved similarly with only the Buccal_upSm gene list statistically significantly enriched. This was expected since it was derived from this dataset.
As a check for reproducibility, two subjects, one smoker and one nonsmoker, both cheeks, were retested several months after the initial sampling was performed. Four arrays were generated (11Sm a, b and 12NS a, b). This small dataset was examined with GSEA against the same six gene sets. The results showed that this repeated subset had significant gene enrichment for smokers with the RP_upSm, Nasal_upSm and Buccal_upSm gene lists with a nominal p-value of 0, an indication of good reproducibility.
To further evaluate the gene lists derived from the SmvsNS dataset for biological coherence the SAM and RP gene lists were evaluated for over-representation of transcription factor binding sites in the promoters of these genes using the Promoter Analysis and Interactive Tool Set, (PAINT) [12, 28], Materials and Methods, and for shared functional interactions using Ingenuity Pathways Analysis, (IPA) (version 7.0, Copyright 2009 Ingenuity Systems, Inc., Redwood City CA). Statistically significant transcriptional regulation elements (TREs) were found with 15 of the SAM_upSm and 42 RP_upSm genes. No TREs were found for genes in the RP_downSm genes.
In IPA, 17 of the 25 genes from SAM_upSm formed two initial networks sharing broad functional categories including tumor morphology, lipid metabolism, carbohydrate metabolism, and small molecule biochemistry that could be merged into a single large network. The RP_downSm genes did not result in any functional networks when examined in IPA. However, 91 of the genes on the RP_upSm list fell into five networks which could be merged into a single large network, indicating shared function. Functional categories for this network included: cell growth, movement, development and death; cell cycle; gene expression, cancer and immunological system development and function.
This study was focused on determining whether the buccal mucosa could serve as a tissue source for total RNA to be used in relative gene expression studies and biomarker detection by qPCR and microarray analyses. Two previous studies had suggested that buccal cells had efficacy for measuring responses to tobacco smoke exposure [5, 6] and suggested extrapolation of this tissue source to other inhalation or ingestion exposures .
Our initial RNA isolations from matched blood and buccal RNA showed a marked difference in the quality of the isolated material between the two sources and showed that there was significant degradation in buccal mucosa RNA. The qPCR results from the matched samples showed an average lower copy number in buccal RNA than blood RNA for all four genes tested and greater variability between subjects (Table 3). The lower copy-number was expected as salivary glands express all four genes at the same or lower level as blood on microarrays (Table 2); however, the increased variability found between buccal samples, including duplicate samples from the same subject over blood is a concern.
The amplification protocols utilized here allow buccal cell samples to be used in repeated measures experiments removing the necessity to sample more than once to obtain sufficient template for a single microarray. The fifty nanograms of RNA we used for amplification can routinely be isolated from a single swab (Additional file 1) and the resulting amplified cDNA is sufficient for an array as well as other procedures such as qPCR. This is in contrast to the multiple sampling and pooling from the same individual required by Sridhar et al.  where amplification was not used. Additionally, there was an advantage to using 3'-amplification over a whole transcriptome approach with the degraded buccal RNA possibly due to a reduction in the rRNA contribution to the amplified product.
In most cases, the array quality was acceptable but with buccal RNA, arrays did have a higher failure rate than is typical for arrays hybridized with target material from blood RNA. Two of 16 samples failed where matching samples from the other cheek passed. This opens the possibility that samples from both cheeks would be required to insure that every sample was collected in a study. However, we found the intra-subject variability to be high as well. The availability of the Sridhar buccal dataset provided comparison data and along with the previous work from this group , also provided published lists of genes from buccal and nasal cells which change expression levels due to smoking. Gene lists developed from the current study did not overlap extensively with each other or with the Sridhar lists. However, using the independent analysis tools PAINT and IPA a cohesive function/cotranscription network was generated suggesting two non-random sets of genes upregulated in smokers. Transcription factor binding site analysis is a good complement to a functional analysis such as IPA because it has no a priori assumptions about gene function relying instead on promoter sequence alone. The analysis results suggested that using an approach which included these two complementary methods is useful for evaluating candidate genes.
The analysis conducted with GSEA was significant because there was perfect concordance between gene lists derived from each of the two datasets for the direction of change in expression between smokers and non-smokers. The results from the small repeated dataset were an indication of reproducibility with this system. This validated the methods used in the current study to discover differentially expressed genes. However, the lack of consistent statistically significant enrichment for the smoker phenotype with GSEA analysis taken with the degradation in RNA derived from buccal cells highlight the difficulties to be expected when using buccal-cell RNA for differential expression testing.
The work presented here was a straightforward evaluation of buccal mucosa as a tissue useful for evaluating relative gene expression changes using an analysis scheme containing well validated and commonly used analysis tools. Isolation and amplification techniques were successfully modified from those used with whole blood. The level of degradation found was not unexpected; nevertheless, we were able to successfully perform qPCR with the buccal RNA. Somewhat surprising was that, given the poor quality of the RNA, the quality of the majority of the microarrays was acceptable and that several lists of genes showing change in expression in smokers compared to nonsmokers resulted from statistical analysis of the arrays. There was evidence of reproducibility in expression change but the borderline statistical significance level of the lists clouds the validity of the findings. Our findings suggest that this may be a difficult tissue to use, requiring replicate sampling and arrays, which may perform better using a different technology such as an array format or amplification method designed for heavily degraded template material. However, using buccal tissue RNA with 3' amplification may be a suitable tissue choice and preparation approach when assaying specific highly differentially expressed gene targets which could overcome the limitations of subject variability and sample degradation.
Thanks to Kristina Holton, Dana Farber Cancer Research Institute, Boston MA for assistance on analysis approaches especially Rank Product.
- Thompson MD, Bowen RA, Wong YBY, Antal J, Liu Z, Yu H, Siminovitch K, Kreiger N, Rohan TE, Cole DE: Whole genome amplification of buccal cell DNA: genotyping concordance before and after multiple displacement amplification. Clin Chem Lab Med. 2005, 43 (2): 157-162. 10.1515/CCLM.2005.026.View ArticlePubMedGoogle Scholar
- Ceder O, van Dijken J, Ericson T, Kollberg H: Ribonuclease in different types of saliva from cystic fibrosis patients. Acta Paediatr Scand. 1985, 74: 102-106. 10.1111/j.1651-2227.1985.tb10928.x.View ArticlePubMedGoogle Scholar
- Vondracek M, Xi Z, Larsson P, Baker V, Mace K, Pfeifer A, Tjalve H, Donato MT, Gomez-Lechon MJ, Grafstrom RC: Cytochrome P450 expression and related metabolism in human buccal mucosa. Carcinogenesis. 2001, 22 (3): 481-488. 10.1093/carcin/22.3.481.View ArticlePubMedGoogle Scholar
- Spira A, Beane J, Schembri F, Liu G, Ding C, Gilman S, Yang X, Cantor C, Brody JS: Noninvasive method for obtaining RNA from buccal mucosa epithelial cells for gene expression profiling. BioTechniques. 2004, 36: 484-487.PubMedGoogle Scholar
- Spivack SD, Hurteau GJ, Jain R, Kumar SV, Aldous MKM, Gierthy JF, Kaminsky LS: Gene-environment interaction signatures by quantitative mRNA profiling in exfoliated buccal mucosal cells. Cancer Res. 2004, 64: 6805-6813. 10.1158/0008-5472.CAN-04-1771.View ArticlePubMedGoogle Scholar
- Sridhar S, Schembri F, Zeskind J, Shah V, Gustafson AM, Steiling K, Liu G, Dumas Y-M, Zhang X, Brody JS, Lenburg ME, Spira A: Smoking-induced gene expression changes in the bronchial airway are reflected in nasal and buccal epithelium. BMC Genomics. 2008, 9: 259-10.1186/1471-2164-9-259.View ArticlePubMedPubMed CentralGoogle Scholar
- Shah V, Sridhar S, Beane J, Brody JS, Spira A: SIEGE: Smoking induced epithelial gene expression database. Nucleic Acids Research. 2005, D573-D579. 33 databaseGoogle Scholar
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. 2005, 102 (43): 15545-15550. 10.1073/pnas.0506580102.View ArticlePubMedPubMed CentralGoogle Scholar
- Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci. 2001, 98 (9): 5116-5121. 10.1073/pnas.091062498.View ArticlePubMedPubMed CentralGoogle Scholar
- Breitling R, Armengaud P, Amtmann A, Herzyk P: Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Letters. 2004, 573: 83-92. 10.1016/j.febslet.2004.07.055.View ArticlePubMedGoogle Scholar
- Ingenuity Pathway Analysis (IPA). [http://www.ingenuity.com/]
- Vadigepalli R, Chakravarthula P, Zak DE, Schwaber JS, Gonye GE: PAINT: a promoter analysis and interaction network generation tool for gene regulatory network identification. OMICS. 2003, 7 (3): 235-252. 10.1089/8482623103322452378.View ArticlePubMedGoogle Scholar
- TRANSFAC database of conserved transcription factor binding sites. [http://www.gene-regulation.com/pub/databases.html]
- PreAnalytiX Technical Notes. [http://www.preanalytix.com/Tech_Notes.asp#]
- Hellemans J, Mortier G, De Paepe A, Speleman F, Vandesompele J: qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data. Genome Biol. 2007, 8 (2): R19-10.1186/gb-2007-8-2-r19.View ArticlePubMedPubMed CentralGoogle Scholar
- Parman C, Hallman C, Gentleman R: AffyQCreport. [http://www.bioconductor.org/packages/2.4/bioc/html/affyQCReport.html]
- Bolstad BM: affyPLM. [http://www.bioconductor.org/packages/bioc/1.6/src/contrib/html/affyPLM.html]
- Bioconductor: open source software for bioinformatics. [http://www.bioconductor.org/]
- Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Schere P, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4 (2): 249-264. 10.1093/biostatistics/4.2.249.View ArticlePubMedGoogle Scholar
- AMP: Automated Microarray Pipeline. [http://compbio.dfci.harvard.edu/amp]
- TM4 Microarray Software Suite. [http://www.tm4.org]
- Saeed AI, Sharov V, White J, Li J, Liang W, Bhagabati N, Braisted J, Klapa M, Currier T, Thiagarajan M, Sturn A, Snuffin M, Rezantsev A, Popov D, Ryltsov A, Kostukovich E, Borisovsky I, Liu Z, Vinsavich A, Trush V, Quackenbush J: TM4: a free, open-source system for microarray data management and analysis. Biotechniques. 2003, 34 (2): 374-378.PubMedGoogle Scholar
- Gene Set Enrichment Analysis. [http://www.broad.mit.edu/gsea]
- Wu C, Orozco C, Boyer J, Leglise M, Goodale J, Batalov S, Hodge CL, Haase J, Janes J, Huss JW, Su AI: BioGPS: an extensible and customizable portal for organizing and querying gene annotation resources. Genome Biol. 2009, 10 (11): R130-10.1186/gb-2009-10-11-r130. [http://biogps.gnf.org]View ArticlePubMedPubMed CentralGoogle Scholar
- Bolstad BM, Collin F, Brettschneider J, Simpson K, Cope L, Irizarry RA, Speed TP: Quality assessment of Affymetrix GeneChip data. Bioinformatics and computational biology solutions using R and Bioconductor. Bioinformatics and Computational Biology Solutions Using R and Bioconductor. Edited by: Gentleman R, Carey V, Huber W, Irizarry RA, Dudoit S. 2005, Springer, 33-47. full_text.View ArticleGoogle Scholar
- SIEGE-Smoking induced epithelial gene expression database. [http://pulm.bumc.bu.edu/siegeDB/]
- Oliveros JC: VENNY. An interactive tool for comparing lists with Venn Diagrams. [http://bioinfogp.cnb.csic.es/tools/venny/index.html]
- PAINT: Promoter analysis and interaction network toolset (V 3.6). [http://www.dbi.tju.edu/dbi/tools/paint/]
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/3/24/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.