Identification of genetic risk variants for deep vein thrombosis by multiplexed next-generation sequencing of 186 hemostatic/pro-inflammatory genes
- Luca A Lotta1, 2,
- Mark Wang2,
- Jin Yu2,
- Ida Martinelli1,
- Fuli Yu2,
- Serena M Passamonti1,
- Dario Consonni3,
- Emanuela Pappalardo1,
- Marzia Menegatti1,
- Steven E Scherer2,
- Lora L Lewis2,
- Humeira Akbar2,
- Yuanqing Wu2,
- Matthew N Bainbridge2,
- Donna M Muzny2,
- Pier M Mannucci1,
- Richard A Gibbs2Email author and
- Flora Peyvandi1Email author
© Lotta et al; licensee BioMed Central Ltd. 2012
Received: 11 October 2011
Accepted: 21 February 2012
Published: 21 February 2012
Next-generation DNA sequencing is opening new avenues for genetic association studies in common diseases that, like deep vein thrombosis (DVT), have a strong genetic predisposition still largely unexplained by currently identified risk variants. In order to develop sequencing and analytical pipelines for the application of next-generation sequencing to complex diseases, we conducted a pilot study sequencing the coding area of 186 hemostatic/proinflammatory genes in 10 Italian cases of idiopathic DVT and 12 healthy controls.
A molecular-barcoding strategy was used to multiplex DNA target capture and sequencing, while retaining individual sequence information. Genomic libraries with barcode sequence-tags were pooled (in pools of 8 or 16 samples) and enriched for target DNA sequences. Sequencing was performed on ABI SOLiD-4 platforms. We produced > 12 gigabases of raw sequence data to sequence at high coverage (average: 42X) the 700-kilobase target area in 22 individuals. A total of 1876 high-quality genetic variants were identified (1778 single nucleotide substitutions and 98 insertions/deletions). Annotation on databases of genetic variation and human disease mutations revealed several novel, potentially deleterious mutations. We tested 576 common variants in a case-control association analysis, carrying the top-5 associations over to replication in up to 719 DVT cases and 719 controls. We also conducted an analysis of the burden of nonsynonymous variants in coagulation factor and anticoagulant genes. We found an excess of rare missense mutations in anticoagulant genes in DVT cases compared to controls and an association for a missense polymorphism of FGA (rs6050; p = 1.9 × 10-5, OR 1.45; 95% CI, 1.22-1.72; after replication in > 1400 individuals).
We implemented a barcode-based strategy to efficiently multiplex sequencing of hundreds of candidate genes in several individuals. In the relatively small dataset of our pilot study we were able to identify bona fide associations with DVT. Our study illustrates the potential of next-generation sequencing for the discovery of genetic variation predisposing to complex diseases.
KeywordsDeep vein thrombosis venous thromboembolism next-generation sequencing target capture multiplexing FGA rs6025 heamostateome DVT VTE
Deep vein thrombosis (DVT) of the lower extremities, a common thrombotic disease often complicated by acute pulmonary embolism , has a strong genetic component as established by family [2–4] and twin studies , with a 3-fold increase in disease risk for siblings of individuals with DVT  and an estimated hereditary component of 60% . Genetic risk factors include rare mutations in PROC, PROS1 and SERPINC1 leading to the deficiencies of natural anticoagulant proteins (protein C, protein S and antithrombin, respectively) and single nucleotide polymorphisms (SNPs) of F5 (rs6025 or factor V Leiden [FVL]) and of F2 (rs1799963 or prothrombin G20210A) . More recently, genome-wide association studies (GWAS) identified associations of common SNPs at 6 different genomic loci (CYP4V2, SERPINC1, GP6, F5, ABO and HIVEP1) [7–9]. In spite of these recent observations, however, genetic variants established to influence the risk for DVT explain only a fraction of disease heritability .
Next-generation DNA sequencing is opening new avenues for genetic association studies in complex diseases, enabling to sequence large fractions of the human genome (or even the entire genome) at unprecedented speed and per-base costs. Re-sequencing the exome (i.e. the protein coding area of the genome) or the entire genome is becoming the gold standard for the identification of disease-causing mutations in Mendelian diseases [11–13]. However, the application of these techniques to common diseases is still limited by the high costs and considerable computational burden associated with the analysis of numerous samples. Sequencing of a few hundred genes or genomic loci is a much less expensive alternative to whole-genome (or exome) sequencing, suitable for the analysis of areas of the genome that are deemed to have particular relevance in the pathophysiology of a given disease. However, this regional sequencing approach, which requires sample multiplexing in order to achieve efficiency, is technically challenging. Multiplexing on next-generation sequencing platforms entails the use of DNA pools, which is affected by limitations in downstream genetic association analysis. These limitations include (a) loss of sensitivity to detect rare genetic variants, (b) uncertainty in allele frequency estimations determined by the unequal representation of samples within the pool, and (c) loss of individual sequence data hampering direct genotype-phenotype associations.
In this pilot study, we used genomic libraries with barcode sequence-tags as a means to overcome the limitations of pooling. Using this approach we were able to multiplex DNA-target capture and SOLiD sequencing while retaining individual sequence information. In what is one of the first practical applications of this technique, we sequenced the coding area of 186 genes involved in blood hemostasis/inflammation (i.e. two pivotal pathophysiological mechanisms of DVT) in 10 Italian patients with early-onset idiopathic DVT and 12 thrombosis-free controls. The goal of this pilot study was to develop pipelines for the application of next-generation sequencing in the setting of DVT and to test different analytical approaches for the identification of disease-associated variants.
Cases of DVT were selected from 2139 unrelated patients referred to the Angelo Bianchi Bonomi Hemophilia and Thrombosis Center, Milan, (Italy) for diagnostic workout and thrombophilia testing after a first episode of DVT of the lower limbs in the years 1995-2010. Patients were asked to bring to the center the diagnostic documentation of their thrombotic episodes and underwent a clinical interview. DVT had been diagnosed by compression ultrasonography or venography. All patients underwent a complete thrombophilia screening, including measurement of natural anticoagulant proteins, genotyping of FVL and prothrombin G20210A and search for antiphospholipid autoantibodies. Coagulation factor VIII and fibrinogen coagulant activities were also measured.
A patient selection flow-chart is shown in Additional file 1 Figure S1. Patients selected for next-generation sequencing were required to have (a) history of idiopathic DVT of the lower limbs, (b) age of disease onset < 55 years, (c) wild-type FVL and prothrombin G20210A genotypes, (d) absence of natural anticoagulant deficiencies, (e) negative search for anti-phospholipid autoantibodies, (f) been born in Lombardy, the 10 million people region of Italy that has Milan as a regional capital. Since 42 patients matched these criteria, patients were further prioritized based on age of onset, familial history of venous or arterial thrombosis, completeness of clinical information, DNA amount and quality. Patients included in the replication were all the remaining 719 patients with a first episode of idiopathic lower-limb DVT and available DNA. Controls included in the study were selected from a total of 1938 healthy Italian individuals recruited among friends and non-consanguineous relatives who accompanied patients to the Hemophilia and Thrombosis Center and agreed to be tested for thrombophilia. Previous arterial or venous thrombosis in the controls was excluded using a validated questionnaire . Controls had similar age of the idiopathic DVT cases (± 5 years) and the same gender. Controls who underwent next-generation sequencing were matched with the patients for geographic provenience (born in Lombardy), in order to minimize population stratification. In case more than one control matched the same patient, control inclusion was random. The study was approved by the Institutional Review Board of the Fondazione IRCCS Ca' Granda - Ospedale Maggiore Policlinico and all subjects gave their informed consent. Patient recruitment, sampling and thrombophilia screening were carried out at the Angelo Bianchi Bonomi Hemophilia and Thrombosis Center, Milan (Italy). Next-generation DNA analysis and replication PCR and Sanger sequencing and associated analyses were carried out at the Human Genome Sequencing Center (HGSC), Baylor College of Medicine, Houston (USA).
Target area selection
The protein coding exons, 3' and 5' UTRs, and the intron-exon boundaries of 186 genes were chosen as target area. Target genes included all coagulation factor genes, anticoagulant genes and genes involved in fibrinolysis, platelet adhesion and aggregation, cell-cell interaction, endothelial activation and inflammation. The genomic coordinate intervals corresponding to the target area were obtained from the UCSC Genome Browser database and sent to NimbleGen for probe design. Probes were designed for all of the submitted intervals, with slightly different genomic coordinates (i.e. tiled regions). The final target area spanned 644, 472 bp. Probes were arrayed on Roche NimbleGen HD2 2.1 M-probe custom chips. The complete lists of target and tiled coordinate-intervals and of the target genes are in the Additional file 2 and Additional file 1 Table S1.
Two experiments were carried out. In the first experiment, genomic libraries from 4 DVT patients (DVT_P_01, DVT_P_02, DVT_P_03, DVT_P_04) and 4 controls (DVT_C_01, DVT_C_02, DVT_C_03, DVT_C_04) were captured on the same Roche NimbleGen HD2 chip and sequenced in the same ABI SOLiD 4 spot (one quarter of a slide). In the second, libraries from 8 cases (DVT_P_01, DVT_P_05, DVT_P_06, DVT_P_07, DVT_P_08, DVT_P_09, DVT_P_10, DVT_P_11) and 8 controls (DVT_C_05, DVT_C_06, DVT_C_07, DVT_C_08, DVT_C_09, DVT_C_10, DVT_C_11, DVT_C_12) were captured on one Roche NimbleGen HD2 chip and sequenced in two ABI SOLiD 4 spots. Sample DVT_P_11 failed capture during the second experiment. Patient DVT_P_01 was sequenced twice as a quality control procedure.
Genomic library preparation, barcoding and enrichment of target DNA sequences
Two micrograms of genomic DNA were used to prepare libraries of DNA fragments that were ligated with ABI SOLiD P1-and P2-adaptors . Different modified P2-adaptors, each containing a specific DNA-tag sequence (molecular barcode), were used for each individual library. Libraries with barcodes were used to prepare equimolar DNA pools of 8 and 16 samples. Four micrograms of DNA from each pool were hybridized on one Roche NimbleGen HD2 chip.
ABI SOLiD 4 sequencing
Capture products underwent emulsion PCR with P1-adaptor mediated attachment of clonally-amplified templates to loading beads. Beads were covalently attached on glass slides at the P2-adaptor extremity. Sequencing by oligonucleotide ligation and detection (SOLiD) was performed on ABI SOLiD 4 platforms .
Read barcode assignment and mapping to the reference genome
Reads with the barcodes were assigned to the corresponding sample using custom Perl scripts and mapped to reference human genome, NCBI36/hg18, using BFAST software . Reads that mapped on the same starting and end coordinates, considered likely to be PCR duplicates, were marked in the binary alignment/mapping (BAM) files, where mapping information was stored.
Genetic variant calls and quality control (QC)
Sorted BAM files were processed in a variant-calling pipeline consisting of a BAM filtering process and a variant calling process. In the first step, duplicate reads were eliminated from the BAM files, retaining only the read with the top mapping quality at each pair of start and end mapping coordinates. Also, reads with mapping quality score of less than 50 were expunged from the BAM files. In the variant-calling step, Samtools  was used to generate PILEUP files with read information at sites where mismatches from the reference sequence were detected. Consensus in the presence of mismatches, read- and base-quality parameters were used as criteria to distinguish genetic variants from sequencing errors, filtering high-quality calls. In a final QC process, this set of calls was further cleared from variants with an allele balance (variant allele reads/total number of reference plus variant allele reads) below 20% and/or with significant strand bias (i.e. sites at which the variant sequence did not appear on both forward and reverse reads) in spite of high variant quality.
Genetic variants were annotated on RefSeq database, dbSNP129 and 1000 Genomes pilot release (March 2010)  using Annovar software . Missense variants were also annotated on SIFT  and Polyphen 2 .
Several pieces of software have been developed to analyze genotyping result of commercially SNP-genotyping arrays (e.g. those used for GWAS), in particular the successful PLINK package . On the other hand, few tools are available for genetic association analysis of next-generation sequencing datasets. For this reason we developed Nxtgen2plink.rb, a software capable of generating PLINK-compatible input files with phenotypic information on sequenced individuals and genotype calls at each site that was found to be variable in at least one of the sequenced individuals. The software uses read alignment and coverage information (BAM files), genetic-variant calls (PILEUP files) and quality control information on variants with low quality (also in the PILEUP file format) to generate genotype calls across all individuals. Details of the workflow of the Nxtgen2plink.rb software are presented in Additional file 1 Figure S2. Association analysis was the carried out by PLINK and custom Perl and Python scripts.
In our dataset we chose to carry out (a) association analysis by Fisher's exact test of all identified variants that had at least a minor allele frequency (MAF) of 8% in the entire dataset of cases plus controls (i.e. the threshold MAF enabling a variant to reach a statistical significance with p < 0.05 given the sample size) and a genotype missingness of less than 10% and (b) 'collapsing' analysis comparing the total number of nonsynonymous single nucleotide variants (SNVs) in coagulation factor and anticoagulant protein genes in DVT cases and controls.
We chose to carry over to replication in up to 719 idiopathic DVT patients and 719 matched controls the top-5 variants from the single-variant association analysis. Replication was carried out by PCR and Sanger sequencing, which were chosen as replication techniques given the high-throughput capacity of HGSC, their accuracy and their potential to reveal neighboring genetic variation in linkage disequilibrium with or with similar effect to that of the variant undergoing replication. Genotype calls were performed automatically by SNP-detector  and 10% of the calls were verified manually on chromatograms. Replication comprised two stages (1) initial replication in 284 individuals (2) full replication in 1438 individuals. We selected for stage 2 variants that given their allele frequency and effect size estimated in stage 1 had 80% chances to be replicated at p < 0.005 in the entire cohort of 1438 individuals. Genetic association was carried out by PLINK. The interaction of rs6050 with FVL and prothrombin G20210A was tested using PLINK with the epistasis option. Association of rs6050 with DVT after adjustment for covariates was assessed by multivariable logistic regression using STATA 10 software.
Patient selection and characteristics
Characteristics of the individuals included in the replication stages of the study.
PATIENTS WITH IDIOPATHIC DVT
GEOGRAPHIC ORIGIN, n (%)
Northern Italy other than Lombardy
Southern Italy and islands
MALE GENDER, n (%)
AGE, mean years (standard deviation)
PULMONARY EMBOLISM, n (%)
REFERRED FOR MORE THAN ONE EPISODE, n (%)
BODY MASS INDEX, kg/m2
FACTOR V LEIDEN (rs6025) GENOTYPE, n (%)
PROTHROMBIN G20210A (rs1799963) GENOTYPE, n (%)
NATURAL ANTICOAGULANT DEFICIENCIES, n (%)
LABORATORY MEASUREMENTS, mean (standard deviation)
Protein C, %
Protein S, %
Prothrombin time, international normalized ratio
Activated partial thromboplastin time, ratio
Coagulation factor VIII coagulant activity, %
Fibrinogen coagulant activity, %
To sequence the 700-kilobase target area at an average depth of coverage of 42X (after removal of duplicate reads) in 22 samples, 12,040,000,000 bp and ~240 million reads of raw sequence data were generated. Reads were mapped on the human reference genome, NCBI36/hg18, and non-duplicate reads were retained and used for genotype calls. An average of 7% of the reads mapped to the target region, corresponding to an enrichment of more than 300-fold (the target area constitutes 0.02% of the 3-gigabase human genome). On average, 98, 4% of the target was covered at least once, 91% of the target area had at least the 10X coverage required for confident variant calls and 80% had coverage > 20X indicating homogeneous high coverage of the target area. Coverage statistics and representative histograms can be found in the Additional file 1 (Table S3 and S4; Figure S3).
Type and frequency of identified genetic variants
Disease-associated variants in the target genes
In order to search for the presence of disease-associated genetic variants, all identified SNVs were annotated on the human gene mutation database (HGMD®), a database that includes published mutations responsible for human diseases . A total of 27 variants were present in HGMD® and are listed in Additional file 1 Table S8. Of these variants, 9 had been reported in association with thrombotic diseases (either venous or arterial thrombosis) and 7 in association with DVT or DVT-associated intermediate phenotypes (i.e. obesity and von Willebrand factor [VWF] levels). Among these variants were a missense variant of CYP4V2 gene (rs13146272) reported to protect from DVT  (allele count in DVT cases vs controls of this study: 3 vs 6), a 5'UTR-variant of FGA gene found to increase DVT risk (rs2070011)  (allele count in DVT cases vs controls: 12 vs 7) and a variant reported to decrease circulating VWF levels (rs216321)  (i.e. expected to have a protective effect since high VWF levels increase DVT risk ; allele count in DVT cases vs controls: 0 vs 3).
Nonsynonymous variation in coagulation genes
Nonsynonymous single nucleotide variants in anticoagulant genes.
1000 Genomes CEU population, allele frequency
C > T
C > A
G > A
C > G
G > A
Association analysis of common variants
Common variant association results.
Replication stage 1 and 2 (combined)
Effective sample size cases
MAF cases, % (n)
Effective sample size controls
MAF controls, % (n)
T > C
exon - missense
1.9 × 10-5
A > G
exon - missense
T > C
G > A
exon - synonymous
T > C
This pilot study is one of the first applications of next-generation sequencing in DVT. Sequencing by oligonucleotide ligation and detection was used to resequence the protein-coding areas of 186 hemostatic/pro-inflammatory genes in cases and controls of DVT. This regional sequencing approach enabled the simultaneous analysis of several dozens of genes in many samples at a fraction of the cost and computation required for whole-exome and whole-genome analysis. At the current level of multiplexing (i.e. up to 64 samples per SOLiD slide, 16 per spot), the analysis of our target area in one sample costs one tenth of a high-coverage exome-sequencing and one hundredth of a high-coverage whole-genome sequencing, with similar proportions in the saving of computational time (3 hours for read mapping vs the few days of whole-exome or the more than 1 week of whole-genome datasets) and information storage-capacity requirements (300 MB per BAM file vs 40 GB per BAM for whole-exome or 400 GB for whole-genome sequencing). For these reasons, regional sequencing is ideal to interrogate relatively small genomic areas deemed of particular functional relevance in a disease. Potential applications of this approach are the sequencing of positional candidate genes or genomic loci identified by genome-wide linkage or association analysis , the large-scale replication of the initial findings of exome and whole-genome resequencing studies and the rapid screening of disease-genes in those Mendelian diseases that have several different causal genes .
In the field of DVT, the importance of being able to analyze all known hemostatic genes (i.e. the 'hemostateome') has been recently highlighted by Fechtel et al. . The simultaneous sequencing of all hemostatic genes in affected individuals is ideal to study specific combinations of variants in the hemostatic pathway acting in synergy to confer DVT-predisposition. This type of approach has the potential to reveal new disease-predisposing mechanisms, besides the identification of isolated variants that show statistical association with the disease. In this study, the use of molecular barcoding allowed multiplexing without loss of individual sequence information, which is required to fully exploit the potential of sequence data. Sequencing of the 186 genes in 22 individuals yielded more than 1700 genetic variants of different functional type and frequency. Annotation of identified variants revealed several disease-associated variants, a proportion of which had already been reported in association with DVT. Many novel variants with potentially deleterious effect on the function of key hemostatic proteins were also found. These results are consistent with the recent report by Dewey et al. of the genome sequence of a family quartet in which the father had a history of DVT and pulmonary embolism . In the study, the authors identified four different novel nonsynonymous variants in DVT-risk genes and other known thrombophilia associated variants.
In our study, we adopted different analytical approaches to reveal potential associations with the disease at both single-variant and gene/pathway levels. Although the number of analyzed individuals was very small, it was possible to find statistically significant and biologically plausible associations with DVT. An increased burden of rare missense mutations in anticoagulant genes was found in DVT cases compared to controls. Single-variant association analysis followed by replication genotyping in > 1400 individuals identified an association for the rs6050 SNP in FGA. The excess of rare missense mutations in anticoagulant genes in patients in whom the deficiencies of natural anticoagulants had been excluded with biochemical assays suggests that a fraction of idiopathic DVT cases might be affected by 'unrecognized' anticoagulant deficiencies, caused by mutations that impart functional effects to which currently used biochemical assays are not sensitive. The association of rs6050, already described by previous candidate SNP studies, was here identified by an agnostic screening of several dozen genes. The association of rs6050 was reported in 4 studies focusing on venous thromboembolism (VTE), therefore including cases of pulmonary embolism without a diagnosis of DVT [25, 32–34]. Two of these studies were very small, with total sample size (cases and controls) of less than 500 individuals [25, 32]. Ours is the second largest of all the studies on rs6050 both in terms of DVT cases investigated and overall statistical power. Thus, along with previous reports, this study makes of rs6050 one of the most widely replicated variants in DVT. The mechanisms for the association of rs6050 with DVT/VTE are not fully understood. FGA rs6050 was reported to result in enhanced coagulation factor XIII-mediated cross-linking of fibrin alpha-chain  and to be associated with increased FGA transcription . In this report we found no association of FGA rs6050 with plasmatic fibrinogen activity, but the fact that rs6050 is a missense SNP and that the amino acid substitution is considered as 'possibly damaging' by Polyphen 2 indeed suggest that the risk for DVT is conferred by an alteration of fibrinogen-alpha chain function.
The approach proposed in this study has limitations. The use of DNA target capture may introduce bias in the detection of genetic variants by allele-selective capture. Both the use of target capture and the short-read length of next-generation sequencing platforms constrain comprehensive analysis of copy number variation (CNV), an important source of genetic variability. However, these limitations can be minimized by increasing sequence coverage and by using complementary genetic analyses (e.g. array-based CNV-ascertainment). Restricting the analysis to a few hundred candidate genes may limit the chance to detect novel associations. On the other hand, the efficiency and cost effectiveness of regional sequencing are ideal for the thorough, large-scale analysis of genetic variation in specific biological pathways such as hemostasis. This approach may provide deep understanding of the mechanisms by which multiple variants in the same biological pathway shape individual predisposition to complex phenotypes.
Using a molecular-barcode based technique for sample multiplexing on next-generation DNA sequencing platforms we sequenced the coding areas of 186 hemostatic/proinflammatory genes in cases and controls of DVT. We were able to detect known disease-associated variants as well as novel potentially-deleterious variants in disease-associated genes. Our results illustrate the potential of next-generation gene sequencing for the discovery of genetic variation predisposing to common diseases and for the study of inherited thrombophilia.
The authors would like to thank Dr. Luigi F Ghilardini for the help in drafting figures and tables, all the former and current members of HGSC MedSeq and Library laboratories for sample analysis, Kyle Chang, Jennifer Drummond and Nipun Kakkar for help in genetic variant calling and annotation, Yu-Ye Wen for help in study design. FP is recipient of Bayer Hemophilia Special Project Award 2011. LAL is recipient of the Bayer Hemophilia Clinical Training Award 2009 and of the Associazione Italiana Centri Emofilia (AICE) - De Biasi Prize. Financial support from the Italian Ministry of Health (Grant n. RF-2009-1530493) is gratefully acknowledged. This work has been supported by Fondazione Cariplo (Grant n. 2011-0524).
- Tapson VF: Acute pulmonary embolism. N Engl J Med. 2008, 358: 1037-52. 10.1056/NEJMra072753.View ArticlePubMedGoogle Scholar
- Sørensen HT, Riis AH, Diaz LJ, Andersen EW, Baron JA, Andersen PK: Familial risk of venous thromboembolism: a nationwide cohort study. J Thromb Haemost. 2011, 9: 320-4. 10.1111/j.1538-7836.2010.04129.x.View ArticlePubMedGoogle Scholar
- Souto JC, Almasy L, Borrell M, Blanco-Vaca F, Mateo J, Soria JM, Coll I, Felices R, Stone W, Fontcuberta J, Blangero J: Genetic susceptibility to thrombosis and its relationship to physiological risk factors: the GAIT study. Genetic Analysis of Idiopathic Thrombophilia. Am J Hum Genet. 2000, 67: 1452-9.View ArticlePubMedGoogle Scholar
- Heit JA, Phelps MA, Ward SA, Slusser JP, Petterson TM, De Andrade M: Familial segregation of venous thromboembolism. J Thromb Haemost. 2004, 2: 731-6. 10.1111/j.1538-7933.2004.00660.x.View ArticlePubMedGoogle Scholar
- Larsen TB, Sørensen HT, Skytthe A, Johnsen SP, Vaupel JW, Christensen K: Major genetic susceptibility for venous thromboembolism in men: a study of Danish twins. Epidemiology. 2003, 14: 328-32.PubMedGoogle Scholar
- Dahlbäck B: Advances in understanding pathogenic mechanisms of thrombophilic disorders. Blood. 2008, 112: 19-27. 10.1182/blood-2008-01-077909.View ArticlePubMedGoogle Scholar
- Bezemer ID, Bare LA, Doggen CJ, Arellano AR, Tong C, Rowland CM, Catanese J, Young BA, Reitsma PH, Devlin JJ, Rosendaal FR: Gene variants associated with deep vein thrombosis. JAMA. 2008, 299: 1306-14. 10.1001/jama.299.11.1306.View ArticlePubMedGoogle Scholar
- Trégouët DA, Heath S, Saut N, Biron-Andreani C, Schved JF, Pernod G, Galan P, Drouet L, Zelenika D, Juhan-Vague I, Alessi MC, Tiret L, Lathrop M, Emmerich J, Morange PE: Common susceptibility alleles are unlikely to contribute as strongly as the FV and ABO loci to VTE risk: results from a GWAS approach. Blood. 2009, 113: 5298-303. 10.1182/blood-2008-11-190389.View ArticlePubMedGoogle Scholar
- Morange PE, Bezemer I, Saut N, Bare L, Burgos G, Brocheton J, Durand H, Biron-Andreani C, Schved JF, Pernod G, Galan P, Drouet L, Zelenika D, Germain M, Nicaud V, Heath S, Ninio E, Delluc A, Münzel T, Zeller T, Brand-Herrmann SM, Alessi MC, Tiret L, Lathrop M, Cambien F, Blankenberg S, Emmerich J, Trégouët DA, Rosendaal FR: A follow-up study of a genome-wide association scan identifies a susceptibility locus for venous thrombosis on chromosome 6p24.1. Am J Hum Genet. 2010, 86: 592-5. 10.1016/j.ajhg.2010.02.011.View ArticlePubMedPubMed CentralGoogle Scholar
- Morange PE, Tregouet DA: Deciphering the molecular basis of venous thromboembolism: where are we and where should we go?. Br J Haematol. 2010, 148: 495-506. 10.1111/j.1365-2141.2009.07975.x.View ArticlePubMedGoogle Scholar
- Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, Lee C, Shaffer T, Wong M, Bhattacharjee A, Eichler EE, Bamshad M, Nickerson DA, Shendure J: Targeted capture and massively parallel sequencing of 12 human exomes. Nature. 2009, 461: 272-6. 10.1038/nature08250.View ArticlePubMedPubMed CentralGoogle Scholar
- Ng SB, Buckingham KJ, Lee C, Bigham AW, Tabor HK, Dent KM, Huff CD, Shannon PT, Jabs EW, Nickerson DA, Shendure J, Bamshad MJ: Exome sequencing identifies the cause of a mendelian disorder. Nat Genet. 2010, 42: 30-5. 10.1038/ng.499.View ArticlePubMedGoogle Scholar
- Lupski JR, Reid JG, Gonzaga-Jauregui C, Rio Deiros D, Chen DC, Nazareth L, Bainbridge M, Dinh H, Jing C, Wheeler DA, McGuire AL, Zhang F, Stankiewicz P, Halperin JJ, Yang C, Gehman C, Guo D, Irikat RK, Tom W, Fantin NJ, Muzny DM, Gibbs RA: Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy. N Engl J Med. 2010, 362: 1181-91. 10.1056/NEJMoa0908094.View ArticlePubMedPubMed CentralGoogle Scholar
- Frezzato M, Tosetto A, Rodeghiero F: Validated questionnaire for the identification of previous personal or familial venous thromboembolism. Am J Epidemiol. 1996, 143: 1257-65.View ArticlePubMedGoogle Scholar
- Homer N, Merriman B, Nelson SF: BFAST: An alignment tool for large scale genome resequencing. PLoS ONE. 2009, 4: e7767-10.1371/journal.pone.0007767.View ArticlePubMedPubMed CentralGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup: The Sequence alignment/map (SAM) format and SAMtools. Bioinformatics. 2009, 25: 2078-9. 10.1093/bioinformatics/btp352.View ArticlePubMedPubMed CentralGoogle Scholar
- 1000 Genomes Project Consortium, Durbin RM, Abecasis GR, Altshuler DL, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA: A map of human genome variation from population-scale sequencing. Nature. 2010, 467: 1061-73. 10.1038/nature09534.View ArticleGoogle Scholar
- Wang K, Li M, Hakonarson H: ANNOVAR: Functional annotation of genetic variants from next-generation sequencing data. Nucleic Acids Research. 2010, 38: e164-10.1093/nar/gkq603.View ArticlePubMedPubMed CentralGoogle Scholar
- Ng PC, Henikoff S: Accounting for human polymorphisms predicted to affect protein function. Genome Res. 2002, 12: 436-46. 10.1101/gr.212802.View ArticlePubMedPubMed CentralGoogle Scholar
- Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR: A method and server for predicting damaging missense mutations. Nat Methods. 2010, 7: 248-249. 10.1038/nmeth0410-248.View ArticlePubMedPubMed CentralGoogle Scholar
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81: 559-75. 10.1086/519795.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang J, Wheeler DA, Yakub I, Wei S, Sood R, Rowe W, Liu PP, Gibbs RA, Buetow KH: SNPdetector: a software tool for sensitive and accurate SNP detection. PLoS Comput Biol. 2005, 1: e53-10.1371/journal.pcbi.0010053.View ArticlePubMedPubMed CentralGoogle Scholar
- Stenson PD, Mort M, Ball EV, Howells K, Phillips AD, Thomas NS, Cooper DN: The Human Gene Mutation Database: 2008 update. Genome Med. 2009, 1: 13-10.1186/gm13.View ArticlePubMedPubMed CentralGoogle Scholar
- Ko YL, Hsu LA, Hsu TS, Tsai CT, Teng MS, Wu S, Chang CJ, Lee YS: Functional polymorphisms of FGA, encoding alpha fibrinogen, are associated with susceptibility to venous thromboembolism in a Taiwanese population. Hum Genet. 2006, 119: 84-91. 10.1007/s00439-005-0102-0.View ArticlePubMedGoogle Scholar
- Vaidya D, Yanek LR, Herrera-Galeano JE, Mathias RA, Moy TF, Faraday N, Becker LC, Becker DM: A common variant in the Von Willebrand factor gene is associated with multiple functional consequences. Am J Hematol. 2010, 85: 971-3. 10.1002/ajh.21859.View ArticlePubMedPubMed CentralGoogle Scholar
- Smith NL, Rice KM, Bovill EG, Cushman M, Bis JC, McKnight B, Lumley T, Glazer NL, van Hylckama Vlieg A, Tang W, Dehghan A, Strachan DP, O'Donnell CJ, Rotter JI, Heckbert SR, Psaty BM, Rosendaal FR: Genetic variation associated with plasma von Willebrand factor levels and the risk of incident venous thrombosis. Blood. 2011, 117: 6007-11. 10.1182/blood-2010-10-315473.View ArticlePubMedPubMed CentralGoogle Scholar
- Momozawa Y, Mni M, Nakamura K, Coppieters W, Almer S, Amininejad L, Cleynen I, Colombel JF, de Rijk P, Dewit O, Finkel Y, Gassull MA, Goossens D, Laukens D, Lémann M, Libioulle C, O'Morain C, Reenaers C, Rutgeerts P, Tysk C, Zelenika D, Lathrop M, Del-Favero J, Hugot JP, de Vos M, Franchimont D, Vermeire S, Louis E, Georges M: Resequencing of positional candidates identifies low frequency IL23R coding variants protecting against inflammatory bowel disease. Nat Genet. 2011, 43: 43-7. 10.1038/ng.733.View ArticlePubMedGoogle Scholar
- Shearer AE, DeLuca AP, Hildebrand MS, Taylor KR, Gurrola J, Scherer S, Scheetz TE, Smith RJ: Comprehensive genetic testing for hereditary hearing loss using massively parallel sequencing. Proc Natl Acad Sci USA. 2010, 107: 21104-9. 10.1073/pnas.1012989107.View ArticlePubMedPubMed CentralGoogle Scholar
- Fechtel K, Osterbur ML, Kehrer-Sawatzki H, Stenson PD, Cooper DN: Delineating the Hemostaseome as an aid to individualize the analysis of the hereditary basis of thrombotic and bleeding disorders. Hum Genet. 2011, 130: 149-66. 10.1007/s00439-011-0984-y.View ArticlePubMedPubMed CentralGoogle Scholar
- Dewey FE, Chen R, Cordero SP, Ormond KE, Caleshu C, Karczewski KJ, Whirl-Carrillo M, Wheeler MT, Dudley JT, Byrnes JK, Cornejo OE, Knowles JW, Woon M, Sangkuhl K, Gong L, Thorn CF, Hebert JM, Capriotti E, David SP, Pavlovic A, West A, Thakuria JV, Ball MP, Zaranek AW, Rehm HL, Church GM, West JS, Bustamante CD, Snyder M, Altman RB, Klein TE, Butte AJ, Ashley EA: Phased whole-genome genetic risk in a family quartet using a major allele reference sequence. PLoS Genet. 2011, 7: e1002280-10.1371/journal.pgen.1002280.View ArticlePubMedPubMed CentralGoogle Scholar
- Carter AM, Catto AJ, Kohler HP, Ariëns RA, Stickland MH, Grant PJ: alpha-fibrinogen Thr312Ala polymorphism and venous thromboembolism. Blood. 2000, 96 (3): 1177-9. Aug 1PubMedGoogle Scholar
- Rasmussen-Torvik LJ, Cushman M, Tsai MY, Zhang Y, Heckbert SR, Rosamond WD, Folsom AR: The association of alpha-fibrinogen Thr312Ala polymorphism and venous thromboembolism in the LITE study. Thromb Res. 2007, 121: 1-7. 10.1016/j.thromres.2007.02.008.View ArticlePubMedPubMed CentralGoogle Scholar
- Arellano AR, Bezemer ID, Tong CH, Catanese JJ, Devlin JJ, Reitsma PH, Bare LA, Rosendaal FR: Gene variants associated with venous thrombosis: confirmation in the MEGA study. J Thromb Haemost. 2010, 8: 1132-4.PubMedGoogle Scholar
- Standeven KF, Grant PJ, Carter AM, Scheiner T, Weisel JW, Ariëns RA: Functional analysis of the fibrinogen alpha Thr312Ala polymorphism: effects on fibrin structure and function. Circulation. 2003, 107: 2326-30. 10.1161/01.CIR.0000066690.89407.CE.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/5/7/prepub