Identification of breast cancer associated variants that modulate transcription factor binding

@article{Liu2017IdentificationOB,
  title={Identification of breast cancer associated variants that modulate transcription factor binding},
  author={Yunxian Liu and Ninad M Walavalkar and Mikhail G. Dozmorov and Stephen S. Rich and Mete Civelek and Michael J. Guertin},
  journal={PLoS Genetics},
  year={2017},
  volume={13}
}
Genome-wide association studies (GWAS) have discovered thousands loci associated with disease risk and quantitative traits, yet most of the variants responsible for risk remain uncharacterized. The majority of GWAS-identified loci are enriched for non-coding single-nucleotide polymorphisms (SNPs) and defining the molecular mechanism of risk is challenging. Many non-coding causal SNPs are hypothesized to alter transcription factor (TF) binding sites as the mechanism by which they affect… 

Figures from this paper

Elucidating the Underlying Functional Mechanisms of Breast Cancer Susceptibility Through Post-GWAS Analyses
TLDR
Identifying the true causal variants underlying breast cancer susceptibility and their function not only provides better estimates of the explained familial relative risk thereby improving polygenetic risk scores (PRSs), it also increases the understanding of the biological mechanisms responsible for causing susceptibility to breast cancer.
Detecting genome-wide directional effects of transcription factor binding on polygenic disease risk
TLDR
Signed linkage disequilibrium profile regression is a new method for detecting directional effects of genomic annotations on disease risk that implicate new causal disease genes and can suggest mechanisms underlying the effects of causal genes on disease.
Integrating genome-wide association study with regulatory SNP annotation information identified candidate genes and pathways for schizophrenia
TLDR
A large-scale integrative analysis of two genome-wide association studies of schizophrenia with functional annotation datasets of regulatory single-nucleotide polymorphism found multiple candidate genes, gene ontology terms and pathways for schizophrenia, supporting the functional importance of rSNPs, and providing novel clues for understanding the genetic architecture of schizophrenia.
An integrated platform to systematically identify causal variants and genes for polygenic human traits
TLDR
Applying a platform using ATAC-seq, DNaseI footprints, NG Capture-C and machine learning to red blood cell traits identifies a significant proportion of known causative variants and their effector genes which can be validated by direct in vivo modelling.
Risks and Function of Breast Cancer Susceptibility Alleles
TLDR
The application of these multidisciplinary approaches to understand genetic risk for breast cancer can be used to identify individuals in the population that may benefit from clinical interventions including screening for early detection and prevention, and treatment strategies to reduce breast cancer-related mortalities.
Annotations capturing cell-type-specific TF binding explain a large fraction of disease heritability
TLDR
The results show that intersecting sequence-based TF predictions with cell-type-specific chromatin information can help refine genome-wide association signals.
Annotations capturing cell-type-specific TF binding explain a large fraction of disease heritability.
TLDR
TF binding annotations constructed by intersecting sequence-based TF binding predictions with cell-type-specific chromatin data explain a large fraction of heritability across a broad set of diseases and corresponding cell-types and can help refine genome-wide association signals.
funMotifs: Tissue-specific transcription factor motifs
TLDR
A framework to identify tissue-specific functional motifs (funMotifs) across the genome based on thousands of annotation tracks obtained from large-scale genomics projects including ENCODE, RoadMap Epigenomics and FANTOM, which enabled us to measure tissue specificity of different TFs and to identify candidate functional variants in TF motifs.
SMADs binding site polymorphisms rs9911630 is associated with susceptibility but not prognosis of gastric cancer: a case control study
TLDR
The results suggested that rs9911630 polymorphism in SMADs target site might influence susceptibility but not prognosis of gastric cancer.
Non-coding mutations reveal cancer driver cistromes in luminal breast cancer
TLDR
This work reveals an enrichment of non-coding mutations in cis-regulatory elements that cover a subset of transcription factors linked to tumor progression in luminal breast cancers and presents a strategy to find cancer driver cistromes relying on quantifying the enrichment ofNon-Coding mutations over cis- Regulatory elements concatenated into a functional unit drawn from an accessible chromatin catalogue derived from primary cancer tissues.
...
...

References

SHOWING 1-10 OF 94 REFERENCES
Breast cancer risk-associated SNPs modulate the affinity of chromatin for FOXA1 and alter gene expression
TLDR
The results show that breast cancer risk–associated SNPs are enriched in the cistromes of FOXA1 and ESR1 and the epigenome of histone H3 lysine 4 monomethylation in a cancer- and cell type–specific manner.
Comprehensive Functional Annotation of Seventy-One Breast Cancer Risk Loci
TLDR
It is proposed that BCa risk resides in a subgroup of SNPs that functionally affects breast biology, and a shortlist of these SNPs is proposed to aid in framing hypotheses to prioritize a manageable number of likely disease-causing SNPs.
Systematic Localization of Common Disease-Associated Variation in Regulatory DNA
TLDR
P pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders are suggested.
Genome-wide association analysis of more than 120,000 individuals identifies 15 new susceptibility loci for breast cancer
TLDR
15 new loci associated with breast cancer at P < 5 × 10−8 are identified, and one association appears to be driven by an amino acid substitution encoded in EXO1, which is found in women of European ancestry.
Principles for the post-GWAS functional characterization of cancer risk loci
Genome wide association studies (GWAS) have identified more than 200 mostly new common low-penetrance susceptibility loci for cancers. The predicted risk associated with each locus is generally
Prediction of Causal Candidate Genes in Coronary Artery Disease Loci
TLDR
It is concluded that the great majority of causal variations affecting CAD risk occur in noncoding regions, with 41% affecting gene expression robustly versus 6% leading to amino acid changes.
Noncoding somatic and inherited single-nucleotide variants converge to promote ESR1 expression in breast cancer
TLDR
This work reports significant enrichment of somatic mutations within the set of regulatory elements (SRE) regulating ESR1 in 7% of E SR1-positive breast cancers, highlighting the importance of considering the combinatorial activity of Regulatory elements as a single unit to delineate the impact of noncoding genetic alterations on single genes in cancer.
Functional interpretation of non-coding sequence variation: Concepts and challenges
TLDR
This article summarizes concepts for the systematic interpretation of non‐coding genetic signals using genome annotation data sets in different cellular systems and outlines strategies for the global analysis of multiple association intervals and the in‐depth molecular investigation of individual intervals.
Large-scale genotyping identifies 41 new loci associated with breast cancer risk
TLDR
A meta-analysis of 9 genome-wide association studies, including 10,052 breast cancer cases and 12,575 controls of European ancestry, and identified 29,807 SNPs for further genotyping suggests that more than 1,000 additional loci are involved in breast cancer susceptibility.
...
...