COSMIC: exploring the world's knowledge of somatic mutations in human cancer

  title={COSMIC: exploring the world's knowledge of somatic mutations in human cancer},
  author={Simon A. Forbes and David Beare and Prasad Gunasekaran and Kenric Leung and Nidhi Bindal and Harry Boutselakis and Minjie Ding and Sally Bamford and Charlotte Cole and Sari Ward and Chai Yin Kok and Mingming Jia and Tisham De and Jon W. Teague and Michael R. Stratton and Ultan McDermott and Peter J. Campbell},
  journal={Nucleic Acids Research},
  pages={D805 - D811}
COSMIC, the Catalogue Of Somatic Mutations In Cancer ( is the world's largest and most comprehensive resource for exploring the impact of somatic mutations in human cancer. Our latest release (v70; Aug 2014) describes 2 002 811 coding point mutations in over one million tumor samples and across most human genes. To emphasize depth of knowledge on known cancer genes, mutation information is curated manually from the scientific literature, allowing very precise… 

Figures and Tables from this paper

COSMIC: somatic cancer genetics at high-resolution

COSMIC v78 contains wide resistance mutation profiles across 20 drugs, detailing the recurrence of 301 unique resistance alleles across 1934 drug-resistant tumours.

Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

A new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes is proposed, which offers a novel opportunity for cancer drug development based on the mutational profile of patients.

Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes

A review of methods and computational tools developed during the past several years for detecting potential driver or actionable mutations in rapidly emerging precision cancer medicine may help investigators find an appropriate tool.

Finding cancer driver mutations in the era of big data research

This review considers both coding and non-coding driver mutations, and discusses how such mutations might be identified from cancer sequencing datasets, and some of the tools and database that are available for the annotation of somatic variants and the identification of cancer driver genes.

Whole genome sequencing analysis for cancer genomics and precision medicine

Explosive advances in next‐generation sequencer (NGS) and computational analyses have enabled exploration of somatic protein‐altered mutations in most cancer types, with coding mutation data

Utilizing somatic mutation data from numerous studies for cancer research: proof of concept and applications

It is shown that somatic gene mutations alone can reliably and specifically predict cancer subtypes, and curation efforts, such as COSMIC, have great added and complementary value even in the era of large international cancer projects.

mSignatureDB: a database for deciphering mutational signatures in human cancers

Abstract Cancer is a genetic disease caused by somatic mutations; however, the understanding of the causative biological processes generating these mutations is limited. A cancer genome bears the

Computational methods and resources for the interpretation of genomic variants in cancer

This article describes the available repositories of cancer genome data and provides an overview of the methods for the detection of genetic variation and computational tools for the prioritization of cancer related genes and causative somatic variations.

Beyond the exome: the role of non-coding somatic mutations in cancer.

  • S. PirainoS. Furney
  • Biology
    Annals of oncology : official journal of the European Society for Medical Oncology
  • 2016
An overview of insights gleaned from the analysis of mutations from sequenced cancer genomes is presented and the mechanisms by which non-coding mutations can play a role in cancer are reviewed.



Discrepancies in cancer genomic sequencing highlight opportunities for driver mutation discovery.

It is highlighted that cross-referencing between genomic databases is required to comprehensively assess genomic alterations in commonly used cell lines and that there are still significant opportunities to identify novel drivers of tumorigenesis in poorly sequenced areas of the exome.

Data mining using the Catalogue of Somatic Mutations in Cancer BioMart

The data available in COSMIC is described along with examples of how to successfully mine and integrate data sets using COS MICMart, and a flexible way to mine these data and combine somatic mutations with other biological relevant data sets is provided.

The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website

The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website have been developed to store somatic mutation data in a single location and display the data and other information related to human cancer.

Predicting the functional consequences of cancer-associated amino acid substitutions

An adaptation of the original algorithm that incorporates a cancer-specific model to potentiate the functional analysis of driver mutations is described, which observed improved performances when distinguishing between driver mutations and other germ line variants (both disease-causing and putatively neutral mutations).

Mapping the cancer genome. Pinpointing the genes involved in cancer will help chart a new course across the complex landscape of human malignancies.

Less than three years after the Human Genome Project’s completion, the National Institutes of Health has officially launched the pilot stage of an effort to create a comprehensive catalogue of the genomic changes involved in cancer: The Cancer Genome Atlas (TCGA).

Functional impact bias reveals cancer drivers

This work hypothesized that any bias toward the accumulation of variants with high functional impact observed in a gene or group of genes may be an indication of positive selection and can thus be used to detect candidate driver genes or gene modules and developed a method to measure this bias.

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells

The Genomics of Drug Sensitivity in Cancer (GDSC) provides a unique resource incorporating large drug sensitivity and genomic datasets to facilitate the discovery of new therapeutic biomarkers for cancer therapies.

Mapping the cancer genome

Less than three years after the Human Genome Project’s completion, the National Institutes of Health has officially launched the pilot stage of an effort to create a comprehensive catalogue of the genomic changes involved in cancer: The Cancer Genome Atlas (TCGA).

A census of human cancer genes

A 'census' of cancer genes is conducted that indicates that mutations in more than 1% of genes contribute to human cancer.

Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor

Summary: A tool to predict the effect that newly discovered genomic variants have on known transcripts is indispensible in prioritizing and categorizing such variants. In Ensembl, a web-based tool