Identifying known unknowns using the US EPA’s CompTox Chemistry Dashboard

@article{McEachran2016IdentifyingKU,
  title={Identifying known unknowns using the US EPA’s CompTox Chemistry Dashboard},
  author={Andrew D. McEachran and J. Sobus and Antony J. Williams},
  journal={Analytical and Bioanalytical Chemistry},
  year={2016},
  volume={409},
  pages={1729-1735}
}
AbstractChemical features observed using high-resolution mass spectrometry can be tentatively identified using online chemical reference databases by searching molecular formulae and monoisotopic masses and then rank-ordering of the hits using appropriate relevance criteria. The most likely candidate “known unknowns,” which are those chemicals unknown to an investigator but contained within a reference database or literature source, rise to the top of a chemical list when rank-ordered by the… Expand

Figures, Tables, and Topics from this paper

Linking in silico MS/MS spectra with chemistry data to improve identification of unknowns
TLDR
The generation en masse of predicted MS/MS spectra for the entirety of the US EPA’s DSSTox database is described using competitive fragmentation modelling and a freely available open source tool, CFM-ID. Expand
Open Science for Identifying "Known Unknown" Chemicals.
TLDR
Challenges facing comprehensive suspect screening include increasing chemicals of interest, as well as ever-decreasing detection limits, leading to increased false positives, and open science is poised to play a pivotal role in the evolution of suspect screening. Expand
Using the US EPA CompTox Chemicals Dashboard to interpret targeted and non-targeted GC–MS analyses from human breath and other biological media
TLDR
Specific procedures using the Dashboard as a first-stop tool for exploring both targeted and non-targeted results from GC–MS analyses of chemicals found in breath, exhaled breath condensate, and associated aerosols are described. Expand
The CompTox Chemistry Dashboard: a community data resource for environmental chemistry
TLDR
The U.S. Environmental Protection Agency’s web-based CompTox Chemistry Dashboard is addressing needs by integrating diverse types of relevant domain data through a cheminformatics layer, built upon a database of curated substances linked to chemical structures. Expand
EPA’s DSSTox database: History of development of a curated chemistry resource supporting computational toxicology research
TLDR
This work states that the legacy, manually curated DSSTox_V1 content was migrated to a MySQL data model, with modern cheminformatics tools supporting both manual and automated curation processes to increase efficiencies. Expand
In silico MS/MS spectra for identifying unknowns: a critical examination using CFM-ID algorithms and ENTACT mixture samples
TLDR
The abilities of in silico spectra are shown to correctly identify true positives in complex samples (at rates comparable to those observed with reference spectra), and efficiently filter large numbers of potential false positives from further consideration. Expand
“MS-Ready” structures for non-targeted high-resolution mass spectrometry screening studies
TLDR
The workflow for the generation and linking of ~ 700,000 MS-Ready structures as well as download, search and export capabilities to serve structure identification using HRMS are described. Expand
Revisiting Five Years of CASMI Contests with EPA Identification Tools
TLDR
The results suggest that Dashboard data and tools would enhance chemical identification capabilities for practitioners of HRMS-based NTA, and an in-depth review of the CASMI structure sets made these reviewed sets available via the Dashboard. Expand
A comparison of three liquid chromatography (LC) retention time prediction models.
TLDR
The OPERA-RT model, generated as a proof-of-concept using a limited set of open source data, performed as well as the commercial tool ACD/ChromGenius when constrained to the same small training and test sets and is expected to increase in predictive ability as the availability of RT data increases. Expand
Integrating tools for non-targeted analysis research and chemical safety evaluations at the US EPA
TLDR
Current applications of NTA methods, challenges to their effective use in chemical screening studies, and ways in which shared resources can advance their use in risk-based chemical prioritization are discussed. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 33 REFERENCES
Identification of “Known Unknowns” Utilizing Accurate Mass Data and ChemSpider
TLDR
These approaches were shown to be successful in identifying “known unknowns” noted in the laboratory and for compounds of interest to others. Expand
Identification of “Known Unknowns” Utilizing Accurate Mass Data and Chemical Abstracts Service Databases
TLDR
These approaches were shown to be successful in identifying “known unknowns” noted in LC-MS and even GC-MS analyses in the laboratory and were demonstrated in the identification of a variety of compounds of interest to others. Expand
Is nontarget screening of emerging contaminants by LC-HRMS successful? A plea for compound libraries and computer tools
TLDR
The advantages and future needs of publicly available MS and MS/MS reference databases and libraries which have mostly been created for the metabolomic field are discussed and the availability of comprehensive MS libraries with a focus on environmental contaminants would tremendously improve the situation. Expand
In silico fragmentation for computer assisted identification of metabolite mass spectra
TLDR
A method that is able to identify small molecules from tandem MS measurements, even without spectral reference data or a large set of fragmentation rules is presented. Expand
Identifying small molecules via high resolution mass spectrometry: communicating confidence.
TLDR
A level system is proposed, which arose from intense discussions within the department, to ease the communication of identification confidence and form the basis of further discussions on this topic, and specifically covers the new possibilities in HR-MS-based analysis. Expand
Facilitating high resolution mass spectrometry data processing for screening of environmental water samples: An evaluation of two deconvolution tools.
TLDR
This work has shown that the use deconvolution tools facilitates non-target screening and enables the identification of a higher number of compounds. Expand
MassBank: a public repository for sharing mass spectral data for life sciences.
TLDR
MassBank is the first public repository of mass spectra of small chemical compounds for life sciences and provides a merged spectrum for each compound prepared by merging the analyzed ESI-MS(2) data on an identical compound under different collision-induced dissociation conditions. Expand
PubChem Substance and Compound databases
TLDR
An overview of the PubChem Substance and Compound databases is provided, including data sources and contents, data organization, data submission using PubChem Upload, chemical structure standardization, web-based interfaces for textual and non-textual searches, and programmatic access. Expand
Linking high resolution mass spectrometry data with exposure and toxicity forecasts to advance high-throughput environmental monitoring.
TLDR
A novel suspect screening methodology to prioritize chemicals of interest for subsequent targeted analysis is demonstrated that relies on strategic integration of available public resources and should be considered in future non-targeted and suspect screening assessments of environmental and biological media. Expand
Non-target screening with high-resolution mass spectrometry: critical review using a collaborative trial on water analysis
TLDR
A dataset from a collaborative non-target screening trial organised by the NORMAN Association is used to review the state-of-the-art and discuss future perspectives of non- target screening using high-resolution mass spectrometry in water analysis. Expand
...
1
2
3
4
...