Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery

@article{ztrk2020ExploringCS,
  title={Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery},
  author={Hakime {\"O}zt{\"u}rk and Arzucan {\"O}zg{\"u}r and P. Schwaller and T. Laino and Elif Ozkirimli},
  journal={Drug discovery today},
  year={2020}
}
Text-based representations of chemicals and proteins can be thought of as unstructured languages codified by humans to describe domain-specific knowledge. Advances in natural language processing (NLP) methodologies in the processing of spoken languages accelerated the application of NLP to elucidate hidden knowledge in textual representations of these biochemical entities and then use it to construct models to predict molecular properties or to design novel molecules. This review outlines the… Expand

Figures, Tables, and Topics from this paper

Advanced machine-learning techniques in drug discovery.
A Corpus of Very Short Scientific Summaries
...
1
2
...

References

SHOWING 1-10 OF 189 REFERENCES
ChemBoost: A Chemical Language Based Approach for Protein – Ligand Binding Affinity Prediction
Machine learning in chemoinformatics and drug discovery.
Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
Linking the Neural Machine Translation and the Prediction of Organic Chemistry Reactions
A High Efficient Biological Language Model for Predicting Protein–Protein Interactions
Protein Word Detection using Text Segmentation Techniques
Foundations of statistical natural language processing
...
1
2
3
4
5
...