Published January 1, 2020 | Version v1
Journal article Open

Exploring chemical space using natural language processing methodologies for drug discovery

  • 1. Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
  • 2. IBM Res Zurich, Saumerstr 4, CH-8803 Ruschlikon, Switzerland

Description

Text-based representations of chemicals and proteins can be thought of as unstructured languages codified by humans to describe domain-specific knowledge. Advances in natural language processing (NLP) methodologies in the processing of spoken languages accelerated the application of NLP to elucidate hidden knowledge in textual representations of these biochemical entities and then use it to construct models to predict molecular properties or to design novel molecules. This review outlines the impact made by these advances on drug discovery and aims to further the dialogue between medicinal chemists and computer scientists.

Files

bib-cd228ddf-8346-4321-9c26-da3ff40ebe73.txt

Files (202 Bytes)

Name Size Download all
md5:6015fe70fdaf1440ae9bc5f00da9ab93
202 Bytes Preview Download