The Signature Molecular Descriptor. 2. Enumerating Molecules from Their Extended Valence Sequences


We present a new algorithm that enumerates molecular structures matching a predefined extended valence sequence or signature. The algorithm can construct molecular structures composed of about 50 non-hydrogen atoms in CPU seconds time scale. The algorithm is run to produce all molecular structures matching the binding affinities (IC(50)) of some HIV-1 protease inhibitors. The algorithm is also used to compute the degeneracy, or the number of molecular structures, corresponding to a given signature. Signature degeneracy is systematically studied for varying signature heights on four molecular series, alkanes, alcohols, fullerene-type structures, and peptides. Signature degeneracy is compared with similar results obtained with popular topological indices (TIs). As a general rule, we find that signature degeneracy decreases as the signature height increases. We also find that alkanes, alcohols, and fullerene-type structures comprising n non-hydrogen atoms are uniquely characterized by signatures of height n/4, while peptides up to 4000 amino acids can be singled out with signatures of heights as small as 2 and 3.

DOI: 10.1021/ci020346o

Extracted Key Phrases

21 Figures and Tables

Citations per Year

Citation Velocity: 6

Averaging 6 citations per year over the last 3 years.

Learn more about how we calculate this metric in our FAQ.

Cite this paper

@article{Faulon2003TheSM, title={The Signature Molecular Descriptor. 2. Enumerating Molecules from Their Extended Valence Sequences}, author={Jean-Loup Faulon and Carla J. Churchwell and Donald P. Visco}, journal={Journal of chemical information and computer sciences}, year={2003}, volume={43 3}, pages={721-34} }