Rule Extraction from a Mutagenicity Data Set Using Adaptively Grown Phylogenetic-like Trees

Publication: J Chem Inf Model
Software: MedChem Studio™

Abstract

A public bacterial mutagenicity database was classified into 2-D structural families using a set of specific algorithms and clustering techniques that find overlapping classes of compounds based upon chemical substructures. Structure−activity relationships were learned from the biological activity of the compounds within each class and used to identify rules that define substructures potentially responsible for mutagenic activity. In addition, this method of analysis was used to compare the pharmacologically relevant substructure of test compounds with their potential toxic substructures making this a potentially valuable in silico profiling tool for lead selection and optimization.