Emerging pattern mining to aid toxicological knowledge discovery

Article English OPEN
Gillet, V.J. ; Webb, S.J. ; Sherhod, R. ; Judson, P.N. ; Thierry, H. ; Vessey, J.D (2014)
  • Publisher: American Chemical Society

Knowledge-based systems for toxicity prediction are typically based on rules, known as structural alerts, that describe relationships between structural features and different toxic effects. The identification of structural features associated with toxicological activity can be a time-consuming process and often requires significant input from domain experts. Here, we describe an emerging pattern mining method for the automated identification of activating structural features in toxicity data sets that is designed to help expedite the process of alert development. We apply the contrast pattern tree mining algorithm to generate a set of emerging patterns of structural fragment descriptors. Using the emerging patterns it is possible to form hierarchical clusters of compounds that are defined by the presence of common structural features and represent distinct chemical classes. The method has been tested on a large public in vitro mutagenicity data set and a public hERG channel inhibition data set and is shown to be effective at identifying common toxic features and recognizable classes of toxicants. We also describe how knowledge developers can use emerging patterns to improve the specificity and sensitivity of an existing expert system.
  • References (36)
    36 references, page 1 of 4

    Phylogenetic-Like Trees. J. Chem. Inf. Comput. Sci. 2002, 42, 1069− 1079.

    (7) Harper, G.; Bravi, G. S.; Pickett, S. D.; Hussain, J.; Green, D. V. S.

    Sci. 2004, 44, 2145−2156.

    (8) Takigawa, I.; Mamitsuka, H. Graph Mining: Procedure, Application to Drug Discovery and Recent Advances. Drug Discovery Today 2013, 18, 50−57.

    (9) Kazius, J.; Nijssen, S.; Kok, J.; Bac̈k, T.; Ijzerman, A. P. Substructure Mining Using Elaborate Chemical Representation. J. Chem. Inf. Model.

    (10) Lozano, S.; Poezevara, G.; Halm-Lemeille, M. P.; LescotFontaine, E.; Lepailleur, A.; Bissell-Siders, R.; Creḿilleux, B.; Rault, S.; Cuissart, B.; Bureau, R. Introduction of Jumping Fragments in Combination with QSARs for the Assessment of Classification in Ecotoxicology. J. Chem. Inf. Model. 2010, 50, 1330−1339.

    (11) Poezevara, G.; Cuissart, B.; Creḿilleux, B. Extracting and Summarizing the Frequent Emerging Graph Patterns from a Dataset of Graphs. J. Intell. Inf. Syst. 2011, 37, 333−353.

    (12) Ferrari, T.; Cattaneo, D.; Gini, G.; Bakhtyari, N. G.; Manganaro, A.; Benfenati, E. Automatic Knowledge Extraction from Chemical Structures: The Case of Mutagenicity Prediction. SAR QSAR Environ.

    Res. 2013, 24, 631−649.

    (13) Jullian, N.; Afshar, M. Novel Rule-Based Method for MultiParametric Multi-Objective Decision Support in Lead Optimization Using Kem. Curr. Comput.-Aided Drug Des. 2008, 4, 35−45.

  • Related Research Results (2)
  • Metrics
    0
    views in OpenAIRE
    0
    views in local repository
    10
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    White Rose Research Online - IRUS-UK 0 10
Share - Bookmark