Impurity Profiling

Overview

Impurity profiling module is a result of the collaboration between ACD/Labs and FDA Center for Food Safety and Nutrition (CFSAN). Evaluation of genotoxic and/or carcinogenic potential is based on a battery of probabilistic models for bioassays reflecting different mechanisms of hazardous activity. A knowledge-based expert system identifies potentially hazardous structural fragments that could be responsible for carcinogenic activity of the test molecule.

The toxicity predictions in the Impurities Package offer greater insight into the safety of impurities, providing detailed information on toxic endpoints, reflecting various mechanisms of hazardous activity including:

Mutagenicity (Ames test, Mouse Lymphoma Assay, and other standard assays)
Clastogenicity (Micronucleus test, Chromosomal Aberrations)
DNA damage (Unscheduled DNA Synthesis)
Carcinogenicity (FDA rodent carcinogenicity data)
Endocrine disruption mechanisms (estrogen receptor binding)

The impurities package offers probabilistic predictive models for 21 different endpoints that cover various mechanisms of hazardous activity presented above. These predictors are supplemented with a knowledge-based expert system that identifies potentially hazardous structural fragments that could be responsible for genotoxic and/or carcinogenic activity of the compound of interest.

Features

Predict the genotoxic and carcinogenic effects of an impurity from simple structure input (name, 2D structure, SMILES string), with a reliability index generated by the probabilistic models
Identify potentially hazardous structural fragments responsible for carcinogenic and genotoxic activity
Gain insight into the possible mechanisms of toxic effects
See a display of up to 5 similar structures with experimental results in relevant bioassays

Interface

Hover over the name of an alert to highlight the alerting group on the structure of the molecule
The list of all alerting groups found in the molecule. Each alert is supplied with statistical data regarding distrubution of positive and negative compounds possessing this hazardous fragment in all considered databases, along with the respective z-scores. Z-scores show whether the presence of the fragment leads to a statistically significant increase in proportion of compounds with a positive test result for a particular assay. This information provides further evidence regarding the possible mechanisms of action.
Each hazardous fragment is provided with a short description of its mechanism of action and literature references.
The output of probabilistic models is presented in the form of a tree view, where the nodes corresponding to individual endpoints are grouped into higher level nodes according to species/test system and mechanism of action. The output for each endpoint consists of the following parts:
- p-value – probability that a compound will result in a positive test in the respective assay
- Coverage – an indication whether the compound belongs to Model Applicability Domain according to calculated RI value
- Call – (+ or –) if the compound can be reliably classified on the basis of p and RI values, “Undefined” otherwise.
Clicking on a tree node brings up 5 most similar structures in the respective training set with names, CAS numbers and experimental results (positive or negative, as well as quantitative TD50 values and tumour target sites in case of carcinogenicity)

Technical information

ACD/Labs Package for Toxicity Screening of Impurities provides a battery of in silico tests to accurately assess the genotoxic and carcinogenic potential of impurities and degradants, found to be below the threshold of toxicological concern in drug products, helping companies remain compliant with regulatory submission requirements.
Profile impurities using predictions for genotoxic and carcinogenic endpoints, quickly determine if an impurity is likely to pose a safety risk, and identify potentially hazardous structural fragments responsible for toxic activity.

The expert system contains a list of 67 alerting groups of toxicophores, 53 of which account for point mutational and/or clastogenic mechanisms of DNA damage, while the remaining 14 substructures detect carcinogens acting by non-genotoxic mechanisms. The expert system was able to recognize >94% of mutagens in ACD/Ames test database, and >90% of compounds marked as potent carcinogens in the FDA's OFAS Food-Additive Knowledgebase.

Experimental Data

A complete list of modeled endpoints is provided in Table 1, while the data sources are briefly described below.

Genetic toxicity: data sets for standard assays reflecting different mechanisms of genetic damage were obtained from the FDA. Gene mutation tests and techniques detecting clastogenic/aneugenic effects are included. Data was collected from EPA GENE-TOX database and scientific literature [1].

Carcinogenicity: results of chronic (two-year term) carcinogenicity studies in rats and mice were received from FDA. This data was based on NTP technical reports, IARC monographs, Carcinogenic Potency DataBase [2] and other publicly available sources. Raw data was converted to binary classification using a weight of evidence (WOE) approach [1]. Classification using the WOE threshold corresponding to “potent carcinogens” was used to build the models in the current study.

Reproductive toxicity: experimental data characterizing the potential for endocrine system disruption due to Estrogen receptor α binding were acquired from ChEMBL database [3]. Compounds were classified as binders/non-binders on the basis of their relative binding affinities (RBA) compared to reference ligand estradiol. Two cut-offs were used: LogRBA > -3 (“general binding”), and LogRBA > 0 (“strong binding”)

Methods

Probabilistic predictive models for all considered endpoints were developed using GALAS modeling methodology [4]. Each GALAS model consists of two parts:

Global (baseline) model that reflects general trends in the property of interest. Baseline models were built using binomial PLS method based on fragmental descriptors.
Local corrections were applied to baseline predictions using a special similarity-based routine, after performing an analysis for the most similar compounds used in the training set. The local part of the model provides the basis for the calculation of the Reliability index (RI), a value ranging from 0 to 1 that provides a quantitative estimate of prediction accuracy.

A single baseline model was derived for each group of endpoints representing the same mechanism of hazardous action. Such model reflects a “cumulative” toxicity potential of chemicals in these assays. Experimental values specific for a particular assay were used during the local part of the modeling to yield final GALAS model for that endpoint.

Genotoxicity/Carcinogenicity Hazards

The knowledge-based expert system that identifies structural fragments potentially responsible for genotoxic effect is an extension of the previously described Ames mutagenicity hazards system [5]. The list of alerting groups was augmented with structural moieties that are frequently present in compounds tested positive in chromosomal damage assays, eucaryote gene mutation tests, as well as in carcinogens acting by non-genotoxic (epigenetic) mechanisms. The final list included 67 structural alerts, 14 of which represent epigenetic carcinogens (androgens, peroxisome proliferators, etc.).
Overall, the expert system was able to detect 94% of mutagens in the Ames test DB and 90% of compounds labeled as potent carcinogens by FDA.

The alert list is not limited to directly acting substructures, such as planar polycyclic arenes, aromatic amines, quinones, N-nitro and N-nitroso groups, but also includes various fragments that may undergo biotransformation to reactive intermediates. As an example, troglizatone, a thiazolidinedione class antidiabetic drug, was classified by the FDA as a potent carcinogen and has since been withdrawn from the USA market. The carcinogenic effect of this drug is mediated by several reactive metabolites. In human liver microsomes, the chromane ring of troglitazone is metabolized by CYP3A4 to form quinone and quinone-methide products. Furthermore, oxidative cleavage of thiazolidinedione ring results in a reactive sulfenic acid metabolite that also contains an isocyanate moiety [6].

References

[1] Matthews EJ et al. Regul Toxicol Pharmacol. 2007, 47, 115.
[2] Gold LS et al. Toxicol Sci. 2005, 85, 747.
[3] ChEMBL DB (https://www.ebi.ac.uk/chembldb). Target ID 206.
[4] Didziapetris R et al. J Comput Aided Mol Des. 2010, 24, 891.
[5] Didziapetris R et al. Toxicol Lett. 2008, 180, S152.
[6] Mansuy D & Dansette PM. Arch Biochem Biophys. 2011, 507, 1745.

Impurity Profiling

Contents

Overview

Features

Interface

Technical information

Experimental Data

Methods

Genotoxicity/Carcinogenicity Hazards

References

Navigation menu

Impurity Profiling

Overview

Features

Interface

Technical information

Experimental Data

Methods

Genotoxicity/Carcinogenicity Hazards

References

Navigation menu

Search