Improving the usefulness of molecular similarity-based chemical prioritization strategies

C Li,L M Colosi

doi:10.1080/1062936x.2013.792876

Abstract

Quantitative molecular similarity analysis (QMSA) is a seemingly useful tool for estimating environmental properties for the hundreds of emerging contaminants that have not yet been fully evaluated. Moreover, calibrated QMSA models are also useful for prioritizing research among currently unmeasured chemicals of interest. Previous work has demonstrated that prioritization based on molecular ‘representativeness’, as parameterized using summed Euclidean distances in n dimensions corresponding to n molecular descriptors, improves the prediction accuracy of QMSA models compared to random selection of compounds to be measured. In this study, we use two datasets of environmental parameters (i.e. in vitro oestrogenicity and sorption distribution coefficient Kd ) to demonstrate that maximizing representativeness alone cannot deliver optimal improvement in prediction accuracy if many of the chemicals that have already been measured are themselves highly representative. Thus, proper QMSA-based prioritization among unmeasured chemicals constitutes a balance between maximizing representativeness and minimizing redundancy. It is demonstrated that redundancy considerations are especially critical for highly heterogeneous datasets, and some discussion about achieving a proper balance between the two prioritization criteria is presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving the usefulness of molecular similarity-based chemical prioritization strategies

Abstract

Talk to us

Similar Papers

More From: SAR and QSAR in Environmental Research

Lead the way for us

Journal: SAR and QSAR in Environmental Research	Publication Date: May 28, 2013
Citations: 1

Similar Papers

QSAR Studies of Antiviral Agents Using Molecular Similarity Analysis and Structure-Activity Maps
R R Parakulam ... C Tsai
SAR and QSAR in Environmental Research | VOL. 10
R R Parakulam, et. al.R R Parakulam ... C Tsai
01 Jul 1999
SAR and QSAR in Environmental Research | VOL. 10

Additional file 1: of The prediction accuracy of dynamic mixed-effects models in clustered data
...
-
, et. al. ...
01 Jan 2015
01 Jan 2015

Optimal neighbor selection in molecular similarity: comparison of arbitrary versus tailored prediction spaces†
B D Gute ... S C Basak
SAR and QSAR in Environmental Research | VOL. 17
B D Gute, et. al.B D Gute ... S C Basak
01 Feb 2006
SAR and QSAR in Environmental Research | VOL. 17

Quantitative molecular similarity analysis (QMSA) methods for property estimation: A comparison of property-based, arbitrary, and tailored similarity spaces
S.C Basak ... D Mills
SAR and QSAR in Environmental Research | VOL. 13
S.C Basak, et. al.S.C Basak ... D Mills
01 Jan 2002
SAR and QSAR in Environmental Research | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the usefulness of molecular similarity-based chemical prioritization strategies

Abstract

Talk to us

Similar Papers

More From: SAR and QSAR in Environmental Research