A metric for pairwise similarity analysis of binary cheminformatics data

2nd International Conference on Chemo and Bioinformatics ICCBIKG 2023 (593-596)

АУТОР(И) / AUTHOR(S): Izudin Redžepović

Е-АДРЕСА / E-MAIL: iredzepovic@np.ac.rs

Download Full Pdf  

DOI: 10.46793/ICCBI23.593R

САЖЕТАК / ABSTRACT:

This paper unveils the findings derived from an in-depth exploration of a novel similarity measure designed to assess pairwise resemblances. Called the Substructure Similarity Index, this measure centers around the comparison of substructures identified within compounds. Through a rigorous evaluation conducted on an extensive dataset of drugs and by juxtaposing it against other commonly employed indices, the study reveals that the Substructure Similarity Index can be adeptly employed for molecular similarity calculations since it provides information that cannot be obtained by available measures.

КЉУЧНЕ РЕЧИ / KEYWORDS:

molecular similarity, molecular structure, binary vectors, molecular fingerprints, similarity measure

ЛИТЕРАТУРА / REFERENCES:

  • A. Bender, R.C. Glen., Molecular Similarity: A Key Technique in Molecular Informatics, Organic & Biomolecular Chemistry, 2 (2004) 3204-3218.
  • C.W. Coley, L. Rogers, W.H. Green, K.F. Jensen., Computer-Assisted Retrosynthesis Based on Molecular Similarity, ACS Central Science, 3 (2017) 1237-1245.
  • N.J. Morehouse, T.N. Clark, E.J. McMann, J.A. van Santen, F.P.J. Haeckl, C.A. Gray, R.G. Linington., Annotation of Natural Product Compound Families Using Molecular Networking Topology and Structural Similarity Fingerprinting, Nature Communications, 14 (2023) #308.
  • M.A. Johnson, G.M. Maggiora., Concepts and Applications of Molecular Similarity, John Wiley &Sons, New York, 1990.
  • R. Todeschini, V. Consonni, H. Xiang, J. Holliday, M. Buscema, P. Willett, Similarity Coefficients for Binary Chemoinformatics Data: Overview and Extended Comparison Using Simulated and Real Data Sets, Journal of Chemical Information and Modeling, 52 (2012) 2884-2901.