November 25, 2024, 05:51:39 PM
Forum Rules: Read This Before Posting


Topic: Molecule Similarity  (Read 3236 times)

0 Members and 1 Guest are viewing this topic.

Offline JulesMhz

  • Very New Member
  • *
  • Posts: 1
  • Mole Snacks: +0/-0
Molecule Similarity
« on: May 17, 2023, 10:54:11 AM »
Hello everyone,

I have a question regarding molecule similarity computation. I'm more from computation than chemistry, so it is a fairly new topic for me, and I'm actually working with a (quantum computing) algorithm for molecule similarity computation.

So here is my question, given these molecules:
- niacin c1(cccnc1)C(=O)O, hereafter "reference molecule"
- 4-CARBOXYPIPERIDINE c1(ccncc1)C(=O)O, hereafter "molecule 1"
- nicotinamide c1cc(cnc1)C(=O)N, hereafter "molecule 2"
- P modified nicotinamide c1cc(cnc1)C(=O)P, hereafter "molecule 3"

If I compute the Tanimoto similarity between reference and molecule 1, I have 0.419.
If I compute the Tanimoto similarity between reference and molecule 2, I have 0.633.

What I observe is that Tanimoto similarity considers that molecule 2 is more similar to reference molecule than molecule 1, but if we look at molecule illustrations, we notice that molecule 1 differs from reference by one N atom moved by one position, whereas molecule 2 differs from reference by one molecule which is not the same.

So, in an algorithmic point of view, it makes sense that molecule 1 has two molecule differences (one N replaced by C, and one C replaced by N) whilst molecule 2 has only one molecule difference (OH replaced by NH2) so the similarity is lower for molecule 1.
But, in a chemical point of view, does this also make sense ? I mean, why just moving one N atom is less similar than changing one atom by an other ? In other word, is the chemical function of molecule 2 more similar than molecule 1 to reference molecule ?

An other observation, if I compute the Tanimoto similarity between reference and molecule 3, I have 0.633 (like for molecule 2), so Tanimoto distance does not take in account the fact that one atom differs between molecule 2 and 3, whilst my "non-chemical-specialist" mind would guess than one is more similare than the other as they are not equivalent ?

Finally, is there a "chemical" process (by chemical, I mean not algorithmic) to compare molecules in order to have "chemical function" similarity I can refer to ?

Thank you for your help, I hope my questions are well formulated.

Offline Borek

  • Mr. pH
  • Administrator
  • Deity Member
  • *
  • Posts: 27864
  • Mole Snacks: +1813/-412
  • Gender: Male
  • I am known to be occasionally wrong.
    • Chembuddy
Re: Molecule Similarity
« Reply #1 on: May 17, 2023, 02:23:17 PM »
No idea. But in general similarity is a poorly defined concept, so you won't get any exact answers.

What is more similar to a square - a triangle, or a pentagram?

Sure, you can choose some set of rules to calculate "similarity" index, but it will be always arbitrary and as such can work for some applications and not work for others. This is a can of worms if you want to pretend there is any strict science behind.
ChemBuddy chemical calculators - stoichiometry, pH, concentration, buffer preparation, titrations.info

Offline Corribus

  • Chemist
  • Sr. Member
  • *
  • Posts: 3551
  • Mole Snacks: +545/-23
  • Gender: Male
  • A lover of spectroscopy and chocolate.
Re: Molecule Similarity
« Reply #2 on: May 22, 2023, 02:23:16 PM »
But, in a chemical point of view, does this also make sense ? I mean, why just moving one N atom is less similar than changing one atom by an other ? In other word, is the chemical function of molecule 2 more similar than molecule 1 to reference molecule ?
It would also be important to address the question: similar with respect to what? 1 and 2 may be, for example, the most similar in terms of boiling point, but the least similar in terms of reactivity with an acid. (Just a hypothetical, I didn't look anything up.) You need to define your property of interest; similarity between chemical structures means nothing in any general sense.
What men are poets who can speak of Jupiter if he were like a man, but if he is an immense spinning sphere of methane and ammonia must be silent?  - Richard P. Feynman

Offline clarkstill

  • Chemist
  • Full Member
  • *
  • Posts: 477
  • Mole Snacks: +77/-4
Re: Molecule Similarity
« Reply #3 on: May 24, 2023, 09:23:17 AM »
Note, either the structure or the name are incorrect for molecule 1 - the image is of a pyridine while you have called it a piperidine.

Sponsored Links