Text this: The study of probability model for compound similarity searching