Cao Y, Jiang T and Girke T (2010).
“Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing.”
Bioinformatics, 26(7), pp. 953–959.
doi: 10.1093/bioinformatics/btq067, https://doi.org/10.1093%2Fbioinformatics%2Fbtq067}.