code
代码说明:
Minhashing和LSH算法。用于查找和比较一个pair是否是相似的。并分析false negative和false positive的值(Minhashing and Locality-Sensitive-Hashing (LSH). The algorithms are approximate in that they find only candidate pairs that are likely similar. Therefore, there are two types of error. A false positive (FP) is a candidate pair that is not actually similar according to the given similarity threshold. A false negative (FN) is a similar pair that is not a candidate pair. In this project, we will implement the Minhashing and LSH algorithms, apply them on data sets, and draw observations about when these algorithms perform. )
下载说明:请别用迅雷下载,失败请重下,重下不扣分!


