Skip to content

Commit d02a0f1

Browse files
authored
Merge pull request #36 from wsdonny/patch-1
Typo fix
2 parents 5c4652f + 7232169 commit d02a0f1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -89,7 +89,7 @@ A lot of nearest-neighbor search algorithms and indexing structures rely on the
8989
## Shingles (n-gram) based similarity and distance
9090
A few algorithms work by converting strings into sets of n-grams (sequences of n characters, also sometimes called k-shingles). The similarity or distance between the strings is then the similarity or distance between the sets.
9191

92-
Some ot them, like jaccard, consider strings as sets of shingles, and don't consider the number of occurences of each shingle. Others, like cosine similarity, work using what is sometimes called the profile of the strings, which takes into account the number of occurences of each shingle.
92+
Some of them, like jaccard, consider strings as sets of shingles, and don't consider the number of occurences of each shingle. Others, like cosine similarity, work using what is sometimes called the profile of the strings, which takes into account the number of occurences of each shingle.
9393

9494
For these algorithms, another use case is possible when dealing with large datasets:
9595
1. compute the set or profile representation of all the strings

0 commit comments

Comments
 (0)