Skip to content

Commit 1a172b7

Browse files
Fixed the issue toLower in de-duplication query
1 parent 1d9ac23 commit 1a172b7

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

backend/src/graphDB_dataAccess.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -399,9 +399,9 @@ def get_duplicate_nodes_list(self):
399399
AND
400400
(
401401
// either contains each other as substrings or has a text edit distinct of less than 3
402-
(size(toString(other.id)) > 2 AND toLower(n.id) CONTAINS toLower(other.id)) OR
403-
(size(toString(n.id)) > 2 AND toLower(other.id) CONTAINS toLower(n.id))
404-
OR (size(toString(n.id))>5 AND apoc.text.distance(toLower(n.id), toLower(other.id)) < $duplicate_text_distance)
402+
(size(toString(other.id)) > 2 AND toLower(toString(n.id)) CONTAINS toLower(toString(other.id))) OR
403+
(size(toString(n.id)) > 2 AND toLower(toString(other.id)) CONTAINS toLower(toString(n.id)))
404+
OR (size(toString(n.id))>5 AND apoc.text.distance(toLower(toString(n.id)), toLower(toString(other.id))) < $duplicate_text_distance)
405405
OR
406406
vector.similarity.cosine(other.embedding, n.embedding) > $duplicate_score_value
407407
)] as similar

0 commit comments

Comments
 (0)