Skip to content

flow tests of tiered with dbpedia to run bm (including flat buffer limit) #366

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 14 commits into
base: feature_HNSW_tiered_index
Choose a base branch
from

Conversation

meiravgri
Copy link
Collaborator

Describe the changes in the pull request

A clear and concise description of what the PR is solving.

Which issues this PR fixes

  1. #...
  2. MOD...

Main objects this PR modified

  1. ...
  2. ...

Mark if applicable

  • This PR introduces API changes
  • This PR introduces serialization changes

Base automatically changed from tiered_index_enable_sync_mode to feature_HNSW_tiered_index May 7, 2023 09:38
GuyAv46 and others added 7 commits May 8, 2023 12:56
* small modification to bf batch iterator

* remove promise of perfect score in HNSW multi batch

* implement batch iterator for tiered and some needed helpers

* fix for merge results

* make the iterator a nested class, fix and modify logic

* added first unit test

* some fixes and more tests

* another test

* first overlapping vector tests

* fix a bug on reallocation

* added an edge cases test

* added comments

* added `BY_SCORE_THEN_ID` order and sorter

* make BF batch iterator use it in select-base search

* modification to the BI to handle resize while alive, and use BY_SCORE_THEN_ID

* added dynamic parallel test

* move iterator from generic vec_sim_tiered to hnsw_tiered

* leak fixes

* fix clang build

* minor test refactor

* review fixes

* decrease index size

* move some array logic to arr_cpp.h

* after rebase fixes

* review fixes
This reverts commit fa7307e, reversing
changes made to b9a12a6.
* implemented `rangeQuery` for VecSimTieredIndex,
... including needed utility functions

* renaming `merge_results.h` and moving `filter_results` to it

* fix build

* first test and some fixes

* improved test and added a parallel test

* fix a bug where we safely get (from `safeGetEntryPoint`)
the old entry point but then we get the new max
level when trying to search using the old one

* fix tests

* Update comments

* review fixes

* after rebase fixes

* added a general comment on tiered index's guarantees
…s requires changing a bit the logic of updating jobs id after removing a vector from flat buffer and swap ids. (#367)
@meiravgri meiravgri force-pushed the meiravg_bufferlimit_bm branch from 1d73f9f to 8adef94 Compare May 8, 2023 14:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants