Implement Indexed Priority Queue to improve grouping performance (#251) #299

Dilaxn · 2025-04-13T17:43:53Z

Is your feature request related to a problem?
Yes. The current grouping logic uses a standard priority queue, forcing removal operations to run in O(n)O(n). This creates a significant performance bottleneck—especially in scenarios with frequent updates or large data sets.

What solution would you like?
Introduce an Indexed Priority Queue in the query grouping logic. By allowing direct lookup of an element’s position in the heap, arbitrary removals and updates can be done in O(log⁡n) time instead of O(n). This significantly boosts performance for frequent insertions and removals.

What alternatives have you considered?

1.Binary Heap without indexing: Simpler to implement, but removal of an arbitrary element still requires an O(n) scan.

2.Fibonacci Heap or Pairing Heap: While they can provide even better theoretical performance for certain operations, they are more complex and less practical for our specific use case.

Additional context
I have reviewed the existing code and tested how the new Indexed Priority Queue integrates with the grouping logic. Early benchmarks show a noticeable reduction in latency for frequent queue updates. I plan to add thorough test coverage to validate correctness and performance improvements before merging.

kkhatua · 2025-04-14T22:11:00Z

@Dilaxn Thank you for taking interest in improving performance. Unfortunately we don't have any performance tests as yet to validate this. Would you be able to help share some numbers so as to showcase the performance gains? (cc: @deshsidd )

ansjcy

@Dilaxn Thanks for your work on this! I've added several comments regarding the generic type handling and thread safety.

@deshsidd, could you please also take a look to verify the functional correctness of the Indexed Priority Queue logic? Thanks!

ansjcy · 2025-04-14T23:40:59Z

src/main/java/org/opensearch/plugin/insights/core/service/TopQueriesService.java

@@ -99,7 +99,7 @@ public class TopQueriesService {
    /**
     * The internal thread-safe store that holds the top n queries insight data
     */
-    private final PriorityBlockingQueue<SearchQueryRecord> topQueriesStore;
+    private final IndexedPriorityQueue<SearchQueryRecord> topQueriesStore;


We don't do deletion for data stored in the PriorityQueue in TopQueriesService, I think the performance of PriorityBlockingQueue and IndexedPriorityQueue would be the same here.

We’re already handing the very same topQueriesStore into the MinMaxHeapQueryGrouper, so splitting it into two separate queues just adds needless complexity. By switching both service and grouper to use a single IndexedPriorityQueue, we get one shared heap—no extra merge logic, no change in performance characteristics—and all of our inserts, polls, and snapshots happen against the exact same data structure. For those reasons, I’d stick with the IPQ here.

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java

ansjcy · 2025-04-14T23:54:45Z

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java

+public class IndexedPriorityQueue<T> {
+
+    private final List<T> heap;
+    private final Map<T, Integer> indexMap;


In here, we use T as the map key, this will require T to have equals / hashCode implemented. Also if T is mutable (which is true for our case), the equals and hashCode will change after the data is inserted, in which case the indexMap will become inconsistent (e.g. contains and remove might fail). Also the order will depend on the comparator T implemented (this should be fine in our case though).

I think we should probably change IndexedPriorityQueue to be IndexedPriorityQueue<K, V>, and make index map: Map<K, Integer> indexMap; and heap be List<Entry<K, V>> heap . We should also pass a comparator in the constructor IMO.

Keys are now of type K, separate from your mutable V values, so you only ever hash or compare on that stable identifier.

ansjcy · 2025-04-15T00:13:55Z

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java

+        this.indexMap = new HashMap<>(initialCapacity);
+    }
+
+    public synchronized boolean insert(T item) {


As I think about it more, using synchronized on each function is a good start to ensure thread safety, but it might be a bottleneck under high contention. I think it makes more sense to use more fine-grained locking mechanisms instead of doing synchronized for all public functions - the main challenge here should be ensuring data consistency between the heap and the indexMap (e.g. in siftUp, updating on the heap and indexMap should be atomic).

Implemented separate locks for read and write operations.

ansjcy · 2025-04-15T00:20:25Z

Also the unit test testAddRecordToLimitAndDrain is failing, could you double check?
https://github.com/opensearch-project/query-insights/actions/runs/14431905665/job/40541189714?pr=299

You can run ./gradlew test on your local env to validate your changes as well.

Dilaxn · 2025-04-15T03:08:47Z

Thank you @ansjcy and @kkhatua for your feedback. I do check the review comments and try to solve those. Thank you.

…nsearch-project#251) Signed-off-by: Dilaxn <dilaxn@gmail.com>

Dilaxn · 2025-04-19T19:41:03Z

Hi @ansjcy I’ve updated the code per your review:
-Replaced T with Entry<K, V>.
-Refined the locking mechanism.
-Applied fixes to the unit tests.
Please verify the changes, Thank you for ur time and consideration.

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java

ansjcy · 2025-04-28T19:16:39Z

As .kkhatua mentioned:

Would you be able to help share some numbers so as to showcase the performance gains?

We might need to do some benchmarking tests and CPU profiling to understand the performance gains. We need to:

run an OpenSearch cluster
use https://docs.opensearch.org/docs/latest/benchmark/ to ingest some workload.
Then run some search requests with different shapes.
use https://github.com/async-profiler/async-profiler to see the CPU usage impact numbers.

deshsidd · 2025-05-01T18:27:51Z

@Dilaxn Hope you are able to make progress on this PR. Please let us know if you need any guidance :)

Dilaxn · 2025-05-02T13:57:31Z

@deshsidd @ansjcy
Thank you for your patience. I’ve been spending additional time testing both functionality and performance to ensure the solution is solid. I’ll continue working on it over the weekend and will post an update soon.
Appreciate the support and guidance!

Dilaxn · 2025-05-06T16:27:54Z

@ansjcy @kkhatua
PQ.json
IPQ.json
Attached the benchmark results comparing the PriorityBlockingQueue (PQ) version of the Query Insights plugin with the updated IndexedPriorityQueue (IPQ) version.

Benchmark Setup:
Benchmark tool: OpenSearch Benchmark
Workload: nyc_taxis
Pipeline: benchmark-only
Environment: Local, external provisioned OpenSearch cluster (3.0.0-SNAPSHOT)
Metric source: JSON results from benchmark-results.json captured after each run
Plugin versions: plugin_version: pq vs. plugin_version: ipq (tagged via --user-tag)

Here's a summary of key performance metrics:

Metric	PQ	IPQ	Δ
Index throughput (mean)	50,856 docs/s	61,300 docs/s	↑ 20.5%
p90 latency	1,203 ms	804 ms	↓ 33%
p99 latency	5,929 ms	2,043 ms	↓ 65%
Indexing duration	3,532 s	2,825 s	↓ 20%
Distance agg service time (mean)	6,250 ms	11,173 ms	↑ 78%
Merge time	6,422 s	5,886 s	↓ 8%
Segment count	27	21	↓ 22%
Young GC time	167 s	145 s	↓ 13%

Key Takeaways:

The IPQ-based version significantly improves indexing throughput and reduces latency, especially at p99.

A slight increase in distance aggregation service time was observed 
Overall, the switch to IPQ results in lower GC pressure, faster merging, and fewer segments.

Please feel free to correct me if there’s any mistake. Thank you!

Dilaxn · 2025-05-09T08:04:30Z

Subject: ✅ Functional Parity & Concurrency Benchmark Validation for IPQ

Hi @deshsidd ,

I've completed a comprehensive validation of our IndexedPriorityQueue (IPQ) against the suggested IndexMinPQ implementation.
🔍 Functional Parity

✅ Verified 1:1 behavioral parity using:

    1,000 randomized functional test scenarios

    Property-based tests (250+ diverse operation sequences)

    Edge case coverage (insert, remove, peek, poll correctness)

🎯 Both queues produced identical sorted outputs and consistent state transitions.

⚙️ Concurrency Comparison

Benchmarked concurrent insertions (8 threads, 100,000 entries) against sequential IndexMinPQ.

Observation: IPQ performs slightly slower than the non-thread-safe version in microbenchmarks, primarily due to the overhead introduced by ReentrantReadWriteLock — which is expected for thread-safe implementations.

Advantage: IPQ is concurrency-safe by design and scales better under real-world multithreaded conditions (read/write workloads).

For this, I created a separate Gradle project with the help of GPT. The project is : https://github.com/Dilaxn/indexed-priority-queue-compat

Thanks again for the guidance!

Best regards,
Dilakshan

ansjcy · 2025-06-03T21:29:54Z

@Dilaxn This is very impressive, thanks for doing the performance benchmarking! Let me take a final check on the PR.
Also, since this change mainly affects search path, I'm wondering why are we seeing an improvement on indexing. Do you have more performance numbers on the search path?

Dilaxn · 2025-06-07T10:59:18Z

@Dilaxn This is very impressive, thanks for doing the performance benchmarking! Let me take a final check on the PR. Also, since this change mainly affects search path, I'm wondering why are we seeing an improvement on indexing. Do you have more performance numbers on the search path?

Hi @ansjcy, Thanks for the feedback!
I’ve attached the benchmark reports for the nyc_taxis workload using both IndexedPriorityQueue (the new change) and PriorityBlockingQueue (master).
IndexedPriorityQueue.json
master benchmark.txt
PriorityBlockingQueue.json

deshsidd · 2025-06-12T19:42:40Z

@Dilaxn Thanks for the parity checks and the benchmarks. This is pretty impressive! Let me take a final look today.

deshsidd

LGTM overall. Thanks again for the change. Some minor comments @Dilaxn
Once addressed we can aim to rebase approve and merge.

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java

ansjcy · 2025-06-12T22:04:09Z

hmm, looking at the full benchmark run results I see latency increases on indexed PQ on almost all search workloads. I want us to do some extra benchmarks and cpu profilings. Let's hold on this PR for now until we fully understand the performance impact.

Dilaxn requested review from ansjcy, jainankitk, deshsidd and dzane17 as code owners April 13, 2025 17:43

Dilaxn force-pushed the feature/indexed-priority-queue branch 2 times, most recently from a65e75d to 9502ce3 Compare April 13, 2025 17:54

ansjcy reviewed Apr 15, 2025

View reviewed changes

Dilaxn force-pushed the feature/indexed-priority-queue branch from 9502ce3 to 06fd7bc Compare April 19, 2025 19:24

Implement Indexed Priority Queue to improve grouping performance (ope…

4909bf7

…nsearch-project#251) Signed-off-by: Dilaxn <dilaxn@gmail.com>

Dilaxn force-pushed the feature/indexed-priority-queue branch from 06fd7bc to 4909bf7 Compare April 19, 2025 19:34

deshsidd requested changes Apr 21, 2025

View reviewed changes

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java Show resolved Hide resolved

deshsidd reviewed Jun 12, 2025

View reviewed changes

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java Show resolved Hide resolved

src/main/java/org/opensearch/plugin/insights/core/utils/IndexedPriorityQueue.java Outdated Show resolved Hide resolved

Resolve review comments

3792e70

Implement Indexed Priority Queue to improve grouping performance (#251) #299

Are you sure you want to change the base?

Implement Indexed Priority Queue to improve grouping performance (#251) #299

Uh oh!

Conversation

Dilaxn commented Apr 13, 2025

Uh oh!

kkhatua commented Apr 14, 2025

Uh oh!

ansjcy left a comment

Choose a reason for hiding this comment

Uh oh!

ansjcy Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

Dilaxn Apr 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ansjcy Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

Dilaxn Apr 19, 2025

Choose a reason for hiding this comment

Uh oh!

ansjcy Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

Dilaxn Apr 19, 2025

Choose a reason for hiding this comment

Uh oh!

ansjcy commented Apr 15, 2025

Uh oh!

Dilaxn commented Apr 15, 2025

Uh oh!

Dilaxn commented Apr 19, 2025

Uh oh!

Uh oh!

ansjcy commented Apr 28, 2025

Uh oh!

deshsidd commented May 1, 2025

Uh oh!

Dilaxn commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dilaxn commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dilaxn commented May 9, 2025

Uh oh!

ansjcy commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dilaxn commented Jun 7, 2025

Uh oh!

deshsidd commented Jun 12, 2025

Uh oh!

deshsidd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ansjcy commented Jun 12, 2025

Uh oh!

Uh oh!

Dilaxn commented May 2, 2025 •

edited

Loading

Dilaxn commented May 6, 2025 •

edited

Loading

ansjcy commented Jun 3, 2025 •

edited

Loading