Name: Accelerating Aggregations with Lucene-Aware Execution in OpenSearch - Ankit Jain, Amazon Web Services
Start: 2026-04-16T15:50:00+0200
End: 2026-04-16T16:30:00+0200

16-17 April 2026 | Prague, Czechia
View More Details & Registration

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. In order to attend OpenSearchCon Europe 2026, please visit our website to register.

This schedule is automatically displayed in Central European Summer Time (UTC+02:00). To see the schedule in your preferred timezone, please select from the drop-down located at the bottom of the menu to the right.

Accelerating Aggregations with Lucene-Aware Execution in OpenSearch - Ankit Jain, Amazon Web Services

Thursday April 16, 2026 15:50 - 16:30 CEST

Bohemia 2

Aggregation performance in OpenSearch is often limited by JVM execution overhead rather than aggregation logic itself. Most queries process documents at a time (DAAT) through Lucene collectors and rely on deep virtual call chains, which restrict JIT optimizations and waste CPU resources at scale.

This talk explores how Lucene-aware, bulk-oriented execution can significantly speed up aggregation processing. We will show how batching document processing, simplifying collector and doc values access patterns, and reducing virtual method dispatch lead to more JVM-friendly execution paths. We will also touch on how Lucene skip indexes built on sorted doc values can be used to skip or bulk-process ranges of documents during aggregation, reducing unnecessary work when data can be summarized at a higher level.

Using real OpenSearch aggregation pipelines as examples, the session presents early performance results and discusses practical trade-offs when moving from fine-grained per-document logic to bulk processing. The talk is aimed at engineers working on search engines, Lucene-based systems, or JVM performance.

Speakers

Ankit Jain

Lucene Committer | OpenSearch Maintainer | AWS OpenSearch, Amazon

Ankit Jain is a Software Engineer on the Amazon OpenSearch Service team, leading performance and scalability initiatives for search infrastructure. He is an active maintainer and committer for the Apache Lucene and OpenSearch projects, with hands-on experience operating large-scale... Read More →

BulkCollection Europe pdf

Thursday April 16, 2026 15:50 - 16:30 CEST
Bohemia 2

Search & Apache Lucene

OpenSearchCon Europe 2026

Ankit Jain

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Get help with the event