re: Investigating query performance issue in RavenDB

Oct 27 2020

reInvestigating query performance issue in RavenDB

time to read 3 min | 533 words

At the beginning of the year, we run into a problematic query. The issue was the use of an in clause vs. a series of OR. You can see the previous investigation results here. We were able to pinpoint the issue pretty well, very deep in the guts of Lucene, our query engine.

Fast Query	Slow Query

Time: 1 – 2 ms	Time: 60 – 90 ms

The key issue for this query was simple. There are over 600,000 orders with the relevant statuses, but there are no orders for CustomerId “customers/100”. In the OR case, we would evaluate the query lazily. First checking the CustomerId, and given that there have been no results, short circuiting the process and doing no real work for the rest of the query. The IN query, on the other hand, would do things eagerly. That would mean that it would build a data structure that would hold all 600K+ documents that match the query, and then would throw that all away because no one actually needed that.

In order to resolve that, I have to explain a bit about the internals of Lucene. As its core, you can think of Lucene in terms of sorted lists inside dictionaries. I wrote a series of posts on the topic, but the gist of it is:

Note that the ids for documents containing a particular term are sorted. That is important for a lot of optimizations in Lucene, which is also a major problem for the in query. The problem is that each component in the query pipeline needs to maintain this invariant. But when we use an IN query, we need to go over potentially many terms. And then we need to get the results in the proper order to the calling code. I implemented a tiered approach. If we are using an IN clause with a small number of terms in it (under 128), we will use a heap to manage all the terms and effectively do a merge sort on the results.

When we have more than 128 terms, that stops being very useful, however. Instead, we’ll create a bitmap for the possible results and scan through all the terms, filling the bitmap. That can be expensive, of course, so I made sure that this is done lazily by RavenDB.

The results are in:

	OR Query	IN Query
Invalid CustomerId	1.39 – 1.5 ms	1.33 – 1.44 ms
Valid CustomerId	17.5 ms	12.3 ms

For the first case, this is now pretty much a wash. The numbers are slightly in favor of the IN query, but it is within the measurement fluctuations.

For the second case, however, there is a huge performance improvement for the IN query. For that matter, the cost is going to be more noticeable the more terms you have in the IN query.

I’m really happy about this optimization, it ended up being quite elegant.

Tweet Share Share 4 comments

Tags:

More posts in "re" series:

(05 Dec 2025) Build AI that understands your business
(02 Dec 2025) From CRUD TO AI – building an intelligent Telegram bot in < 200 lines of code with RavenDB
(29 Sep 2025) How To Run AI Agents Natively In Your Database
(22 Sep 2025) How To Create Powerful and Secure AI Agents with RavenDB
(29 May 2025) RavenDB's Upcoming Optimizations Deep Dive
(30 Apr 2025) Practical AI Integration with RavenDB
(19 Jun 2024) Building a Database Engine in C# & .NET
(05 Mar 2024) Technology & Friends - Oren Eini on the Corax Search Engine
(15 Jan 2024) S06E09 - From Code Generation to Revolutionary RavenDB
(02 Jan 2024) .NET Rocks Data Sharding with Oren Eini
(01 Jan 2024) .NET Core podcast on RavenDB, performance and .NET
(28 Aug 2023) RavenDB and High Performance with Oren Eini
(17 Feb 2023) RavenDB Usage Patterns
(12 Dec 2022) Software architecture with Oren Eini
(17 Nov 2022) RavenDB in a Distributed Cloud Environment
(25 Jul 2022) Build your own database at Cloud Lunch & Learn
(15 Jul 2022) Non relational data modeling & Database engine internals
(11 Apr 2022) Clean Architecture with RavenDB
(14 Mar 2022) Database Security in a Hostile World
(02 Mar 2022) RavenDB–a really boring database

Comments

27 Oct 2020
12:24 PM

Nicholas Paldino

Curious, why isn't the second query transformed into the first query? Aren't they logically equivalent?

28 Oct 2020
08:00 AM

Dalibor Čarapić

Just a small correction: Your image of the 'slow' code is the same as the image of the 'fast' code.
Aside: Your sign in to comment with Google still doesn't seem to work. I always have to enter my name and email before posting a comment :(

28 Oct 2020
15:54 PM

Oren Eini

Dalibor,

Thanks, I fixed the issue with the images.

28 Oct 2020
15:55 PM

Oren Eini

Nicholas,

Yes, you can convert one to the other. In practice, this is what we are doing internally for small queries, although we do that in a deeper mode than just converting the query.

Comment preview

Comments have been closed on this topic.

Oren Eini

Oren Eini

CEO of RavenDB