Voron Internals: The diff is the way

Sep 13 2016

Voron InternalsThe diff is the way

time to read 3 min | 486 words

I talked about the goals of using diffs for the journals in a previous post, but in this one, I want to talk about what it actually did. To start with, it turn out that using diffs in the journal invalidates quite a few optimizations that we had. We had a whole bunch of stuff that we could do to avoid writing data to the data file if there are additional modifications to it. When using diffs, we can’t do that, because the next diff is building on the previous version. That actually ended up improving the code quality, because a bunch of pretty tricky code had to go away.

It was tricky code because we tried to reduce the data file I/O, but only in the cases that it was allowed, which wasn’t always. Now we always write the latest version of all the modified pages, which might add some additional I/O in some cases, but in practice, this didn’t show up in our benchmarks. And the reduction of complexity might have been worth it even if it was.

We actually have two diff implementation, one that tests two different versions of a page and find the differences, and one that test a page against zeros. That, in addition to collapsing runs of zeros, is the entire implementation. We are actually not as small as we could get, because we insert some metadata into the diff stream to allow easier debugging and error detection. But the nice thing about the diff output is that we are still compressing it, so working extra hard to reduce the uncompressed data isn’t that important if compression will cover it anyway.

We tested before & after of using diffs for journal writes, and we found the following:

Journal size (inserts)	37% reduction in size
Journal size (updates)	80% reduction in size
Bulk insert speed	5% – 10% speed improvement *
Writes / sec (inserts)	12% improvement
Writes / sec (updates)	18% improvement

* If the sizes went down so significantly, why haven’t the insert & update speed improve by a comparable amount?

The answer to that is that for the most part, reducing the amount of data we are writing is awesome, a major limiting factor is the number of times we write to the journal, rather than the number of bytes. And we have to write once per transaction, so that is a limiting factor.

However, the current numbers we are looking at is a benchmark showing roughly 30,000 write requests / second and are unable to get anything higher, because we have saturated the local network. We are currently setting up a dedicated testing environment to see how far we can push this.

Tweet Share Share 8 comments

Tags:

More posts in "Voron Internals" series:

(13 Sep 2016) The diff is the way
(07 Sep 2016) Reducing the journal
(31 Aug 2016) I/O costs analysis
(30 Aug 2016) The transaction journal & recovery
(29 Aug 2016) Cleaning up scratch buffers
(26 Aug 2016) MVCC - All the moving parts

Comments

14 Sep 2016
08:53 AM

alex

That sounds like goal achieved. Seems that the changes to make were rather involved as you initially mentioned, so it is nice that simpler code was the byproduct.

That you are now able to saturate the network and be I/O bound by the network bandwidth is rather funny. That is a good place to be for a database product. And well, removing one performance bottleneck will obviously reveal the next one. Also, having a journal size that is substantially reduced should help reduce network bandwidth requirements in a log-shipping replication setup.

I am curious what the results will be in your new testing environment.

23 Sep 2016
17:03 PM

Vlad

Offtopic: I'd love to read your thoughts on this one: https://jeremydmiller.com/2016/09/23/why-you-should-give-marten-a-look-before-adopting-an-orm/ and this one: https://jeremydmiller.com/2016/08/18/moving-from-ravendb-to-marten/ Thanks

20 Oct 2016
15:02 PM

Eleasar

Hmm - where did it go? I got a review in my feed reader and it was available at this URL but it is gone now: https://ayende.com/blog/175553/ravendb-restorspective

It did talk about differences in approaches e.g. RavenDB focusing on not shooting yourself in the foot with ounbound result sets.

20 Oct 2016
15:17 PM

Oren Eini

Eleasar, That was a draft post that was accidentally published.

06 Dec 2016
14:09 PM

Eleasar

Has been a while that it was published shortly. Will this be published anytime soon? I mean i have it but i would like to see about discussions and experience from others. Thanks!

06 Dec 2016
18:13 PM

Oren Eini

Eleasar, This is going into RavenDB 4.0, we'll have a preview for that shortly.

08 Dec 2016
08:46 AM

Eleasar

Hi sorry - was not clear enough. I meant your article about comparing RavenDB to Marten that was published accidentally. If/when you plan to publish that article as i would be interested on the discussion and maybe your latest thoughts.

08 Dec 2016
08:50 AM

Oren Eini

Eleasar, That was something for our internal consumption, it wasn't intended to be externally published

Comment preview

Comments have been closed on this topic.

Oren Eini

Oren Eini

CEO of RavenDB