RavenDB 4.0: The etag simplification

Jun 13 2017

RavenDB 4.0The etag simplification

time to read 2 min | 357 words

A seemingly small change in RavenDB 4.0 is the way we implement the etag. In RavenDB 3.x and previous we used a 128 bits number, that was divided into 8 bits of type, 56 bits of restarts counter and 64 bits of changes within the current restart period. Visually, this looks like a GUID: 01000000-0000-0018-0000-000000000002.

The advantage of this format is that it is always increasing, very cheap to handle and requires very little persistent data. The disadvantage is that it is very big, not very human readable and the fact that the number of changes reset on every restart means that you can’t make meaningful deduction about relative sizes between any two etags.

In RavenDB 4.0 we shifted to use a single 64 bits number for all etag calculations. That means that we can just expose a long (no need for the Etag class) which is more natural for most usages. This decision also means that we need to store a lot less information, and etags are one of those things that we go over a lot. A really nice side affect which was totally planned is that we can now take two etags and subtract them and get a pretty good idea bout the range that needs to be traversed.

Another important decision is that everything uses the same etag range. So documents, revisions, attachments and everything share the same etag, which make it very simple to scan through and find the relevant item just based on a single number. This make it very easy to implement replication, for example, because the wire protocol and persistence format remain the same.

I haven’t thought to write about this, seemed like too small a topic for post, but there was some interest about it in the mailing list, and enumerating all the reasons, it suddenly seems like it isn’t such a small topic.

Update: I forgot to mention, a really important factor of this decision is that we can do do this:

So we can give detailed information and expected timeframes easily.

Tweet Share Share 5 comments

Tags:

More posts in "RavenDB 4.0" series:

(30 Oct 2017) automatic conflict resolution
(05 Oct 2017) The design of the security error flow
(03 Oct 2017) The indexing threads
(02 Oct 2017) Indexing related data
(29 Sep 2017) Map/reduce
(22 Sep 2017) Field compression

Comments

14 Jun 2017
05:52 AM

Marc

Would this make it possible to get some sort of progress indicator for the indexing?

14 Jun 2017
06:12 AM

Oren Eini

Marc, Yes, I totally forgot that we made this into a feature. I updated the post and you can see how this looks like.

14 Jun 2017
08:54 AM

XiniX00

Is it also exposed in the API (databases/Northwind/stats)? This way we could give an indication to the user when updating indexes, or adding in a new release. Now they are blind and have to wait (in our example) before all indexes are non-stale.

14 Jun 2017
09:56 AM

Oren Eini

Yes, this is exposed. You can ask for details on a per index if needed.

14 Jun 2017
18:28 PM

Marc

This is great!

Comment preview

Comments have been closed on this topic.

Oren Eini

Oren Eini

CEO of RavenDB