World’s Smallest No SQL Database: Concurrency

architecture (612) rss
bugs (451) rss
challanges (123) rss
community (380) rss
databases (481) rss
design (895) rss
development (642) rss
hibernating-practices (71) rss
miscellaneous (592) rss
performance (397) rss
programming (1085) rss
raven (1450) rss
ravendb.net (534) rss
reviews (184) rss

2025
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

RavenDB - High-Performance NoSQL Document Database

Jul 11 2013

World’s Smallest No SQL DatabaseConcurrency

time to read 14 min | 2667 words

I am pretty sure that it would surprise you, but the World’s Smallest No SQL Database has a well defined concurrency model. Basically, it is using Last Write Wins. And we are safe from any concurrency issues. That is pretty much it, right?

Well, not really. In a real world system, you actually need to do a lot more with concurrency. Some obvious examples:

Create this value only if it doesn’t exists already.
Update this value only if it didn’t change since I last saw it.

Implementing those is actually going to be pretty simple. All you need to do is to have a metadata field, version, that is incremented on every change. Here is the change that we need to make:

   1: public class Data

   2: {

   3:     public byte[] Value;

   4:     public int Version;

   5: }

6:

   7: static readonly ConcurrentDictionary<string, Data> data =

   8:    new ConcurrentDictionary<string, Data>(StringComparer.InvariantCultureIgnoreCase);

9:

  10:  public HttpResponseMessage Get(string key)

  11:  {

  12:      Data value;

  13:      if(data.TryGetValue(key, out value) == false)

  14:          return new HttpResponseMessage(HttpStatusCode.NotFound);

15:

  16:      return new HttpResponseMessage

  17:          {

  18:              Headers = { "Version", value.Version },

  19:             Content = new ByteArrayContent(value.Value)

  20:          };

  21:  }

22:

  23: public void Put(string key, [FromBody]byte[] value, int version)

  24: {

  25:     data.AddOrUpdate(key, () =>

  26:     { // create

  27:        if(version != 0)

  28:            throw new ConcurrencyException();

  29:        return new Data{ Value = value, Version = 1 };

  30:     }, (_, prev) =>

  31:     { // update

  32:         if(prev.Version != version)

  33:           throw new ConcurrencyException();

  34:         return new Data{ Value = value, Version = prev.Version +1 };

  35:     });

  36: }

As you can see, it merely doubled the amount of code that we had to write, but it is pretty obvious how it works. RavenDB actually uses something very similar to that for concurrency control for writes, although the RavenDB ETag mechanism is alos doing a lot more.

But the version system that we have above is actually not enough, it only handle concurrency control for updates. What about concurrency controls for reads?

In particular, how are we going to handle non repeatable reads or phantom reads?

Non repeatable reads happen when you are reading a value, it is then deleted, and when you try to read it again, it is gone.
Phantom read is the other way around, first you tried, but didn’t find anything, then it was created, and you read it again and find it.

This is actually interesting, because you only care about those for the duration of a single operation / transaction / session. As it stand now, we actually have no way to handle either issue. This can lead to… interesting bugs that only happen under very specific scenarios.

With RavenDB, we actually handle both cases. In a session lifetime, you are guaranteed that if you saw a document, you’ll continue to see this document until the end of the session, which deals with the issue of non repeatable read. Conversely, if you didn’t see a document, you will continue to not see it until the session is closed. This is done for Load, queries are a little bit different.

Another aspect of concurrency that we need to deal with is Locking. Sometimes a user has a really good reason why they want to lock a record for a period of time. This is pretty much the only way to handle “checking out” of a record in a scenario where you have to multiple users wanting to make changes to a record concurrently. Locks can be Write Or ReadWrite locks. A Write lock allows users to read the data, but prevent them from changing that. When used in practice, this is usually going to immediately fail an operation, rather than make you wait for it.

The reasoning behind immediate fail for write is that if you encountered a record with a write lock, it means that it was either already written to or is about to be written to. At that case, your write is going to be operating on stale data, so we might was well fail you immediately. For ReadWrite locks, the situation is a bit different. In this case, we want to also prevent readers from moving on. This is usually done to ensure consistent state system wise, and basically, any operation on the record would have to wait until the lock is removed.

In practice,ReadWrite locks can cause a lot of issues. The moment that you have people start placing locks, you have to deal with lock expiration, manual unlocking, abandoned lock detection, lock maintenance, etc. About the only thing that they are good for is to allow the user to make a set of changes and present them as one unit, if we don’t have better transaction support. But I’ll discuss that in another post. In the meantime, just keep in mind that from my point of view, ReadWrite locks are pretty useless all around.

Tweet Share Share 16 comments

Tags:

nosql

Comments

11 Jul 2013
10:18 AM

Jacob Rohde

Great series! Love it.

11 Jul 2013
11:22 AM

Tucaz

Loving the series. Very instructive, but I would like to see how you test it and make sure it does what it says it does. Thank you!

11 Jul 2013
11:39 AM

tobi

The ConcurrentDictionary strikes again! Multiple concurrent put-update calls can happen at the same time for the same key because the user delegate is not being called under a lock. This can cause different data to be read for the same version.

11 Jul 2013
12:14 PM

Patrick Huizinga

@tobi,

If that happens, the ConcurrentDictionary will call the update delegate again and again until it can finally replace the _prev_ you got with the updated value you returned.

And unless your update delegate has side effects, no one will be any wiser.

Also I'm not sure what you mean with "This can cause different data to be read for the same version." I don't see how that could happen with Ayende's code.

11 Jul 2013
12:14 PM

Ayende Rahien

Tobi, To my knowledge, only one of those updates will actually be made visible. So you won't see different data for the same version.

11 Jul 2013
12:16 PM

Patrick Huizinga

great, apparently both underscores and asterisks make your text italic.

Let's see %if% I %%can%% $find$ $$a$$ way #to# ##make## text __bold__.

11 Jul 2013
16:19 PM

Ryan Heath

Hmm, maybe Tobi is on to something ...

I think its possible to get to the point that two threads try to update the same version. One of them will win but the other will never know it has lost. Ie no concurrencyException is thrown.

// Ryan

11 Jul 2013
16:35 PM

Ayende Rahien

Ryan, The calling code doesn't actually care. Even if the value is there, it can't assume that it will stay there (another request may come in).

11 Jul 2013
16:57 PM

Ryan Heath

Hmm, are we on the same page?

I am talking about two threads trying to update with the same version. One of them should/must get the concurrencyException but I think it's possible, due to the impl of concurrentdictionary, both think they succeeded.

// Ryan

11 Jul 2013
16:59 PM

Ayende Rahien

No, that won't happen. The CD will take care or retrying the second one.

12 Jul 2013
08:47 AM

Ryan Heath

I did some testing and can confirm the update will work indeed! Seems the impl of concurrentdictionary has some versioning bookkeeping too :)

However, when two threads want to insert the same key at the precise same time, one of them will win and the other will not know his insert has been overwritten. I do not see a way how to fix that since the dictionary will not expose the data of the other thread yet ...

// Ryan

12 Jul 2013
09:26 AM

Matt Warren

Take a look at AddOrUpdate in Reflector, it looks like this:

do
{
    TValue local3;
    while (this.TryGetValue(key, out local3))
    {
        local = updateValueFactory(key, local3);
        if (this.TryUpdate(key, local, local3))
        {
            return local;
        }
    }
    local = addValueFactory(key);
}
while (!this.TryAddInternal(key, local, false, true, out local2));
return local2;

When it tries an update (the inner while loop) it will only succeeded if another Update hasn't happened in the mean time as it uses TryUpdate(.., local3) to write it back at the end.

For the thread that fails, the UpdateFunc will then get called a 2nd time and the code that Ayende has ("if(prev.Version != version)") will be triggered as the current value (prev in this case) will be different.

12 Jul 2013
09:44 AM

Patrick Huizinga

Ryan,

I decompiled CD.AddOrUpdate and this is basically what happens:

do { while (!this.TryGetValue(key, out comparisonValue)) { // attempt to add, and return if successful } newValue = updateValueFactory(key, comparisonValue); } while (!this.TryUpdate(key, newValue, comparisonValue));

Strangely the remarks of this method only mention that addValueFactory will be called as many times as needed.

12 Jul 2013
09:49 AM

Patrick Huizinga

sigh defeated by the blog formatting again. Ayende, could you please implement a preview function?

Looking at Matt's post, it's funny that Reflector and dotPeek (which I used) apparently decompiled the loops the other way around. Or maybe the implementation changed in between versions (I looked at v4.0.30319).

12 Jul 2013
13:36 PM

Ryan Heath

Matt & Patrick,

Yup, that behavior was exposed by my tests. The delegates are called for a second time when needed.

Ayende's code works as expected with updates. However with inserts there is no way to determine when to throw a concurrentyException because of a second run of the delegate.

// Ryan

12 Jul 2013
13:43 PM

Ryan Heath

I take that back, inserts works as good as updates. My tests for inserts had a bug ...

// Ryan

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

World’s Smallest No SQL DatabaseConcurrency

More posts in "World’s Smallest No SQL Database" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

More posts in "World’s Smallest No SQL Database" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication