Find the bug: RavenDB HiLo implementation

architecture (618) rss
bugs (451) rss
challanges (123) rss
community (381) rss
databases (481) rss
design (896) rss
development (647) rss
hibernating-practices (72) rss
miscellaneous (592) rss
performance (397) rss
programming (1093) rss
raven (1459) rss
ravendb.net (545) rss
reviews (184) rss

2025
- August (6)
- July (7)
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

Aug 13 2010

Find the bugRavenDB HiLo implementation

time to read 3 min | 412 words

The follow code is part of RavenDB HiLo implementation:

        private long NextId()
        {
            long incrementedCurrentLow = Interlocked.Increment(ref currentLo);
            if (incrementedCurrentLow > capacity)
            {
                lock (generatorLock)
                {
                    if (Thread.VolatileRead(ref currentLo) > capacity)
                    {
                        currentHi = GetNextHi();
                        currentLo = 1;
                        incrementedCurrentLow = 1;
                    }
                }
            }
            return (currentHi - 1)*capacity + (incrementedCurrentLow);
        }

It contains a bug, can you see it? I took a long time to figure it out, I am ashamed to say.

BTW, you can safely assume that GetNextHi is correct.

Tweet Share Share 18 comments

Tags:

Bugs

Comments

13 Aug 2010
09:10 AM

Frank

The if statement within the lock block misses an else clause which gets a new incrementedCurrentLow. The if statement detects if another thread has gotten a new high, but you use the already invalidated incrementedCurrentLow to calculate the ID. Which results in the same ID being generated more than once.

13 Aug 2010
09:27 AM

Ryan Heath

When 2 threads are waiting at lock (generatorLock) the second thread will have the same Id as the first thread.

// Ryan

13 Aug 2010
09:43 AM

Sheila

There's no synchronization on currentHi or capacity.

The result is calculated from these outside of the lock, and hence may not be thread safe.

13 Aug 2010
10:23 AM

Nikos Baxevanis

If the bug is not in the HiLo algorithm and has to do with the language itself I would change the code to use Monitor.Enter's overload that takes a boolean parameter indicating if the lock was actually taken in case of an exception. For volatile fields I would use the Interlocked constructs to ensure that increment/decrement will be done in atomic fashion.

Boolean lockTaken = false;

            Monitor.Enter(generatorLock, ref lockTaken);

            try

            {

                if (Interlocked.Read(ref currentLo) > capacity)

                {

                    currentHi = GetNextHi();

                    currentLo = 1;

                    incrementedCurrentLow = 1;

                }

            }

            finally

            {

                if (lockTaken) { Monitor.Exit(generatorLock); }

            }

13 Aug 2010
11:14 AM

Andrew Borodin

long incrementedCurrentLow = Interlocked.Increment(ref currentLo);

        if (incrementedCurrentLow > capacity)

        {

            lock (generatorLock)

            {..}}

It's useless to use interlocked.

Actually, you enter lock twice in case of couinter adjustments, and you enter lock anyway.

I'm really not shure Interlocked.Increment lock is faster then Monitor.Enter();

If two threads increments together while you are around capacity, they both do GetNextHi().

13 Aug 2010
11:17 AM

Andrew Borodin

oops. I was wrong about two GetNextHi().

Someone will get overflowed incrementedCurrentLow and two threads will get same Id.

13 Aug 2010
11:42 AM

Duarte Nunes

If currentLo is a long, then currentLo = 1 is not atomic on a 32-bit machine; this does not play well with the atomic increment outside the lock.

13 Aug 2010
13:18 PM

Patrick Huizinga

If currentLo == capacity, and two threads enter NextId() simultaniously, the first one to do the Interlocked.Increment will have incrementedCurrentLow == capacity, while it could have the same currentHi as the second thread.

If a thread context switch is made for the first thread between the increment and the return, the second thread has the opportunity to do a GetNextHi() in between.

Assuming currentHi is a volatile field, the return for the first thread will get the currentHi as the second thread set it. If it's not a volatile field, well then all bets are off for what the value of currentHi is.

13 Aug 2010
13:33 PM

Patrick Huizinga

@Andrew Borodin

Interlocked.Increment is a lot faster than using a Monitor. When two threads attempt to increment simultaneously, the Monitor will cause a context switch for the unlucky thread, which you don't want. And all that just because you were a few processor cycles to soon or late compared to the other thread.

If you want to make multithreaded code faster, use volatile and interlocked instead of Monitor. Be prepared to deal with the added complexity (and therefor development time) that such lock free code brings with it.

And as Oren showed with his example, it's sometimes very hard, if not impossible, to get (partially) lock free correct.

13 Aug 2010
13:57 PM

Patrick Huizinga

Oren,

You don't have to do a Thread.VolatileRead inside a lock. Monitor.Enter already generates a MemoryBarrier, which will guarantee you get the latest value of currentLo.

I see two ways of making NextId() correct and lock free.

instead of calculating the id from currentLo and Hi, calculate the currentLo and Hi from the id, and increment only that in NextId().

so the method NextId() would become "return Interlocked.Increment(ref currentId);"

the property CurrentLo would become "get { return CurrentId) % capacity; }"

and the property CurrentHi would become "get { return CurrentId / capacity + 1; }"

I will leave the lock free set { } implementations for CurrentLo, CurrentHi and Capacity as an exercise for the reader. ;-)

create an immutable class CurrentId with the members Lo and Hi. Every update that involves the current id, lo and hi will then need to be performed with a loop and Interlocked.CompareExchange. Be aware however that this can create a livelock-like situation when you have a lot of concurrent threads trying to modify it.

Of course there's always the option of using a big honking lock that wraps the entire NextId() method. :-)

13 Aug 2010
14:12 PM

Patrick Huizinga

I dug up my LockFreeUpdate() wich you could use for solution 2:
  
  
		/// 
<summary  
		/// Updates the specified 
<paramref with the result of the specified
  
		/// 
<paramref. The specified 
<paramref may be invoked
  
		/// multiple times if the value of 
<paramref was changed during the
  
		/// execution of 
<paramref.
  
		/// 
>  
		/// 
<returnsThe new value of 
<paramref.
>  
		/// 
<exceptionIf 
<paramref is null.
>  
		public static T LockFreeUpdate
<t(ref T location, Func
<t,>
 change)
  
			where T : class
  
		{
  
			if (change == null)
  
				throw new ArgumentNullException("change");
  
  
			T oldValue;
  
			T newValue;
  
  
			Thread.MemoryBarrier(); // CompareReplace will cause MemoryBarrier at end of loop
  
			do
  
			{
  
				oldValue = location;
  
				newValue = change(oldValue);
  
			} while (Interlocked.CompareExchange(ref location, newValue, oldValue) != oldValue);
  
  
			return newValue;
  
		}
  
  
Anyone may use it as they wish, as long as no one blames me when there is a problem. :)
>

13 Aug 2010
14:19 PM

Patrick Huizinga

whoops, it seems I lost all the < and > in the above code.

Take a look at the html source if you want to know the secret ingredients of LockFreeUpdate(). I don't feel like making it readable myself. :-P

13 Aug 2010
17:57 PM

Adrian

Problem sits with the "if (Thread.VolatileRead(ref currentLo) > capacity)" line.

What happens if it is no longer > capacity when, as stated earlier, 2 threads make the call to Interlocked.Increment when it is equal to capacity and both enter the lock .

13 Aug 2010
18:09 PM

Adrian

To follow on from my above comment, if "(Thread.VolatileRead(ref currentLo) > capacity)" is no longer true when the second thread enter then it will simply use the value it has in incrementedCurrentLow, which is the wrong value as it was meant to be reset

14 Aug 2010
01:25 AM

Steve Py

That just doesn't seem to read right. You're incrementing the currentLo in an atomic operation so it cannot be interrupted by a thread swap. That makes sense, but then afterwards you are performing a lock and inspecting the incremented value to see it it's rolled over the capacity.

This looks to me that a race could result in-between the increment and the generator lock.

My guess would be that this would do the job:

long result;

lock (generatorLock)

{

Interlocked.Increment(ref currentLo);

if (Thread.VolatileRead(ref currentLo) > capacity)

{

    currentHi = GetNextHi();

    currentLo = 1;

}

result = (currentHi - 1)*capacity + (currentLow);

}

return result;

The bug may be that while you are guarding the increment as an atomic unit, the operation to calculate the result isn't guarded.

14 Aug 2010
16:51 PM

Harry Steinhilber

@Steve Py,

But in your implementation it will lock every time you generate a new ID. Ayende's implementation will only lock when generating a new high value. The real issue (I think) is what @Frank said. If the current thread is the second one to hit the lock, it will not end up executing the code in the if (incrementedCurrentLow > capacity) block so it will use the stale value of incrementedCurrentLow with the newly updated value of CurrentHi. Later on, another thread will eventually request the same ID.

15 Aug 2010
18:16 PM

Andrew Borodin`

@Patrick Huizinga:

Compare and swap implemetation of Interlocked.Increment() could be slower. It can be easyly googled.

Many concurrent CAS operations on different CPUs will never agree. It would be better someone give up his CPU in spinlock.

Of course, it's arguable, it's just not the case we deal with. Anyway, aggregate sum of CPU time this code works is a lot smaller then we are talking about. Monitor.Eneter() here will never be performance bottleneck.

16 Aug 2010
07:27 AM

Patrick Huizinga

@Andrew Borodin

I know the CAS is slower than a simple increment. I didn't mean to replace just the 'increment Lo' with the CAS, but the entire NextId method, including checking capacity and generating a new Hi.

I didn't knew concurrent CAS operations could result in a livelock. I always assumed the cpu would guarantee at least one win.

And as far as I know the SpinLock was introduced in .Net 4.0.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

Find the bugRavenDB HiLo implementation

More posts in "Find the bug" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Related posts that you may find interesting:

More posts in "Find the bug" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication