Code review challenge: The concurrent dictionary refactoring

architecture (618) rss
bugs (451) rss
challanges (123) rss
community (381) rss
databases (481) rss
design (896) rss
development (647) rss
hibernating-practices (72) rss
miscellaneous (592) rss
performance (397) rss
programming (1093) rss
raven (1459) rss
ravendb.net (545) rss
reviews (184) rss

2025
- August (6)
- July (7)
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

RavenDB Workshops - Deep dive into practical use of Document Data Modeling

Dec 30 2015

Code review challengeThe concurrent dictionary refactoring

time to read 2 min | 216 words

In a recent code review, I had modified the following code:

_freeSegments.AddOrUpdate(memoryDataForPointer.SizeInBytes, x =>
{
   var newQueue = new ConcurrentQueue<AllocatedMemoryData>();
   newQueue.Enqueue(memoryDataForPointer);
   return newQueue;
}, (x, queue) =>
{
   queue.Enqueue(memoryDataForPointer);
   return queue;
});

Into this code:

var q = _freeSegments.GetOrAdd(memoryDataForPointer.SizeInBytes, 
                         size => new ConcurrentQueue<AllocatedMemoryData>());
q.Enqueue(memoryDataForPointer);

Can you tell me why?

Tweet Share Share 16 comments

Tags:

challanges

Comments

30 Dec 2015
12:45 PM

Rafal

create action can be called multiple times concurrently, and then all but one results will be discarded?

30 Dec 2015
12:53 PM

Patrick Huizinga

In the first code, the update function can be called multiple times and then the same item is queues multiple times.

The first code would have been correct if you had been using an ImmutableQueue.

30 Dec 2015
12:53 PM

Ian Griffiths

The documentation for ConcurrentDictionary notes that its atomicity guarantees do not extend to callbacks. It also notes that the add callback may be called multiple times and that not all the values returned will necessarily get added to the dictionary. It does not say anything about the update method, so it's not entirely clear whether that might see multiple calls, but if that is a possibility, it's important to move the call to Enqueue out of that callback.

(I'm guessing this is probably not the answer though, because the documentation around the race conditions that afflict AddOrUpdate and GetOrAdd are entirely concerned with calls to the Add callback whose results are ultimately abandoned, and not the update callback. I'd be surprised if you ever did see a double update callback in practice - if there is a scenario in which that happens, it's not evident from the documentation.)

30 Dec 2015
13:05 PM

Chris Loggins

Improved readability, reduced duplication. Depending on how things are called, you could have a modified closure around memoryDataForPointer iirc.

30 Dec 2015
13:16 PM

HarryDev

Other people have probably answered the real issue, but the GetOrAdd version has the added benefit of one less allocation for delegates, in fact since the delegate in question captures no state it can be allocated just once. Meaning only the queue will be allocated if it did not exist already.

30 Dec 2015
14:59 PM

Nikola

There is no need to lock the dictionary for duration of update.

It's evident that you desire to add a queue to a dictionary if it's not there, or to update content of the queue which is already in the dictionary.

Second variant of code has performance benefit for two reasons.

First one: There is no need to hold lock for duration of updating the value. Since you're code is not replacing a reference, and since the object in question is ConcurrentQueue, we can be free to release the dictionary before updating the queue.

Second one: Lock duration for adding a value can be shortened by just adding an empty queue. We can release the dictionary and later insert new items to ConcurrentQueue.

Wouldn't be even more performant to do following? var newQueue = new ConcurrentQueue<AllocatedMemoryData>(); var q = _freeSegments.GetOrAdd(memoryDataForPointer.SizeInBytes, newQueue); q.Enqueue(memoryDataForPointer);

My guess is that in that case we save a method invocation in case of adding and hold the lock for shorter time.

30 Dec 2015
15:13 PM

Nikola

After inspecting ConcurrentDictionary with ILSpy I see that it doesn't actually hold locks while executing delegates.

So I'm correcting myself. First version of code locks the dictionary in either case, whether values are being added or updated. Second version locks the dictionary in case of adding a new key/value pair. In case of updating the pair, there is no need for locking. Update concurrency is handled by the ConcurrentQueue itself.

30 Dec 2015
16:31 PM

Aaron Stephens

Three reasons I can see immediately.

1) Removal of duplicate code.

2) The less work performed in a concurrent function, the less time resources have be locked.

3) Since ConcurrentQueue is itself concurrent, there is no need to use Enqueue() from within an atomic callback.

Cheers.

30 Dec 2015
16:54 PM

Dmitry Starosta

The Add delegate will not be synchronized by the ConcurrentDictionary which can cause multiple instances of the newQueue variable to be created in multiple threads.

30 Dec 2015
22:48 PM

Oren Eini

Rafal, That is an issue, yes, but it isn't an important one. It might cause a few allocations, but not significant ones. That is a relatively rare occurrence, and it won't actually impact the system

30 Dec 2015
22:52 PM

Oren Eini

Patrick, You are correct in principal, but that isn't actually how it works. See the code here: http://referencesource.microsoft.com/#mscorlib/system/Collections/Concurrent/ConcurrentDictionary.cs,1148

In this case, we are returning the same value, so that will result in an always successful update. So this end up behaving correctly, although that is not ensured by the contract, mind.

Note that this is still doing a lot more work than we want it to do, because the TryUpdate needs to take a lock: http://referencesource.microsoft.com/#mscorlib/system/Collections/Concurrent/ConcurrentDictionary.cs,562

This is typically an uncontended lock, though

30 Dec 2015
22:53 PM

Oren Eini

HarryDev, Yes, that is what we are aiming at. This method has no locks, no allocations, and it is very cheap

30 Dec 2015
22:54 PM

Oren Eini

Nikola, There are no locks held during the update or add. And your code will require us to do an extra allocation (of a relatively large object) on each call

31 Dec 2015
07:36 AM

Oğuzhan Eren

because easy to read

31 Dec 2015
13:08 PM

Jesús López

Update with the same value is pointless. When a key exists in the dictionary you don't want to associate a different queue to the key, in other words, you don't want to update the dictionary entry, what you really want is to add a new entry to the queue associated to the key, so AddOrUpdate is no sense. Additionally, I'ts more complex. The refactorized code is to the point, simpler, more readable, and I bet it's faster.

31 Dec 2015
15:19 PM

vendettamit

I see two reason of making changes. 1. SRP violation in old method AddorUpdate(). 2. Duplicate code for Adding new item to dictionary.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

Code review challengeThe concurrent dictionary refactoring

More posts in "Code review challenge" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Related posts that you may find interesting:

More posts in "Code review challenge" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication