Buffer allocation strategies: Explaining the solution

filter by tags archive

architecture (614) rss
bugs (451) rss
challanges (123) rss
community (381) rss
databases (481) rss
design (896) rss
development (642) rss
hibernating-practices (71) rss
miscellaneous (592) rss
performance (397) rss
programming (1086) rss
raven (1455) rss
ravendb.net (539) rss
reviews (184) rss

2025
- July (5)
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

Sep 08 2015

Buffer allocation strategiesExplaining the solution

time to read 4 min | 662 words

In my previous post, I threw a bunch a code at you, with no explanation, and asked you to discuss it.

Here is the code, with full discussion below.

    [ThreadStatic] private static Stack<byte[]>[] _buffersBySize;

    private static byte[] GetBuffer(int requestedSize)
    {
        if(_buffersBySize == null)
            _buffersBySize = new Stack<byte[]>[32];

        var actualSize = PowerOfTwo(requestedSize);
        var pos = MostSignificantBit(actualSize);

        if(_buffersBySize[pos] == null)
            _buffersBySize[pos] = new Stack<byte[]>();

        if(_buffersBySize[pos].Count == 0)
            return new byte[actualSize];

        return _buffersBySize[pos].Pop();
    }

    private static void ReturnBuffer(byte[] buffer)
    {
        var actualSize = PowerOfTwo(buffer.Length);
        if(actualSize != buffer.Length)
            return; // can't put a buffer of strange size here (probably an error)

        if(_buffersBySize == null)
            _buffersBySize = new Stack<byte[]>[32];

        var pos = MostSignificantBit(actualSize);

        if(_buffersBySize[pos] == null)
            _buffersBySize[pos] = new Stack<byte[]>();


        _buffersBySize[pos].Push(buffer);
    }

There are a couple of interesting things going on here. First, we do allocations by power of two number, this reduce the number of different sizes we have to deal with. We store all of that in a small array (using the most significant bit to index into the array based on the requested size) that contains stacks for all the requested sizes.

In practice, most of the time we’ll use a very small number of sizes, typically 4KB – 32KB. The basic idea is that you’ll pull an array from the pool, and if there is a relevant one, we save allocations. If not, we allocate a new one and return it to the user.

Once we gave the user a buffer, we don’t keep track of it. If they return it to us, this is great, if not, the GC will clean it up. This is important, because otherwise forgetting to call ReturnBuffer creates what is effectively a memory leak.

Another thing to notice is that we aren’t requiring that the same thread will be used for getting and returning the buffer. It is fine to use one thread to get it and another to return it. This means that async code will work well with thread hopping and this buffer pool. We also use a stack, to try to keep the busy buffer close to the actual CPU cache.

Note that this is notepad code, so there are probably issues with it.

In fact, there is a big issue here that will only show up in particular usage patterns. Can you see it? I’ll talk about it in my next post.

Tweet Share Share 10 comments

Tags:

design

Comments

08 Sep 2015
09:30 AM

Jahmai Lay

"This means that async code will work well with thread hopping and this buffer pool."

I assume by work well you mean be quite random in nature and cost more memory that you'd expect. IO completion ports have their own dedicated threads in the thread pool, which means that continuations from IO completion routines (socket / file IO) might release buffers into thread statics that are never acquirable by the a normal worker thread. That besides, the solution is very non-deterministic in it's effectiveness in async code and is bound to end up asking ourselves "why does my memory footprint seem to grow into infinity?"

See: https://msdn.microsoft.com/en-us/library/system.threading.threadpool.setmaxthreads(v=vs.110).aspx

08 Sep 2015
10:06 AM

Damien

Yes, if the set of allocating threads and the set of releasing threads aren't the same then the {releasing, non-allocating} threads are now a memory leak. That could be somewhat dealt with by setting a cap on the number of buffers that you keep in the pool.

But if you implement that generally, you lose the locality that you liked - unless you also switch to deques instead of stacks so that, on hitting the limit, you can discard the LRU buffer, rather than the current one (in the release method).

08 Sep 2015
11:51 AM

Jahmai Lay

Some things worth looking into for improving Async support for this over ThreadStatic, depending on what version of the framework you're targeting and what platforms you need to run on:

https://msdn.microsoft.com/en-us/library/system.runtime.remoting.messaging.callcontext.logicalgetdata(v=vs.110).aspx https://msdn.microsoft.com/en-us/library/dn906268(v=vs.110).aspx

08 Sep 2015
12:00 PM

Fabian Wetzel

To get around the issue Damien is talking about (one thread allocates and another releases), I would think about replacing

[ThreadStatic] private static Stack<byte[]>[]

with

System.Collections.Concurrent.ConcurrentBag<byte[]>[]

but I have no idea on your specific use case or your special performance requirements, but I had read about the concurrent collections in the past and they are known to be mostly lockfree and they try to use threadlocal AND global stuff under the covers so (at least in my head) they will generelly perform better than your threadlocal stacks.

08 Sep 2015
15:57 PM

Philip

Yes, large issues are:

1) If all allocations are done from a set of threads different from the ones releasing, you have a hard effective memory leak that will grow without bounds

2) If memory is usually used in bursts - for example, threads use very large buffers at startup, then release them and only a few small buffers are needed after that - your memory usage will be much higher than required, because the memory used by your pool during all of runtime will be equal to the maximum ever needed at once.

3) as noted above, I/O threads get their own pool, so if all allocations are done from the worker pool and then released in the I/O pool, you have an increasing amount of memory used and never reused.

4) In the odd case where you're not using the thread pool (or any type of thread pooling), this code will be useless. Hopefully this is a non-issue in most cases.

5) Finally, you'll have to audit all your code (and check any framework code you're using) to ensure you're not using buffer.length. Because you're not clearing the contents of the buffer it could be a random bug factory at best, and a security vulnerability at worst.

09 Sep 2015
02:07 AM

dhasenan

Oh yeah, you get 250 IO threads by default, so that means, if you allocate, say, 256MB in an IO callback every so often, once your application has been up long enough, you'll have allocated 64GB of memory. Not ideal.

09 Sep 2015
10:15 AM

Eli

What if somebody returns MAX_INT number of tiny sized buffers because they happen to read a very large number of tiny files and then they never exhibit that usage pattern again? Oompf. Whatever wraps this data structure should periodically expire buffers when they idle too long and it should use a minimum core pool size defined by the user (or a sane default) to keep some buffers idle and at the ready, but let the GC take care of the rest.

09 Sep 2015
14:18 PM

Oren Eini

Eli, The whole point of buffer pool is that you take and return buffers from it. To take 2 billion buffers and not return them is something very strange.

10 Sep 2015
04:24 AM

Eli

Truly. I was just using shorthand for "really big number" here. The point was that usually pool have bounds to prevent caller from doing outrageous things.

10 Sep 2015
04:44 AM

Oren Eini

Eli, If you are allocating a lot of small buffers, you want us to pool them. Otherwise you are going to force the GC to do a LOT of work.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

Buffer allocation strategiesExplaining the solution

More posts in "Buffer allocation strategies" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Related posts that you may find interesting:

More posts in "Buffer allocation strategies" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication