The hidden costs of allocations

time to read 4 min | 606 words

I mentioned in a pervious post that we have a code quality gateway to make sure that all logging statements are wrapped in an if statement, to reduce the number of allocations when the user has logging turned off. This is done because logging can be expensive, and is often turned off, so there is no point in paying a penalty for stuff that isn’t working.

There seems to be some confusion about why this is done. Let us assume that we have the following logging code:

void Debug(string msg)
{
	if(IsDebugEnabled)
		Console.WriteLine(msg);
}

void Debug(string format, params object[] args)
{
	if(IsDebugEnabled)
		Console.WriteLine(format, args);
}

void Debug(Func<string> generateMsg)
{
	if(IsDebugEnabled)
		Console.WriteLine(generateMsg());
}

Now, the obvious bad example would be to use:

Debug("Hello "+ user.Name);

Since that is going to allocate a new string, and this will happen regardless of whatever logging is enabled or not. On high frequency call sites, this can end up allocating a lot of useless stuff.

So we will move to this mode:

Debug("Hello {0}", user.Name);

And we saved the allocation, right? Except that this actually generate this code, do you see the allocation now?

Debug("Hello {0}", new[] { user.Name });

So let us introduce a better option, shall we? We’ll add a few common overloads without the use of params.

void Debug(string format, object p1);
void Debug(string format, object p1, object p2);
void Debug(string format, object p1, object p2, object p3);

And now we saved the allocation. Unless…

int requestNumber = ...;
Debug("Request # {0}", requestNumber);

Do you see the allocation now? We pass an int to a object, which require us to do boxing, which is an allocation Smile .

So let us try using the lambda method, this way nothing is executed!

Debug(() => return "Hello " + user.Name);

Except… this is actually translated to:

var loggerMsg = new LoggerMessage(user);
Func<string> func = new Func<string>(loggerMsg.Write);
Debug(func);

Here are all the allocations.

There is another issue with logging via lambdas, consider the following code:

void Index(JsonDocument[] docs)
{
	var batch = new IndexBatchStats();
	database.Stats.Add(batch);// long lived
	batch.Completed += () => database.Stats.IncrmentCompletedBatches(batch);
	Log(() => "Indexing " + docs.Length + " documents");
}

You might notice that we have two lambdas here. C# is optimizing the number of types generated, and will generally output a single type for all the lifted member in all the lambdas in the method. This means that we have:

void Index(JsonDocument[] docs)
{
	var batch = new IndexBatchStats();
	database.Stats.Add(batch);// long lived

	var args = new { database, batch, docs }; // all lifted members


	batch.Completed += (args) => args.database.Stats.IncrmentCompletedBatches(args.batch);
	Log((args) => "Indexing " + args.docs.Length + " documents");
}

As you can see, we have a long lived lambda, which we think is using only other long lived objects (database & batch), but is actually holding a reference to the docs, which are VERY large.

Except for the last issue, which require moving the logging to a separate method to avoid this optimization, all of the issues outlined above can be handled by explicitly calling IsDebugEnabled in the calling code.

And this is why we require it.

Tweet Share Share 20 comments

Tags:

development

Comments

18 Nov 2015
07:19 AM

Rafal

Thanks for clarification - i actually thought params just pretends to put argumetns in an array, but it actually does ;) BTW after all these rounds of low-level optimizations, don't you think CLR is only standing in the way (at least for the crucial low-level stuff)?

18 Nov 2015
07:22 AM

Oren Eini

Rafal, There are similar tradeoffs in pretty much any language / platform. Those kind of things aren't too hard for us to deal with. The things that are painful is that we don't get a better way to control memory. For example, I would really love this: https://github.com/dotnet/coreclr/issues/1235

18 Nov 2015
10:32 AM

Ryan Heath

You seem to say it in the last sentence but would it not be better to wrap all your debugcalls into an if debugEnabled? Solves all those gotchas.

int requestNumber = ...; if (Debug.IsEnabled) { Debug("Request # {0}", requestNumber); }

// Ryan

18 Nov 2015
10:33 AM

Ryan Heath

Whoah, formatting got screwed up ...

// Ryan

18 Nov 2015
10:53 AM

Jan

What about generics with overloading? Instead of

void Debug(string format, object p1);
void Debug(string format, object p1, object p2);

use

void Debug<T1>(string format, T1 p1);
void Debug<T1, T2>(string format, T1 p1, T2 p2);

? As far as I know, code will be bigger (because e.g. primitive data types like int, double need their own generated code), but this should avoid allocations for boxing...

18 Nov 2015
11:04 AM

meow

I wonder if this is a good usecase for ConditionalAttribute (https://msdn.microsoft.com/en-us/library/aa664622(v=vs.71).aspx)? It seems that it gives you all performance benefits of if (isDebug) and also is much more easier/less cumbersome to use.

On the other hand enabling/disabling of logging must be done at compile time (and you need full rebuild).

18 Nov 2015
11:23 AM

Viacheslav Ivanov

What about Debug("Request # {0}", requestNumber.ToString());

18 Nov 2015
11:45 AM

Oren Eini

Ryan, Yes, that is a continuation of the previous post, which said that we had a check for this sort of things

18 Nov 2015
11:47 AM

Oren Eini

Jan, You can do it with generics, yes, but it has its own costs, as you noted. It also doesn't help if the things you need are more complex.

If I need to compute something to log it, for example

18 Nov 2015
11:48 AM

Oren Eini

meow, Conditional is great, but re-compiling is not something that we can allow

18 Nov 2015
13:28 PM

Stan

Have you considered Fody for doing this work instead of manually adding log level checks to code?

18 Nov 2015
14:21 PM

Matt Warren

If you want a way to pick up these "hidden allocations" whilst writing/reading code, you should check out the "Roslyn Clr Heap Allocation Analyzer" https://github.com/mjsabby/RoslynClrHeapAllocationAnalyzer.

I know you can profile you code and see them that way, but sometimes it's easier to see the issues when you are writing the code.

18 Nov 2015
15:30 PM

Oren Eini

Stan, Yes, we did consider that, but Fody doesn't help for the more complex stuff. We have logging stuff that takes multiple statements to prepare

18 Nov 2015
15:30 PM

Oren Eini

Matt, We use a R# addon that does the same

23 Nov 2015
21:05 PM

Kurt

Or use a real logging library that short circuits and early exits any logging calls if the log level is lower than the desired line. It basically does what Ryan Health said.

24 Nov 2015
12:40 PM

Oren Eini

Kurt, Um... nope. What we are showing here is the cost of calling the log library, not the cost of the log library itself

24 Nov 2015
13:39 PM

Kurt

I know what you meant by this post. For instance, Boost::Logging short circuits any calling costs.

For example, in pseudo-code:

logger.set_level(info); logger.debug_msg( "sleep for 3 seconds %d", sleep(3) ); // this returns instantly, doesn't sleep

24 Nov 2015
13:50 PM

Oren Eini

Kurt, Is this done during compilation, or at runtime? If the later, how does debug_msg declared?

12 Jan 2016
18:38 PM

Nathan

Good write up! I don't quite follow the section below however. var loggerMsg = new LoggerMessage(user); Func<string> func = new Func<string>(loggerMsg.Write); Debug(() => return "Hello " + user.Name); Why would C# initialize a function that it won't be using anywhere? I'm a bit uncertain why this LoggerMessage class is being created too. Surely it would do something like this instead? Func<string> func = () => "Hello " + user.Name; Debug(func);

13 Jan 2016
00:35 AM

Oren Eini

Nathan, That was a typo on my part, I fixed it in the post

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB