Using GOTO in C#

time to read 2 min | 309 words

After talking about GOTO in C, I thought that I should point out some interesting use cases for using GOTO in C#. Naturally, since C# actually have proper methods for resource cleanups (IDisposable and using), the situation is quite different.

Here is one usage of GOTO in RavenDB’s codebase:

This is used for micro optimization purposes. The idea is that we put the hot spots of this code first, and only jump to the rare parts of the code if the list is full. This keep the size of the method very small, it allow us to inline it in many cases and can substantially improve performance.

Here is another example, which is a bit crazier:

As you can see, this is a piece of code that is full of gotos, and there is quite a bit of jumping around. The answer to why we are doing this is again, performance. In particular, this method is located in a very important hot spot in our code, as you can imagine. Let’s consider a common usage of this:

var val = ReadNumber(buffer, 2);

What would be the result of this call? Well, we asked the JIT to inline the method, and it is small enough that it would comply. We are also passing a constant to the method, so the JIT can simplify it further by checking the conditions. Here is the end result in assembly:

Of course, this is the best (and pretty common for us) case where we know what the size would be. If we have to send a variable, we need to include the checks, but that is still very small.

In other words, we use GOTO to direct as much as possible the actual output of the machine code, explicitly trying to be more friendly toward the machine at the expense of readability in favor of performance.

Tweet Share Share 15 comments

Tags:

Comments

27 Jun 2018
10:36 AM

Diego F.

I'd be interested in hearing why the first example would be preferred over this:

[MethodImpl(MethodImplOptions.AggressiveInlining)]
public void Add(T item)
{
    if (_size != _items.Length) 
    {
        _items[_size++] = item;
        _version++;
        return;
    }
    
    AddUnlikely(item, (int)_size + 1);
}

Or even a else condition.

27 Jun 2018
10:39 AM

Diego F.

I mean, it's not clear to me how it actually leads to a performance optimisation. :-)

27 Jun 2018
12:12 PM

Federico Lois

Hi Diego,

The underlying reason is you want to keep hot code sequentially to better use the frontend and avoid tripping over cache lines. Most of those optimizations are instances of a more general class of optimization known as code layout optimizations. I talked about it here: https://youtu.be/DD3w66Ff8Ms?t=20436

27 Jun 2018
13:01 PM

svick

Federico, but hot code is sequential in Diego's version of the code. Diego's code actually produces exactly the same IL as the original version.

So, using goto doesn't improve anything in this case.

27 Jun 2018
13:30 PM

svick

As for ReadNumber, if I use multiple returns instead of gotos, the assembly is the same in the constant case (at least on .Net Core 2.1).

In the variable case, I see different assembly than what's in the gist. And goto does produce more efficient assembly than multiple returns, but not by much (the only different is that where goto has je, multiple returns has jne jmp).

27 Jun 2018
14:25 PM

Federico Lois

Sorry, didn't notice the actual code difference in the first message.

That's a simple example, in that case, there is no difference (it used to have not long ago); but we have a much larger codebase with a much bigger surface and those are JIT based optimization that when the JIT improves they shouldn't exist; so all of them follow the same pattern in order to be able to roll them back when the issues that prevent the JIT to take the right choice get fixed (which in this case is the Unlikely part: https://github.com/dotnet/coreclr/issues/6024). For example, no long ago, having multiple returns would mess your code layout. You would use a GOTO to avoid code repetition (pop repetition), and as soon as we are sure noone is using those versions anymore they will get rolled back in bulk.

So the question of why we use one way and not the other is because of consistency; even though the goto-less version is equivalent.

27 Jun 2018
14:38 PM

Federico Lois

Moreover, the second code is more or less in the same venue. It has been fixed at 2.1 after a few PRs spanning the 1.1, 2.0 and 2.1 release. For our purposes, it is effectively solved, but we cannot roll it back until we do not need to support those targets anymore. First, they solved the marking the throw only method as cold code (which has a very important impact on highly inlined code), then they solved the multiple returns the goto Successful is not needed anymore and they went the extra mile to solve that also for loops (which are not showcased here).

28 Jun 2018
12:03 PM

Very uncommon in C# but a common practice in C++ (Linux kernel etc.). If one is surprised by such issues, here is an interesting blog post on this subject: http://250bpm.com/blog:6

28 Jun 2018
14:10 PM

Anthony Nichols

I still don't see the point to this - even in the 2nd example you can just return the value instead of using goto. It also means I don't have to scroll down to see what goto actually does.

In a more complex example you should just create a method that does what is in the goto sections. Nobody has ever been able to show me a good use case example of goto that either produced more readable code or more optimized without sacrificing readability.

28 Jun 2018
14:17 PM

Oren Eini

Anthony, Method call are costly, we want to avoid that.

28 Jun 2018
20:47 PM

Ivan

I agree with previous opinions, I'm not convinced that using a GOTO have any advantages over standard if/else workflow. I personally think that if/else is much easier to read and follow the structure. GOTO is a tool which might be useful in some cases (haven't seen any), but it's an odd one which much easier to misuse than to use properly.

29 Jun 2018
15:09 PM

Alex

As previously established, just because one .NET JIT implementation/version happens to produce the same machine code for an if and tail calls as it does for gotos, it doesn't mean that all will. When you've got a loop being run tens or hundreds of millions of times per second, eliminating individual instructions in a consistent and reliable manner across all supported versions of .NET is worth gold.

Readability is subjective. A 20-line method being inlined into a very tight loop is not somewhere readability is going to suffer if a few gotos are deployed judiciously.

And quite frankly, anyone who's scared off by a goto under those conditions has no business maintaining that code anyway.

30 Jun 2018
04:18 AM

Federico Lois

No goto has ever gone into the codebase without a vtune profiling run showing it improves by a decent enough amount. That some gotos are not needed anymore, on certain newer CLRs, doesn't preclude they are still needed in our codebase for the cases where a client is still using an older one (we have clients using RavenDB 2.5 still, which is 6 years old). And definitely no, the kind of codebase that should use goto out of necessity is far away from the usual code. And they are also a rarely seen in the wild kind; in most cases, they are locked down behind walls.

20 Jul 2018
15:00 PM

Mark Brents

In the ReadNumber, why not just replace the "goto Error" with the code from the Error label?

16 Aug 2018
19:04 PM

Federico Lois

@Mark Because the preparation work needed to throw an exception would interrupt the flow of execution and cause a cache miss. In that way you get a single jump instruction which will play nice with the instruction prefetcher.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

Using GOTO in C#

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication