Raven Situational Awareness

time to read 4 min | 684 words

There is a whole set of features that require collaboration from a set of severs. For example, when talking about auto scale scenarios, you really want the servers to figure things out on their own, without needing administrators to hold their hands and murmur sweet nothings at 3 AM.

We needed this feature in Raven DB, Raven MQ and probably in Raven FS, so I sat down and thought about what is actually needed and whatever I could package that in a re-usable form. I am on a roll for the last few days, and something that I estimated would take a week or two took me about six hours, all told.

At any rate, I realized that the important parts of this feature set is the ability to detect siblings on the same network, being able to detect failure of those siblings and the ability to dynamically select the master node. The code is available here: https://github.com/hibernating-rhinos/Raven.SituationalAwareness under the AGPL license. If you want to use this code commercially, please contact me for commercial licensing arrangements.

Let us see what is actually involved here:

var presence = new Presence("clusters/commerce", new Dictionary<string, string>
{
    {"RavenDB-Endpoint", new UriBuilder("http", Environment.MachineName, 8080).Uri.ToString()}
}, TimeSpan.FromSeconds(3));
presence.TopologyChanged += (sender, nodeMetadata) =>
{
    switch (nodeMetadata.ChangeType)
    {
        case TopologyChangeType.MasterSelected:
            Console.WriteLine("Master selected {0}", nodeMetadata.Uri);
            break;
        case TopologyChangeType.Discovered:
            Console.WriteLine("Found {0}", nodeMetadata.Uri);
            break;
        case TopologyChangeType.Gone:
            Console.WriteLine("Oh no, {0} is gone!", nodeMetadata.Uri);
            break;
        default:
            throw new ArgumentOutOfRangeException();
    }
};
presence.Start();

As you can see, we are talking about a single class that is exposed to your code. You need to provide the cluster name, this allows us to run multiple clusters on the same network without conflicts. (For example, in the code above, we have a set of servers for the commerce service, and another for the billing service, etc). Each node also exposes metadata to the entire cluster. In the code above, we share the endpoint for our RavenDB endpoint. The TimeSpan variable determines the heartbeat frequency for the cluster (how often it would check for failing nodes).

We have a single event that we can subscribe to, which let us know about changes in the system topology. Discovered and Gone are pretty self explanatory, I think. But MasterSelected is more interesting.

After automatically discovering all the siblings on the network, Raven Situation Awareness will use the Paxos algorithm to decide who should be the master. The MasterSelected event happens when a quorum of the nodes select a master. You can then proceed with your own logic based on that. If the master will fail, the nodes will convene again and the quorum will select a new master.

With the network topology detection and the master selection out of the way, (and all of that with due consideration for failure conditions) the task of actually implementing a distributed server system just became significantly easier.

Tweet Share Share 14 comments

Tags:

Raven

Comments

27 Apr 2011
09:44 AM

Roja Buck

How is the split-brain ( http://en.wikipedia.org/wiki/Split-brain_(Computing) ) problem dealt with by Raven.SituationalAwareness?

Is the implementation of Paxos protocol internal to the framework or is it based on an external dependency?

27 Apr 2011
09:49 AM

Ayende Rahien

Roja,

Nodes will be notified about the split and about the joining, masters will be assigned for each split, and when the split it merged, will be merged as well.

What they do about it is outside the scope of RSA and in the realm of the actual server running it

Paxos is implemented internally, solely for the purpose of master detection

27 Apr 2011
10:04 AM

tobi

There is a threading bug here: The timer can fire on multiple threads at the same time (for example if your callback takes over 3 seconds to run). Your callback must be thread safe, probably by just ignoring concurrent timer events. Also after disposing the timer, events will still be delivered potentially infinitely many times because they might have queued up on the thread pool. I ran into this once and solved it with the following timer wrapper class which I license to you without restrictions:

public class StoppableSequentialTimer : IDisposable

{

    readonly Action callback;

    readonly System.Threading.Timer timer;

    bool disposed;

    bool running;


    public StoppableSequentialTimer(Action callback, TimeSpan dueTime, TimeSpan intervall)

    {

        if (callback == null) throw new ArgumentNullException("callback");

        this.callback = callback;

        timer = new System.Threading.Timer(_ => TimerCallback(), null, dueTime, intervall);

    }


    void TimerCallback()

    {

        lock (timer)

        {

            if (disposed || running) return;

            running = true;

        }

        try

        {

            callback();

        }

        finally

        {

            lock (timer)

                running = false;

        }

    }


    public void Dispose()

    {

        lock (timer)

            disposed = true;

        timer.Dispose();

    }

}

The callback executes non-concurrently and stops firing once the Timer is disposed. After disposal only a single superfluous callback run can happen.

27 Apr 2011
10:12 AM

Ayende Rahien

Tobi,

Where is the bug? Are you talking about the usage of the Timer in there?

There is no bug here, we might be running concurrently, but the code is safe for multi threading access.

27 Apr 2011
10:21 AM

tobi

Is topologyState accessed thread-safely? Is is being accessed and modified in the callback concurrently. I must say that I did not completely digest the code but this "smelled" of threading-bug.

27 Apr 2011
10:27 AM

Ayende Rahien

topologyState is ConcurrentDictionary

27 Apr 2011
10:58 AM

tobi

I cannot find anything concrete that is wrong with the code so it was probably a false alarm.

27 Apr 2011
12:23 PM

hangy

I know it's a breaking change, but you have a typo in the namespace and the the project files:

Raven.SituationaAwareness misses an "l".

27 Apr 2011
13:20 PM

Alex Simkin

@hangy

This is a standard trick. You cannot copyright common words but you can copyright misspelled ones :)

27 Apr 2011
15:14 PM

Jon Wingfield

Google uses Paxos in its BigTable implementation. I came across this not by wikipedia, but in reading the BigTable pub. Pretty interesting

27 Apr 2011
17:45 PM

lontivero

Hi Ayende, could you share with us how you tested it? I find it very interesting.

28 Apr 2011
05:25 AM

Daniel marbach

Hy ayende

Why are you using Udp with wcf. Isn't that scenario a typical one for rhino esb? Discovery, heartbeat etc. this all screams RhinoEsb ;)

Daniel

28 Apr 2011
05:25 AM

Ayende Rahien

Daniel,

Discovery & heartbeat have nothing to do with Rhino ESB

28 Apr 2011
06:23 AM

Daniel Marbach

I thought this scenario is ideal to be integrated with RhinoESB and messaging...

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB