Rhino Divan DB – Design - Ayende @ Rahien

Mar 01 2010

Rhino Divan DB – Design

time to read 3 min | 474 words

One of the things that I wanted to do with RDB is to create an explicit actor model inside the codebase. I have been using a similar structure inside NH Prof, and it has been quite successful. The design goals for RDB is:

Assumptions for the database cosntruction

Get / Put / Delete semantics for Json documents.

All those operations can access batches of documents to work on. Those operations fully implement ACID. Which means that if you got a successful response for a document Put, you can rely on the document always being there.

Those operations should be considered cheap.

Reboot / crash resistant

The DB can crash / restart, but no lose of functionality may occur, but as soon as it restarts, everything goes on as usual. There can be no in memory data structures / work that cannot be recovered from persistent structure.

Views for searching

The DB use views, defined using linq expressions, for supporting search capabilities. Those views are background indexed (so no holding up request processing for views). When you get a result from a queue you always know if the result is stale or not.

Adding a view to an existing database is a cheap operation, regardless of the database size. During view construction, the view can be queried (but its results will be considered stale). Reboot during view construction will not impact the construction process.

Indexing a document twice is a stable operation, which means that a view can always choose to re-index things if it so choose.

Overall design

RDB stores two major pieces of information in transactional storage.

Documents, obviously, which are stored in a format that allows to send the document content to the user quickly, and tasks.

Tasks are how RDB maintains state over crashes / reboots, and they also form the base of async work of the database. Any work that is going to take some time for the database to perform is written to transactional storage as a task. Those tasks are things like: “View ‘peopleByName’ should index documents 1 – 42'”.

There are background threads working of off this tasks queue, performing the work and removing the task when they are completed.

The results of each view is written to a Lucene index (one per view).

So far i have the entire structure done, I need to some polishing, and I have a different OSS strategy to go with, but thinks are looking good.

Tweet Share Share 18 comments

Tags:

Rhino DivanDB

Comments

01 Mar 2010
10:21 AM

Rafal

Can you justify using Esent as a transactional store? Many applications will use a relational database along with the 'nosql' document db, so it would be much simpler to use the same database server for storing both relational data and DivanDB documents. It would simplify many tasks like administration, backup/restore, debugging etc.

01 Mar 2010
11:12 AM

Ayende Rahien

Rafal,

I want it to be a single thing, not something that relies on a lot of external components, requires complex installation, etc.

Administration - there should be none

Backup/Restore - esentutl.exe is already part of windows

Debugging - you don't do that, because the server has no logic

Index checking - there is Luke

01 Mar 2010
11:48 AM

j23tom

hmm ... does it mean that this project will never run under linux/mono ?

01 Mar 2010
13:12 PM

j23tom,

It would, when someone would port the storage to BDB

01 Mar 2010
13:18 PM

Ben

Ayende, is the code available to start poking around?

01 Mar 2010
13:31 PM

Steve

Good stuff

Let's say I use this for my 'command storage' in a command-query pattern, one piece that would be essential to me would be to figure out how to update my query database (let's say MS SQL)

01 Mar 2010
13:39 PM

Ben,

Not at the moment, we are working on a different release plan

01 Mar 2010
13:40 PM

You have a message dispatched to a consumer whenever a command is executed?

01 Mar 2010
14:53 PM

Jan Limpens

Ayende, under which license do you plan to release all this? Or is this a commercial project?

01 Mar 2010
16:06 PM

Thanks Ayende. Hurry up ;) I was all set to start with CouchDB or MongoDB but this sounds like it'd be a better fit for what I need. Like Jan, I'm curious on your release plan... if you're going dual license, commercial, purely oss, what.

01 Mar 2010
16:12 PM

Set

Does it support multiple documents commit?

01 Mar 2010
16:38 PM

Vadim Kantorov

I've heard some ramblings that there's a database size limit in Esent. Is there really?

01 Mar 2010
16:59 PM

Vadim,

Individual columns can be up to 2GB in size. A database can be up to 16TB in size.

Copied from: blogs.msdn.com/.../...-api-in-the-windows-sdk.aspx

So, yes, there is a limit but, from a practical standpoint, probably a livable one for most apps.

01 Mar 2010
18:41 PM

Jan,

This will be OSS, but I am not sure under what license.

01 Mar 2010
18:45 PM

I am willing to give source access in exchange for work on the project.

Set,

Not at the moment, it will soon.

01 Mar 2010
20:06 PM

Ayende,

I can certainly help with anything that doesn't require a big brain ;)

02 Mar 2010
04:03 AM

Kerja Jawatan Kosong

Ayende.. im currenty using xoops for my own project..Do u think this storage working with it??

02 Mar 2010
07:54 AM

It would be possible, yes.

We have a fully functional JSON / HTTP API

Comment preview

Comments have been closed on this topic.

Oren Eini

Oren Eini

CEO of RavenDB