Distributed in memory cache / storage

time to read 3 min | 572 words

Let us start by defining the difference between cache and storage. A cache may decide to evict items whenever it feels like, a storage will only do so at well defined points.

As a simple example, this is legal for a cache:

PUT "foo", "item data"
result = GET "foo" 
assert result is null

The cache contract means that it "might" preserve values, or it might not, it is the cache choice. A storage contract make the above code illegal. It may never happen. Most cache solutions have a way to specify priorities, including the "do not evict" flag, which makes them into good storage solutions.

For now, we will assume the same API for both storage and cache, and assume that we have flag to turn it this way or that. We need to figure out what is the contract we are working with in the first place. Distributed & in memory means that we have no practical memory limit and we do not care much for disk access times. Distributed does mean that we have remote calls penalty, but in general it means that we don't have to wait for disk, as we would have to in many DB based solutions.

We will go over the API first. Note that I am using an abstract API (suspiciously similar to the Memcached one, I know), which most cache/storage solution would support.

PUT key, data, expiration
Will fail if item is already in cache
GET key
Will return null if item is not in cache or if expired
DEL key
Delete the key from the cache
UPDATE key, data, version
Update the item if the version matches

We assume that the storage reside on multiple machines, which means that we have several interesting problems to deal with when we deal with the storage .

We are, by definition, working in a threaded environment.
The cache offer no way to lock items
The set of items we need to perform a single operation may reside on different machines.
As a result, Each operation is distinct from any other operation, there is no way to batch several operations into a distinct operation. In other words, we don't have transactions.

Think about the implications of this for a moment, will you?

Let us say that I have the following piece of code:

employee, employee_version = GET "employee#"+id
salary, salary_version = GET "salaray#"+id

# business logic

UPDATE "employee#"+id, employee, employee_version
UPDATE "salary#"+id, salary, salary_version

At any point, we might get interleaving from another client, which will lead to data corruption. Even the safe update version that we have is no good, assume that we fail to update the salary. At that point, we have already updated the employee, now we need to roll it back, but another client might have added their changes in the mean time...

To make things worse, the employee and salary might very well reside on different machines. Trying to build transactions around is possible, but extremely expensive in terms of performance.

We need a better solution, one that build on top of the existing API we have and give us a way to handle this properly.

I will touch on that in a future post.

Tweet Share Share 8 comments

Tags:

Comments

20 Jul 2008
15:04 PM

Arne Claassen

The way I've been dealing with memcache to get around this is that memcache is just a first place to look for readonly data. I have a single service that uses DB transactions for writes and it populates memcache on writes and on cache miss requests, Yeah, that means that the DB is once again the write bottleneck, so this is more useful for scenarios of reads greatly outweighing writes.

And yes there's still stale cache issues of the single process writing the new data out in a non-transactional manner.

Staying tuned to your approach (my vote would be to build something on mnesia)

20 Jul 2008
20:24 PM

Cassio Tavares

Hi Ayende. Do you know why in memory cache can spent much more memory than the database itself? I read on oracle consultant post where he says that 1GB database could use more than 100gb in memory. What do you think? Where can I read more about cache and distributed cache?

I already searched in Google but I didn't find what I was exactly looking for.

Thank you.

20 Jul 2008
20:29 PM

Ayende Rahien

I would like to say where that consultant said that, it doesn't make sense.

A 1 GB of data on HD would consume 1 GB of data in memory.

There are issues of the model used. Key/value vs. RDBMS.

In general, it is the old issue of memory utilization vs. processing time. You can trade one for the other. When using the disk, you tend to prefer to minimize size, because it end up more efficient this way, but two orders of magnitude bigger is way beyond what I would expect.

20 Jul 2008
23:41 PM

Cassio Tavares

Hi again. Finally I found the post but unfortunately is not exactly like I said. He says something about multi-versioning that I sincerely don't know what is.

Here is the link.

http://asktom.oracle.com/pls/asktom/f?p=100:11:0::::P11_QUESTION_ID:821088900346364898

But Oracle has a product called Times Ten that is a in memory database. Do you know anything about it?

Thank you.

20 Jul 2008
23:50 PM

Ayende Rahien

It is bull, basically.

What he is talking about is an approach for concurrency that requires several versions of the row to exists at one time.

As a simple example, I read the customer row in one transaction, and you modify it in another transaction.

I can re-read the same row in my transaction and get the values that I got when I first touched that row.

This is a base feature in Oracle, in SQL Server it is called Snapshot Isolation.

This require to keep several versions of the same row in the DB.

In general, unless the DB is implemented by idiots, there isn't any way that you will get to that point.

02 Sep 2008
03:08 AM

bartholo

hi genius,

Great topic! Is there any reference to the implementation of Memory cache, especially under the RAC environment?

Thanks

Best regards

02 Sep 2008
03:16 AM

Ayende Rahien

RAC ? I am not familiar with the term.

I recently implemented a couple of distributed caches, you can find the references in the blog (NMemcache)

02 Sep 2008
05:06 AM

bartholo

I mean under the Real Application Cluster.

I can't wait to take a look at it. Thanks very much!

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB