What is new in RavenDB 3.0: RavenFS

architecture (624) rss
bugs (451) rss
community (383) rss
databases (481) rss
design (899) rss
development (658) rss
hibernating-practices (74) rss
miscellaneous (592) rss
performance (397) rss
programming (1113) rss
raven (1483) rss
ravendb.net (570) rss
reviews (184) rss

2025
- December (8)
- November (4)
- October (4)
- September (10)
- August (6)
- July (7)
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

AI Agents, powered by your database - Bring AI into your applications with context, tools, and your data

Sep 11 2014

What is new in RavenDB 3.0RavenFS

time to read 5 min | 802 words

A frequent request from RavenDB users was the ability to store binary data. Be that actual documents (PDF, Word), images (user’s photo, accident images, medical scans) or very large items (videos, high resolution aerial photos).

RavenDB can do that, sort of, with attachments. But attachments were never a first class feature in RavenDB.

With RavenFS, files now have first class support. Here is a small screen shot, I’ve a detailed description of how it works below.

The Raven File System exposes a set of files, which are binary data with a specific key. However, unlike a simple key/value store, RavenFS does much more than just store the binary values.

It was designed upfront to handle very large files (multiple GBs) efficiently at API and storage layers level. To the point where it can find common data patterns in distinct files (or even in the same file) and just point to it, instead of storing duplicate information. RavenFS is a replicated and highly available system, updating a file will only send the changes made to the file between the two nodes, not the full file. This lets you update very large files, and only replicate the changes. This works even if you upload the file from scratch, you don’t have to deal with that manually.

Files aren’t just binary data. Files have metadata associated with them, and that metadata is available for searching. If you want to find all of Joe’s photos from May 2014, you can do that easily. The client API was carefully structured to give you full functionality even when sitting in a backend server, you can stream a value from one end of the system to the other without having to do any buffering.

Let us see how this works from the client side, shall we?

var fileStore = new FilesStore()
{
    Url = "http://localhost:8080",
    DefaultFileSystem = "Northwind-Assets",
};

using(var fileSession = fileStore.OpenAsyncSession())
{
    var stream = File.OpenRead("profile.png");
    var metadata = new RavenJObject
    {
        {"User", "users/1345"},
        {"Formal": true}
    };
    fileSession.RegisterUpload("images/profile.png", stream, metadata);
    await fileSession.SaveChangesAsync(); // actually upload the file
}

using(var fileSession = fileStore.OpenAsyncSession())
{
    var file = await session.Query()
                    .WhereEquals("Formal", true)
                    .FirstOrDefaultAsync();

    var stream = await session.DownloadAsync(file.Name);

    var file = File.Create("profile.png");

    await stream.CopyToAsync(file);
}

First of all, you start by creating a FileStore, similar to RavenDB’s DocumentStore, and then create a session. RavenFS is fully async, and we don’t provide any sync API. The common scenario is using for large files, where blocking operations are simply not going cut it.

Now, we upload a file to the server, note that at no point do we need to actually have the file in memory. We open a stream to the file, and register that stream to be uploaded. Only when we call SaveChangesAsync will we actually read from that stream and write to the file store. You can also see that we are specifying metadata on the file. Later, we are going to be searching on that metadata. The results of the search is a FileHeader object, which is useful if you want to show the user a list of matching files. To actually get the contents of the file, you call DownloadAsync. Here, again, we don’t load the entire file to memory, but rather will give you a stream for the contents of the file that you can send to its final destination.

Pretty simple, and highly efficient process, overall.

RavenFS also has all the usual facilities you need from a data storage system, including full & incremental backups, full replication and high availability features. And while it has the usual file system folder model, to encourage familiarity, the most common usage is actually as a metadata driven system, where you locate a desired file based searching.

Tweet Share Share 27 comments

Tags:

raven

Comments

11 Sep 2014
09:08 AM

Matthijs ter Woord

What license is RavenFS? I found the github repository, but there's no license info in there...

11 Sep 2014
09:10 AM

njy

This is, I mean, yeah, this is pretty impressive. A lot.

11 Sep 2014
09:22 AM

Ayende Rahien

Matthijs , There isn't a separate repository for RavenFS any more. What you are looking at is a very old remnant. RavenFS is licensed under the same license as RavenDB.

11 Sep 2014
09:25 AM

Koen Verheyen

In your code example you're not using the metadata object while storing so I don't think this will work like this. Also it doesn't compile I guess (missing a semicolon at the end).

11 Sep 2014
09:29 AM

Ayende Rahien

Koen, Thanks, I updated the code sample.

11 Sep 2014
09:58 AM

Mike Mooney

This is very cool. Will there bee any tool to upgrade/migrate existing attachments in Raven 2.5 to RavenFS?

11 Sep 2014
10:33 AM

Ian Cross

Great news - I've been eagerly awaiting this feature. There are many things we're planning to do with it :-)

11 Sep 2014
11:28 AM

Joe

"If you want to find all of Joe’s photos from May 2014, you can do that easily."

Damn it, iCloud.

11 Sep 2014
11:48 AM

Olav

This look really great! Have you thought about how you will priced this on RavenHQ?

11 Sep 2014
12:14 PM

Ayende Rahien

Mike, Yes, there is such a tool, it is part of the 3.0 release dist.

Olav, This is just the 3.0 release status. The RavenHQ stuff and especially billing is something that will be handled separately.

11 Sep 2014
13:19 PM

Mike Mooney

Cool. Just wondering, if my client already purchased Raven DB 2.x, will they need to purchase a new license for 3.0?

11 Sep 2014
16:08 PM

Ayende Rahien

Mike, That depend on the type of license they purchased. If they went with the subscription model, then yes, they can just upgrade and there is no issue with versioning. If your client purchased a one time license, that requires a new license purchase (we do provide 15% discount).

12 Sep 2014
01:51 AM

Steven Roberts

What about file versioning, will RavenFS support that as well?

12 Sep 2014
05:17 AM

Ayende Rahien

Steven, Currently we didn't implement automatic versioning. Considering the fact that we are looking at handling this for very large files, that is something that we wanted the user to have a choice about. It is probable we'll add that once we have enough customer feedback.

15 Sep 2014
14:07 PM

Matt

Ayende, What would happen if the file was edited at the same time at different sites? How does RavenFS handle this?

15 Sep 2014
14:09 PM

Ayende Rahien

Matt, That would generate a conflict, just like in RavenDB. You would be asked to resolve that conflict, and everything would go on as usual.

15 Sep 2014
14:12 PM

Matt

Thanks for the quick response. Did you ever consider the ability to 'lock' a file?

16 Sep 2014
04:36 AM

Ayende Rahien

Matt, When you upload a file to a node, that is locked _on that node_. We don't do distributed locks, however.

24 Sep 2014
12:53 PM

Rik

Have you considered adding WebDav support?

26 Sep 2014
07:21 AM

Ayende Rahien

Rik, Not at the moment, no.

21 Oct 2014
23:19 PM

Flukus

This is going to be a killer feature I think, even on projects using an sql database.

I've been on a few projects recently where we wanted file storage like this but things like amazon weren't an option

30 Nov 2014
21:16 PM

Eric Taylor

Ayende, do you see RavenFS replacing the need for, say, Azure blob storage, or AWS's S3 or other storage option? If so, up to what point? What would be use case threshold where one might say, "O.K., we've outstripped RavenFS's ability to meet demand; time to move to Azure or AWS"?

30 Nov 2014
22:40 PM

Ayende Rahien

Eric, RavenFS is there to provide more than just blob storage. To start with, it is replicated and has rich metadata capabilities (including searching).

The use case is quite different. A common use case can be distributing large files across multiple nodes (data bus), storing information in a way that allows you to do fast queries on their metadata, and avoiding moving the data to a remote location.

30 Dec 2014
21:38 PM

Shmueli Englard

Where are the files stored? And can we control where? Just thinking that where I have the RavenDB server may not have the ability to store gigabytes of data.

31 Dec 2014
10:12 AM

Ayende Rahien

Shmueli, We store them in a data directory, inside a single file on the file system. You can control where the happens, yes.

18 Feb 2015
22:13 PM

Nannez

Ayende, can I implement sharding with RavenFS?How?

19 Feb 2015
06:47 AM

Ayende Rahien

Nannez, That is probably something that we need to discuss over email. It can be done, yes, but you need to handle the distribution yourself right night.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

What is new in RavenDB 3.0RavenFS

More posts in "What is new in RavenDB 3.0" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Related posts that you may find interesting:

More posts in "What is new in RavenDB 3.0" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication