Request for comments: Removing graph queries from RavenDB

architecture (618) rss
bugs (451) rss
challanges (123) rss
community (381) rss
databases (481) rss
design (896) rss
development (646) rss
hibernating-practices (71) rss
miscellaneous (592) rss
performance (397) rss
programming (1092) rss
raven (1458) rss
ravendb.net (543) rss
reviews (184) rss

2025
- August (4)
- July (7)
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

Couchbase vs RavenDB Performance at Rakuten Kobo Whitepaper

Mar 10 2022

Request for commentsRemoving graph queries from RavenDB

time to read 2 min | 245 words

In version 4.2 we have added an experimental feature to RavenDB, Graph Queries. That was quite a bit of effort and we were really excited about it. The feature was marked as experimental and had been in the product in that state for the past 4 years or so.

Unfortunately, while quite impressive, it didn’t graduate from an experimental feature to a stable one. Mostly because there wasn’t enough usage of graph queries to warrant it. We have seen its usage in some cases, but it seems that our target audience isn’t interested in graph queries for RavenDB.

Given that there isn’t much use of graph queries, we are also aren’t spending much time there. We are looking at the 6.0 release (scheduled around July 2022) and we realize that this feature makes our life more complicated and that the support burden of keeping it outweigh its benefits.

For that reason, we have made the decision to remove the experimental Graph Queries from RavenDB in the 6.0 release. Before we actually pull the trigger on that, I wanted to get your feedback on the feature and its usage. In particular, if you are using it and if so, what are you using it for?

The most common scenarios for this feature are already covered via projection queries in RavenDB, which often can be easier to express for developers.

Regardless, the feature will remain in the 5.x branch and the 5.2 version LTS will support it until at least 2024.

Tweet Share Share 15 comments

Tags:

Comments

10 Mar 2022
13:04 PM

Milosz

Please don't do it.
It's a great selling point when choosing a database - even though you could do a lot of things without it, the official support for graph queries gives a peace of mind that once it is needed, it is there.
And competition has it to:
https://www.mongodb.com/databases/mongodb-graph-database
https://docs.microsoft.com/en-us/azure/cosmos-db/graph/graph-introduction
https://age.apache.org/
https://docs.microsoft.com/en-us/sql/relational-databases/graphs/sql-graph-overview

10 Mar 2022
14:09 PM

Jon

Graph storage and queues are super cool, but niche in their application/usage scenarios. I wouldn't mind if the feature is separated out to a new project, or is dropped.

Though I am still waiting for easier sharding support (like the Raven 3.x style) to return...

10 Mar 2022
18:41 PM

Uri

Great decision, you don't need more features. keep focus and do what you're best of - give me the fastest database. period. replace Lucene is big enough challenge, you don't need more weights.

this reminds me IE6 support, yes, there are some users with it, but for the 99% of the users it's not relevant.

10 Mar 2022
23:26 PM

Jason

Our product not yet utilize that feature, so impact for us is none. Whether we will choose other database if RavenDB remove it or not. The answer is no. It is not the core value we initially choose RavenDB.

Also, I would rather prefer specialized database rather than Jack of all trades, master of none.

I remember original implementation of Twitter was to using multiple database for different purpose. Same goes with RavenDB. As long as ETL can be done easily, accurately on the database, then data can be export and reshaped into a graph specific database. Which is more specialized, more performant. Even Microsoft's Data Lake + PowerBI can be an lower performance alternative to the job. To generate user report.

Same can be said on full text search, of course that as baseline make RavenDB shine and we do depends on it, but there is other option, such as ETL to other full text search services. It does raise the cost, but some of those full text search service makes easy for ordinary developer to use. Also able to support different language easily. Language that not depends on space to separate word. Such as Asian language.

11 Mar 2022
01:42 AM

Trev

I'll admit we fall into the class of users that have intentions to use graph capabilities for one of our use cases, but haven't gotten around to it yet. That said, even then I don't believe we need the full capabilities that graph queries cover. So yeah, cull it.

As a token replacement though, a nice addition may be a more intrinsic way of creating labelled relationships between documents. No, I'm not talking about a relational DB. Imagine an ecommerce DB with a list of a few million products. Some of these products can be linked together in some novel ways:

The basic linking is on an attribute on the product itself. Products by the same manufacturer. Easily queryable with an index.
A more complex M-M style linking with another document. Product categories for example. A category document exists, and products can exist in multiple categories. Again queryable if the list of categories that the product is in is stored on the product document.

So far, so good. All doable within standard modelling and queries. But what about going far beyond that and being able to create an arbitrary number of different types of links between different products and then navigating those links. For example:

People who bought this product also bought these products...
This product was reviewed alongside these products and came in 3rd place...
This product and these other products are part of this promotion...
This product is complimented by these other products... n, ....

These can be modeled and queried using a collection of documents per relationship type and product (e.g. for the first case, a document per product in a PWBAB collection that contains a list of other products). It could also be modeled in a graph db with labelled edges between nodes. But it can get a bit clunky in a pure document db maintaining all those relationship documents because there's no referential integrity between the docs - deleting a product and cleaning up the relationships it is involved in falls onto application logic. So we CAN do relationships, we just need to manage them in the app, not the DB.

Maybe that's ok. Maybe adding some kind of first-class-citizen reference between documents is a step too close to being a relational DB. Maybe people who come from a relational background would end up abusing it. You're probably absolutely right in https://ayende.com/blog/4584/ravendb-includes ("disallow associations between documents").

But also, maybe some enhancements to includes could replace graph queries with something more lightweight. Maybe the referential cleanup could be BASE instead of ACID. Maybe there could be a more intrinsic way of loading 2-3 levels deep of a relationship or querying what relationships a product has. Maybe it's yet another differentiator to other document DB's without as big a footprint as graph queries.

Just thinking out loud to how we'll eventually implement our product db without graph queries. It'll be easy, but maybe it could be easier in some simple ways.

11 Mar 2022
05:33 AM

Milosz

Jon's idea of extraction to a separate project would be a nice compromise indeed - the weight of keeping it up-to-date could be moved onto the community. Keeping it (at least as a separate project) would also allow to retain the current version of https://ravendb.net/why-ravendb/multi-model

11 Mar 2022
06:19 AM

Milosz

Jason's idea to use ETL to send data to a "real" graph database is somewhat calming. Obviously using additional database increases the cost of development and administration but maybe it is inevitable trade-off.

11 Mar 2022
08:31 AM

Oren Eini

Milosz,

Features have cost, and they have to pay for themselves. Note that we have the same set of features with the graph API or not. We can do the same sort of lookups, it is an issue of what runs the query and what we are guiding for.

And as a personal note, I would rather RavenDB be an awesome database for its core competencies rather than provide an 80% solution.

11 Mar 2022
08:34 AM

Oren Eini

Jon,

Part of the reason for dropping graph queries is actually that the 6.0 edition has built in sharding support. Providing sharded graph queries is a huge cost, and we didn't see the pickup on the feature to justify it.

11 Mar 2022
08:35 AM

Oren Eini

Jason,

I certainly agree with you about master of none. In addition, I just wanted to point out that we now also have ETL to Elastic Search, if you want to go that route. And we do support full text search on non latin languages, including Asian ones.

11 Mar 2022
08:45 AM

Oren Eini

Trev,

Those are actually possible right now.

Take a look at this post, which seems to be almost exactly what you are looking for:

https://ravendb.net/articles/product-recommendations-in-ravendb

Note that this is actually a map/reduce, so you aren't querying the raw data (which is good, since you get faster responses).

Note that for your needs, those aren't actually associations between documents. Those are emreging relationships, not between two documents, but between classes of those. That is why an index approach works better in that regard.

11 Mar 2022
08:49 AM

Jason

I am aware of ETL to elastic search, we could also export for Azure search. Currently our product is focus on English only, so it shouldn't be any issue for us.

When I say full text search on Asian language, I'm more towards specialized tokenizer. Like Microsoft's tokenizer for different languages. e.g. Azure Search Tokenizer.

I know you can do NGramAnalyzer, but that's not optimal for Asian or latin languages, it trades space for capability.

I remember I asked you about language specific tokenizer, which you said I can create my own tokenizer and it is not currently RavenDB's goal. Which is understandable. Each language specific tokenizer requires language experts. Unless you integrate with other service that already has it.

11 Mar 2022
08:54 AM

Oren Eini

Jason,

You can do that with RavenDB by using analyzers, for example, this one: https://lucenenet.apache.org/docs/3.0.3/d2/dab/_chinese_analyzer_8cs_source.html

You can add those analyzers to RavenDB and utilize them. The issue is that we aren't providing them OOTB, but they exists.

11 Mar 2022
08:57 AM

Jason

That's good to know. Thanks for info.

21 Mar 2022
17:51 PM

Alexandru

I had a use-case recently that I would have liked to use ravendb, but ended up going with Neo4j since it had a lot of built-in graph processing function s such as page-rank. I'm considering replicating data between ravendb and neo4j as I continue the project (raven for main data, map/reduce and neo4j for fancy calculations). A built-in replication/processing pipeline would be neat. This is my fancy part of my current usage:

https://github.com/ops-ai/PageRank-Crawler/blob/develop/PageRank-Crawler/Program.cs#L82

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

Request for commentsRemoving graph queries from RavenDB

More posts in "Request for comments" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Related posts that you may find interesting:

More posts in "Request for comments" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication