Graphs in RavenDB: Query results

architecture (618) rss
bugs (451) rss
challanges (123) rss
community (381) rss
databases (481) rss
design (896) rss
development (647) rss
hibernating-practices (72) rss
miscellaneous (592) rss
performance (397) rss
programming (1093) rss
raven (1459) rss
ravendb.net (545) rss
reviews (184) rss

2025
- August (6)
- July (7)
- June (7)
- May (10)
- April (10)
- March (10)
- February (7)
- January (12)
2024
- December (3)
- November (2)
- October (1)
- September (3)
- August (5)
- July (10)
- June (4)
- May (6)
- April (2)
- March (8)
- February (2)
- January (14)
2023
- December (4)
- October (4)
- September (6)
- August (12)
- July (5)
- June (15)
- May (3)
- April (11)
- March (5)
- February (5)
- January (8)
2022
- December (5)
- November (7)
- October (7)
- September (9)
- August (10)
- July (15)
- June (12)
- May (9)
- April (14)
- March (15)
- February (13)
- January (16)
2021
- December (23)
- November (20)
- October (16)
- September (6)
- August (16)
- July (11)
- June (16)
- May (4)
- April (10)
- March (11)
- February (15)
- January (14)
2020
- December (10)
- November (13)
- October (15)
- September (6)
- August (9)
- July (9)
- June (17)
- May (15)
- April (14)
- March (21)
- February (16)
- January (13)
2019
- December (17)
- November (14)
- October (16)
- September (10)
- August (8)
- July (16)
- June (11)
- May (13)
- April (18)
- March (12)
- February (19)
- January (23)
2018
- December (15)
- November (14)
- October (19)
- September (18)
- August (23)
- July (20)
- June (20)
- May (23)
- April (15)
- March (23)
- February (19)
- January (23)
2017
- December (21)
- November (24)
- October (22)
- September (21)
- August (23)
- July (21)
- June (24)
- May (21)
- April (21)
- March (23)
- February (20)
- January (23)
2016
- December (17)
- November (18)
- October (22)
- September (18)
- August (23)
- July (22)
- June (17)
- May (24)
- April (16)
- March (16)
- February (21)
- January (21)
2015
- December (5)
- November (10)
- October (9)
- September (17)
- August (20)
- July (17)
- June (4)
- May (12)
- April (9)
- March (8)
- February (25)
- January (17)
2014
- December (22)
- November (19)
- October (21)
- September (37)
- August (24)
- July (23)
- June (13)
- May (19)
- April (24)
- March (23)
- February (21)
- January (24)
2013
- December (23)
- November (29)
- October (27)
- September (26)
- August (24)
- July (24)
- June (23)
- May (25)
- April (26)
- March (24)
- February (24)
- January (21)
2012
- December (19)
- November (22)
- October (27)
- September (24)
- August (30)
- July (23)
- June (25)
- May (23)
- April (25)
- March (25)
- February (28)
- January (24)
2011
- December (17)
- November (14)
- October (24)
- September (28)
- August (27)
- July (30)
- June (19)
- May (16)
- April (30)
- March (23)
- February (11)
- January (26)
2010
- December (29)
- November (28)
- October (35)
- September (33)
- August (44)
- July (17)
- June (20)
- May (53)
- April (29)
- March (35)
- February (33)
- January (36)
2009
- December (37)
- November (35)
- October (53)
- September (60)
- August (66)
- July (29)
- June (24)
- May (52)
- April (63)
- March (35)
- February (53)
- January (50)
2008
- December (58)
- November (65)
- October (46)
- September (48)
- August (96)
- July (87)
- June (45)
- May (51)
- April (52)
- March (70)
- February (43)
- January (49)
2007
- December (100)
- November (52)
- October (109)
- September (68)
- August (80)
- July (56)
- June (150)
- May (115)
- April (73)
- March (124)
- February (102)
- January (68)
2006
- December (95)
- November (53)
- October (120)
- September (57)
- August (88)
- July (54)
- June (103)
- May (89)
- April (84)
- March (143)
- February (78)
- January (64)
2005
- December (70)
- November (97)
- October (91)
- September (61)
- August (74)
- July (92)
- June (100)
- May (53)
- April (42)
- March (41)
- February (84)
- January (31)
2004
- December (49)
- November (26)
- October (26)
- September (6)
- April (10)

RavenDB - High-Performance NoSQL Document Database

Oct 22 2018

Graphs in RavenDBQuery results

time to read 2 min | 381 words

We run into an interesting design issue when building graph queries for RavenDB. The problem statement is fairly easy. Should a document be allowed to be bound to multiple aliases in the query results, or just one? However, without context, the problem statement in not meaningful, so let’s talk about what the actual problem is. Consider the graph on the right. We have three documents, Arava, Oscar and Phoebe and the following edges:

Arava Likes Oscar
Phoebe Likes Oscar

We now run the following query:

This query asks for a a dog that likes another dog that is liked by a dog. Another way to express the same sentiment (indeed, how RavenDB actually considers this type of query) is to write it as follows:

When processing the and expression, we require that documents that match to the same alias will be the same. Given the graph that we execute this on, what would you consider the right result?

Right now, we have the first option, in which a document can be match to multiple different alias in the same result, which would lead to the following results:

Note that in this case, the first and last entries match A and C to the same document.

The second option is to ensure that a document can only be bound to a single alias in the result, which would remove the duplicate results above and give us only:

Note that in either case, position matters, and the minimum number of results this query will generate is two, because we need to consider different starting points for the pattern match on the graph.

What do you think should we do in such a case? Are there reasons to want this behavior or that and should it be something that the user select?

Tweet Share Share 18 comments

Tags:

raven
design

Comments

22 Oct 2018
13:32 PM

Jan

Here is related corner case which I'm thinking about. Let's have following graph:
1. Arava likes Oscar.
2. Arava likes Arava (well, he/she is a narcissist :-) )

Now I want following query:

match (a:Dogs)-[:Likes]->(b:Dogs) and (a:Dogs)-[:Likes]->(c:Dogs)-[:Likes]->(b:Dogs)
select a.Name, b.Name

With "1 entity to multiple alias matching", this query shall have this output:
Arava , Oscar

Without such matching, there is no result - and this is a case which I'm considering wrong.

22 Oct 2018
13:46 PM

Damien

I may be showing my relational leanings but I'd prefer to have all of the results (presuming there's an easy way to take on something along the line of a <> c as an additional part of the query, if I don't want the "duplicates").

Say that the query is asymmetric, such that there's a further link hanging off of c. And say that, for the purposes of following that link to d, I do want a to be treated identically to any other c. How do I express that if a and c are automatically precluded from being the same?

22 Oct 2018
14:37 PM

peter

is the problem the same "document" or is the problem that the first resultset is showing the same relationship, which is NOT what the query is trying to find? In relational sql, when doing a self join we would prevent the same row by adding e.g. where a.id != b.id. However, sometimes the same row can match itself, depending on the columns being compared.

22 Oct 2018
14:40 PM

Oren Eini

Jan, Thanks for the interesting query. Right now, this query will return:

+-------+-------+-------+
| a     | b     | c     |
+-------+-------+-------+
| Arava | Oscar | Arava |
+-------+-------+-------+
| Arava | Arava | Arava |
+-------+-------+-------+

The problem I have is with the second result, which make sense, but is probably not expected

22 Oct 2018
14:41 PM

Oren Eini

Damien, What we are considering is adding a where clause after the match, which will allow you to define exclusions like that. I'm not sure that I'm following the point on asymmetric query.

22 Oct 2018
14:41 PM

Jorge

I believe it makes more sense to return all the results. If we need, we can always add a "where a.Id <> b.Id" clause.

22 Oct 2018
14:42 PM

Oren Eini

Peter, See my example to Jan, which may explain it better. Yes, the problem is that it is matched to itself

22 Oct 2018
14:47 PM

Damien

Okay, so for the "asymmetric" query, I was thinking something along the lines of "Find all dogs which are two likes removed from dogs that Arava likes". if we express that as (a:Dogs) -[:Likes] -> (b:Dogs) <-[:Likes]- (c:Dogs) <- [:Likes] - (d:Dogs). d contains the dogs that are two likes removed, except those that like Arava, if we auto-filter to prevent c being equal to a.

22 Oct 2018
15:00 PM

Oren Eini

Damien, Oh, I see. Yes. The end result is that we must implement some manner to allow the user to filter these.

22 Oct 2018
15:21 PM

wqw

where a.Id <> b.Id is not very convenient but where a.Id > b.Id is more general for de-duping so that triples can follow the same pattern: a.Id > b.Id > c.Id etc.

23 Oct 2018
06:21 AM

Oren Eini

wqw, I'm not sure that I follow why you would want a.Id > b.Id here. In particular, note that in RavenDB, ids are usually strings without lexical sorting, so I don't see how greater then would help

23 Oct 2018
10:42 AM

Pop Catalin

This query asks for a dog that likes another dog that is liked by a dog I think this gives the clearest answer of how the results should be returned.

Returning all results would mean:

This query asks for a dog that likes another dog that is liked by a dog or by itself."

Which is a rather clear violation of the "principle of least astonishment". In my view, it's imperative that features behave as closely as possible to the human logical understanding of the feature and not necessarily with the widest mathematical understanding. The problem with allowing A to alias B is in multiple operand queries there would have to be lot's of filters added, which can lead to the number of filter clauses to become factorial to the number of operands:

IE: (a:Dogs) -[:Likes] -> (b:Dogs) <-[:Likes]- (c:Dogs) <- [:Likes] - (d:Dogs) where a != c and a != d and b != d

IF someone whishes to express (a:Dogs) -[:Likes] -> (b:Dogs) <-[:Likes]- (c:Dogs) as a can euqual to b then it can do it (a:Dogs) -[:Likes] -> (b:Dogs) <-[:Likes]- (c:Dogs) or (a:Dogs) -[:Likes] -> (b:Dogs) <-[:Likes]- (a:Dogs) but this query makes little sense to me.

23 Oct 2018
11:53 AM

Oren Eini

Pop, A better example might be something like:

(e:Employees)-[:Manager]->(m:Employees (Nice = true) )

How would you handle the case where you have a self managed employee that is nice?

23 Oct 2018
12:07 PM

Pop Catalin

@Oren

by default, I would like e and m to be exclusive but for other purposes, I would like some form of set algebraic operators to indicate relationships between a and m, something like 'e supersetof m', which I personally think it's more expressive than boolean where conditions between elements (But this is my opinion, I could be biased here).

23 Oct 2018
12:11 PM

Oren Eini

Pop,

The problem is that users, I believe, will expect to match this behavior: select e.Id, m.Id from Employees e join Employees m where e.Manager = m.Id

23 Oct 2018
12:37 PM

Pop Catalin

@Ayende,

While the expectation is true, the self-managed employee is usually the case of bugs in software. Whenever it happens to have a self-managed employee there's usually some bugs to go along with it. Some of which I've encountered myself:

Weekly email with the title: Bad employee notification, Employees who did not close timesheet by Friday 5:30 ( Was funny when people received this email about themselves, it actually made higher management apologize and change the title of the email.
Billing misalignment between systems (One system was counting one extra employee per team, and for a while, bills could not be created without executive override because the system saw 50% capacity being billed)

While I fully agree about the expectation of the orthogonality of the feature: graph query should behave like Linq counterpart, Is still think to work with multiple relationships over a set, it a nuisance. I don't think there's a wrong or right way from the general sense, just a matter of preference for various types of users.

23 Oct 2018
21:40 PM

Oren Eini

Pop, I'm not sure if you mean that graph queries should behave like Linq or should not, but I'm pretty sure that writing the above query with Linq would produce the same result, unless we explicitly do something to prevent it, no?

23 Oct 2018
22:00 PM

Pop Catalin

Yes, Makes sense. If the relationship exists it should appear in the Query.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB

Graphs in RavenDBQuery results

More posts in "Graphs in RavenDB" series:

Comments

Comment preview

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication

Main feed
Comments feed

Oren Eini

CEO of RavenDB

Related posts that you may find interesting:

More posts in "Graphs in RavenDB" series:

Comments

Comment preview

Markdown formatting

Phrase Emphasis

Links

Images

Headers

Lists

Blockquotes

Horizontal Rules

Manual Line Breaks

Fenced Code Blocks

Header IDs

Tables

Definition Lists

Footnotes

Abbreviations

FUTURE POSTS

RECENT SERIES

RECENT COMMENTS

Syndication