Raven’s Scripted Index Results

time to read 2 min | 317 words

Scripted Index Results (I wish it would have a better name) is a really interesting new feature in RavenDB 2.5. As the name implies, it allows you to attach scripts to indexes. Those scripts can operate on the results of the indexing.

Sounds boring, right? But the options that is opens are nothing but. Using Scripted Index Results you can get recursive map/reduce indexes, for example. But we won’t be doing that today. Instead, I’ll show how you can enhance entities with additional information from other sources.

Our sample database is Northwind, and we have defined the following index to get some statistics about our customers:

And we can query it like this:

However, what we want to do is to be able to embed those values inside the company document, so we won’t have to query for it separately. Here is how we can use the new Scripted Index Results bundle to do this:

Once we have defined that, whenever the index is done, it will run these scripts, and that, in turns, means that this is what our dear ALFKI looks like:

I’ll leave recursive map/reduce as tidbit for my dear readers Smile .

Tweet Share Share 10 comments

Tags:

raven

Comments

30 May 2013
12:17 PM

Khalid Abuhakmeh

I have a few questions about this feature:

What makes this better than using a transformer?
What kind of performance hit does this put on the indexing process?

I'm sure I will have more, but those are the first two that come to mind.

30 May 2013
14:22 PM

Ayende Rahien

Khalid, 1) This happens during indexing, and they can update document(s), so you can index those items. 2) It would slow down a bit, but I don't expect it to be too much.

30 May 2013
14:30 PM

Khalid Abuhakmeh

Your answer to question one seems very interesting. So how do you handle this scenario?

An Order is added.
Orders/ByCompany is updated, which updates the Companies collection.

Does step 2 have to run all indexes again, and can you cause a weird cyclical issue with indexes, where they will constantly be running?

Index A is dependent on Collection A and updates Collection B, and Index B is dependent on collection B which updates Collection A.

How do you prevent something like that from happening, or at least warn the developer that they are doing something stupid?

30 May 2013
14:40 PM

Ayende Rahien

Companies doc gets updated, the relevant indexes gets run. You cannot modify a document that will be indexed by the same index that trigger this operation. The reason for that is to avoid infinite recursion.

If you have it in two indexes, yes, you have a problem.

30 May 2013
14:45 PM

Khalid Abuhakmeh

Do you and the team have any ideas on how to prevent someone from shooting themselves in the foot, or is that just accepted as collateral damage?

It seems like an issue that could be common across a development team. Two developers could be working on separate but related features on separate branches, where the issue would manifest after the two features were merged into the same branch.

30 May 2013
14:51 PM

Ayende Rahien

Khalid, I don't know how to handle that scenario, in order to do that, you would have to keep track of every action by every index for all time. We keep track of that for a single index, though.

30 May 2013
15:02 PM

Khalid Abuhakmeh

I am just thinking out loud, but what if during the indexing process you also tracked the source of what caused that indexing to happen?

External Document Put
Internal Document Put (script) with Source

You could do pattern matching based on Index, Source, Document Id that caused the index, and frequency within a certain time. If you hit a threshold, you can log a warning in the Management Studio. In addition, you could stop the indexes to save the system.

"Woah we sure touched this one document a lot within a certain time, and it seems what is affecting it is internal to RavenDB, we think there might be something wrong with these indexes: {Index}, {Source}."

You don't have to save this data for a long period of time, you just need a buffered window according to your frequency window (1 minute)? If an item falls out of the window, just throw away that info. Funny enough I think this is a good use case for your previous idea of an event stream.

Not sure if this is possible, just thinking out loud.

30 May 2013
15:05 PM

Ayende Rahien

Khalid, And now you need to track a WHOLE lot of information in the system. Not only that, but you need to track the source of each write, and it is pretty expensive and hard to do. For something that is purely theoretical right now.

Also, there might be valid reasons why you would want to do that (if you know that you do recursion only to a certain level, for example).

05 Jun 2013
15:20 PM

Simone

Hey,

but the Orders class with count and total in the main document should be already be there and empty in the .NET class?

If not how those information will be deserialized?

06 Jun 2013
12:08 PM

Ayende Rahien

Simone, If the properties aren't there, they will be ignored and removed on the next save.

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB