NHibernate – The difference between Get, Load and querying by id

time to read 3 min | 450 words

One of the more common mistakes that I see people doing with NHibernate is related to how they are loading entities by the primary key. This is because there are important differences between the three options.

The most common mistake that I see is using a query to load by id. in particular when using Linq for NHibernate.

var customer = (
	select customer from s.Linq<Customer>()
	where customer.Id = customerId
	select customer
	).FirstOrDefault();

Every time that I see something like that, I wince a little inside. The reason for that is quite simple. This is doing a query by primary key. The key word here is a query.

This means that we have to hit the database in order to get a result for this query. Unless you are using the query cache (which by default you won’t), this force a query on the database, bypassing both the first level identity map and the second level cache.

Get and Load are here for a reason, they provide a way to get an entity by primary key. That is important for several aspects, most importantly, it means that NHibernate can apply quite a few optimizations for this process.

But there is another side to that, there is a significant (and subtle) difference between Get and Load.

Load will never return null. It will always return an entity or throw an exception. Because that is the contract that we have we it, it is permissible for Load to not hit the database when you call it, it is free to return a proxy instead.

Why is this useful? Well, if you know that the value exist in the database, and you don’t want to pay the extra select to have that, but you want to get that value so we can add that reference to an object, you can use Load to do so:

s.Save(
	new Order
	{
		Amount = amount,
		customer = s.Load<Customer>(1)
	}
);

The code above will not result in a select to the database, but when we commit the transaction, we will set the CustomerID column to 1. This is how NHibernate maintain the OO facade when giving you the same optimization benefits of working directly with the low level API.

Get, however, is different. Get will return null if the object does not exist. Since this is its contract, it must return either the entity or null, so it cannot give you a proxy if the entity is not known to exist. Get will usually result in a select against the database, but it will check the session cache and the 2nd level cache first to get the values first.

So, next time that you need to get some entity by its primary key, just remember the differences…

Tweet Share Share 25 comments

Tags:

NHibernate

Comments

30 Apr 2009
07:14 AM

should the where clause in the first snippet

("where customer.Id = customerId")

really be:

where customer.Id == customerId

30 Apr 2009
07:34 AM

Anders

Would this be catched by NHProf?

30 Apr 2009
07:45 AM

So,

Session.Delete(Session.Load <customer(1))

Only goes to the DB once? Cool. I thought it went twice, which always bugged me.

30 Apr 2009
09:06 AM

Ayende Rahien

MF,

Yes, it should be

30 Apr 2009
09:07 AM

Ayende Rahien

Andres,

Not currently, but that is a good suggestion.

Ng,

That depends on a lot of things, mostly if you have cascade associations.

30 Apr 2009
10:09 AM

Valeriu

Get will bring back an initialized entity and will eager load all associations?

Or loading associations will depend on your explicit mappings?

30 Apr 2009
10:10 AM

Ayende Rahien

Valeriu,

Get works based on your mapping, it does't do any eager loading outside of what is defined there

30 Apr 2009
14:03 PM

Will Shaver

Another difference -

Query based selects such as the first example will include all defined and active filters on the entity.

Load / Get will IGNORE all filters for that entity. Filters set up on the entity's relationships will still be used when loading sets/references on the entity returned from Load / Get.

This can cause quite a bit of headache if you're making heavy use of filters and not expecting this behavior.

30 Apr 2009
15:36 PM

Neil Mosafi

I have seen this before:

s.Save(

new Order

{

    Amount = amount,

    customer = new Customer { Id = 1 }

}

);

I think it works, but I assume that's not recommended?

30 Apr 2009
16:23 PM

Ayende Rahien

Will,

I haven't even considered that, but of course, you are right.

30 Apr 2009
16:26 PM

Ayende Rahien

Neil,

Yuck, that is likely to cause "an object with the same id but with different reference is already associated with the current session"

30 Apr 2009
17:01 PM

Rob

Very enlightening. How does this all relate to custom fetching strategies? I can't see a way to apply them using anything but a criteria/query.

30 Apr 2009
17:07 PM

Ayende Rahien

Rob,

It doesn't apply. If you need custom fetching, you need to use a query.

If you consider the reasons for Get / Load, you would see that it make sense that a custom fetching strategy would require a query.

There is no way for Get or Load to handle that.

30 Apr 2009
17:13 PM

Rob

That's what I thought. Just wanted to make sure that I wasn't missing something.

30 Apr 2009
17:51 PM

Anthony Dewhirst

I know that you said that "if you know that the value exist in the database" but, thinking about concurrency, what if in your call it did exist but another user has deleted it before you make your call, will NHibernate still check with a select before insert or trust you and hope that you have used a FK constraint in the DB otherwise?

30 Apr 2009
17:53 PM

Ayende Rahien

Anthony,

That is why we have FK for

30 Apr 2009
22:32 PM

Neil Mosafi

Makes sense, but only if the object has already been loaded into the session I presume? I have seen it on some projects I worked on and always thought it looked a bit smelly! Having read your post I can see that calling Load is definitely the way to do it.

Cheers

Neil

01 May 2009
06:30 AM

Ayende Rahien

Neil,

Yes, if it is already there, it might cause that.

Another nasty side effect that can happen is if there is cascade defined on the object, which might cause NH to initialize all columns to null / default because of this trick

04 May 2009
00:00 AM

Jon Kruger

Would it be bad to always use Load() and never use Get()? Or is there some scenario where you should use Get() over Load()? It sounds like Load() lets NHibernate figure out how to best handle the loading.

04 May 2009
02:07 AM

Ayende Rahien

Jon,

You should use Get() if you don't know that the entity exists.

Because Load will always return a value, Get will return null if the value does not exists

06 May 2009
09:58 AM

Ryan Heath

Perhaps this will show I do not use NH (yet!), but how do you handle a case where you are 95% sure the object is in the database (ie a blogpost) and it better be loaded from the cache instead from the database (it will not change frequently). How to handle the 5% that is not in the database (deleted, wrong id through url hacking, google finds an old url, etc etc)? It feels wrong you have to catch the specific Load exception instead of checking null in order to return a nice 404 instead of an aweful 502 ...

// Ryan

06 May 2009
10:14 AM

Ayende Rahien

Ryan,

You use a Get

06 May 2009
10:48 AM

Ryan Heath

Aaah, I somehow had the impression Get never checked the cache, but upon rereading your post this sentence "Get will usually result in a select against the database, but it will check the session cache and the 2nd level cache first to get the values first." makes me happy again :)

// Ryan

03 Jun 2009
07:04 AM

Suiden

Which of the 3 ways is best to load an Author by Id AND his blogs (mapped one-to-many, lazy)?

Note: author.Blogs is lazy because 95% these are not needed on-the-fly; this scenario is about the other 5% when blogs must be available without making another round-trip to database..

In general: how to best load an entity by Id and a series of associations at the same time?

03 Jun 2009
10:49 AM

Ayende Rahien

Suiden,

HQL or Criteria are the things to use

Comment preview

Comments have been closed on this topic.

Markdown turns plain text formatting into fancy HTML formatting.

Phrase Emphasis

*italic*   **bold**
_italic_   __bold__

Links

Inline:

An [example](http://url.com/ "Title")

Reference-style labels (titles are optional):

An [example][id]. Then, anywhere
else in the doc, define the link:
  [id]: http://example.com/  "Title"

Images

Inline (titles are optional):

![alt text](/path/img.jpg "Title")

Reference-style:

![alt text][id]
[id]: /url/to/img.jpg "Title"

Headers

Setext-style:

Header 1
========
Header 2
--------

atx-style (closing #'s are optional):

# Header 1 #
## Header 2 ##
###### Header 6

Lists

Ordered, without paragraphs:

1.  Foo
2.  Bar

Unordered, with paragraphs:

*   A list item.
    With multiple paragraphs.
*   Bar

You can nest them:

*   Abacus
    * answer
*   Bubbles
    1.  bunk
    2.  bupkis
        * BELITTLER
    3. burper
*   Cunning

Blockquotes

> Email-style angle brackets
> are used for blockquotes.
> > And, they can be nested.
> #### Headers in blockquotes
> 
> * You can quote a list.
> * Etc.

Horizontal Rules

Three or more dashes or asterisks:

---
* * *
- - - -

Manual Line Breaks

End a line with two or more spaces:

Roses are red,   
Violets are blue.

Fenced Code Blocks

Code blocks delimited by 3 or more backticks or tildas:

```
This is a preformatted
code block
```

Header IDs

Set the id of headings with {#<id>} at end of heading line:

## My Heading {#myheading}

Tables

Fruit    |Color
---------|----------
Apples   |Red
Pears	 |Green
Bananas  |Yellow

Definition Lists

Term 1
: Definition 1
Term 2
: Definition 2

Footnotes

Body text with a footnote [^1]
[^1]: Footnote text here

Abbreviations

MDD <- will have title
*[MDD]: MarkdownDeep

Oren Eini

Oren Eini

CEO of RavenDB