Ayende @ Rahien

Hi!
My name is Oren Eini
Founder of Hibernating Rhinos LTD and RavenDB.
You can reach me by phone or email:

ayende@ayende.com

+972 52-548-6969

, @ Q c

Posts: 18 | Comments: 82

filter by tags archive

Strange production errors

time to read 1 min | 135 words

The following code cause a really strange error in production:

new MailAddress("test@gmail.​com");

The specified string is not in the form required for an e-mail address.

Huh?!

Obviously it is!

After immediately leaping to the conclusion that .NET is crap and I should immediately start writing my own virtual machine, I decided to dig a little deeper:

Character Code
t 116
e 101
s 115
t 116
@ 64
g 103
m 109
a 97
i 105
l 108
. 46
? 8203
c 99
o 111
m 109

8203 stands for U+200B or zero width space.

I guess that someone with a software testing background decided to get medieval on one of our systems.


Comments

Phil Jones

Holy crap!

I just debugged the exact same issue on my client's system.

We were all similarly scratching our heads till I had to use to view source.

My solution:

// Remove HTML characters email = Regex.Replace(email, "&#[0-9]+;", "");

(A big hacky)

Itamar

This usually happens when you copy-paste from Word. That guy isn't too sophisticated, he is just lazy...

Nic Wise

We've just been dealing with something similar.

select id, catnum from table;

1 ABCD-1234 2 ABCD-1234

select id, '[' + catnum + ']' from table; 1 [ABCD-1234] 2 [ABCD-1234

(catnum is ment to be unique, too!)

Got some unicode nonsense going on in there somewhere.... I suspect a newline, but we still can't find it.

Mikkel Christensen

Another problem to watch out for when using the MailAddress constructor:

http://social.msdn.microsoft.com/forums/en-US/netfxnetcom/thread/2217c413-968f-4dcf-8035-45eaf2a3c609

Matt

I get this quite a lot in our databases. The source is usually legacy processes that rely on Excel spreadsheets/vba for data loading (yuck).

Sina

This is a quite valid and common character in some languages, such as Persian (it is called Zero-Width Non Jointer) and joins different parts of a single word, when you don't want it get separated when word-wrapping happens. E.g., the following word contains a ZWNJ: می‌روم

Since it is a very common character for some languages it may happen usually that somebody changes the keyboard language accidentally and enter it without purpose.

Comment preview

Comments have been closed on this topic.

FUTURE POSTS

  1. The insidious cost of allocations - 5 hours from now
  2. Buffer allocation strategies: A possible solution - 3 days from now
  3. Buffer allocation strategies: Explaining the solution - 4 days from now
  4. Buffer allocation strategies: Bad usage patterns - 5 days from now
  5. The useless text book algorithms - 6 days from now

And 1 more posts are pending...

There are posts all the way to Sep 11, 2015

RECENT SERIES

  1. Find the bug (5):
    20 Apr 2011 - Why do I get a Null Reference Exception?
  2. Production postmortem (10):
    03 Sep 2015 - The industry at large
  3. What is new in RavenDB 3.5 (7):
    12 Aug 2015 - Monitoring support
  4. Career planning (6):
    24 Jul 2015 - The immortal choices aren't
View all series

Syndication

Main feed Feed Stats
Comments feed   Comments Feed Stats