Sometimes, reading candidates answer is just something that I know is going to piss me off.
We have a question that goes something like this (the actual question is much more detailed):
We have a 15TB csv file that contains web log, the entries are sorted by date (since this is how they were entered). Find all the log entries within a given date range. You may not read more than 32 MB.
A candidate replied with an answered that had the following code:
My reply was:
The data file is 15TB in size, if the data is beyond the first 32MB, it won't be found.
The candidate then fixed his code. It now includes:
Yep, this is on a 15TB file.
Now I’m going to have to lie down for a bit, I am not feeling so good.