﻿<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>Ayende @ Rahien</title><link>http://ayende.com</link><description>Ayende @ Rahien</description><copyright>Copyright (C) Ayende Rahien  2004 - 2021 (c) 2026</copyright><ttl>60</ttl><item><title>Gian Maria commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>Having such huge amount of file is a pain for every filesystem, because you have files that are smaller than the size of a cluster in the disk, and this mean wasted space and wasted time in reading and writing on disk. The picture of ayende shows how the real file size on disk is four time bigger than file size. 

Having them stored as a single compressed file is a good strategy, because you are trading disk activity with CPU activity to decompress the file and this is a gain because disk is much more slower. The good part is that all compression libraries have the option to create an archive at 0 compression level, this permits to avoid spending CPU cycle for decompression, but with the advantage of handling everything as a single file.  

</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment9</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment9</guid><pubDate>Fri, 06 Apr 2012 07:47:23 GMT</pubDate></item><item><title>configurator commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>@Dan, Patrick, Darius: Are you suggesting that Ayende should tell all his customers to install SSDs on their DB servers instead of enabling his software to read tarballs?</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment8</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment8</guid><pubDate>Fri, 30 Mar 2012 08:03:12 GMT</pubDate></item><item><title>Darius commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>&gt;&gt;How about just getting a good SSD? :D
How about getting a decent file system?</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment7</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment7</guid><pubDate>Thu, 29 Mar 2012 15:23:43 GMT</pubDate></item><item><title>Patrick Huizinga commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>@Dan: +1 Because if Ayende had gotton a good SSD, the Size on disk of all his files would've been less than 743 MB.</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment6</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment6</guid><pubDate>Thu, 29 Mar 2012 07:24:04 GMT</pubDate></item><item><title>Rafal commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>Yes, it's a shame modern 'high performance' filesystems (NTFS, i'm talking to you) can't handle 50 thousand small files in a single directory. Some time ago a bug in my company's application that resulted in creation of thousands of temp files brought down the whole server - IIS, the application and even file explorer. And it was hard to find too because the filesystem behaved as if the hardware was failing or the disk logical structure was corrupt.</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment5</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment5</guid><pubDate>Wed, 28 Mar 2012 18:00:38 GMT</pubDate></item><item><title>Dan commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>How about just getting a good SSD? :D</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment4</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment4</guid><pubDate>Wed, 28 Mar 2012 17:53:27 GMT</pubDate></item><item><title>configurator commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>@Steve: ext* would still be a lot slower to read millions of files in sequence than using an archive, assuming you don't care about the order in which you read files. It's because a file's size is measured in pages, and you get lots of free space between the files, so even if the files are all sequential on disk there's a lot of seek time. I think even an uncompressed tar file would be faster than pretty much any OS.

Not to mention that most compression schemes (e.g. zip, gzip, but not bz2) can be decompressed faster than they are read from disk on modern machines, meaning I/O is still the bottleneck even on compressed files.</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment3</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment3</guid><pubDate>Wed, 28 Mar 2012 15:53:32 GMT</pubDate></item><item><title>Steve Wagner commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>My thought is that NTFS just plain sucks with that much files even if they are large. UNIX filesystem dose not have that much problems with it.</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment2</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment2</guid><pubDate>Wed, 28 Mar 2012 15:31:42 GMT</pubDate></item><item><title>Falhar commented on Watch your 6, or is it your I/O? It is the I/O, yes</title><description>Damn, I thought about that too. Now I'm sad I didn't mention it in previous thread.</description><link>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment1</link><guid>http://ayende.com/154465/watch-your-6-or-is-it-your-i-o-it-is-the-i-o-yes#comment1</guid><pubDate>Wed, 28 Mar 2012 10:17:37 GMT</pubDate></item></channel></rss>