Exploring the data, extracting series from my blog posts

time to read 2 min | 256 words

I’m pretty bad when it comes to actually organizing my blog. I just like to write stuff out, I don’t like to do things like properly setting things up in series. Mostly because I usually think about one post at a time, or three at the most.

I did notice that I usually use something like “Series name: post name” convention when writing series of posts. So I decided to write the following index to check the data out:

image

As you can see, this is pretty simple way of doing things. And that lead to the following data.

image

Some of those are obviously false positives, and we have things like this, which are obviously out:

image

But it looks like important series are also spread over time:

image

I think that I’m going to have to do a new blog feature, to highlight those emergent series.