Ayende @ Rahien

Hi!
My name is Ayende Rahien
Founder of Hibernating Rhinos LTD and RavenDB.
You can reach me by phone or email:

ayende@ayende.com

+972 52-548-6969

@

Posts: 5,947 | Comments: 44,540

filter by tags archive

Extracting a list of committers from Git


There doesn’t seem to be a nice way to getting stats like “who are the committers in this project” from Git. There is probably some fancy script that does sed/awk magic to do so, but I went with a simpler alternative:

git log --raw > log.txt

var logFile = @"C:\work\ravendb\log.txt";

var committers = from line in File.ReadAllLines(logFile)
                 where line.StartsWith("Author: ")
                 let author = line.Substring("Author: ".Length)
                 group author by author
                 into g
                 let result = new {Committer = g.Key, CommitsCount = g.Count()}
                 orderby result.CommitsCount descending
                 select result;

foreach (var committer in committers)
{
    Console.WriteLine(committer);
}

Running this on Raven’s code produces:

{ Committer = Ayende Rahien <ayende>, CommitsCount = 555 }
{ Committer = Ayende Rahien <Ayende>, CommitsCount = 477 }
{ Committer = unknown <Ayende, CommitsCount = 72 }
{ Committer = Paul B <github, CommitsCount = 24 }
{ Committer = Andrew Stewart <Andrew.Stewart, CommitsCount = 24 }
{ Committer = Benny Thomas <jan.thomas, CommitsCount = 19 }
{ Committer = Luke Hertert <lukehertert, CommitsCount = 15 }
{ Committer = Bobby Johnson <bobby.johnson, CommitsCount = 13 }
{ Committer = Rob Ashton <robashton, CommitsCount = 11 }
{ Committer = unknown <LukeHertert, CommitsCount = 7 }
{ Committer = Andrew Theken <theken.1, CommitsCount = 6 }
{ Committer = AndyStewart <Andy.Stewart, CommitsCount = 3 }
{ Committer = Steve Strong <steve, CommitsCount = 3 }
{ Committer = unknown <Aaron Weiker, CommitsCount = 1 }
{ Committer = Emil Cardell <emil.cardell, CommitsCount = 1 }
{ Committer = bjarte.skogoy <bjarte.skogoy, CommitsCount = 1 }
{ Committer = unknown <henke, CommitsCount = 1 }


Comments

Martin Larsen

git log --raw | grep "^Author: " | sort | uniq -c

Masklinn

There doesn’t seem to be a nice way to getting stats like “who are the committers in this project” from Git.

The gitstat tool from the internets looks nice, considering git apparently doesn't have the equivalent of hg churn.

Frank Quednau

For completeness a powershell version that uses what is contained in the < and >:

git log --raw | Select-String -Pattern "^Author.<(.)>" | % { $_.Matches[

0].Groups[1].Value } | Group-Object | select count, name

Chris

+1 Larsen's solution

Ayende Rahien

Just one problem with all of the things that were suggested.

They requires me to THINK about those.

My solution was coded under heavy time pressure and I just needed the answers.

Going with the route of least resistance was the easiest and faster way to get things done.

Johannes Rudolph

That's why I couldn't live without linqPad anymore!

Martin Larsen

Ah... Cosmin 0wns us all. :-)

Marcin Floryan

Nice one, but how about streaming from a file? You'd be better off not having to read the whole file into memory which is precisely what LINQ allows you to do.

public IEnumerator

<string GetEnumerator()

    {

        using (TextReader reader = dataSource())

        {

            string line;

            while ((line = reader.ReadLine()) != null)

            {

                yield return line;

            }

        }

    }

Comment preview

Comments have been closed on this topic.

FUTURE POSTS

No future posts left, oh my!

RECENT SERIES

  1. RavenDB Sharding (2):
    21 May 2015 - Adding a new shard to an existing cluster, the easy way
  2. The RavenDB Comic Strip (2):
    20 May 2015 - Part II – a team in trouble!
  3. Challenge (45):
    28 Apr 2015 - What is the meaning of this change?
  4. Interview question (2):
    30 Mar 2015 - fix the index
  5. Excerpts from the RavenDB Performance team report (20):
    20 Feb 2015 - Optimizing Compare – The circle of life (a post-mortem)
View all series

RECENT COMMENTS

Syndication

Main feed Feed Stats
Comments feed   Comments Feed Stats