Ayende @ Rahien

Hi!
My name is Oren Eini
Founder of Hibernating Rhinos LTD and RavenDB.
You can reach me by phone or email:

ayende@ayende.com

+972 52-548-6969

, @ Q c

Posts: 5,953 | Comments: 44,409

filter by tags archive

Extracting a list of committers from Git


There doesn’t seem to be a nice way to getting stats like “who are the committers in this project” from Git. There is probably some fancy script that does sed/awk magic to do so, but I went with a simpler alternative:

git log --raw > log.txt

var logFile = @"C:\work\ravendb\log.txt";

var committers = from line in File.ReadAllLines(logFile)
                 where line.StartsWith("Author: ")
                 let author = line.Substring("Author: ".Length)
                 group author by author
                 into g
                 let result = new {Committer = g.Key, CommitsCount = g.Count()}
                 orderby result.CommitsCount descending
                 select result;

foreach (var committer in committers)
{
    Console.WriteLine(committer);
}

Running this on Raven’s code produces:

{ Committer = Ayende Rahien <ayende>, CommitsCount = 555 }
{ Committer = Ayende Rahien <Ayende>, CommitsCount = 477 }
{ Committer = unknown <Ayende, CommitsCount = 72 }
{ Committer = Paul B <github, CommitsCount = 24 }
{ Committer = Andrew Stewart <Andrew.Stewart, CommitsCount = 24 }
{ Committer = Benny Thomas <jan.thomas, CommitsCount = 19 }
{ Committer = Luke Hertert <lukehertert, CommitsCount = 15 }
{ Committer = Bobby Johnson <bobby.johnson, CommitsCount = 13 }
{ Committer = Rob Ashton <robashton, CommitsCount = 11 }
{ Committer = unknown <LukeHertert, CommitsCount = 7 }
{ Committer = Andrew Theken <theken.1, CommitsCount = 6 }
{ Committer = AndyStewart <Andy.Stewart, CommitsCount = 3 }
{ Committer = Steve Strong <steve, CommitsCount = 3 }
{ Committer = unknown <Aaron Weiker, CommitsCount = 1 }
{ Committer = Emil Cardell <emil.cardell, CommitsCount = 1 }
{ Committer = bjarte.skogoy <bjarte.skogoy, CommitsCount = 1 }
{ Committer = unknown <henke, CommitsCount = 1 }


Comments

Martin Larsen

git log --raw | grep "^Author: " | sort | uniq -c

Masklinn

There doesn’t seem to be a nice way to getting stats like “who are the committers in this project” from Git.

The gitstat tool from the internets looks nice, considering git apparently doesn't have the equivalent of hg churn.

Frank Quednau

For completeness a powershell version that uses what is contained in the < and >:

git log --raw | Select-String -Pattern "^Author.<(.)>" | % { $_.Matches[

0].Groups[1].Value } | Group-Object | select count, name

Chris

+1 Larsen's solution

Ayende Rahien

Just one problem with all of the things that were suggested.

They requires me to THINK about those.

My solution was coded under heavy time pressure and I just needed the answers.

Going with the route of least resistance was the easiest and faster way to get things done.

Johannes Rudolph

That's why I couldn't live without linqPad anymore!

Martin Larsen

Ah... Cosmin 0wns us all. :-)

Marcin Floryan

Nice one, but how about streaming from a file? You'd be better off not having to read the whole file into memory which is precisely what LINQ allows you to do.

public IEnumerator

<string GetEnumerator()

    {

        using (TextReader reader = dataSource())

        {

            string line;

            while ((line = reader.ReadLine()) != null)

            {

                yield return line;

            }

        }

    }

Comment preview

Comments have been closed on this topic.

FUTURE POSTS

No future posts left, oh my!

RECENT SERIES

  1. The RavenDB Comic Strip (3):
    28 May 2015 - Part III – High availability & sleeping soundly
  2. Special Offer (2):
    27 May 2015 - 29% discount for all our products
  3. RavenDB Sharding (3):
    22 May 2015 - Adding a new shard to an existing cluster, splitting the shard
  4. Challenge (45):
    28 Apr 2015 - What is the meaning of this change?
  5. Interview question (2):
    30 Mar 2015 - fix the index
View all series

Syndication

Main feed Feed Stats
Comments feed   Comments Feed Stats