Good Practices for Software Analytics Papers

26 June 2014 by Martin P. Robillard with Audris Mockus

Much effort is currently invested to increase our understanding of software development by analyzing large data sets like GitHub, StackOverflow (or their corporate equivalent). This type of effort is now known as Software Analytics (or more generally business analytics applied to software development). Unlike for data collected as part of controlled experiments, for software analytics we have to be careful in our interpretation since the data is indirectly collected.

During a recent Dagstuhl seminar, I sat down with Audris Mockus and we synthesized some of the discussions into a list of desirable practices for reporting on software analytics projects.