Professional Documents
Culture Documents
17 de noviembre de 2011
Daniel Izquierdo
(cc) 2011 Daniel Izquierdo. Some rights reserved. This document is distributed under the Creative Commons Attribution-ShareAlike 3.0 licence, available in http://creativecommons.org/licenses/by-sa/3.0/
Daniel Izquierdo
Index
Introduction
Data Sources
References
Daniel Izquierdo
Global software development implies a distributed environment of work In the specic case of FLOSS projects, specic infrastructure is used as communication channels.
Daniel Izquierdo
That infrastructure is generally speaking, publicly available Thanks to this open nature of the repositories of information, it is possible to study them
Daniel Izquierdo
Index
Introduction
Data Sources
References
Daniel Izquierdo
Source code management system Mailing lists Bug tracking system Source code
Daniel Izquierdo
This is used to manage le versions during the development process It is possible to obtain the state of the source code at any time during its history
Daniel Izquierdo
Daniel Izquierdo
There are two main types: centralized and distributed Centralized: CVS or SVN Distributed: Git or Mercurial
Daniel Izquierdo
Daniel Izquierdo
Used to organized libre software projects Asynchronous way of communication Basically, those forward received e-mail messages to subscribed e-mail addresses Usually stored in specic formats such as RFC 4155
Daniel Izquierdo
Daniel Izquierdo
This type of tool is also used as a peer review process of the changes done in the SCM
Lists of Commits-watchers Patches attached
Daniel Izquierdo
This tool is used to manage the evolution of the several errors detected in the source code
Daniel Izquierdo
Some examples:
Bugzilla Sourceforge tracker GNATS: used by the FSF Jira Mantis
Daniel Izquierdo
Daniel Izquierdo
Daniel Izquierdo
Generally obtained by means of the releases of source code bug also from the trunk or master branch of the SCMs
Daniel Izquierdo
There exist a hierarchical structure It is possible to discriminate by type of le (translation, source code, binary, makele, etc). Denitively, several types of les, and not all of them are useful for evaluating software!
Daniel Izquierdo
Also interesting to evaluate the copyright and authors of the several les This makes possible similar studies about ownership in the source code as done in the SCM
Daniel Izquierdo
Also interesting to study licenses (in the end, the license is found in the source code). Between the source code and what a developer claims, the source code will provide the actual information.
Daniel Izquierdo
Index
Introduction
Data Sources
References
Daniel Izquierdo
References
Producing OSS by Karl Fogel Tools and datasets for mining libre software repositories, by Gregorio Robles, Jess M. Gonzlez-Barahona, Daniel u a Izquierdo-Cortzar and Israel Herraiz a The promises and perils of mining git by Christian Bird, Peter Rigby, Earl Barr, David Hamilton, Daniel German, and Premkumar Devanbu
Daniel Izquierdo