• Publications
  • Influence
A Comprehensive Study of Software Forks: Dates, Reasons and Outcomes
TLDR
The investigation shows, among other results, that forks occur in every software domain, that they have become more frequent in recent years, and that very few forks merge with the original project.
Applying Social Network Analysis to the Information in CVS Repositories
TLDR
This paper proposes the use of a well known set of methodologies (social network analysis) for characterizing libre software projects, their evolution over time and their internal structure, and shows how such methodologies have applied to real cases, and some preliminary conclusions from that experience are extracted.
On the reproducibility of empirical software engineering studies based on data retrieved from development repositories
TLDR
Which elements can be considered to characterize the reproducibility of a study in this area, and how they can be analyzed to better understand the type of reproduction studies they enable or obstruct, are explored.
On the Inequality of Contributions to Wikipedia
TLDR
This paper analyzes the trends in the inequality of distributions for the ten biggest language editions of Wikipedia, and their evolution over time, and finds large differences in the number of contributions by different authors, and a trend to stable patterns of inequality in the long run.
Comparison between SLOCs and number of files as size metrics for software evolution analysis
TLDR
This study shows that in those cases the evolution patterns in both cases (counting SLOCs or files) is the same, and that some patterns not conforming to Lehman's laws are indeed apparent.
Evolution and growth in large libre software projects
TLDR
It is observed in the studied sample that super-linearity occurs only exceptionally, that many of the systems follow a linear growth pattern and that smooth growth is not that common, which differ from the ones found generally in classical software evolution studies.
Remote analysis and measurement of libre software systems by means of the CVSAnalY tool
TLDR
This paper proposes a methodology for measuring and analyzing remotely big libre software projects using publicly-available data from their version control repositories, and implements a tool called CVSAnalY that has been implemented following this methodology.
Applying Social Network Analysis Techniques to Community-Driven Libre Software Projects
TLDR
Methodology is proposed, based on well known concepts from the social networks analysis field, which can be used to study the relationships among developers and how they collaborate in different parts of a project.
Developer identification methods for integrated data from various sources
TLDR
This paper proposes an approach, based on the application of heuristics, to identify the many identities of developers in such cases, and a data structure for allowing both the anonymized distribution of information, and the tracking of identities for verification purposes.
...
...