Qualitas Corpus: A Curated Collection of Java Code for Empirical Studies

TitleQualitas Corpus: A Curated Collection of Java Code for Empirical Studies
Publication TypeConference Paper
Year of Publication2010
AuthorsTempero, E, Anslow C, Dietrich J, Han T, Li J, Lumpe M, Melton H, Noble J
Conference Name2010 Asia Pacific Software Engineering Conference (APSEC2010)
Keywordscurated code corpus, Empirical studies, experimental infrastructure
Abstract

In order to increase our ability to use measurement to support software development practise we need to do more analysis of code. However, empirical studies of code are expensive and their results are difficult to compare. We describe the Qualitas Corpus, a large curated collection of open
source Java systems. The corpus reduces the cost of performing large empirical studies of code and supports comparison of measurements of the same artifacts. We discuss its design, organisation, and issues associated with its development.

URLhttp://qualitascorpus.com/docs/citation.html
Refereed DesignationRefereed