Takeshi Kakimoto

Learn More
The goal of this paper is to improve the prediction performance of fault-prone module prediction models (fault-proneness models) by employing over/under sampling methods, which are preprocessing procedures for a fit dataset. The sampling methods are expected to improve prediction performance when the fit dataset is unbalanced, i.e. there exists a large(More)
<b>Background:</b> Any software project dataset sometimes includes outliers which affect the accuracy of effort estimation. Outlier deletion methods are often used to eliminate them. However, there are few case studies which apply outlier deletion methods to analogy-based estimation, so it is not clear which method is more suitable for analogy-based(More)
Software engineers have to select some appropriate development technologies to use in the work; however, engineers sometimes cannot find the appropriate technologies because there are vast amount of options today. To solve this problem, we propose a software technology recommendation method based on collaborative filtering (CF). In the proposed method, at(More)
The scale-free network shown in the small world phenomenon indicates that our human society consists of a small number of people who play the role of hubs linked with many nodes (persons) and a large number of people as nodes linked with few nodes. From our analysis of a large-scale online community - SourceForge.net - which has a large number of developers(More)
Knowledge collaboration is the key for success of open source software (OSS) communities, because not all members have knowledge and skills necessary for software development. Generally, members in OSS communities communicate for knowledge collaboration using communication tools (e.g. mailing lists, discussion forums, bug tracking systems, and so on) so(More)
Many software development platforms provide a large number of library components to make it easy to build high quality software. On the other hand, it became more and more difficult for developers to find useful components in each development context because the amount of components provided became too large today. This paper proposes a recommender system(More)
Novel tricyclic 3,3a,5,9b-tetrahydro-2H-furo[3,2-c][2]benzopyran (TFB) derivatives were synthesized, and their herbicidal activities were elucidated. They were synthesized from D-glucose as a natural chiral source. The formation of the TFB skeleton was achieved by a Friedel-Crafts type intramolecular cyclization of methyl(More)
We view software development as a system of evolution consisting of the three elements: (1) artifacts, (2) individual developers, and (3) a community of developers. An individual’s determining what artifacts to contribute and how, with whom to communicate by asking or answering, and which role to play within the community affects the quality of software to(More)
2. INPUT DATA We used mailing lists (MLs) archives of PostgreSQL, downloaded from http://www.postgresql.org/community/lists/. The MLs mainly consist of user lists and developer lists. We used developer lists archive since we needed developers’ working time. Table 1 explains details of each ML. Figure 1 shows amounts of messages of each ML in the developer(More)
Analogy-based cost estimation provides a useful and intuitive means to support decision making in software project management. It derives a cost estimate required for completing a project from information about similar past projects, namely the analogues. While on average this method provides a relatively accurate cost estimate there remains a possibility(More)