Published January 1, 2010
| Version v1
Journal article
Open
A new perspective on data homogeneity in software cost estimation: a study in the embedded systems domain
Creators
- 1. Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
- 2. Natl Res Council Canada, Inst Informat Technol, Software Engn Grp, Ottawa, ON K1A 0R6, Canada
Description
Cost estimation and effort allocation are the key challenges for successful project planning and management in software development. Therefore, both industry and the research community have been working on various models and techniques to accurately predict the cost of projects. Recently, researchers have started debating whether the prediction performance depends on the structure of data rather than the models used. In this article, we focus on a new aspect of data homogeneity, "cross-versus within-application domain'', and investigate what kind of training data should be used for software cost estimation in the embedded systems domain. In addition, we try to find out the effect of training dataset size on the prediction performance. Based on our empirical results, we conclude that it is better to use cross-domain data for embedded software cost estimation and the optimum training data size depends on the method used.
Files
bib-82ccc7e9-8803-4786-8fcb-fc1a22e95e95.txt
Files
(188 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:447ee2ecb3c1d7358cf0ac2e62629ce2
|
188 Bytes | Preview Download |