Natural Language Processing and Machine Learning Methods for
Software Development Effort Estimation
Vlad-Sebastian IONESCU1, Horia DEMIAN2, Istvan-Gergely CZIBULA1
1 Babeş-Bolyai University,
1, M. Kogălniceanu Street, Cluj-Napoca, 400084, Romania.
2 University of Oradea,
1, Universității Street, Oradea, Romania.
ABSTRACT: The growing complexity and number of software projects requires both increasingly more experienced developers, testers and other specialists as well as a larger number of persons to fill these roles. This leads to increased personnel and management costs and also makes effort and cost estimation at task and activity levels more difficult for software development companies. An automated solution for software development effort estimation based on text descriptions of tasks and activities, combined with available metrics, is introduced. A real world case study consisting of data from a software company whose activity spans a rich development spectrum is conducted. The results obtained are very encouraging and surpass the few similar approaches available in research literature.
KEYWORDS: Software development effort estimation, Machine learning, Word embeddings, doc2vec, Support vector machines, Gaussian Naive Bayes.
>>FULL TEXT: PDF
CITE THIS PAPER AS:
Vlad-Sebastian Ionescu, Horia Demian, Istvan-Gergely Czibula, Natural Language Processing and Machine Learning Methods for Software Development Effort Estimation, Studies in Informatics and Control, ISSN 1220-1766, vol. 26(2), pp. 219-228, 2017. https://doi.org/10.24846/v26i2y201710