A Framework for Early Detection of Agile Software Development Project Failures using Machine Learning Algorithms
Abstract
In the realm of software development, the early identification of project failures is crucial for ensuring project success and minimizing risks. This study developed an early detection framework using various machine learning models to anticipate potential failures. Agile software projects were used for the study. The framework employed a range of machine learning models including decision tree, bagging classifier, weighted bagging classifier, random forest classifier, weighted random forest classifier, decision tree estimator, and bagging estimator. These models are trained and tested using a dataset comprising 13,238 observations from 12 different software companies, each with 15 variables relevant to project performance and outcomes. Initial training of the different models yielded promising results, with performance ranging between 45% to 55% accuracy during testing. Despite attempts to enhance the model's performance, including refinement of features and algorithms, there were no significant improvements observed. The evaluation results highlight the need for further refinement and optimization of the models used in the framework. In conclusion, while the decision tree classifier, bagging classifier, and random forest exhibited outstanding performance in the training results, the overall evaluation suggests that more work is required to improve the effectiveness of the early detection framework for Agile software project failures. Further research and refinement of the models are necessary to enhance accuracy and reliability in identifying potential project failures early in the Agile software development lifecycle.
References
B. Alajaleen and A. Alhroob, ‘Failure Prediction Approach in Agile Software Development’, Int. J. Softw. Innov. IJSI, vol. 10, no. 1, pp. 1–11, 2022.
J. L. Alonso, L. Belache, and D. R. Avresky, ‘Predicting Software Anomalies Using Machine Learning Techniques’, in Network Computing and Applications (NCA), 10th IEEE International Symposium on Software Aging and Rejuvenation, 2011.
B. O. Akumba, S. U. Otor, I. Agaji, and B. T. Akumba, ‘A Predictive Risk Model for Software Projects’ Requirement Gathering Phase’, Int. J. Innov. Sci. Res. Technol., vol. 5, no. 6, pp. 231–236, 2020.
D. Chiller and K. Sharma, ‘Proposed T-Model to Cover 4S Quality Metrics Based on an Empirical Study of the Root Cause of Software Failures’, Int. J. Electr. Comput. Eng., vol. 9, no. 2, 2019.
A. C. Eberendu, ‘Evaluation of Software Project Failure and Abandonment in Tertiary Institutions in Nigeria’, J. Inf. Knowl. Manag., vol. 5, no. 4, 2015.
Z. Slimi and B. V. Carballido, ‘Systematic review: AI’s impact on higher education-learning, teaching, and career opportunities’, TEM J., vol. 12, no. 3, p. 1627, 2023.
A. Nizam, ‘Software Project Failure Process Definition’, IEEE Access, vol. 10, pp. 34428–34441, 2022.
E. N. Osegi, V. I. Anireh, and C. G. Onukwugha, ‘PCWoT-MOBILE: A Collaborative Web-Based Platform for Real-Time Control in the Smart Space’, in iSTEAMS SMART-MIINDs Conference YABATECH, Lagos, Nigeria, 2018.
L. Zhang and P. N. Suganthan, ‘Random Forests with Ensemble of Feature Spaces’, Pattern Recognit., vol. 47, no. 10, pp. 3429–3437, 2014.
J. G. Rivera-Ibarra, G. Borrego, and R. R. Palacio, ‘Early Estimation in Agile Software Development Projects: A Systematic Mapping Study’, Informatics, vol. 11, no. 4, p. 81, 2024.
I. K. Hamdan, W. Aziguli, D. Zhang, E. Sumarliah, and K. Usmanova, ‘Forecasting blockchain adoption in supply chains based on machine learning: evidence from Palestinian food SMEs’, Br. Food J., vol. 124, no. 12, pp. 4592–4609, 2022.
N. Syam and A. Sharma, ‘Waiting for a sales renaissance in the fourth industrial revolution: Machine learning and artificial intelligence in sales research and practice’, Ind. Mark. Manag., vol. 69, pp. 135–146, 2018.
K. Singla, J. Bose, and C. Naik, ‘Analysis of software engineering for agile machine learning projects’, in 2018 15th IEEE India Council International Conference (INDICON), IEEE, 2018, pp. 1–5. Accessed: Mar. 03, 2025. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/8987154/
S. M. Satapathy and S. K. Rath, ‘Empirical assessment of machine learning models for agile software development effort estimation using story points’, Innov. Syst. Softw. Eng., vol. 13, no. 2–3, pp. 191–200, Sep. 2017, doi: 10.1007/s11334-017-0288-z.
P. Pospieszny, B. Czarnacka-Chrobot, and A. Kobylinski, ‘An effective approach for software project effort and duration estimation with machine learning algorithms’, J. Syst. Softw., vol. 137, pp. 184–196, 2018.
M. H. Mahmud, M. T. H. Nayan, D. M. N. A. Ashir, and M. A. Kabir, ‘Software risk prediction: systematic literature review on machine learning techniques’, Appl. Sci., vol. 12, no. 22, p. 11694, 2022.