Architecting Autonomous Testing Pipelines for Cloud-Native Systems Using AI-Driven Fault Injection and Predictive Analytics

Authors

  • Pranay Kale Automation Architect, USA. Author

DOI:

https://doi.org/10.63282/3050-9416.IJAIBDCMS-V6I1P121

Keywords:

Cloud-Native Systems, Autonomous Testing, AI-Driven Fault Injection, Predictive Analytics, CI/CD Pipelines, Resilience Engineering, Machine Learning, DevOps

Abstract

The rapid adoption of cloud-native architectures, characterized by microservices, containerization, and dynamic orchestration, has significantly increased the complexity of software systems. Conventional methods of testing do not always suffice to tackle the issues related to distributed environments, scaling dynamically and deploying continuously. In this paper, the author introduces an autonomous testing pipeline architecture which combines AI-based fault injection and predictive analytics to promote the reliability, resilience, and performance of a cloud-native system. The suggested model establishes a smart fault injection engine that can emulate realistic and adaptive failure modes according to past system dynamics and run-time environment. Simultaneously, a predictive analytics engine is used to predict possible failures, high-risk components, and prioritize test execution using machine learning models. These modules are easily incorporated in CI/CD pipelines by the use of a centralized test orchestrator that facilitates the execution of continuous, automated, and proactive tests in dynamic settings. Also, the architecture includes a monitoring and observability layer and a feedback and learning module that creates a closed-loop system that recursively optimizes testing strategies and promotes self-healing. Experimental evidence shows that it has a large improvement on fault-detection rates, prediction accuracy, and system resilience, without incurring a large performance overhead. The study will help to improve the intelligent DevOps in the future by facilitating self-adaptive and scalable efficient testing pipelines, which will lead to the development of robust and reliable delivery of cloud-native applications.

References

1. Gangolli, A., Mahmoud, Q. H., & Azim, A. (2022). A systematic review of fault injection attacks on IoT systems. Electronics, 11(13), 2023. https://doi.org/10.3390/electronics11132023

2. Malik, S., Naqvi, M. A., & Moonen, L. (2023). CHESS: A framework for evaluation of self-adaptive systems based on chaos engineering. arXiv. https://arxiv.org/abs/2303.07283

3. Flora, J., Gonçalves, P., Teixeira, M., & Antunes, N. (2022). A study on the aging and fault tolerance of microservices in kubernetes. IEEE Access, 10, 132786-132799.

4. Kratzke, N. (2022). Cloud-native observability: The many-faceted benefits of structured and unified logging—A multi-case study. Future Internet, 14(10), 274. https://doi.org/10.3390/fi14100274

5. Harve, B. M., Bidkar, D. M., Krishnappa, M. S., Pandy, G., Jayaram, V., Veerapaneni, P. K., & Mehta, G. (2024, December). The cloud-native revolution: Microservices in a cloud-driven world. In 2024 International Conference on Intelligent Cybernetics Technology & Applications (ICICyTA) (pp. 1043-1048). IEEE.

6. Nikolaidis, F., Chazapis, A., Marazakis, M., & Bilas, A. (2021). Frisbee: Automated testing of cloud-native applications in Kubernetes. arXiv. https://arxiv.org/abs/2109.10727

7. Bakshi, K. (2017, March). Microservices-based software architecture and approaches. In 2017 IEEE aerospace conference (pp. 1-8). IEEE.

8. Oyeniran, O. C., Adewusi, A. O., Adeleke, A. G., Akwawa, L. A., & Azubuko, C. F. (2024). Microservices architecture in cloud-native applications: Design patterns and scalability. International Journal of Advanced Research and Interdisciplinary Scientific Endeavours, 1(2), 92-106.

9. Avireneni, R. T., Koneru, S. H., Yelkoti, N. K. K. R., Khaga, S. Y., & Nelavelli, S. (2022). Cloud Orchestration with Kubernetes/Docker. American International Journal of Computer Science and Technology, 4(1), 24-34.

10. Campos, J., Arcuri, A., Fraser, G., & Abreu, R. (2014, September). Continuous test generation: Enhancing continuous integration with automated test generation. In Proceedings of the 29th ACM/IEEE international conference on Automated software engineering (pp. 55-66).

11. Bandi Sudakara, B. (2023). Integrating cloud-native testing frameworks with DevOps pipelines for healthcare applications. International Journal of Research Publications in Engineering, Technology and Management, 6(5), 9309–9316. https://doi.org/10.15662/IJRPETM.2023.0605004

12. Shahin, M., Babar, M. A., & Zhu, L. (2017). Continuous integration, delivery and deployment: a systematic review on approaches, tools, challenges and practices. IEEE access, 5, 3909-3943.

13. Rosenthal, C., & Jones, N. (2020). Chaos engineering: system resiliency in practice. O'Reilly Media.

14. Mulla, N., & Jayakumar, N. (2021). Role of Machine Learning & Artificial Intelligence Techniques in Software Testing. Turkish Journal of Computer and Mathematics Education, 12(6), 2913-2921.

15. Kumar, V., & Pham, H. (Eds.). (2022). Predictive analytics in system reliability. Springer Nature.

16. Gangina, P. (2022). Resilience engineering principles for distributed cloud-native applications under chaos. International Journal of Computer Technology and Electronics Communication, 5(5), 5760-5770.

17. Panichella, A., Kifetew, F. M., & Tonella, P. (2018). Automated test case generation as a many-objective optimisation problem with dynamic selection of the targets. IEEE Transactions on Software Engineering, 44(2), 122–158. https://doi.org/10.1109/TSE.2017.2663435

18. Ahern, M., O’sullivan, D. T., & Bruton, K. (2022). Development of a framework to aid the transition from reactive to proactive maintenance approaches to enable energy reduction. Applied Sciences, 12(13), 6704.

19. Kosińska, J., Baliś, B., Konieczny, M., Malawski, M., & Zieliński, S. (2023). Toward the observability of cloud-native applications: The overview of the state-of-the-art. IEEE Access, 11, 73036-73052.

20. Moradi, M., Fabarisov, T., Challenger, M., & Denil, J. (2024). Model-implemented fault injection in cyber-physical systems: A systematic literature review. SSRN. https://doi.org/10.2139/ssrn.4813763

Downloads

Published

2025-03-31

Issue

Section

Articles

How to Cite

1.
Kale P. Architecting Autonomous Testing Pipelines for Cloud-Native Systems Using AI-Driven Fault Injection and Predictive Analytics. IJAIBDCMS [Internet]. 2025 Mar. 31 [cited 2026 Apr. 29];6(1):207-16. Available from: https://ijaibdcms.org/index.php/ijaibdcms/article/view/538