Self-Healing Microservices for Insurance Platforms: A Fault-Tolerant Architecture Using AWS and PostgreSQL

Authors

  • Gowtham Reddy Enjam Independent Researcher, USA. Author
  • Komal Manohar Tekale Independent Researcher, USA. Author

DOI:

https://doi.org/10.63282/3050-9416.IJAIBDCMS-V5I1P113

Keywords:

Self-healing microservices, fault tolerance, insurance platforms, AWS, PostgreSQL, resilience, cloud-native architecture

Abstract

Insurance: The global insurance industry is in the process of a digital transformation driven by microservices, cloud-native computing, and distributed data platforms. Traditional monolithic insurance platforms are becoming problematic as they scale, become unmaintainable and unreliable, and adversely impact the customer experience and the continuity of operations. This paper describes a self-healing microservices-based architecture using Amazon Web Services (AWS) and PostgreSQL to build a fault-tolerant, resilient insurance system. The architecture that we propose combines auto-scaling, monitoring, service mesh orchestration and automated failover for database availability. Unlike traditional fault-tolerance methods, the architecture uses self-healing concepts based on proactive error-monitoring and reactive recovery mechanisms incorporated into microservices themselves. The key contributions in this paper are: (1) systematic framework for fault detection and automated healing in insurance microservices; (2) the integration of PostgreSQL replication and failover with AWS-native services (CloudWatch, EKS, RDS Multi-AZ); and (3) proof of decreased downtime and increased resiliency metrics vs. legacy architectures. Undergoing synthetic workloads that included claim processing, underwriting, and policy management, the system was able to maintain 99.95% uptime, 40% reduction in mean time to recovery (MTTR), and an improvement of 32% throughput during failure scenarios. The proposed architecture can be of much benefit to insurance platforms by ensuring business continuity, regulatory compliance and enhanced customer satisfaction. This paper describes the theoretical foundations, design methodology in practice, experimental assessment, and implications for future large-scale deployments

References

1. Newman, 2015 – Building Microservices: Designing Fine-Grained Systems (O'Reilly Media, 2015) – Provides foundational insight into microservices architecture. Wikipedia

2. Dragoni et al., 2017 – Microservices: How to Make Your Application Scale (ArXiv, Feb 2017) – Highlights microservices’ scalability improvements over SOA. arXiv

3. Shadija, Rezai & Hill, 2017 – Towards an Understanding of Microservices (ArXiv, Sep 2017) – Compares microservices with SOA and explores their flexibility in dynamic domains. arXiv

4. Castro & Liskov, 1999 – Byzantine Fault Tolerance – A seminal work addressing fault tolerance in distributed systems. (Classical reference; foundational theory.)

5. De Florio, 2016 – Application-layer Fault-Tolerance Protocols (ArXiv, Nov 2016) – Discusses methods of embedding fault-tolerance at the application layer. arXiv

6. Microservices in Life Insurance: Enhancing Scalability and Agility in Legacy Systems. Malali, Nihar. University of Texas at Dallas, March 2022.

7. “Towards Resilient Method,” 2021 – Toward a Resilient Method: An exhaustive survey of fault tolerance methods in the cloud computing environment (Computer Science Review, May 2021). Categorizes reactive, proactive, and resilient methods. ScienceDirectSpringerLink

8. Khiat, 2021 – Cloud-oriented fault tolerance techniques. Proposes adaptive frameworks combining checkpoint/restart and replication for reactive fault tolerance. SAGE Journals

9. Somasekaram, Calinescu & Buyya, 2021 – High-Availability Clusters: A Taxonomy, Survey, and Future Directions (ArXiv, Sep 2021) – Comprehensive survey of high-availability cluster techniques. arXiv

10. Percona Blog, 2018 – High Availability for Enterprise-Grade PostgreSQL Environments – Introduces solutions like Patroni, Stolon, repmgr, PAF, pglookout, and pgPool-II. Percona

11. Magableh, B., & Almiani, M. “A Self Healing Microservices Architecture: A Case Study in Docker Swarm Cluster.” In Advanced Information Networking and Applications – AINA 2019, AINA 2019. Advances in Intelligent Systems and Computing, Vol. 926. Springer, Cham, 2020.

12. Amarjeet Singh; Alok Aggarwal. “Artificial Intelligence Self Healing Capability Assessment in Microservices Applications Deployed in AWS using CloudWatch and Hystrix.” Australian Journal of Machine Learning Research & Applications, before 2023.

13. Microservice Disaster Crash Recovery: A Weak Global Referential Integrity Management. Maude Manouvrier; Cesare Pautasso; Marta Rukoz. In Computational Science – ICCS 2020, pp. 482 495, 2020.

14. Gotti, 2016 – Introduction to Stolon: cloud native PostgreSQL high availability – Introduces Stolon’s architecture (keeper, sentinel, proxy) and cloud-native durability. Simone GottiGitHub

15. Rusum, G. P., Pappula, K. K., & Anasuri, S. (2020). Constraint Solving at Scale: Optimizing Performance in Complex Parametric Assemblies. International Journal of Emerging Trends in Computer Science and Information Technology, 1(2), 47-55. https://doi.org/10.63282/3050-9246.IJETCSIT-V1I2P106

16. Pappula, K. K., & Anasuri, S. (2020). A Domain-Specific Language for Automating Feature-Based Part Creation in Parametric CAD. International Journal of Emerging Research in Engineering and Technology, 1(3), 35-44. https://doi.org/10.63282/3050-922X.IJERET-V1I3P105

17. Rahul, N. (2020). Optimizing Claims Reserves and Payments with AI: Predictive Models for Financial Accuracy. International Journal of Emerging Trends in Computer Science and Information Technology, 1(3), 46-55. https://doi.org/10.63282/3050-9246.IJETCSIT-V1I3P106

18. Pappula, K. K., Anasuri, S., & Rusum, G. P. (2021). Building Observability into Full-Stack Systems: Metrics That Matter. International Journal of Emerging Research in Engineering and Technology, 2(4), 48-58. https://doi.org/10.63282/3050-922X.IJERET-V2I4P106

19. Pedda Muntala, P. S. R., & Karri, N. (2021). Leveraging Oracle Fusion ERP’s Embedded AI for Predictive Financial Forecasting. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 2(3), 74-82. https://doi.org/10.63282/3050-9262.IJAIDSML-V2I3P108

20. Rahul, N. (2021). Strengthening Fraud Prevention with AI in P&C Insurance: Enhancing Cyber Resilience. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 2(1), 43-53. https://doi.org/10.63282/3050-9262.IJAIDSML-V2I1P106

21. Rusum, G. P. (2022). WebAssembly across Platforms: Running Native Apps in the Browser, Cloud, and Edge. International Journal of Emerging Trends in Computer Science and Information Technology, 3(1), 107-115. https://doi.org/10.63282/3050-9246.IJETCSIT-V3I1P112

22. Pappula, K. K. (2022). Architectural Evolution: Transitioning from Monoliths to Service-Oriented Systems. International Journal of Emerging Research in Engineering and Technology, 3(4), 53-62. https://doi.org/10.63282/3050-922X.IJERET-V3I4P107

23. Jangam, S. K. (2022). Self-Healing Autonomous Software Code Development. International Journal of Emerging Trends in Computer Science and Information Technology, 3(4), 42-52. https://doi.org/10.63282/3050-9246.IJETCSIT-V3I4P105

24. Anasuri, S. (2022). Adversarial Attacks and Defenses in Deep Neural Networks. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(4), 77-85. https://doi.org/10.63282/xs971f03

25. Pedda Muntala, P. S. R. (2022). Anomaly Detection in Expense Management using Oracle AI Services. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(1), 87-94. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I1P109

26. Rahul, N. (2022). Automating Claims, Policy, and Billing with AI in Guidewire: Streamlining Insurance Operations. International Journal of Emerging Research in Engineering and Technology, 3(4), 75-83. https://doi.org/10.63282/3050-922X.IJERET-V3I4P109

27. Rusum, G. P., & Anasuri, S. (2023). Composable Enterprise Architecture: A New Paradigm for Modular Software Design. International Journal of Emerging Research in Engineering and Technology, 4(1), 99-111. https://doi.org/10.63282/3050-922X.IJERET-V4I1P111

28. Pappula, K. K. (2023). Reinforcement Learning for Intelligent Batching in Production Pipelines. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(4), 76-86. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I4P109

29. Jangam, S. K., & Pedda Muntala, P. S. R. (2023). Challenges and Solutions for Managing Errors in Distributed Batch Processing Systems and Data Pipelines. International Journal of Emerging Research in Engineering and Technology, 4(4), 65-79. https://doi.org/10.63282/3050-922X.IJERET-V4I4P107

30. Anasuri, S. (2023). Secure Software Supply Chains in Open-Source Ecosystems. International Journal of Emerging Trends in Computer Science and Information Technology, 4(1), 62-74. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I1P108

31. Pedda Muntala, P. S. R., & Karri, N. (2023). Leveraging Oracle Digital Assistant (ODA) to Automate ERP Transactions and Improve User Productivity. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(4), 97-104. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I4P111

32. Rahul, N. (2023). Transforming Underwriting with AI: Evolving Risk Assessment and Policy Pricing in P&C Insurance. International Journal of AI, BigData, Computational and Management Studies, 4(3), 92-101. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V4I3P110

33. Pappula, K. K., & Rusum, G. P. (2020). Custom CAD Plugin Architecture for Enforcing Industry-Specific Design Standards. International Journal of AI, BigData, Computational and Management Studies, 1(4), 19-28. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V1I4P103

34. Rahul, N. (2020). Vehicle and Property Loss Assessment with AI: Automating Damage Estimations in Claims. International Journal of Emerging Research in Engineering and Technology, 1(4), 38-46. https://doi.org/10.63282/3050-922X.IJERET-V1I4P105

35. Pappula, K. K., & Rusum, G. P. (2021). Designing Developer-Centric Internal APIs for Rapid Full-Stack Development. International Journal of AI, BigData, Computational and Management Studies, 2(4), 80-88. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V2I4P108

36. Pedda Muntala, P. S. R., & Jangam, S. K. (2021). End-to-End Hyperautomation with Oracle ERP and Oracle Integration Cloud. International Journal of Emerging Research in Engineering and Technology, 2(4), 59-67. https://doi.org/10.63282/3050-922X.IJERET-V2I4P107

37. Rahul, N. (2021). AI-Enhanced API Integrations: Advancing Guidewire Ecosystems with Real-Time Data. International Journal of Emerging Research in Engineering and Technology, 2(1), 57-66. https://doi.org/10.63282/3050-922X.IJERET-V2I1P107

38. Pappula, K. K., & Rusum, G. P. (2021). Designing Developer-Centric Internal APIs for Rapid Full-Stack Development. International Journal of AI, BigData, Computational and Management Studies, 2(4), 80-88. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V2I4P108

39. Pedda Muntala, P. S. R., & Jangam, S. K. (2021). End-to-End Hyperautomation with Oracle ERP and Oracle Integration Cloud. International Journal of Emerging Research in Engineering and Technology, 2(4), 59-67. https://doi.org/10.63282/3050-922X.IJERET-V2I4P107

40. Rahul, N. (2021). AI-Enhanced API Integrations: Advancing Guidewire Ecosystems with Real-Time Data. International Journal of Emerging Research in Engineering and Technology, 2(1), 57-66. https://doi.org/10.63282/3050-922X.IJERET-V2I1P107

41. Rusum, G. P., & Pappula, kiran K. . (2022). Event-Driven Architecture Patterns for Real-Time, Reactive Systems. International Journal of Emerging Research in Engineering and Technology, 3(3), 108-116. https://doi.org/10.63282/3050-922X.IJERET-V3I3P111

42. Pappula, K. K. (2022). Containerized Zero-Downtime Deployments in Full-Stack Systems. International Journal of AI, BigData, Computational and Management Studies, 3(4), 60-69. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V3I4P107

43. Jangam, S. K., & Karri, N. (2022). Potential of AI and ML to Enhance Error Detection, Prediction, and Automated Remediation in Batch Processing. International Journal of AI, BigData, Computational and Management Studies, 3(4), 70-81. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V3I4P108

44. Anasuri, S. (2022). Formal Verification of Autonomous System Software. International Journal of Emerging Research in Engineering and Technology, 3(1), 95-104. https://doi.org/10.63282/3050-922X.IJERET-V3I1P110

45. Pedda Muntala, P. S. R. (2022). Natural Language Querying in Oracle Fusion Analytics: A Step toward Conversational BI. International Journal of Emerging Trends in Computer Science and Information Technology, 3(3), 81-89. https://doi.org/10.63282/3050-9246.IJETCSIT-V3I3P109

46. Rahul, N. (2022). Optimizing Rating Engines through AI and Machine Learning: Revolutionizing Pricing Precision. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(3), 93-101. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I3P110

47. Rusum, G. P., & Anasuri, S. (2023). Synthetic Test Data Generation Using Generative Models. International Journal of Emerging Trends in Computer Science and Information Technology, 4(4), 96-108. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I4P111

48. Pappula, K. K. (2023). Edge-Deployed Computer Vision for Real-Time Defect Detection. International Journal of AI, BigData, Computational and Management Studies, 4(3), 72-81. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V4I3P108

49. Jangam, S. K. (2023). Data Architecture Models for Enterprise Applications and Their Implications for Data Integration and Analytics. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 91-100. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I3P110

50. Anasuri, S., Rusum, G. P., & Pappula, K. K. (2023). AI-Driven Software Design Patterns: Automation in System Architecture. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(1), 78-88. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I1P109

51. Pedda Muntala, P. S. R., & Karri, N. (2023). Managing Machine Learning Lifecycle in Oracle Cloud Infrastructure for ERP-Related Use Cases. International Journal of Emerging Research in Engineering and Technology, 4(3), 87-97. https://doi.org/10.63282/3050-922X.IJERET-V4I3P110

52. Rahul, N. (2023). Personalizing Policies with AI: Improving Customer Experience and Risk Assessment. International Journal of Emerging Trends in Computer Science and Information Technology, 4(1), 85-94. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I1P110

53. Pappula, K. K. (2021). Modern CI/CD in Full-Stack Environments: Lessons from Source Control Migrations. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 2(4), 51-59. https://doi.org/10.63282/3050-9262.IJAIDSML-V2I4P106

54. Pedda Muntala, P. S. R. (2021). Integrating AI with Oracle Fusion ERP for Autonomous Financial Close. International Journal of AI, BigData, Computational and Management Studies, 2(2), 76-86. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V2I2P109

55. Rusum, G. P., & Pappula, K. K. (2022). Federated Learning in Practice: Building Collaborative Models While Preserving Privacy. International Journal of Emerging Research in Engineering and Technology, 3(2), 79-88. https://doi.org/10.63282/3050-922X.IJERET-V3I2P109

56. Jangam, S. K., & Pedda Muntala, P. S. R. (2022). Role of Artificial Intelligence and Machine Learning in IoT Device Security. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 3(1), 77-86. https://doi.org/10.63282/3050-9262.IJAIDSML-V3I1P108

57. Anasuri, S. (2022). Next-Gen DNS and Security Challenges in IoT Ecosystems. International Journal of Emerging Research in Engineering and Technology, 3(2), 89-98. https://doi.org/10.63282/3050-922X.IJERET-V3I2P110

58. Pedda Muntala, P. S. R. (2022). Enhancing Financial Close with ML: Oracle Fusion Cloud Financials Case Study. International Journal of AI, BigData, Computational and Management Studies, 3(3), 62-69. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V3I3P108

59. Rusum, G. P. (2023). Secure Software Supply Chains: Managing Dependencies in an AI-Augmented Dev World. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(3), 85-97. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I3P110

60. Jangam, S. K., & Karri, N. (2023). Robust Error Handling, Logging, and Monitoring Mechanisms to Effectively Detect and Troubleshoot Integration Issues in MuleSoft and Salesforce Integrations. International Journal of Emerging Research in Engineering and Technology, 4(4), 80-89. https://doi.org/10.63282/3050-922X.IJERET-V4I4P108

61. Anasuri, S. (2023). Synthetic Identity Detection Using Graph Neural Networks. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(4), 87-96. https://doi.org/10.63282/3050-9262.IJAIDSML-V4I4P110

62. Pedda Muntala, P. S. R. (2023). AI-Powered Chatbots and Digital Assistants in Oracle Fusion Applications. International Journal of Emerging Trends in Computer Science and Information Technology, 4(3), 101-111. https://doi.org/10.63282/3050-9246.IJETCSIT-V4I3P111

Downloads

Published

2024-03-30

Issue

Section

Articles

How to Cite

1.
Enjam GR, Tekale KM. Self-Healing Microservices for Insurance Platforms: A Fault-Tolerant Architecture Using AWS and PostgreSQL. IJAIBDCMS [Internet]. 2024 Mar. 30 [cited 2025 Oct. 29];5(1):127-36. Available from: https://ijaibdcms.org/index.php/ijaibdcms/article/view/252