AI-Powered ETL Automation for Compliant Data Migration

Authors

  • Anusha Joodala Independent Researcher, USA. Author

DOI:

https://doi.org/10.63282/3050-9416.IJAIBDCMS-V6I4P116

Keywords:

ETL, Data Migration, GDPR, HIPAA, AI, Schema Matching, Data Lineage, DataOps, Compliance Automation

Abstract

Data governance poses an increasingly complex challenge for organizations that operate on multiple heterogeneous systems, including various clouds, to ensure compliance with numerous regulations, like GDPR or HIPAA. Whereas traditional approaches to ETL (Extract–Transform–Load) have problems with manual schema mapping, weak transformation logic, and traceability, AutoETL-C provides a solution to these problems with a suggested depreciation of ETL, as it incorporates AI to deliver an automated and policy-compliant ETL. The AI to automate ETL creates a more seamless experience for users, combining (i IJAIBDCMS-V6I4P116  neural schema matching via domain-adapted language models, (ii) hybrid transformation through rules and learning, automated constraint discovery, and invasive trace law, (iii) data quality monitoring with continuous anomaly detection, (iv) and full-coverage lineage capture, including all governance and compliance limited controls. AutoETL-C was tested on three migration scenarios—core-to-lakehouse for banking, EHR modernization for healthcare, and ERP-to-warehouse for retail—public and synthetic datasets as well as anonymized enterprise data were used. AutoETL-C outperformed its manually configured ETL, template-based ETL and ML-assisted mapping, setting a new industry standard while proving compliance with Shelby Control Model documentation, HIPAA Security Rule controls, and data maps. Overall, results suggest that migration risk and time-to-value can be substantially improved with augmented, privacy-aware data engineering

References

1. Goel, P., Jain, P., Pasman, H. J., Pistikopoulos, E. N., & Datta, A. (2020). “Integration of data analytics with cloud services for safer process systems, application examples and implementation challenges.” In Journal of Loss Prevention in the Process Industries (Vol. 68, p. 104316). Elsevier BV. https://doi.org/10.1016/j.jlp.2020.104316

2. A. Ramachandran, "AI-Driven Approaches to Enterprise Data Migration: A Comparative Analysis," ResearchGate, 2024. [Online]. Available: https://www.researchgate.net/publication/383450441_Harnessing_Ad vanced_Artificial_Intelligence_for_Enhanced_Enterprise_Data_Migr ation_A_Comprehensive_Analysis.

3. B. K. Pandey, A. Tanikonda, S. R. Peddinti, and S. R. Katragadda, "AI- Driven Methodologies for Mitigating Technical Debt in Legacy Systems," Journal of Science & Technology (JST), vol. 2, no. 2, pp. 1- 10, Apr.-Jul. 2021.

4. V. B. R. Soperla, "AI-Enhanced Data Migration Strategy for Legacy Systems," International Journal of Research in Computer Applications and Information Technology (IJRCAIT), vol. 8, no. 2, pp. 55-89, 2025.

5. Shubhodip Sasmal, "AI-powered Data Migration: Challenges and Solutions, ", ResearchGate, 2022, https://www.researchgate.net/publication/379036031_AI-powered_Data_Migration_Challenges_and_Solutions

6. O. Gierszal, "Data Migration Strategy for a Legacy App: Step-by-Step Guide," Brainhub, 2024.

7. R. Dhall and R. Sharma, "Mitigating the Challenges of Legacy Modernization and Fast-Tracking Outcomes with High-Value Generative AI Use Cases," Birlasoft, 2024.

8. V. B. R. Soperla, "AI-Enhanced Data Migration Strategy for Legacy Systems," International Journal of Research in Computer Applications and Information Technology (IJRCAIT), vol. 8, no. 2, pp. 55-89, 2025.

9. B. K. Pandey, A. Tanikonda, S. R. Peddinti, and S. R. Katragadda, "AI- Driven Methodologies for Mitigating Technical Debt in Legacy Systems," Journal of Science & Technology (JST), vol. 2, no. 2, pp. 1- 10, Apr.-Jul. 2021.

10. Miyamoto N., Higuchi K., Tsuji T. Incremental Data Migration for Multi-database Systems Based on MySQL with Spider Storage Engine. In: IIAI 3rd International Conference on Advanced Applied Informatics. 2014. p. 745-750.

11. Hussain Sh. Beyond Theory: Practical Approaches to Modern Data Migration Challenges. 2025. Available at: https://www.researchgate.net/

12. Thalheim B., Wang Q. Towards a Theory of Refinement for Data Migration // Part of the Lecture Notes in Computer Science book series (LNISA, volume 6998). 2011. P. 1-14. DOI: 10.1016/j.datak.2012.12.003 18.

13. Bahssas D.M., AlBar A.M., Hoque R. Enterprise Resource Planning (ERP) Systems: Design, Trends and Deployment. The International Technology Management Review. 2015;5:72-81. DOI: 10.2991/itmr.2015.5.2.2

14. Mason R.T. NoSQL databases and data modeling techniques for a document-oriented NoSQL database. In: Informing Science & IT Education Conference (InSITE). 2015. P. 259-268. DOI: 10.1109/BigDataCongress.2016.16

15. Loginovsky O.V., Maksimov A.A., Burkov V.N., Burkova I.V., Gelrud Ya.D., Korennaya K.A., Shestakov A.L. Upravlenie promyshlennymi predpriyatiyami: strategii, mekhanizmy, sistemy: monogr. [Industrial Enterprise Management: Strategies, Mechanisms, Systems. Monograph]. Moscow: Infra-M Publ.; 2018. 410 p.

16. Kanji, R. K. (2021). Real-Time Big Data Processing with Edge Computing. European Journal of Advances in Engineering and Technology, 8(11), 152-155.

Downloads

Published

2025-11-16

Issue

Section

Articles

How to Cite

1.
Joodala A. AI-Powered ETL Automation for Compliant Data Migration. IJAIBDCMS [Internet]. 2025 Nov. 16 [cited 2026 Jan. 13];6(4):142-53. Available from: https://ijaibdcms.org/index.php/ijaibdcms/article/view/343