Automated Root Cause Analysis in Distributed Cloud Environments: An Unsupervised AIOps Approach Using BigQuery ML

Authors

  • Arun Harikrishnan

DOI:

https://doi.org/10.22399/ijcesen.4996

Keywords:

AIOps, Unsupervised Learning, Anomaly Detection, Root Cause Analysis, Cloud Operations

Abstract

Modern distributed cloud environments present unprecedented challenges for operational monitoring and incident management, where traditional rule-based systems fail to effectively handle the scale and complexity of containerized microservices architectures. This article presents an advanced AIOps framework that leverages unsupervised machine learning techniques through BigQuery ML to automate anomaly detection and root cause analysis in distributed cloud systems. The framework aggregates heterogeneous telemetry data from container orchestration platforms, application traces, and network flow logs into unified analytics pipelines that establish dynamic operational baselines without requiring labeled training datasets. The implementation employs clustering algorithms and statistical deviation detection mechanisms to identify anomalous patterns across multiple system layers while suppressing operational noise and false positive alerts. Experimental evaluation in production environments demonstrates substantial improvements in detection accuracy, significant reductions in mean time to detection and resolution, and enhanced operational efficiency for site reliability engineering teams. The SQL-based machine learning implementation enables operations teams to deploy sophisticated analytics without specialized data science expertise, providing scalable anomaly detection capabilities that adapt to evolving operational patterns in dynamic cloud environments.

References

[1] David Bernstein, "Containers and Cloud: From LXC to Docker to Kubernetes," IEEE Cloud Computing, 2014. Available: https://ieeexplore.ieee.org/document/7036275

[2] Varun Chandola et al., "Anomaly Detection: A Survey," ACM Computing Surveys, 2009. Available: https://www.researchgate.net/publication/220565847_Anomaly_Detection_A_Survey

[3] Yingnong Dang et al., "AIOps: Real-World Challenges and Research Innovations," IEEE Network, 2018. Available: https://ieeexplore.ieee.org/document/8802836

[4] Xue Yang et al., "ADT: Time series anomaly detection for cyber-physical systems via deep reinforcement learning," ScienceDirect, 2024. Available: https://www.sciencedirect.com/science/article/pii/S0167404824001263

[5] Hui Kang et al., "Container and Microservice Driven Design for Cloud Infrastructure DevOps," IEEE International Conference on Cloud Computing Technology and Science, 2016. Available: https://ieeexplore.ieee.org/document/7484185

[6] Albert Bifet et al., "Machine Learning for Data Streams with Practical Examples in MOA," ResearchGate, 2019. Available: https://www.researchgate.net/publication/323993210_Machine_Learning_for_Data_Streams_with_Practical_Examples_in_MOA

[7] I. Gethzi Ahila Poornima et al., "Anomaly detection in wireless sensor network using machine learning algorithm," ScienceDirect, 2020. Available: https://www.sciencedirect.com/science/article/abs/pii/S0140366419309673

[8] Miguel G. Xavier et al., "Performance Evaluation of Container-Based Virtualization for High Performance Computing Environments," IEEE Xplore, 2013. Available: https://ieeexplore.ieee.org/document/6498558

[9] Mohiuddin Ahmed et al., "A survey of network anomaly detection techniques," ScienceDirect, 2016. Available: https://www.sciencedirect.com/science/article/abs/pii/S1084804515002891

[10] Robin Sommer, Vern Paxson, "Outside the Closed World: On Using Machine Learning for Network Intrusion Detection," IEEE Symposium on Security and Privacy, 2010. Available: https://ieeexplore.ieee.org/document/5504793

Downloads

Published

2026-03-05

How to Cite

Arun Harikrishnan. (2026). Automated Root Cause Analysis in Distributed Cloud Environments: An Unsupervised AIOps Approach Using BigQuery ML. International Journal of Computational and Experimental Science and Engineering, 12(1). https://doi.org/10.22399/ijcesen.4996

Issue

Section

Research Article