Observability in Microservices: From Traditional Monitoring to Distributed System Intelligence
DOI:
https://doi.org/10.15680/IJCTECE.2022.0506023Keywords:
PGP Key Management, B2B Data Exchange, Cryptographic Governance, Regulatory Compliance, Auditability, Key Lifecycle Automation, Secure IntegrationAbstract
The adoption of microservices structures has transformed the world of software development with unmatched scalability and responsiveness through which monolithic software can be broken down to separate units of manageable software. This change, however, has also added complexities of operations that can best be accommodated by traditional monitoring tools whose design is mainly done in good, host-based settings. This paper discusses how the reactive monitoring can be transformed into proactive and intelligent observability. We compile the findings of 30 current research papers to analyze the shortcomings of legacy systems, the principles of observability (logs, metrics, and traces) and the application of the artificial intelligence to handle high-cardinality telemetry data. The paper draws out the importance of distributed tracing and service mesh telemetry because they have led to the need to have visibility of the inter-service dependencies. We suggest an Observability Maturity Model, which helps organizations to overcome simple infrastructure monitoring to self-healing and predictive systems. The paper argues the role of advanced observability in determining the minimization of Mean Time to Recovery (MTTR) and enhancing the reliability of the system through different case studies.
References
[1] Gupta, A. (2021, September 28). Scalable distributed tracing and performance optimization in microservices. https://espjeta.org/jeta-v1i1p122
[2] Li, B., Peng, X., Xiang, Q., Wang, H., Xie, T., Sun, J., & Liu, X. (2021). Enjoy your observability: an industrial survey of microservice tracing and analysis. Empirical Software Engineering, 27(1), 25. https://doi.org/10.1007/s10664-021-10063-9
[3] Saminathan, M., Bhattacharyya, S., & Bairi, A. R. (2021, June 17). End-to-End observability in Cloud-Native systems: integrating distributed tracing and Real-Time analytics. https://thesciencebrigade.org/jst/article/view/566
[4] Narayanan, K. & Pondicherry University. (2021). Observability-Driven optimization of cloud and distributed systems. In International Journal of Science, Engineering and Technology (Vols. 9–6, p. 2). https://www.ijset.in/wp-content/uploads/IJSET_V9_issue6_538.pdf
[5] Waseem, M., Liang, P., Shahin, M., Amleto, D. S., & Márquez, G. (2021). Design, monitoring, and testing of Microservices Systems: The Practitioners’ Perspective. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2108.03384
[6] Gupta, A. (2021). Scalable distributed tracing and performance optimization in microservices [Original Article]. ESP Journal of Engineering & Technology Advancements, 1–1(1), 210–224. https://espjeta.org/Volume1-Issue1/JETA-V1I1P122.pdf
[7] Burramukku, N. R. & IJSRET. (2021). Cloud-Native Network Monitoring: tools, architectures, and best practices. In International Journal of Scientific Research & Engineering Trends (Vol. 7, Issue 5) [Journal-article]. https://ijsret.com/wp-content/uploads/IJSRET_V7_issue5_726.pdf
[8] Niedermaier, S., Koetter, F., Freymann, A., & Wagner, S. (2019). On Observability and Monitoring of Distributed Systems: An Industry Interview study. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1907.12240
[9] Thalheim, J., Rodrigues, A., Akkus, I. E., Bhatotia, P., Chen, R., Viswanath, B., Jiao, L., & Fetzer, C. (2017). Sieve: Actionable Insights from Monitored Metrics in Microservices. arXiv (Cornell University). https://doi.org/10.48550/arxiv.1709.06686
[10] Ramamoorthi, V. (2020, January 15). Machine learning models for anomaly detection in microservices. https://vectoral.org/index.php/QJETI/article/view/145

