Method-Level Performance Instrumentation for Production Java Microservices

Tejendra Patel

doi:10.22399/ijcesen.5035

Authors

Tejendra Patel

DOI:

https://doi.org/10.22399/ijcesen.5035

Keywords:

Bytecode Instrumentation, Adaptive Sampling, Performance Regression Detection, Cost Attribution, Runtime Class Transformation

Abstract

For production Java microservices, there is an observability gap in tracking infrastructure cost down to the method level of a source code change. Existing Java profilers either have low overhead sampling with short retention for unqueryable flame graph formats or rich events with high overhead that must be manually enabled during production. Neither approach supports automated regression detection workflows that fast-moving engineering teams need. A runtime instrumentation architecture supports (1) configuration-driven bytecode instrumentation, (2) adaptive per-method sampling that calibrates how much telemetry is collected from each application method based on how many times that method is called per second, (3) fail-open streaming telemetry that separates telemetry transport from application execution, and (4) SQL-driven automated regression detection over a columnar data warehouse with extended retention. It provides cost and latency telemetry at the method call level with low overhead, a structured queryable system, and long-term storage. It can automatically detect performance regressions, attribute costs per method, and provide systematic performance governance, without requiring source-level modification or manual profiling of the code under assessment.

References

[1] Google Cloud, "Cloud Profiler Overview," 2024. [Online]. Available: https://docs.cloud.google.com/profiler/docs/about-profiler

2] Oracle, "About Java Flight Recorder," 2024. [Online]. Available: https://docs.oracle.com/javacomponents/jmc-5-4/jfr-runtime-guide/about.htm

[3] Peter Sestoft, "Runtime Code Generation with JVM and CLR," Royal Veterinary and Agricultural University, Copenhagen, and IT University of Copenhagen, Tech. Draft, ver. 1.00, Oct. 2002. [Online]. Available: https://www.itu.dk/~sestoft/rtcg/rtcg.pdf

[4] Oracle, "Java SE Documentation: ThreadMXBean," 2024. [Online]. Available: https://docs.oracle.com/javase/8/docs/api/java/lang/management/ThreadMXBean.html

[5] Tianqi Chen and Carlos Guestrin, "XGBoost: A Scalable Tree Boosting System," in Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, San Francisco, CA, USA, 2016, pp. 785–794. [Online]. Available: https://dl.acm.org/doi/10.1145/2939672.2939785

[6] Scott M. Lundberg and Su-In Lee, "A Unified Approach to Interpreting Model Predictions," in Advances in Neural Information Processing Systems, vol. 30, 2017, pp. 4765–4774. [Online]. Available: https://arxiv.org/pdf/1705.07874

[7] Muhammad Hanif, et al., "SLA-based adaptation schemes in distributed stream processing engines," Applied Sciences, vol. 9, no. 6, p. 1045, 2019. [Online]. Available: https://doi.org/10.3390/app9061045

[8] Sergey Melnik, et al., "Dremel: Interactive analysis of web-scale datasets," Proc. VLDB Endowment, vol. 3, no. 1–2, pp. 330–339, 2010. [Online]. Available: https://doi.org/10.14778/1920841.1920886

[9] Philipp Leitner, et al., "An Exploratory Study of the State of Practice of Performance Testing in Java-Based Open Source Projects," in ICPE '17: Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering, 2017. [Online]. Available: https://doi.org/10.1145/3030207.3030213

[10] D. Sculley et al., "Hidden technical debt in machine learning systems," in Advances in Neural Information Processing Systems, vol. 28, 2015. [Online]. Available: https://proceedings.neurips.cc/paper_files/paper/2015/file/86df7dcfd896fcaf2674f757a2463eba-Paper.pdf

Method-Level Performance Instrumentation for Production Java Microservices

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Information

Keywords

Announcements

Current Issue