Chaos Engineering for API-Centric Systems in Telecom & eCommerce

Authors

  • Priyadarshini Jayakumar Independent Researcher, USA. Author

DOI:

https://doi.org/10.63282/3050-922X.IJERET-V6I2P112

Keywords:

Chaos Engineering, API Middleware, Distributed Systems, Resilience Engineering, Telecom Networks, eCommerce Platforms, Cloud-Native Architectures

Abstract

As modern telecom and eCommerce platforms increasingly rely on distributed, API-centric architectures, ensuring system resilience and reliability has become a critical challenge. Chaos engineering offers a proactive approach by deliberately injecting controlled failures into systems to uncover weaknesses before they impact customers. This white paper examines the role of chaos engineering in large-scale telecom and eCommerce environments, focusing on its application to orchestration layers, control planes, service meshes, and edge routing. It identifies five key domains where chaos engineering delivers measurable value to API-driven systems: resiliency testing, observability and monitoring, deployment and release reliability, performance and scalability, and platform evolution and change management. By simulating real-world failure scenarios, organizations can validate assumptions, improve fault tolerance, and strengthen operational readiness. The paper demonstrates how chaos engineering functions not only as a technical practice, but as a strategic capability for safeguarding business continuity in high-scale digital ecosystems.

References

[1] A. Basiri, N. Behnam, R. de Rooij, L. Hochstein, L. Kosewski, J. Reynolds, and C. Rosenthal, “Chaos Engineering,” IEEE Software, vol. 33, no. 3, pp. 35–41, 2016.

[2] J. Owotogbe, I. Kumara, W. van den Heuvel, and D. Tamburri, “Chaos Engineering: A Multi-Vocal Literature Review,” arXiv preprint, arXiv:2412.01416, 2024.

[3] H. Jernberg, P. Runeson, and E. Engström, “Getting Started with Chaos Engineering: Design of an Implementation Framework in Practice,” Proc. ACM/IEEE Int. Symp. Empirical Software Engineering and Measurement (ESEM), pp. 1–10, 2020.

[4] D. Dedousis et al., “Chaos Engineering for Microservices Architectures and Fault Tolerance,” arXiv preprint, 2023.

[5] D. Cotroneo et al., “Chaos Engineering Implementation for Distributed Systems Resilience,” arXiv preprint, 2022.

[6] F. Fogli et al., “Advanced Chaos Engineering Techniques for Cloud-Native Applications,” arXiv preprint, 2023.

[7] Y. Zhang et al., “Chaos Engineering Platforms and Tooling: A Series of Studies,” arXiv preprints, 2019, 2021, 2023.

[8] C. Frank et al., “Chaos Engineering Tools Comparison and Evaluation,” arXiv preprints, 2021, 2023.

[9] J. Simonsson et al., “Chaos Engineering Tooling and Best Practices,” arXiv preprint, 2021.

[10] K. Torkura et al., “Security-Focused Chaos Engineering,” IEEE Conf. Proc., 2020, 2021.

[11] Gremlin Inc., “Chaos Engineering: Finding Failures Before They Become Outages,” Industry Whitepaper, 2023.

[12] LTIMindtree, “Adopting Chaos Engineering – Part II,” Technical Report, 2023.

[13] Amazon Web Services, “Chaos Engineering in the Cloud,” AWS Technical Guide, 2022.

[14] S. Palani and R. Gupta, “Chaos Engineering Frameworks: A Technical Overview,” arXiv preprint, 2023.

[15] R. Rivera et al., “Measuring Resiliency of Systems Using Chaos Engineering Experiments,” Riverside Research Technical Report, 2023.

[16] I. Konstantinou et al., “Chaos Engineering for Kubernetes and Container Orchestration,” arXiv preprint, 2021.

[17] O. Bedoya et al., “Chaos Engineering Evaluation Methodologies,” arXiv preprint, 2023.

Downloads

Published

2025-05-29

Issue

Section

Articles

How to Cite

1.
Jayakumar P. Chaos Engineering for API-Centric Systems in Telecom & eCommerce. IJERET [Internet]. 2025 May 29 [cited 2026 Jan. 26];6(2):98-110. Available from: https://ijeret.org/index.php/ijeret/article/view/391