AdaptCacheAI: Adaptive Hybrid Caching with Machine-Learned Eviction for Dynamic Cloud Workloads
DOI:
https://doi.org/10.63282/3050-922X.IJERET-V4I1P118Keywords:
Hybrid Caching, Cloud Computing, Machine Learning, Eviction Policy, Adaptive Systems, Dynamic Workloads, Cache Optimization, Predictive Modeling, Multi-Tier StorageAbstract
Dynamic cloud workloads bring challenges such as high variability, multitenant interference, and heterogeneous access patterns, which make it necessary to have effective cache eviction to retain low latency and cost efficiency. Conventional eviction algorithms like LRU, LFU, ARC, and W-TinyLFU, which are based on static heuristics, give good results under stable conditions but their performance decreases significantly when request patterns change rapidly or differ across applications. AdaptCacheAI, to solve these problems, features an adaptive hybrid caching framework that combines machine learning–driven eviction with a multi-tier cache architecture going from memory to SSD layers. A predictive eviction model that, in real time, calculates the probability of reuse of an object is at the center, supported by a workload classification engine that is capable of recognizing patterns like temporal hotspots, bursty traffic, and tenant-specific behaviors. The continuous feedback loop that accounts for signals such as latency, eviction regret, and tier pressure refurbishes the decisions and thus allows AdaptCacheAI to modify policies on a dynamic basis. Tests on diversified traces confirm the substantial improvements that traditional algorithms can make through the employment of AdaptCacheAI, such as increased hit rates, decreased tail latencies, and lower SSD write amplification, all resulting in tangible performance gains and cost savings. AdaptCacheAI serves as a demonstration of the feasibility of employing machine learning for intelligent cache management and also it paves the way for the potential of future enhancements such as reinforcement learning–based optimization and cross-layer coordination in cloud systems.
References
[1] Singh, G., Nadig, R., Park, J., Bera, R., Hajinazar, N., Novo, D., & Mutlu, O. (2022, June). Sibyl: Adaptive and extensible data placement in hybrid storage systems using online reinforcement learning. In Proceedings of the 49th Annual International Symposium on Computer Architecture (pp. 320-336).
[2] Sethi, K., & Suri, M. (2020). NV-Fogstore: Device-aware hybrid caching in fog computing environments. arXiv preprint arXiv:2010.10562.
[3] Guim, F., Metsch, T., Moustafa, H., Verrall, T., Carrera, D., Cadenelli, N., ... & Prats, R. G. (2021). Autonomous lifecycle management for resource-efficient workload orchestration for green edge computing. IEEE Transactions on Green Communications and Networking, 6(1), 571-582.
[4] Datla, Lalith Sriram. “Identity Threat Detection: Techniques for Preventing Credential Abuse in Cloud Systems”. International Journal of Emerging Trends in Computer Science and Information Technology, vol. 2, no. 4, Dec. 2021, pp. 95-104
[5] Li, P., Guo, Y., & Gu, Y. (2022, November). Predicting reuse interval for optimized web caching: an lstm-based machine learning approach. In SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (pp. 1-15). IEEE.
[6] Guntupalli, Bhavitha. "Unit Testing in ETL Workflows: Why It Matters and How to Do It." International Journal of Artificial Intelligence, Data Science, and Machine Learning 2.4 (2021): 38-50.
[7] Meoni, M., Perego, R., & Tonellotto, N. (2018). Dataset popularity prediction for caching of CMS big data. Journal of Grid Computing, 16(2), 211-228.
[8] Cavdar, D., Chen, L. Y., & Alagoz, F. (2015). Priority scheduling for heterogeneous workloads: Tradeoff between evictions and response time. IEEE Systems Journal, 11(2), 684-695.
[9] Fernando, D., Bagdi, H., Hu, Y., Yang, P., Gopalan, K., Kamhoua, C., & Kwiat, K. (2016, December). Quick eviction of virtual machines through proactive live snapshots. In Proceedings of the 9th International Conference on Utility and Cloud Computing (pp. 99-107).
[10] Cidon, A., Eisenman, A., Alizadeh, M., & Katti, S. (2015). Dynacache: Dynamic cloud caching. In 7th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 15).
[11] Parakala, Adityamallikarjunkumar. "Role Evolution: Developer, Analyst, Lead, Senior." American International Journal of Computer Science and Technology 4.3 (2022): 11-19.
[12] Yang, F., Pang, B., Zhang, J., Qiao, B., Wang, L., Couturier, C., ... & Zhang, D. (2022, April). Spot virtual machine eviction prediction in microsoft cloud. In Companion Proceedings of the Web Conference 2022 (pp. 152-156).
[13] Panneerselvam, J., Liu, L., Antonopoulos, N., & Trovati, M. (2016, March). Latency-aware empirical analysis of the workloads for reducing excess energy consumptions at cloud datacentres. In 2016 IEEE Symposium on Service-Oriented System Engineering (SOSE) (pp. 44-52). IEEE.
[14] Parakala, Adityamallikarjunkumar, and Rangaram Pothula. "AI+ Document Understanding in UiPath: Solving Real Government Problems." International Journal of Artificial Intelligence, Data Science, and Machine Learning 3.3 (2022): 111-122.
[15] Xu, Y., Musgrave, Z., Noble, B., & Bailey, M. (2014). Workload-aware provisioning in public clouds. IEEE internet computing, 18(4), 15-21.
[16] Cheng, D., Wang, Y., & Dai, D. (2021). Dynamic resource provisioning for iterative workloads on apache spark. IEEE Transactions on Cloud Computing, 11(1), 639-652.
[17] Guntupalli, Bhavitha. "Writing Maintainable Code in Fast-Moving Data Projects." International Journal of Emerging Trends in Computer Science and Information Technology 3.2 (2022): 65-74.
[18] Lykouris, T., & Vassilvitskii, S. (2018). Better Caching with Machine Learned Advice.
[19] Rohatgi, D. (2020). Near-optimal bounds for online caching with machine learned advice. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms (pp. 1834-1845). Society for Industrial and Applied Mathematics.
[20] Babaioff, M., Lempel, R., Lucier, B., Menache, I., Slivkins, A., & Wong, S. C. W. (2022, April). Truthful online scheduling of cloud workloads under uncertainty. In Proceedings of the ACM Web Conference 2022 (pp. 151-161).
[21] Gali, V. K. (2021). Predictive Forecasting and Strategic Approach in Oracle Fusion ERP: Intelligent Planning Models. International Journal of AI, BigData, Computational and Management Studies, 2(3), 82-92. https://doi.org/10.63282/3050-9416.IJAIBDCMS-V2I3P110