Leveraging Cloud Object Storage Mechanisms for Analyzing Massive Datasets

Authors

  • Sarbaree Mishra Program Manager at Molina Healthcare Inc., USA. Author

DOI:

https://doi.org/10.63282/3050-922X.IJERET-V2I1P106

Keywords:

Cloud object storage, big data analytics, massive datasets, scalability, unstructured data, data analysis, cost-effectiveness, data storage, cloud storage, data management, performance optimization, data processing, infrastructure efficiency, cloud-based solutions, data access, data scalability, storage cost reduction, big data management, flexible storage

Abstract

Cloud object storage has become an important part of managing & analyzing these huge datasets. It gives businesses a flexible & useful way to store & analyze their both structured & unstructured information. The huge and growing volume of data in these various fields sometimes makes it impossible for ordinary data storage systems to handle it all. Cloud object storage overcomes these problems by being scalable, durable & cost-effective. It gives businesses a single place to store and access huge amounts of data easily. This article talks about how cloud object storage & modern data analytics tools may help businesses get more crucial information from huge datasets. Cloud object storage is built to handle a lot of the information well, with flexible access & robust security features. Cloud object storage is important for big data analytics because it lets you handle more enormous amounts of the information & use advanced analytical approaches like ML & AI. Cloud object storage employs data lakes & associated their distributed computing frameworks to make sure that big datasets are always ready for actual time analysis. This helps people make decisions faster & makes it easier to make better ones. The research looks at strategies to make cloud object storage work better, such as data tiering & the caching, which make it faster to access and cheaper to use. It also speaks about how businesses in banking, healthcare & e-commerce use cloud object storage to have an advantage over their competition. Organizations may go beyond what conventional storage can do by using the flexibility of their cloud storage. This gives rise to the latest opportunities for development and the latest ideas. Cloud object storage helps organizations make choices based on their information in a digital environment that is continually changing. It also makes it simpler to handle these huge datasets

References

[1] Rupprecht, L., Zhang, R., Owen, B., Pietzuch, P., & Hildebrand, D. (2017, April). SwiftAnalytics: Optimizing object storage for big data analytics. In 2017 IEEE International Conference on Cloud Engineering (IC2E) (pp. 245-251). IEEE.

[2] Chen, H. M., Chang, K. C., & Lin, T. H. (2016). A cloud-based system framework for performing online viewing, storage, and analysis on big data of massive BIMs. Automation in Construction, 71, 34-48.

[3] Mohammad, Abdul Jabbar. “Sentiment-Driven Scheduling Optimizer”. International Journal of Emerging Research in Engineering and Technology, vol. 1, no. 2, June 2020, pp. 50-59

[4] Dey, S., Chakraborty, A., Naskar, S., & Misra, P. (2012, October). Smart city surveillance: Leveraging benefits of cloud data stores. In 37th Annual IEEE Conference on Local Computer Networks-Workshops (pp. 868-876). IEEE.

[5] Sai Prasad Veluru. “Real-Time Fraud Detection in Payment Systems Using Kafka and Machine Learning”. JOURNAL OF RECENT TRENDS IN COMPUTER SCIENCE AND ENGINEERING ( JRTCSE), vol. 7, no. 2, Dec. 2019, pp. 199-14

[6] Armbrust, M., Das, T., Sun, L., Yavuz, B., Zhu, S., Murthy, M., ... & Zaharia, M. (2020). Delta lake: high-performance ACID table storage over cloud object stores. Proceedings of the VLDB Endowment, 13(12), 3411-3424.

[7] Manda, Jeevan Kumar. "Cloud Security Best Practices for Telecom Providers: Developing comprehensive cloud security frameworks and best practices for telecom service delivery and operations, drawing on your cloud security expertise." Available at SSRN 5003526 (2020).

[8] Belcastro, L., Marozzo, F., Talia, D., & Trunfio, P. (2017). Big data analysis on clouds. Handbook of big data technologies, 101-142.

[9] Shaik, Babulal. "Network Isolation Techniques in Multi-Tenant EKS Clusters." Distributed Learning and Broad Applications in Scientific Research 6 (2020).

[10] Adedugbe, O., Benkhelifa, E., Campion, R., Al-Obeidat, F., Bani Hani, A., & Jayawickrama, U. (2020). Leveraging cloud computing for the semantic web: review and trends. Soft Computing, 24(8), 5999-6014.

[11] Allam, Hitesh. Exploring the Algorithms for Automatic Image Retrieval Using Sketches. Diss. Missouri Western State University, 2017.

[12] Qolomany, B., Al-Fuqaha, A., Gupta, A., Benhaddou, D., Alwajidi, S., Qadir, J., & Fong, A. C. (2019). Leveraging machine learning and big data for smart buildings: A comprehensive survey. IEEE access, 7, 90316-90356.

[13] Nookala, G. (2020). Automation of privileged access control as part of enterprise control procedure. Journal of Big Data and Smart Systems, 1(1).

[14] Jani, Parth. "Privacy-Preserving AI in Provider Portals: Leveraging Federated Learning in Compliance with HIPAA." The Distributed Learning and Broad Applications in Scientific Research 6 (2020): 1116-1145.

[15] Veluru, Sai Prasad. "Threat Modeling in Large-Scale Distributed Systems." International Journal of Emerging Research in Engineering and Technology 1.4 (2020): 28-37.

[16] Cai, H., Xu, B., Jiang, L., & Vasilakos, A. V. (2016). IoT-based big data storage systems in cloud computing: perspectives and challenges. IEEE Internet of Things Journal, 4(1), 75-87.

[17] Manda, Jeevan Kumar. "5G Network Slicing: Use Cases and Security Implications." Available at SSRN 5003611 (2021).

[18] Chen, J., Douglas, C., Mutsuzaki, M., Quaid, P., Ramakrishnan, R., Rao, S., & Sears, R. (2012, May). Walnut: a unified cloud object store. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (pp. 743-754).

[19] Patel, Piyushkumar. "Remote Auditing During the Pandemic: The Challenges of Conducting Effective Assurance Practices." Distributed Learning and Broad Applications in Scientific Research 6 (2020): 806-23.

[20] Immaneni, J. (2021). Scaling Machine Learning in Fintech with Kubernetes. International Journal of Digital Innovation, 2(1).

[21] Fazio, M., Celesti, A., Puliafito, A., & Villari, M. (2015). Big data storage in the cloud for smart environment monitoring. Procedia Computer Science, 52, 500-506.

[22] Jani, Parth. "Real-Time Patient Encounter Analytics with Azure Databricks during COVID-19 Surge." The Distributed Learning and Broad Applications in Scientific Research 6 (2020): 1083-1115.

[23] Nookala, Guruprasad. "End-to-End Encryption in Data Lakes: Ensuring Security and Compliance." Journal of Computing and Information Technology 1.1 (2021).

[24] Demirkan, H., & Delen, D. (2013). Leveraging the capabilities of service-oriented decision support systems: Putting analytics and big data in cloud. Decision Support Systems, 55(1), 412-421.

[25] Immaneni, J. (2020). Building MLOps Pipelines in Fintech: Keeping Up with Continuous Machine Learning. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 1(2), 22-32.

[26] Brim, M. J., Dillow, D. A., Oral, S., Settlemyer, B. W., & Wang, F. (2013, November). Asynchronous object storage with QoS for scientific and commercial big data. In Proceedings of the 8th parallel data storage workshop (pp. 7-13).

[27] Jani, Parth, and Sarbaree Mishra. "Data Mesh in Federally Funded Healthcare Networks." The Distributed Learning and Broad Applications in Scientific Research 6 (2020): 1146-1176.

[28] Arugula, Balkishan, and Sudhkar Gade. “Cross-Border Banking Technology Integration: Overcoming Regulatory and Technical Challenges”. International Journal of Emerging Research in Engineering and Technology, vol. 1, no. 1, Mar. 2020, pp. 40-48

[29] Yaseen, M. U., Anjum, A., Rana, O., & Hill, R. (2018). Cloud-based scalable object detection and classification in video streams. Future Generation Computer Systems, 80, 286-298.

[30] Manda, Jeevan Kumar. "Securing Remote Work Environments in Telecom: Implementing Robust Cybersecurity Strategies to Secure Remote Workforce Environments in Telecom, Focusing on Data Protection and Secure Access Mechanisms." Focusing on Data Protection and Secure Access Mechanisms (April 04, 2020) (2020).

[31] Patel, Piyushkumar, and Hetal Patel. "Lease Modifications and Rent Concessions under ASC 842: COVID-19’s Lasting Impact on Lease Accounting." Distributed Learning and Broad Applications in Scientific Research 6 (2020): 824-41.

[32] Atitallah, S. B., Driss, M., Boulila, W., & Ghézala, H. B. (2020). Leveraging Deep Learning and IoT big data analytics to support the smart cities development: Review and future directions. Computer Science Review, 38, 100303.

[33] Ahmed, E. S. A., & Saeed, R. A. (2014). A survey of big data cloud computing security. International Journal of Computer Science and Software Engineering (IJCSSE), 3(1), 78-85.

[34] Sreejith Sreekandan Nair, Govindarajan Lakshmikanthan (2020). Beyond VPNs: Advanced Security Strategies for the Remote Work Revolution. International Journal of Multidisciplinary Research in Science, Engineering and Technology 3 (5):1283-1294.

Downloads

Published

2021-03-30

Issue

Section

Articles

How to Cite

1.
Mishra S. Leveraging Cloud Object Storage Mechanisms for Analyzing Massive Datasets. IJERET [Internet]. 2021 Mar. 30 [cited 2025 Oct. 12];2(1):47-56. Available from: https://ijeret.org/index.php/ijeret/article/view/229