Data-Driven Pharmaceutical Sales Analysis and Prediction in the Healthcare Industry
DOI:
https://doi.org/10.63282/3050-922X.IJERET-V7I2P120Keywords:
Sales Forecasting, Machine Learning, Time Series Analysis, Pharmaceutical Industry, Seasonality EffectsAbstract
Forecasting pharmaceutical drug sales is a significant challenge for healthcare organizations and pharmaceutical companies due to factors such as seasonality, weather conditions, local health crises, import issues, currency fluctuations, and economic instability. These factors can contribute to shortages of drugs or a surplus of drugs, impacting both drug availability and drug operations. The challenges discussed above have been addressed in this study by presenting an efficient model for Pharmaceutical Sales Analysis and Forecasting using a combination of Hybrid Random Forest (RF) and Gated Recurrent Unit (RF+GRU) model on the Pharmaceutical sales dataset available from Kaggle. The data was preprocessed using techniques like outlier detection, handling missing data, label encoding, normalization, feature extraction, and data balancing using SMOTE to improve data quality and predictive accuracy. The hybrid model proposed is built by combining the best of both worlds from the feature extraction capabilities of RF and the sequential learning capability of GRU in order to increase forecast accuracy and minimize forecast error. A 96.1% accuracy (ACC) rate, 96.8% precision (PRE), 96% recall (REC), and 96.4% F1-score (F1) were attained by the suggested RF+GRU model in comparison to the other models, including XGBoost, SVM, and KNN, according to the experimental data. The results demonstrate that the hybrid forecasting model is reliable and effective for predicting sales for the pharmaceutical industry, and can be utilized to support the inventory management system and healthcare decision-making system.
References
[1] S. Dharmavaram and P. Bhanushali, “Machine Intelligence-Driven Forecasting for ED Triage and Dynamic Hospital Patient Routing,” Feb. 2026. doi: 10.64898/2026.02.18.26346566.
[2] A. G. Kravets, M. A. Al-Gunaid, V. I. Loshmanov, S. S. Rasulov, and L. B. Lempert, “Model of medicines sales forecasting taking into account factors of influence,” J. Phys. Conf. Ser., vol. 1015, p. 032073, May 2018, doi: 10.1088/1742-6596/1015/3/032073.
[3] M. R. Anand, “Enhancing Pharmaceutical Supply Chains with Densenet-121 and 1D CNN Integration,” in 2025 Global Conference in Emerging Technology (GINOTECH), IEEE, May 2025, pp. 1–7. doi: 10.1109/GINOTECH63460.2025.11076754.
[4] J. Jiménez-Luna, F. Grisoni, N. Weskamp, and G. Schneider, “Artificial intelligence in drug discovery: recent advances and future perspectives,” Expert Opin. Drug Discov., vol. 16, no. 9, pp. 949–959, Sep. 2021, doi: 10.1080/17460441.2021.1909567.
[5] R. S. Snehamruth, “Data-Driven Optimization of Pharmaceutical Manufacturing Processes using Quality by Design ( QbD ) Frameworks,” Int. J. Curr. Eng. Technol., vol. 14, no. 6, pp. 557–566, 2024, doi: 10.14741/ijcet/v.14.6.19.
[6] P. Kumar, “Edge Computing and IoT for Real-Time Healthcare Data Processing and Integration,” in 2025 4th International Conference on Applied Artificial Intelligence and Computing (ICAAIC), IEEE, Dec. 2025, pp. 105–110. doi: 10.1109/ICAAIC64647.2025.11331211.
[7] C. Tayal and S. Murumkar, “Patient Identity Protection and Duplicate Record Prevention in Electronic Health Record (EHR) Systems,” in 2026 18th International Conference on Knowledge and Smart Technology (KST), 2026, pp. 458–464. doi: 10.1109/KST67832.2026.11431915.
[8] S. Besma, C. Rachid, and K. Abdelaziz, “For an Effective Management of the Functional Capacities of Companies: A Study of Pharmaceutical Companies,” Int. J. Saf. Secur. Eng., vol. 11, no. 5, pp. 557–563, Oct. 2021, doi: 10.18280/ijsse.110507.
[9] F. Silva-Aravena, I. Ceballos-Fuentealba, and E. Álvarez-Miranda, “Inventory Management at a Chilean Hospital Pharmacy: Case Study of a Dynamic Decision-Aid Tool,” Mathematics, vol. 8, no. 11, p. 1962, Nov. 2020, doi: 10.3390/math8111962.
[10] J. W. Sajja and G. B. Komarina, “Enhancing compliance and data integrity in life sciences and healthcare with S/4HANA: A data management and governance framework,” World J. Adv. Eng. Technol. Sci., vol. 15, no. 2, pp. 2816–2827, May 2025, doi: 10.30574/wjaets.2025.15.2.0843.
[11] S. Mahmud, “AI and Data Analytics for Enhancing Home Healthcare: Optimizing Patient Outcomes and Resource Allocation,” Front. Appl. Eng. Technol., vol. 2, no. 1, pp. 23–100, 2025, doi: 10.70937/faet.v2i01.61.
[12] J. A. Kachhia, “Healthcare Predictive Analytics Based on Machine Learning Techniques for Identifying Cardiovascular Risk Screening,” Int. J. Curr. Eng. Technol., vol. 13, no. 6, pp. 635–642, Dec, 2023, doi: https://doi.org/10.14741/IJCET/V.13.6.17.
[13] S. Golriz Khatami, S. Mubeen, V. S. Bharadhwaj, A. T. Kodamullil, M. Hofmann-Apitius, and D. Domingo-Fernández, “Using predictive machine learning models for drug response simulation by calibrating patient-specific pathway signatures,” npj Syst. Biol. Appl., vol. 7, no. 1, p. 40, oct. 2021, doi: 10.1038/s41540-021-00199-1.
[14] K.-K. Mak and M. R. Pichika, “Artificial intelligence in drug development: present status and future prospects,” Drug Discov. Today, vol. 24, no. 3, pp. 773–780, Mar. 2019, doi: 10.1016/j.drudis.2018.11.014.
[15] P. Kelle, J. Woosley, and H. Schneider, “Pharmaceutical supply chain specifics and inventory solutions for a hospital case,” Oper. Res. Heal. Care, vol. 1, no. 2–3, pp. 54–63, Jun. 2012, doi: 10.1016/j.orhc.2012.07.001.
[16] A. Aliper et al., “Prediction of Clinical Trials Outcomes Based on Target Choice and Clinical Trial Design with Multi-Modal Artificial Intelligence,” Clin. Pharmacol. Ther., vol. 114, no. 5, pp. 972–980, 2023, doi: 10.1002/cpt. 3008.
[17] P. Kumar, “Leveraging Generative AI for Automated Data Standardization and Interoperability in Healthcare,” in 2025 4th International Conference on Applied Artificial Intelligence and Computing (ICAAIC), IEEE, Dec. 2025, pp. 99–104. doi: 10.1109/ICAAIC64647.2025.11330217.
[18] M. R. Anand and A. K. S, “Temporal Fusion Transformer Forecasting and MILP Prescriptive Optimization for Hospital Pharmacy Supply Chain Orchestration,” in 2025 9th International Conference on Electronics, Communication and Aerospace Technology (ICECA), IEEE, Nov. 2025, pp. 1206–1213. doi: 10.1109/ICECA66444.2025.11382695.
[19] H. Abbasimehr, M. Shabani, and M. Yousefi, “An optimized model using LSTM network for demand forecasting,” Comput. Ind. Eng., vol. 143, no. July 2019, p. 106435, May 2020, doi: 10.1016/j.cie.2020.106435.
[20] M. Azadi, S. Yousefi, R. Farzipoor Saen, H. Shabanpour, and F. Jabeen, “Forecasting sustainability of healthcare supply chains using deep learning and network data envelopment analysis,” J. Bus. Res., vol. 154, p. 113357, Jan. 2023, doi: 10.1016/j.jbusres.2022.113357.
[21] R. Snehamrutha, “Patient Engagement Strategies in Community Pharmacies and their Effect on Vaccination Uptake and Medication Synchronizations,” ESP J. Eng. Technol. Adv., vol. 3, no. 3, pp. 163–173, 2023, doi: 10.56472/25832646/JETA-V3I7P120.
[22] M. R. Anand and K. Abhilash, “Transforming Energy-Intensive Smart Factories with AI: TCN-based Forecasting and DQN-Driven Operational Optimization for Healthcare Manufacturing,” in International Conference on Intelligent Computing, Information and Control Systems (ICOIICS-2025), IEEE, 2025, pp. 508–515, November. doi: 10.1109/ICOIICS67115.2025.11390244.
[23] F. Pang, X. Zhou, T. Bai, K. Wen, J. Zhu, and B. Wu, “A multi-level time-series analysis for forecasting and operational planning in pharmaceutical services: A case study,” Jul. 2025. doi: 10.21203/rs. 3. rs-7114272/v1.
[24] H. N. Dinh, T. H. Do, and T. B. Nguyen, “An Efficiency Improvement of the N‐Beats Model for Sale Forecast Problem,” in Creative Approaches Towards Development of Computing and Multidisciplinary IT Solutions for Society, Wiley, 2024, pp. 251–263. doi: 10.1002/9781394272303.ch15.
[25] N. Qassrawi, M. Azzeh, and M. Hijjawi, “Drug sales forecasting in the pharmaceutical market using deep neural network algorithms,” Int. J. Syst. Innov., vol. 8, no. 3, pp. 63–83, 2024, doi: 10.6977/IJoSI.202409_8(3).0006.
[26] S. R. Dutta, S. Das, and P. Chatterjee, “Smart Sales Prediction of Pharmaceutical Products,” in 2022 8th International Conference on Smart Structures and Systems (ICSSS), IEEE, Apr. 2022, pp. 1–6. doi: 10.1109/ICSSS54381.2022.9782271.
[27] K. Konar and H. Pitroda, “Analyzing and Predicting the Impact of COVID-19 on Online Pharmaceuticals Sectors and Pathological Services in India,” in 2022 IEEE 7th International Conference for Convergence in Technology (I2CT), IEEE, Apr. 2022, pp. 1–7. doi: 10.1109/I2CT54291.2022.9824883.
[28] F. Mbonyinshuti, J. Nkurunziza, J. Niyobuhungiro, and E. Kayitare, “The Prediction of Essential Medicines Demand: A Machine Learning Approach Using Consumption Data in Rwanda,” Processes, vol. 10, no. 1, p. 26, Dec. 2021, doi: 10.3390/pr10010026.
[29] R. Pall, Y. Gauthier, S. Auer, and W. Mowaswes, “Predicting drug shortages using pharmacy data and machine learning,” Health Care Manag. Sci., vol. 26, no. 3, pp. 395–411, set. 2023, doi: 10.1007/s10729-022-09627-y.
[30] Y. Xiong, “Development of an AI-Driven Model for Drug Sales Prediction Using Enhanced Golden Eagle Optimization and XGBoost Algorithm,” Informatica, vol. 49, no. 17, Mar. 2025, doi: 10.31449/inf.v49i17.7491.
[31] M. Husban, A. Mir, and I. Yustiana, “Predicting Big Mart Sales with Machine Learning,” in The 7th International Global Conference Series on ICT Integration in Technical Education & Smart Society, Basel Switzerland: MDPI, Sep. 2025, p. 95. doi: 10.3390/engproc2025107095.