Optimizing Pre-Trained Models for Medical Dataset Classification with a Fine-Tuning Approach
DOI:
https://doi.org/10.62760/iteecs.4.1.2025.126Keywords:
Medical Dataset Classification, Real-Time Clinical Implementation, Convolutional Neural Network, Stacked Autoencoders, Logistic Regression classification modelAbstract
Medical organizations struggle to deal with huge high-dimensional datasets that need powerful machine learning systems to produce precise healthcare outcomes. Traditional analytical techniques prove inadequate when dealing with extraction from features and performance of classifiers in this specific setting. The research introduces an algorithm which enhances Stacked Autoencoders (SAEs) by combining them with a customized Logistic Regression model intended for medical high-dimensional data analysis. This approach implements a Hybrid Imputation Method using MICE and KNN Imputation which precedes other stages and helps process missing values and outliers in medical data. We use CNNs and SAEs together for deep feature extraction before using Feature Fusion to assemble a robust feature collection. A set of the most important features is identified by executing Advanced Ensemble Feature Selection (EFS) procedures which include Few-shot Learning and Model-Agnostic Meta-Learning Algorithm (MAML) and Genetic Algorithm-Based Feature Selection (GAFS). The procedure of fine-tuning pre-trained models represents an effective enhancement for classification tasks particularly in situations of limited dataset availability. The experimental outcomes demonstrate remarkable performance gains in terms of accuracy and sensitivity and specificity as well as reduced execution time as compared to current techniques. Upcoming work for this study involves speeding up algorithm processing abilities and scalability alongside the integration of robust deep learning structures with self-supervised learning methodologies together with upgrade transfer learning approaches for medical dataset variety applications. The study will concentrate on enhancing model transparency through explainable AI and real-time validation for clinical deployment and ethical and regulatory compliance to develop this technique for practical healthcare settings.
References
A. Gupta, and S. Gupta “Enhanced Classification of Imbalanced Medical Datasets using Hybrid Data-Level, Cost-Sensitive and Ensemble Methods”, International Research Journal of Multidisciplinary Technovation, Vol.6, No.3 pp. 58-76, 2024.
https://doi.org/10.54392/irjmt2435
S. Moon, T. S. Kim, J. Ryu and W. H. Lee, “Federated Learning for Sleep Stage Classification on Edge Devices via a Model-Agnostic Meta-Learning-Based Pre-Trained Model”, 2023 IEEE 13th International Conference on Consumer Electronics - Berlin (ICCE-Berlin), Berlin, Germany, pp. 188-192, 2023.
https://doi.org/10.1109/ICCE-Berlin58801.2023.10375664
M. H. Javaid, I. A. Shah, M. S. Javaid, U. Bin Irshad and Z. Halim “Model Agnostic Meta Learning for EEG Classification: Multitask Approach”, 2023 IEEE IAS Global Conference on Emerging Technologies (GlobConET), London, United Kingdom, pp. 1-4, 2023.
https://doi.org/10.1109/GlobConET56651.2023.10150186
Z. Hu, L. Shen, Z. Wang, T. Liu, C. Yuan and D. Tao “Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning”, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, pp. 7736-7745, 2023.
https://doi.org/10.1109/CVPR52729.2023.00747
G. H. De Rosa, M. Roder, J. P. Papa and C. F. G. Dos Santos “Improving Pre- Trained Weights through Meta - Heuristics Fine- Tuning”, 2021 IEEE Symposium Series on Computational Intelligence (SSCI), Orlando, FL, USA, pp. 1-8, 2021.
https://doi.org/10.1109/SSCI50451.2021.9659945
T. Christopher, N. Kumar “Optimization Based Feature Selection Algorithm with Twin-Bounded Support Vector Machine for Medical Dataset Classification”, Journal of Survey in Fisheries Sciences, Vol. 10, No. 4S, pp.1079-96, 2023. [CrossRef]
T. Christopher, N. Kumar. Medical Dataset Classification Using Ensemble Feature Selection and Back Propagation Neural Network Algorithm”, International Journal of Intelligent Systems and Applications in Engineering, Vol. 12, No. 22S, pp. 1403-1420, 2024. [CrossRef]
P. Kumar et al. "Feature subset selection using filter, heuristic and meta-heuristic approaches using binary encoded diabetes dataset." AIP Conference Proceedings. Vol. 2555. No. 1, 2022.
https://doi.org/10.1063/5.0108858
K. B. Nahato, KH. Nehemiah, and A. Kannan “Hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets”, Informatics in Medicine Unlocked, Vol. 2 pp.1-11, 2016.
https://doi.org/10.1016/j.imu.2016.01.001
N. Spolaôr, H. D. Lee, A. I. Mendes, C. V. Nogueira, A. R. S. Parmezan, W. S. R. Takaki, C. S. R. Coy, F. C. Wu, and R. F. Pinto “Fine-tuning pre-trained neural networks for medical image classification in small clinical datasets”, Multimedia Tools and Applications, Vol. 83, No. 9, pp. 27305-27329, 2024.
https://doi.org/10.1007/s11042-023-16529-w
T. Chauhan, H. Palivela, S. Tiwari “Optimization and fine-tuning of DenseNet model for classification of COVID-19 cases in medical imaging”, International Journal of Information Management and Data Insights, Vol. 1, No. 2, art. no. 100020, 2021.
https://doi.org/10.1016/j.jjimei.2021.100020
S. Mohammadian, A. Karsaz and Y. M. Roshan, “Comparative Study of Fine-Tuning of Pre-Trained Convolutional Neural Networks for Diabetic Retinopathy Screening”, 2017 24th National and 2nd International Iranian Conference on Biomedical Engineering (ICBME), Tehran, pp. 1-6, 2017.
https://doi.org/10.1109/ICBME.2017.8430269
SM. Roshan, A. Karsaz, AH. Vejdani, YM. Roshan “Fine-tuning of pre-trained convolutional neural networks for diabetic retinopathy screening: a clinical study”, International Journal of Computational Science and Engineering, Vol. 21, No. 4, 564-573, 2020.
https://doi.org/10.1504/IJCSE.2020.106869
JP. Villa-Pulgarin, AA. Ruales-Torres, D. Arias-Garzon, MA. Bravo-Ortiz, HB. Arteaga-Arteaga, A. Mora-Rubio, et al “Optimized convolutional neural network models for skin lesion classification”, Computers Materials and Continua, Vol. 70, No. 2, pp. 2131-2148, 2022.
https://doi.org/10.32604/cmc.2022.019529
X. Liu, C. Wang, J. Bai, G. Liao “Fine-tuning pre-trained convolutional neural networks for gastric precancerous disease classification on magnification narrow band imaging images”, Neurocomputing, Vol. 392, pp. 253-267, 2020.
https://doi.org/10.1016/j.neucom.2018.10.100
M. Bal-Ghaoui, A. HEY, A. Jilbab, A. Bourouhou “Optimizing ultrasound image classification through transfer learning: fine-tuning strategies and classifier impact on pre-trained inner-layers”, Informatyka Automatyka Pomiary Gospod Ochronie Srodowiska, Vol. 13, No. 4, 2023.
https://doi.org/10.35784/iapgos.4464
L. Alkema, A. E. Raftery, P. Gerland, S. J. Clark F. Pelletier “Estimating trends in the total fertility rate with uncertainty using imperfect data: examples from West Africa”, Demographic research, Vol. 26, pp. 331-362, 2012.
https://doi.org/10.4054/DemRes.2012.26.15
M. Ala’raj, M. Majdalawieh, M. F. Abbod “Improving binary classification using filtering based on k-NN proximity graphs”, Journal of Big Data, Vol. 7, art. no. 15, 2020.
https://doi.org/10.1186/s40537-020-00297-7
D. A. Adeniyi, Z. Wei, Y. Yongquan “Automated web usage data mining and recommendation system using K-nearest neighbor (KNN) classification method”, Applied Computing and Informatics, Vol. 12, No. 1, pp. 90-108, 2016.
https://doi.org/10.1016/j.aci.2014.10.001
S. A. Abdullah “Hybrid model based on ReliefF algorithm and K-nearest neighbor for erythemato-squamous diseases forecasting”, Arabian Journal for Science and Engineering, Vol. 47, No. 2, pp. 1299-307, 2022.
https://doi.org/10.1007/s13369-021-05921-z
M. Wu, J. Zhou, Y. Peng, S. Wang, and Y. Zhang “Deep learning for image classification: a review”, International Conference on Medical Imaging and Computer-Aided Diagnosis, pp. 352-362, 2024.
https://doi.org/10.1007/978-981-97-1335-6_31
L. Wang, R. Tao, H. Hu, Y. R. Zeng “Effective wind power prediction using novel deep learning network: stacked independently recurrent autoencoder”, Renewable Energy, Vol. 164, pp. 642-655, 2021.
https://doi.org/10.1016/j.renene.2020.09.108
A. Sumathi, S. Meganathan, B. V. Ravisankar “An intelligent gestational diabetes diagnosis model using deep stacked autoencoder”, Computers Materials and Continia, Vol. 69, No. 3, pp. 3109-3126, 2021.
https://doi.org/10.32604/cmc.2021.017612
D. O'Neill, A. Lensen, B. Xue and M. Zhang, “Particle Swarm Optimisation for Feature Selection and Weighting in High-Dimensional Clustering”, 2018 IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro, Brazil, pp. 1-8, 2018.
https://doi.org/10.1109/CEC.2018.8477974
Y. V. S. Murthy, S. G. Koolagudi, T. K. J. Raja “Singer identification for Indian singers using convolutional neural networks”, International Journal of Speech Technology, Vol. 24, No. 3, pp. 781-796, 2021.
https://doi.org/10.1007/s10772-021-09849-5
C. Finn, P. Abbeel, S. Levine “Model-agnostic meta-learning for fast adaptation of deep networks”, Proceedings of the 34th International Conference on Machine Learning, Vol. 70, pp. 1126-1135, 2017. [CrossRef]
A. Santoro, S. Bartunov, M. Botvinick, D. Wierstra and T. Lillicrap “Meta-learning with memory-augmented neural networks”, Proceedings of the 33rd International Conference on Machine Learning, Vol. 48, pp. 1842-1850, 2016.
[CrossRef]
H. J. Ye, L. Ming, D. C. Zhan and W. L. Chao, “Few-Shot Learning with a Strong Teacher”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 46, No. 3, pp. 1425-1440, 2024.
https://doi.org/10.1109/TPAMI.2022.3160362
X. Liu, C. Wang, J. Bai, G. Liao “Fine-tuning pre-trained convolutional neural networks for gastric precancerous disease classification on magnification narrow-band imaging images”, Neurocomputing, Vol. 392, pp. 253-267, 2020.
https://doi.org/10.1016/j.neucom.2018.10.100
B. Gunel, J. Du, A. Conneau, A. Stoyanov “Supervised contrastive learning for pre-trained language model fine-tuning”, arXiv preprint arXiv: 2011.01403, 2021.
https://doi.org/10.48550/arXiv.2011.01403
J. Pan “Feature-based transfer learning with real-world applications”, Hong Kong University of Science and Technology, 2010.
https://doi.org/10.14711/thesis-b1118218
Y. Wang, S. Nazir, M. Shafiq “An overview on analyzing deep learning and transfer learning approaches for health monitoring”, Computational and Mathematical Methods in Medicine, Vol. 2021, No. 1, art. no. 5552743, 2021.
https://doi.org/10.1155/2021/5552743
B. Tan, Y. Zhang, S. Pan, Q. Yang “Distant domain transfer learning”, Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 31, No. 1, 2017.
https://doi.org/10.1609/aaai.v31i1.10826
M. H. Van den Berg, A. Overbeek, H. J. van der Pal, A. B. Versluys, D. Bresters, F. E. van Leeuwen, C. B. Lambalk, G. J. L Kaspers, and E V D D Broeder “Using web-based and paper-based questionnaires for collecting data on fertility issues among female childhood cancer survivors: differences in response characteristics”, Journal of medical Internet research, Vol. 13, No. 3, art. no. e1707, 2011.
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 N. Kumar, T. Christopher

This work is licensed under a Creative Commons Attribution 4.0 International License.
This Journal and its metadata are licenced under a