CNN-Based SIBI Sign Language Recognition Alphabet: Exploring the Impact of Hardware on Model Training

Aris Rakhmadi; Anton Yudhana; Sunardi Sunardi

doi:10.37385/jaets.v7i1.7071

Authors

Aris Rakhmadi Universitas Muhammadiyah Surakarta
Anton Yudhana Universitas Ahmad Dahlan
Sunardi Sunardi Universitas Ahmad Dahlan

DOI:

https://doi.org/10.37385/jaets.v7i1.7071

Keywords:

Sign Language Recognition, Alphabets, CNN, Hardware, Impact

Abstract

The recognition of Sign Language Alphabets (SLA) plays a vital role in human-computer interaction, especially for individuals with auditory disabilities. This study aims to evaluate the impact of different hardware configurations—specifically CPU, GPU, and memory setups—on the training efficiency and recognition performance of a Convolutional Neural Network (CNN)-based model for SLA using the SIBI dataset. The novelty of this research lies in its focus on hardware-aware deep learning optimization for Indonesian sign language (SIBI), an underexplored area. The model was trained on 3,468 labeled hand gesture images representing 24 SIBI alphabet signs. Experiments were conducted on CPU (Intel Xeon 2.00 GHz) and GPU (Nvidia Tesla T4) platforms using a consistent CNN architecture. The training time was significantly reduced by 45.5%, from 1 hour 39 minutes to just 54 minutes, while the accuracy remained consistent at 96.7%, showing no significant change between the two setups. These results demonstrate the significance of parallel processing and memory bandwidth in enhancing model convergence and generalization. The findings are relevant for real-time SLA deployment with hardware constraints on embedded or mobile platforms. Overall, the study underscores the importance of hardware optimization in accelerating CNN training and improving performance in sign language recognition systems.

Downloads

Download data is not yet available.

References

Abdalla, A., Alsereidi, A., Alyammahi, N., Qehaizel, F. B., Ignatious, H. A., & El-Sayed, H. (2023). An innovative Arabic text sign language translator. Procedia Computer Science, 224, 425–430. https://doi.org/10.1016/j.procs.2023.09.059

Abdallah, M. S., Hemayed, E., Abdalla, M. S., & Hemayed, E. E. (2013). Dynamic hand gesture recognition of Arabic sign language using hand motion trajectory features. https://www.researchgate.net/publication/258172682

Abdallah, M. S., Samaan, G. H., Wadie, A. R., Makhmudov, F., & Cho, Y. I. (2023). Light-weight deep learning techniques with advanced processing for real-time hand gesture recognition. Sensors, 23(1). https://doi.org/10.3390/s23010002

Abdullah, B. A. Al, Amoudi, G. A., & Alghamdi, H. S. (2024). Advancements in sign language recognition: A comprehensive review and future prospects. IEEE Access, 12, 128871–128895. https://doi.org/10.1109/ACCESS.2024.3457692

Al Moustafa, A. M. J., Shafry, M., Rahim, M., Khattab, M. M., Zeki, A. M., Matter, S. S., Soliman, A. M., & Ahmed, A. M. (2024). Arabic sign language recognition systems: A systematic review. Indian Journal of Computer Science and Engineering, 15. https://doi.org/10.21817/indjcse/2024/v15i1/241501008

Alsharif, B., Altaher, A. S., Altaher, A., Ilyas, M., & Alalwany, E. (2023a). Deep learning technology to recognize American sign language alphabet. Sensors, 23(18). https://doi.org/10.3390/s23187970

Alsharif, B., Altaher, A. S., Altaher, A., Ilyas, M., & Alalwany, E. (2023b). Deep learning technology to recognize American sign language alphabet. Sensors, 23(18). https://doi.org/10.3390/s23187970

Arya, N., Soni, T., Pattanaik, M., & Sharma, G. K. (2022). Energy-efficient logarithmic-based approximate divider for ASIC and FPGA-based implementations. Microprocessors and Microsystems, 90, 104498. https://doi.org/10.1016/j.micpro.2022.104498

Ben Atitallah, B., Hu, Z., Bouchaala, D., Hussain, M. A., Ismail, A., Derbel, N., & Kanoun, O. (2022). Hand sign recognition system based on EIT imaging and robust CNN classification. IEEE Sensors Journal, 22(2), 1729–1737. https://doi.org/10.1109/JSEN.2021.3130982

Bryzgalov, P., & Maeda, T. (2024). Using benchmarking and regression models for predicting CNN training time on a GPU. Proceedings of the 4th Workshop on Performance Engineering, 8–15. https://doi.org/10.1145/3660317.3660323

Choi, J., Lee, H. J., Sohn, K., Yu, H. S., & Rhee, C. E. (2024). Accelerating CNN training with concurrent execution of GPU and processing-in-memory. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3488004

Darmawan, I. D. M. B. A., Linawati, L., Sukadarmika, G., Wirastuti, N. M. A. E. D., Pulungan, R., Mulyanto, & Hariyanti, N. K. D. (2023). Advancing total communication in SIBI: A proposed conceptual framework for sign language translation. ICSGTEIS 2023, 23–28. https://doi.org/10.1109/ICSGTEIS60500.2023.10424020

Duarte, A., Palaskar, S., Ventura, L., Ghadiyaram, D., Dehaan, K., Metze, F., Torres, J., & Giro-I-Nieto, X. (2021). How2Sign: A large-scale multimodal dataset for continuous American sign language. http://how2sign.github.io/

Fadlilah, U., Mahamad, A. K., & Handaga, B. (2021a). The development of Android for Indonesian sign language using Tensorflow Lite and CNN: An initial study. Journal of Physics: Conference Series, 1858(1). https://doi.org/10.1088/1742-6596/1858/1/012085

Fadlilah, U., Mahamad, A. K., & Handaga, B. (2021b). The development of Android for Indonesian sign language using Tensorflow Lite and CNN: An initial study. Journal of Physics: Conference Series, 1858(1). https://doi.org/10.1088/1742-6596/1858/1/012085

Fadlilah, U., Prasetyo, R. A. R., Mahamad, A. K., Handaga, B., Saon, S., & Sudarmilah, E. (2022). Modelling of basic Indonesian sign language translator based on Raspberry Pi technology. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, 22(3), 574–584. https://doi.org/10.17586/2226-1494-2022-22-3-574-584

Fadlilah, U., Wismoyohadi, D., Mahamad, A. K., & Handaga, B. (2019). BISINDO information system as potential daily sign language learning. AIP Conference Proceedings, 2114. https://doi.org/10.1063/1.5112492

Ferraz, O., Subramaniyan, S., Chinthala, R., Andrade, J., Cavallaro, J. R., Nandy, S. K., Silva, V., Zhang, X., Purnaprajna, M., & Falcao, G. (2022). A survey on high-throughput non-binary LDPC decoders. IEEE Communications Surveys & Tutorials, 24(1), 524–556. https://doi.org/10.1109/COMST.2021.3126127

Handayani, A. N., Akbar, M. I., Ar-Rosyid, H., Ilham, M., Asmara, R. A., & Fukuda, O. (2022). Design of SIBI sign language recognition using artificial neural network backpropagation. ICICyTA 2022, 192–197. https://doi.org/10.1109/ICICyTA57421.2022.10038205

Hashi, A. O., Hashim, S. Z. M., & Asamah, A. B. (2024). A systematic review of hand gesture recognition: An update from 2018 to 2024. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3421992

Hekmat, A., Ali, M., Abbas, H. H., & Shahadi, I. (2022). Sign language recognition and hand gestures review. https://kjes.uokerbala.edu.iq/

Hu, Y., Liu, Y., & Liu, Z. (2022). A survey on convolutional neural network accelerators: GPU, FPGA and ASIC. ICCRD 2022, 100–107. https://doi.org/10.1109/ICCRD54409.2022.9730377

Huang, H., Li, Y., & Zhou, X. (2023). Accelerating point clouds classification in dynamic graph CNN with GPU Tensor Core. ICPADS, 1725–1732. https://doi.org/10.1109/ICPADS60453.2023.00240

Hussin, S. K., Mohamed, O., Mustafa, Ahmed, E., & Mahmoud, O. (2021). Real-time Arabic sign language translator using MediaPipe and LSTM. PLOMS Journal of Artificial Intelligence, 1–13. https://plomscience.com/journals/index.php/PLOMSAI/index

Irvanizam, I., Horatius, I., & Sofyan, H. (2023). Applying artificial neural network based on backpropagation method for Indonesian sign language recognition. International Journal of Computing and Digital Systems, 14(1), 975–985. https://doi.org/10.12785/ijcds/140176

Kwenda, C., Gwetu, M., & Fonou-Dombeu, J. V. (2023). Ontology with deep learning for forest image classification. Applied Sciences, 13(8). https://doi.org/10.3390/app13085060

Latif, G., Mohammad, N., Alghazo, J., & AlKhalaf, R. (2019). ArASL: Arabic alphabets sign language dataset. Data in Brief, 23. https://doi.org/10.1016/j.dib.2019.103777

Li, H., Wang, Z., Yue, X., Wang, W., Tomiyama, H., & Meng, L. (2023). An architecture-level analysis on deep learning models for low-impact computations. Artificial Intelligence Review, 56(3), 1971–2010. https://doi.org/10.1007/s10462-022-10221-5

Li, L., Li, Y., & Tan, H. (2024). DeepDecompose: A distributed inference framework for CNN on GPU-equipped edge clusters. ICCNIT 2024, 540–544. https://doi.org/10.1145/3670105.3670200

Lu, C., Kozakai, M., & Jing, L. (2023). Sign language recognition with multimodal sensors and deep learning methods. Electronics, 12(23). https://doi.org/10.3390/electronics12234827

Luong, S. (2023). Video sign language recognition using pose extraction and deep learning models [Thesis]. https://doi.org/10.31979/etd.jm4c-myd4

Mohammadi, M., Chandarana, P., Seekings, J., Hendrix, S., & Zand, R. (2022). Static hand gesture recognition for American Sign Language using neuromorphic hardware. Neuromorphic Computing and Engineering, 2(4). https://doi.org/10.1088/2634-4386/ac94f3

Mohammadi, M., Smith, H., Khan, L., & Zand, R. (2023). Facial expression recognition at the edge: CPU vs GPU vs VPU vs TPU. GLSVLSI 2023, 243–248. https://doi.org/10.1145/3583781.3590245

Muis, A., Sunardi, S., & Yudhana, A. (2024). CNN-based approach for enhancing brain tumor image classification accuracy. International Journal of Engineering, 37(5), 984–996. https://doi.org/10.5829/ije.2024.37.05b.15

Nadir, M. N., Siraj Rathore, M., Hayat, A., & Mansoor, J. A. (2024). CPU vs GPU: Performance comparison of OpenCL applications on a heterogeneous architecture. Journal of Computing & Biomedical Informatics, 7(2). https://doi.org/10.56979/702/2024

Nguyen, P. T., Nguyen, T. H., Hoang, N. X. N., Phan, H. T. B., Vu, H. S. H., & Huynh, H. N. (2023). Exploring MediaPipe optimization strategies for real-time sign language recognition. CTU Journal of Innovation and Sustainable Development, 15(ISDS), 142–152. https://doi.org/10.22144/ctujoisd.2023.045

Olisah, C. C., Smith, L., & Smith, M. (2022). Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective. Computer Methods and Programs in Biomedicine, 220, 106773. https://doi.org/10.1016/j.cmpb.2022.106773

Pacini, F., Pacini, T., Lai, G., Zocco, A. M., & Fanucci, L. (2024). Design and evaluation of CPU-, GPU-, and FPGA-based deployment of a CNN for motor imagery classification. Electronics, 13(9). https://doi.org/10.3390/electronics13091646

Ping, Y., Jiang, H., Liu, X., Zhao, Z., Zhou, Z., & Chen, X. (2024). Latency-based inter-operator scheduling for CNN inference acceleration on GPU. IEEE Transactions on Services Computing, 17(1), 277–290. https://doi.org/10.1109/TSC.2023.3345952

Pitonak, R., Mucha, J., Dobis, L., Javorka, M., & Marusin, M. (2022). CloudSatNet-1: FPGA-based hardware-accelerated quantized CNN for satellite on-board cloud coverage classification. Remote Sensing, 14(13). https://doi.org/10.3390/rs14133180

Rajput, S. H., Mukherji, P., Avachat, S., Chitnis, A., Deodhar, N., & Godse, P. (2024). Comparative analysis of efficient implementation of CNN on CPU and GPU for image processing. International Journal of Microsystems and IoT, 2(2), 538–547. https://doi.org/10.5281/zenodo.10792152

Rakhmadi, A., & Ariyanto, R. (2021). Measurement motoric system of cerebral palsy disability using gross motor function measure (GMFM). Khazanah Informatika, 7(1), 32–37.

Rakhmadi, A., Yudhana, A., & Sunardi, S. (2024). Virtual reality and augmented reality in sign language recognition: A review of current approaches. International Journal of Informatics and Computation, 6(2). https://doi.org/10.35842/ijicom

Rasch, M. J., Mackin, C., Le Gallo, M., Chen, A., Fasoli, A., Odermatt, F., Li, N., Nandakumar, S. R., Narayanan, P., Tsai, H., Burr, G. W., Sebastian, A., & Narayanan, V. (2023). Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators. Nature Communications, 14(1). https://doi.org/10.1038/s41467-023-40770-4

Rawal, V., Prajapati, P., & Darji, A. (2023). Hardware implementation of 1D-CNN architecture for ECG arrhythmia classification. Biomedical Signal Processing and Control, 85, 104865. https://doi.org/10.1016/j.bspc.2023.104865

Rico, M.-U.-K. (2023). Performance analysis of CNN model for image classification with Intel OpenVINO on CPU and GPU [Thesis]. University of Windsor.

Safitri, M., Yuniarno, E. M., & Rachmadi, R. F. (2024). Indonesian sign language (SIBI) recognition and extraction using CNN-symmetric deletion spelling correction. ISITIA 2024, 220–225. https://doi.org/10.1109/ISITIA63062.2024.10667714

Saha, A., Rahman, M., & Wu, F. (2025). Benchmarking CPU vs GPU on LSTM model using groundwater dataset. SoutheastCon 2025, 1318–1319. https://doi.org/10.1109/SoutheastCon56624.2025.10971639

Salcedo-Navarro, A., Gutierrez-Aguado, J., & Garcia-Pineda, M. (2025). UHD video encoding in CPU versus GPU: Quality and performance trade-offs. IEEE Access, 13, 55115–55129. https://doi.org/10.1109/ACCESS.2025.3553634

Schulder, M., Bigeard, S., Hanke, T., & Kopf, M. (2023). The sign language interchange format: Harmonising sign language datasets for computational processing. ICASSPW 2023, 1–5. https://doi.org/10.1109/ICASSPW59220.2023.10193022

Siek, M. (2023). Benchmarking CPU vs GPU performance in building predictive LSTM deep learning models. AIP Conference Proceedings, 2510(1), 030017. https://doi.org/10.1063/5.0128638

Staka, Z., Misic, M., & Tomasevic, M. (2025). CPU vs GPU: Performance evaluation of classical machine and deep learning algorithms. INFOTEH 2025, 1–6. https://doi.org/10.1109/INFOTEH64129.2025.10959248

Subandi, R., H., & Yudhana, A. (2024). Pneumonia medical image classification using AlexNet and GoogleNet. International Journal of Computing and Digital Systems, 16(1), 1675–1684. https://doi.org/10.12785/ijcds/1601124

Subburaj, S., & Murugavalli, S. (2022). Survey on sign language recognition in context of vision-based and deep learning. Measurement: Sensors, 23. https://doi.org/10.1016/j.measen.2022.100385

Subramanian, B., Olimov, B., Naik, S. M., Kim, S., Park, K. H., & Kim, J. (2022). An integrated MediaPipe-optimized GRU model for Indian sign language recognition. Scientific Reports, 12(1). https://doi.org/10.1038/s41598-022-15998-7

Suharjito, Thiracitta, N., & Gunawan, H. (2021). SIBI sign language recognition using CNN combined with transfer learning. Procedia Computer Science, 179, 72–80. https://doi.org/10.1016/j.procs.2020.12.011

Sultan, A., Makram, W., Kayed, M., & Ali, A. A. (2022). Sign language identification and recognition: A comparative study. Open Computer Science, 12(1), 191–210. https://doi.org/10.1515/comp-2022-0240

Svendsen, B., & Kadry, S. (2023). Comparative analysis of image classification models for Norwegian sign language recognition. Technologies, 11(4). https://doi.org/10.3390/technologies11040099

Tang, X., Chang, X., Chen, N., Ni, Y. (MaoMao), LC, R. A. Y., & Tong, X. (2023). Community-driven information accessibility: Online sign language content creation within d/Deaf communities. CHI 2023. https://doi.org/10.1145/3544548.3581286

Tarek, H., Aly, H., Eisa, S., & Abul-Soud, M. (2022). Optimized deep learning algorithms for tomato leaf disease detection with hardware deployment. Electronics, 11(1). https://doi.org/10.3390/electronics11010140

Tasnim, T., Rahman, M., & Wu, F. (2024). A comparative analysis of CPU and GPU-based cloud platforms for CNN binary classification. IARIA Congress 2024. https://www.thinkmind.org

Waluyo, W. N., Isnanto, R. R., & Rochim, A. F. (2023). Comparison of Mycobacterium tuberculosis image detection accuracy using CNN and CNN-KNN. Jurnal RESTI, 7(1), 80–87. https://doi.org/10.29207/resti.v7i1.4626

Wijaya, F., Dahendra, L., Purwanto, E. S., & Ario, M. K. (2024). Quantitative analysis of sign language translation using artificial neural network model. Procedia Computer Science, 245, 998–1009. https://doi.org/10.1016/j.procs.2024.10.328

Wu, X., Feng, Y., Xu, H., Lin, Z., Chen, T., Li, S., Qiu, S., Liu, Q., Ma, Y., & Zhang, S. (2023). CTransCNN: Combining transformer and CNN in multilabel medical image classification. Knowledge-Based Systems, 281. https://doi.org/10.1016/j.knosys.2023.111030

Yang, D., Li, J., Hao, G., Chen, Q., Wei, X., Dai, Z., Hou, Z., Zhang, L., & Li, X. (2024). Hardware accelerator for high accuracy sign language recognition with residual network based on FPGAs. IEICE Electronics Express, 21(4). https://doi.org/10.1587/elex.21.20230579

Zaitsev, D. A., Ajima, Y., Bartlett, J. F. C., & Kumar, A. (2025). 3D multicore CPU vs GPU on sparse patterns of Sleptsov net virtual machine. International Journal of Parallel, Emergent and Distributed Systems. https://doi.org/10.1080/17445760.2025.2490148

Zhang, Y., Pandey, D., Wu, D., Kundu, T., Li, R., & Shu, T. (2023). Accuracy-constrained efficiency optimization and GPU profiling of CNN inference for detecting drainage crossing locations. SC’23 Workshops, 1780–1788. https://doi.org/10.1145/3624062.3624260

Zhang, Z., Gao, J., Dhaliwal, R. S., & Li, T. J.-J. (2023). VISAR: A human-AI argumentative writing assistant. UIST 2023. https://doi.org/10.1145/3586183.3606800

CNN-Based SIBI Sign Language Recognition Alphabet: Exploring the Impact of Hardware on Model Training

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

Current Issue

Information

Developed By