AI Driven Deep Neural Network forGesture and Sign Language Recognition-A Survey Approach

  • Shrikanta Jogar Department. of Computer Science &Engineering, Bapuji Institute of Engineering &Technology, Davanagere
  • Dr. Prashantha G. R Dept. of Computer Science &Engineering (Data Science), Bapuji Institute of Engineering &Technology, Davanagere
Keywords: AI Driven deep Neural network, Artificial intelligence, ST-GCN.

Abstract

Sign language recognition plays a key role in bridging communication gaps between hearing-impaired individuals and society. This survey paper offers a comprehensive examination of AI-driven deep neural network (DNN) architectures developed for the purpose of gesture recognition and sign language identification. Human communication relies profoundly on non-verbal channels, and for individuals with hearing or speech impairments, sign language constitutes the central mode of expressing thought, need, and emotion. Bridging the communication divide between the Deaf community and the hearing world through intelligent automated systems represents one of the most socially meaningful directions in contemporary artificial intelligence research. This paper consolidates findings from a wide spectrum of studies spanning classical machine learning, convolutional neural networks, recurrent sequence models, attention-based transformers, graph convolutional networks, and multimodal fusion architectures.

References

[1] A. S. M. Miah, M. A. M. Hasan, S. Nishimura, and J. Shin, "Sign language recognition using graph and general deep neural network based on large scale dataset," IEEE Access, vol. 12, pp. 1–15, 2024. doi: 10.1109/ACCESS.2024.3456765.
[2] Z. Wang, D. Li, R. Jiang, and M. Okumura, "Continuous sign language recognition with multi-scale spatial-temporal feature enhancement," IEEE Access, vol. 13, pp. 5491–5506, 2025. doi: 10.1109/ACCESS.2025.3526330.
[3] A. Khan, S. Jin, G. H. Lee, G. E. Arzu, T. N. Nguyen, and L. M. Dang, "Deep learning approaches for continuous sign language recognition: a comprehensive review," IEEE Access, vol. 13, pp. 1–1, 2025. doi: 10.1109/ACCESS.2025.3535001.
[4] W. Zhao, Y. Zhang, L. Li, X. Chen, and Q. Wang, "MASA: Motion-aware masked autoencoder with semantic alignment for sign language recognition," IEEE Transactions on Circuits and Systems for Video Technology, 2024. doi: 10.1109/TCSVT.2024.3409728.
[5] Z. Liu, J. Wu, Z. Shen, X. Chen, Q. Wu, Z. Gui, L. Senhadji, and H. Shu, "Improving end-to-end sign language translation with adaptive video representation enhanced transformer," IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, pp. 8327–8342, 2024. doi: 10.1109/TCSVT.2024.3398422.
[6] Y. Nakamura and L. Jing, "Skeleton-based data augmentation for sign language recognition using adversarial learning," IEEE Access, vol. 12, pp. 1–12, 2024. doi: 10.1109/ACCESS.2024.3481254.
[7] M. A. Ihsan, A. F. Eram, L. Nahar, and M. A. Kadir, "MediSign: an attention-based CNN-BiLSTM approach of classifying word level signs for patient-doctor interaction in deaf community," IEEE Access, vol. 12, pp. 33803–33815, 2024. doi: 10.1109/ACCESS.2024.3370921.
[8] Y. Zhang, Y. Han, Z. Zhu, X. Jiang, and Y. Zhang, "Artificial intelligence in sign language recognition," Computers and Electrical Engineering, vol. 120, p. 109854, 2024. doi: 10.1016/j.compeleceng.2024.109854.
[9] Y. Zhang and X. Jiang, "Recent advances on deep learning for sign language recognition," Computer Modeling in Engineering and Sciences, vol. 139, no. 3, pp. 2399–2450, 2024. doi: 10.32604/cmes.2023.045731.
[10] V. Leiva et al., "A real-time intelligent system based on machine-learning methods for improving communication in sign language," IEEE Access, vol. 13, pp. 22055–22073, 2025. doi: 10.1109/ACCESS.2025.3532001.
[11] Z. Guan, Y. Hu, H. Jiang, Y. Sun, and B. Yin, "Multi-view isolated sign language recognition based on cross-view and multi-level transformer," Multimedia Systems, vol. 31, no. 3, 2025. doi: 10.1007/s00530-025-01799-1.
[12] Y. Yu et al., "OLMD: Orientation-aware long-term motion decoupling for continuous sign language recognition," Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 9, pp. 9707–9715, 2025. doi: 10.1609/aaai.v39i9.33052.
[13] J. Debnath and P. J. Rao, "Real-time gesture based sign language recognition system," in Proc. 2024 IEEE International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS), pp. 1–6, 2024. doi: 10.1109/ADICS58448.2024.10533401.
[14] A. S. Selvi, G. Ratiraju, A. Jeevan, J. S. K. Reddy, and M. V. Krishna, "Real-time sign language detection using deep learning," in Proc. 2025 IEEE International Conference on Computational, Communication and Information Technology (ICCCIT), pp. 99–103, 2025.
[15] R. Damdoo and P. Kumar, "SignEdgeLVM transformer model for enhanced sign language translation on edge devices," Discover Computing, vol. 28, p. 15, 2025. doi: 10.1007/s10791-025-09509-1.
[16] J. Huang and V. Chouvatut, "Video-based sign language recognition via ResNet and LSTM network," Journal of Imaging, vol. 10, no. 6, p. 149, 2024. doi: 10.3390/jimaging10060149.
Published
2026-04-19
How to Cite
Jogar, S., & G. R, D. P. (2026). AI Driven Deep Neural Network forGesture and Sign Language Recognition-A Survey Approach. Asian Journal For Convergence In Technology (AJCT) ISSN -2350-1146, 12(1), 134-140. Retrieved from https://www.asianssr.org/index.php/ajct/article/view/1531

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.