Character-level arabic text generation from sign language video using encoder–decoder model
Tài liệu tham khảo
Li, 2019, Visual to text: Survey of image and video captioning, IEEE Trans. Emerg. Top. Comput. Intell., 3, 297, 10.1109/TETCI.2019.2892755
Alsmadi, 2020, Content-based image retrieval using color, shape and texture descriptors and features, Arab. J. Sci. Eng., 45, 3317, 10.1007/s13369-020-04384-y
Bodini, 2019, A review of facial landmark extraction in 2d images and videos using deep learning, Big Data Cogn. Comput., 3, 14, 10.3390/bdcc3010014
Simonyan, 2013
Boukdir, 2022, Isolated video-based arabic sign language recognition using convolutional and recursive neural networks, Arab. J. Sci. Eng., 47, 2187, 10.1007/s13369-021-06167-5
Mishra, 2021, A Hindi image caption generation framework using deep learning, Trans. Asian Low-Resour. Lang. Inf. Process., 20, 1, 10.1145/3432246
Daskalakis, 2018, Learning deep spatiotemporal features for video captioning, Pattern Recognit. Lett., 116, 143, 10.1016/j.patrec.2018.09.022
Yang, 2018, Video captioning by adversarial LSTM, IEEE Trans. Image Process., 27, 5600, 10.1109/TIP.2018.2855422
Xu, 2018, Dual-stream recurrent neural network for video captioning, IEEE Trans. Circuits Syst. Video Technol., 29, 2482, 10.1109/TCSVT.2018.2867286
Jin, 2019, Recurrent convolutional video captioning with global and local attention, Neurocomputing, 370, 118, 10.1016/j.neucom.2019.08.042
D. Guo, S. Tang, M. Wang, Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling, in: IJCAI, 2019, pp. 751–757.
Guo, 2019, Hierarchical recurrent deep fusion using adaptive clip summarization for sign language translation, IEEE Trans. Image Process., 29, 1575, 10.1109/TIP.2019.2941267
Wang, 2020, Sequence in sequence for video captioning, Pattern Recognit. Lett., 130, 327, 10.1016/j.patrec.2018.07.024
Vinodhini, 2020, A deep structured model for video captioning, Int. J. Gaming Comput.-Mediat. Simul. (IJGCMS), 12, 44, 10.4018/IJGCMS.2020040103
Nabati, 2020, Video captioning using boosted and parallel long short-term memory networks, Comput. Vis. Image Underst., 190, 10.1016/j.cviu.2019.102840
Nabati, 2020, Multi-sentence video captioning using content-oriented beam searching and multi-stage refining algorithm, Inf. Process. Manage., 57, 10.1016/j.ipm.2020.102302
K. Papineni, S. Roukos, T. Ward, W.-J. Zhu, Bleu: a method for automatic evaluation of machine translation, in: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, 2002, pp. 311–318.