Identification of Multilingual Offense and Troll from Social Media Memes Using Weighted Ensemble of Multimodal Features

Eftekhar Hossain1, Omar Sharif2, Mohammed Moshiul Hoque2, M. Ali Akber Dewan3, Nazmul Siddique4, Md. Azad Hossain1
1Department of Electronics and Telecommunication Engineering, Chittagong University of Engineering & Technology, Chittagong 4349, Bangladesh
2Department of Computer Science and Engineering, Chittagong University of Engineering & Technology, Chittagong 4349, Bangladesh
3School of Computing and Information Systems, Faculty of Science and Technology, Athabasca University, Athabasca, AB T9S 3A3, Canada
4School of Computing, Engineering and Intelligent Systems, Ulster University, Londonderry BT47 7JL, UK

Tài liệu tham khảo

Akiwowo, S., Vidgen, B., Prabhakaran, V., Waseem, Z. (Eds.), 2020. In: Proceedings of the Fourth Workshop on Online Abuse and Harms, Association for Computational Linguistics, Online. URL:/ https://www.aclweb.org/anthology/2020.alw-1.0

Bannink, 2014, Cyber and traditional bullying victimization as a risk factor for mental health problems and suicidal ideation in adolescents, PLOS ONE, 9, 1, 10.1371/journal.pone.0094026

Basile, 2019, SemEval-2019 task 5: multilingual detection of hate speech against immigrants and women in Twitter, 54

Bharathi, Agnusimmaculate, S., 2021. SSNCSE_NLP@DravidianLangTech-EACL2021: Meme classification for Tamil using machine learning approach. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv. pp. 336–339. URL: https://aclanthology.org/2021.dravidianlangtech-1.49.

Bosco, C., Felice, D., Poletto, F., Sanguinetti, M., Maurizio, T., 2018. Overview of the evalita 2018 hate speech detection task. In: EVALITA 2018-Sixth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, vol. 2263, CEUR, pp. 1–9.

Chen, 2020, Uniter: universal image-text representation learning, 104

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K., 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, Minneapolis, Minnesota. pp. 4171–4186. doi:10.18653/v1/N19-1423. URL: https://www.aclweb.org/anthology/N19-1423.

Drakett, 2018, Old jokes, new media–online sexism and constructions of gender in internet memes, Fem. Psychol., 28, 109, 10.1177/0959353517727560

Duggan, M., 2017. Men, women experience and view online harassment differently. pew research center. published july 14.

Fortuna, 2018, A survey on automatic detection of hate speech in text, ACM Comput. Surv., 51, 10.1145/3232676

Frenda, 2022, The unbearable hurtfulness of sarcasm, Expert Syst. Appl., 193, 10.1016/j.eswa.2021.116398

Gambäck, B., Sikdar, U.K., 2017. Using convolutional neural networks to classify hate-speech. In: Proceedings of the First Workshop on Abusive Language Online, Association for Computational Linguistics, Vancouver, BC, Canada, pp. 85–90. doi:10.18653/v1/W17-3013. URL:/ https://www.aclweb.org/anthology/W17-3013

Huang, B., Bai, Y., 2021. HUB@DravidianLangTech-EACL2021: Meme classification for Tamil text-image fusion, in: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv. pp. 210–215. URL: https://aclanthology.org/2021.dravidianlangtech-1.28.

Jørgensen, 2020, Private governance of freedom of expression on social media platforms: Eu content regulation through the lens of human rights standards, Nord. Rev., 41, 51, 10.2478/nor-2020-0003

Kumar, R., Ojha, A.K., Zampieri, M., Malmasi, S. (Eds.), 2018. In: Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), Association for Computational Linguistics, Santa Fe, New Mexico, USA. URL:/ https://www.aclweb.org/anthology/W18-4400

Kumar, R., Ojha, A.K., Lahiri, B., Zampieri, M., Malmasi, S., Murdock, V., Kadar, D. (Eds.), 2020. In: Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, European Language Resources Association (ELRA), Marseille, France. URL:/ https://www.aclweb.org/anthology/2020.trac-1.0

Kumari, K., Singh, J.P., 2021. Identification of cyberbullying on multi-modal social media posts using genetic algorithm. Trans. Emerg. Telecommun. Technol. 32(2), e3907. doi: 10.1002/ett.3907.

Kumari, 2019, Aggressive social media post detection system containing symbolic images, 415

Kumari, 2021, Multi-modal aggression identification using convolutional neural network and binary particle swarm optimization, Future Gener. Comput. Syst., 118, 187, 10.1016/j.future.2021.01.014

Li, Z., 2021. Codewithzichao@DravidianLangTech-EACL2021: Exploring multimodal transformers for meme classification in Tamil language. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv. pp. 352–356. URL: https://www.aclweb.org/anthology/2021.dravidianlangtech-1.52.

Li, 2019

Mandl, T., Modha, S., Kumar, A.M., Chakravarthi, B.R., 2020. Overview of the hasoc track at fire 2020: hate speech and offensive language identification in tamil, malayalam, hindi, english and german. In: Forum for Information Retrieval Evaluation, FIRE 2020, Association for Computing Machinery, New York, NY, USA, p. 29–32. doi:10.1145/3441501.3441517.

Mihaylov, T., Georgiev, G., Nakov, P., 2015. Finding opinion manipulation trolls in news community forums. In: Proceedings of the Nineteenth Conference on Computational Natural Language Learning, Association for Computational Linguistics, Beijing, China, pp. 310–314. doi:10.18653/v1/K15-1032. URL:/ https://www.aclweb.org/anthology/K15-1032.

Mikolov, 2013, Efficient estimation of word representations in vector space, ICLR

Mishra, A.K., Saumya, S., 2021. IIIT_DWD@EACL2021: identifying troll meme in Tamil using a hybrid deep learning approach. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv, pp. 243–248. URL:/ https://www.aclweb.org/anthology/2021.dravidianlangtech-1.33

Mouzannar, 2018

O’Malley, T., Bursztein, E., Long, J., Chollet, F., Jin, H., Invernizzi, L., et al., 2019. Keras tuner. URL: https://github.com/keras-team/keras-tuner.

Ou, X., Li, H., 2020. Ynu@dravidian-codemix-fire2020: Xlm-roberta for multi-language sentiment analysis. In: FIRE.

Pamungkas, E.W., Patti, V., 2019. Cross-domain and cross-lingual abusive language detection: a hybrid approach with deep learning and a multilingual lexicon. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, Association for Computational Linguistics, Florence, Italy, pp. 363–370. doi:10.18653/v1/P19-2051. URL:/ https://www.aclweb.org/anthology/P19-2051.

Que, Q., 2021. Simon @ DravidianLangTech-EACL2021: Meme classification for Tamil with BERT. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv. pp. 287–290. URL: https://aclanthology.org/2021.dravidianlangtech-1.41.

Roberts, S.T., Tetreault, J., Prabhakaran, V., Waseem, Z. (Eds.), 2019. In: Proceedings of the Third Workshop on Abusive Language Online, Association for Computational Linguistics, Florence, Italy. URL:/ https://www.aclweb.org/anthology/W19-3500

Russakovsky, 2015, Imagenet large scale visual recognition challenge, Int. J. Comput. Vision, 115, 211, 10.1007/s11263-015-0816-y

Sadiq, 2021, Aggression detection through deep neural model on twitter, Future Gener. Comput. Syst., 114, 120, 10.1016/j.future.2020.07.050

Saha, D., Paharia, N., Chakraborty, D., Saha, P., 2021. Mukherjee, A. Hate-alert@DravidianLangTech-EACL2021: Ensembling strategies for transformer-based offensive language detection. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, Association for Computational Linguistics, Kyiv, pp. 270–276. URL:/ https://www.aclweb.org/anthology/2021.dravidianlangtech-1.38

Sharif, O., Hossain, E., Hoque, M.M., 2021. Combating hostility: covid-19 fake news and hostile post detection in social media. arXiv:2101.03291.

Singh, V.K., Ghosh, S., Jose, C., 2017. Toward multimodal cyberbullying detection. In: Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems, CHI EA ’17, Association for Computing Machinery. New York, NY, USA. pp. 2090–2099. doi:10.1145/3027063.3053169.

Su, W., Zhu, X., Cao, Y., Li, B., Lu, L., Wei, F., Dai, J., 2019. Vl-bert: Pre-training of generic visual-linguistic representations. arXiv preprint arXiv:1908.08530.

Suryawanshi, S., Chakravarthi, B.R., Verma, P., Arcan, M., McCrae, J.P., Buitelaar, P., 2020. A dataset for troll classification of TamilMemes. In: Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation, European Language Resources Association (ELRA), Marseille, France, pp. 7–13. URL:/ https://www.aclweb.org/anthology/2020.wildre-1.2

Szegedy, 2015, Going deeper with convolutions, 1

Vidgen, B., Harris, A., Nguyen, D., Tromble, R., Hale, S., Margetts, H., 2019. Challenges and frontiers in abusive content detection. In: Proceedings of the Third Workshop on Abusive Language Online, Association for Computational Linguistics, Florence, Italy, pp. 80–93. doi:10.18653/v1/W19-3509. URL:/ https://www.aclweb.org/anthology/W19-3509

Zampieri, 2019, Predicting the type and target of offensive posts in social media, 1415

Zampieri, 2020, SemEval-2020 task 12: multilingual offensive language identification in social media (OffensEval 2020), 1425

Zhou, 2009, Ensemble learning, Encyclopedia Biometrics, 1, 270, 10.1007/978-0-387-73003-5_293

Zhou, 2020, Deep learning based fusion approach for hate speech detection, IEEE Access, 8, 128923, 10.1109/ACCESS.2020.3009244

Zou, 2018