Mining Text in Online News Reports of COVID-19 Virus: Key Phrase Extractions and Graphic Modeling
Tóm tắt
The recent emergence and spread of COVID-19 have altered the way the world operates. As this pandemic continues to run its course, both language educators and learners around the world are facing a unique set of challenges. In this day and age, there are no more relevant, pressing, or internationally ubiquitous news stories than those related to COVID-19. For L2 learners to have a seat at the global table, it is necessary to learn languages using news stories. Hence, the current study applied text mining techniques to explore and identify patterns among news stories related to COVID-19. In the study, a corpus collecting online news reports about COVID-19 was analyzed. A number of R packages including readtext, tidytext, ggplot2, and ggraph were jointly employed to extract key phrases and construct a graphic model underlying the news corpus. A popular term-extraction method often used in text mining—term frequency–inverse document frequency (TF-IDF)—was utilized to extract the key phrases from the news reports on the COVID-19 virus. A wordnet structure was then established to uncover potentially salient thematic components. The pedagogical implications for language education and vocabulary assessment are further discussed.
Tài liệu tham khảo
Darvin, R., Lo, Y. Y., & Lin, A. M. Y. (2020). Examining CLIL through a critical lens. English Teaching and Learning, 44, 103–108.
Smit, U., & Dafouz, E. (2012). Integrating content and language in higher education: An introduction to English-medium policies, conceptual issues and research practices across Europe. AILA Review, 25, 1–12.
Coyle, D., Hood, P., & Marsh, D. (2010). CLIL: Content and language integrated learning. Cambridge: Cambridge University Press.
Marsh, D., & Langé, G. (2000). Using languages to learn and learning to use languages. Jyväskylä: University of Jyväskylä.
Lasagabaster, D. (2011). English achievement and student motivation in CLIL and EFL settings. Innovation in Language Learning and Teaching, 5(1), 3–18.
Brown, H., & Bradford, A. (2017). EMI, CLIL, & CBI: Differing approaches and goals. In P. Clements, A. Krause, & H. Brown (Eds.), Transformation in English education (pp. 328–334). JALT: Tokyo.
Beech, M. (2020). COVID-19 pushes up internet use 70% and streaming more than. 12%. https://www.forbes.com/sites/markbeech/2020/03/25/covid-19-pushes-up-internet-use-70-streaming-more-than-12-first-figures-reveal/#51b471263104. Access date: August 01 2020.
Radovanović, M., & Ivanović, M. (2008). Text mining: Approaches and applications. Novi Sad Journal of Mathematics, 38, 227–234.
Huilgo, P. (2020). Quick introduction to bag-of-words (BoW) and TF-IDF for creating features from text. https://www.analyticsvidhya.com/blog/2020/02/quick-introduction-bag-of-words-bow-tf-idf/. Access data: August 01 2020.
Qaiser, S., & Ali, R. (2018). Text mining: Use of TF-IDF to examine the relevance of words to documents. International Journal of Computer Applications, 181, 25–29.
Qaiser, S., & Ali, R. (2018). Text mining: Use of TF-IDF to examine the relevance of words to. Documents. International Journal of Computer Applications, 181, 25–29.
Kremmel, B. (2018). Development and initial validation of a diagnostic computer-adaptive profiler of vocabulary knowledge unpublished doctoral dissertation, University of Nottingham, Nottingham.
Santos, V. D. O. (2017). A computer-adaptive test of productive and contextualized academic vocabulary breadth in English (CAT-PAV): Development and validation. Graduate Theses and Dissertations, 16292. Ames: Iowa State University.
Tseng, W. T. (2016). Measuring English vocabulary size via computerized adaptive testing. Computers & Education, 97, 69–85.
Vispoel, W. P. (1998). Psychometric characteristics of computer-adaptive and self-adaptive vocabulary tests: The role of answer feedback and test anxiety. Journal of Educational Measurement, 35, 155–167.
Milton, J. (2009). Measuring second language vocabulary acquisition. Bristol: Multilingual Matters.
Mizumoto, A., Sasao, Y., & Webb, S. A. (2019). Developing and evaluating a computerized adaptive testing version of the word part levels test. Language Testing, 36, 101–123.