A code-mixed task-oriented dialog dataset for medical domain

Computer Speech & Language - Tập 78 - Trang 101449 - 2023
Suman Dowlagar1, Radhika Mamidi1
1Language Technologies Research Center, International Institute of Information Technology, Hyderabad, 506002, Telangana, India

Tài liệu tham khảo

Aguilar, 2019 Aguilar, G., Kar, S., Solorio, T., 2020. LinCE: A centralized benchmark for linguistic code-switching evaluation. In: Proceedings of the 12th Language Resources and Evaluation Conference. pp. 1803–1813. Aguilar, G., Solorio, T., 2020. From english to code-switching: Transfer learning with strong morphological clues. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. pp. 8033–8044. Alsentzer, E., Murphy, J., Boag, W., Weng, W.-H., Jindi, D., Naumann, T., McDermott, M., 2019. Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop. pp. 72–78. Angara, 2017, Foodie fooderson a conversational agent for the smart kitchen, 247 Bai, 2022, Incremental intent detection for medical domain with contrast replay networks, 3549 Bali, K., Sharma, J., Choudhury, M., Vyas, Y., 2014 “I am borrowing ya mixing?” An analysis of English-Hindi code mixing in facebook. In: Proceedings of the First Workshop on Computational Approaches to Code Switching. pp. 116–126. Ball, K., Garrette, D., 2018. Part-of-Speech tagging for code-switched, transliterated texts without explicit language identification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. pp. 3084–3089. Banerjee, 2018, A dataset for building code-mixed goal oriented conversation systems, 3766 Barman, 2016, Part-of-speech tagging of code-mixed social media content: Pipeline, stacking and joint modelling, 30 Basu, 2022, Strategies to improve few-shot learning for intent classification and slot-filling, 17 Basu, 2022, Strategies to improve few-shot learning for intent classification and slot-filling, 17 Bhargava, 2013, Easy contextual intent prediction and slot detection, 8337 Bickmore, 2016, Improving access to online health information with conversational agents: a randomized controlled experiment, J. Med. Internet Res., 18, 10.2196/jmir.5239 Bohra, 2018, A dataset of Hindi-English code-mixed social media text for hate speech detection, 36 Brown, 2003, Outcomes of patient-provider interaction, 155 Budzianowski, P., Wen, T.-H., Tseng, B.-H., Casanueva, I., Ultes, S., Ramadan, O., Gasic, M., 2018. MultiWOZ-A large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. pp. 5016–5026. Byrne, 1976 Chakravarthi, B.R., 2020. HopeEDI: A multilingual hope speech detection dataset for equality, diversity, and inclusion. In: Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media. pp. 41–53. Chakravarthi, 2020, A sentiment analysis dataset for code-mixed malayalam-english, 177 Chakravarthi, 2020, A sentiment analysis dataset for code-mixed malayalam-english, 177 Chakravarthi, 2021, Overview of the HASOC-DravidianCodeMix shared task on offensive language detection in tamil and malayalam Chakravarthi, B.R., Muralidaran, V., 2021. Findings of the shared task on hope speech detection for equality, diversity, and inclusion. In: Proceedings of the First Workshop on Language Technology for Equality, Diversity and Inclusion. pp. 61–72. Chakravarthi, 2021, Findings of the shared task on offensive language identification in tamil, malayalam, and kannada, 133 Chakravarthi, 2021 Chelba, 2003, Speech utterance classification, I Chen, 2016, End-to-end memory networks with knowledge carryover for multi-turn spoken language understanding, 3245 Chen, 2020, A simple framework for contrastive learning of visual representations, 1597 Chen, 2019 Clift, 2016 Colby, 1971, Artificial paranoia, Artificial Intelligence, 2, 1, 10.1016/0004-3702(71)90002-6 Conneau, 2020, Unsupervised cross-lingual representation learning at scale, 8440 Conneau, 2017 Core, M.G., Allen, J., 1997. Coding dialogs with the DAMSL annotation scheme. In: AAAI Fall Symposium on Communicative Action in Humans and Machines, Vol. 56. Boston, MA, pp. 28–35. Cote, 1986, Architecture of SNOMED: its contribution to medical language processing, 74 Coucke, 2018 Dabre, 2022, IndicBART: A pre-trained model for indic natural language generation, 1849 Devlin, J., Chang, M.-W., Lee, K., Toutanova, K., 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 4171–4186. Dowlagar, 2020 Dowlagar, S., Mamidi, R., 2021a. Gated convolutional sequence to sequence based learning for english-hingilsh code-switched machine translation. In: Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. pp. 26–30. Dowlagar, S., Mamidi, R., 2021b. Graph convolutional networks with multi-headed attention for code-mixed sentiment analysis. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. pp. 65–72. Dowlagar, S., Mamidi, R., 2021c. A pre-trained transformer and CNN model with joint language ID and part-of-speech tagging for code-mixed social-media text. In: Proceedings of the 2021 Conference on Recent Advances in Natural Language Processing. pp. 367–374. Dowlagar, 2022, CMNEROne at SemEval-2022 task 11: Code-mixed named entity recognition by leveraging multilingual data, 1556 FB, 1963, Medical subject headings, Bull. Med. Libr. Assoc., 51, 114 Feng, 2022, Dynamic schema graph fusion network for multi-domain dialogue state tracking, 115 Feng, 2021, A sequence-to-sequence approach to dialogue state tracking, 1714 Feng, 2022, Language-agnostic BERT sentence embedding, 878 Fleiss, 1971, Measuring nominal scale agreement among many raters., Psychol. Bull., 76, 378, 10.1037/h0031619 Gambäck, B., Das, A., 2014. On measuring the complexity of code-mixing. In: Proceedings of the 11th International Conference on Natural Language Processing, Goa, India. pp. 1–7. Gao, 2019, Neural approaches to conversational ai, Found. Trends Inf. Retr., 13, 127, 10.1561/1500000074 Gehring, J., Auli, M., Grangier, D., Yarats, D., Dauphin, Y.N., 2017. Convolutional sequence to sequence learning. In: Proceedings of the 34th International Conference on Machine Learning-Volume 70. pp. 1243–1252. Gerz, 2021, Multilingual and cross-lingual intent detection from spoken data, 7468 Goo, C.-W., Gao, G., Hsu, Y.-K., Huo, C.-L., Chen, T.-C., Hsu, K.-W., Chen, Y.-N., 2018. Slot-gated modeling for joint slot filling and intent prediction. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers). pp. 753–757. Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T., 2018. Learning Word vectors for 157 languages. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018). Gu, J., Wu, Q., Wu, C., Shi, W., Yu, Z., 2021. PRAL: A tailored pre-training model for task-oriented dialog generation. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). pp. 305–313. Gumperz, 1964, Hindi-Punjabi code-switching in Delhi Gumperz, 1977, The sociolinguistic significance of conversational code-switching, RELC J., 8, 1, 10.1177/003368827700800201 Gundapu, 2020 Gupta, 2018, Uncovering code-mixed challenges: A framework for linguistically driven question generation and neural based question answering, 119 Gupta, A., Zhang, P., Lalwani, G., Diab, M., 2019. CASA-NLU: Context-aware self-attentive natural language understanding for task-oriented chatbots. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). pp. 1285–1290. Haffner, 2003, Optimizing SVMs for complex call classification, I He, 2021, From context-aware to knowledge-aware: Boosting OOV tokens recognition in slot tagging with background knowledge, Neurocomputing, 445, 267, 10.1016/j.neucom.2021.01.134 Hemphill, 1990, The ATIS spoken language systems pilot corpus Henderson, 2014, The second dialog state tracking challenge, 263 Heredia, 2001, Bilingual language mixing: Why do bilinguals code-switch?, Curr. Dir. Psychol. Sci., 10, 164, 10.1111/1467-8721.00140 Heritage, 2006, Problems and prospects in the study of physician-patient interaction: 30 years of research, Annu. Rev. Sociol., 32, 351, 10.1146/annurev.soc.32.082905.093959 Holmes, 2017 Hosseini-Asl, 2020, A simple language model for task-oriented dialogue, Adv. Neural Inf. Process. Syst., 33, 20179 Hoxha, 2016, DREAM: Classification scheme for dialog acts in clinical research query mediation, J. Biomed. Inform., 59, 89, 10.1016/j.jbi.2015.11.011 Hu, 2020, Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation, 4411 Jitta, 2017, “Nee intention enti?” towards dialog act recognition in code-mixed conversations, 243 Johnson, 2016, MIMIC-III, a freely accessible critical care database, Scientific Data, 3, 1, 10.1038/sdata.2016.35 Jordan, 1997, Serial order: A parallel distributed processing approach, 471, 10.1016/S0166-4115(97)80111-2 Joshi, 1982, Processing of sentences with intra-sentential code-switching Jurafsky, 2000 Kachru, 1978 Kakwani, D., Kunchukuttan, A., Golla, S., Gokul, N., Bhattacharyya, A., Khapra, M.M., Kumar, P., 2020. iNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings. pp. 4948–4961. Kanakagiri, T., Radhakrishnan, K., 2021. Task-oriented dialog systems for dravidian languages. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. pp. 85–93. Khanuja, 2021 Khanuja, S., Dandapat, S., Srinivasan, A., Sitaram, S., Choudhury, M., 2020. GLUECoS: An evaluation benchmark for code-switched NLP. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. pp. 3575–3585. Lample, 2017 Laranjo, 2018, Conversational agents in healthcare: a systematic review, J. Am. Med. Inform. Assoc., 25, 1248, 10.1093/jamia/ocy072 Li, X., Chen, Y.-N., Li, L., Gao, J., Celikyilmaz, A., 2017. End-to-end task-completion neural dialogue systems. In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers). pp. 733–743. Li, 2022, BiERU: Bidirectional emotional recurrent unit for conversational sentiment analysis, Neurocomputing, 467, 73, 10.1016/j.neucom.2021.09.057 Li, 2021, Retrieve & memorize: Dialog policy learning with multi-action memory, 447 Liu, 2016, Attention-based recurrent neural network models for joint intent detection and slot filling, 685 Louvan, 2020, Recent neural methods on slot filling and intent classification for task-oriented dialogue systems: A survey, 480 Luo, 2021, Promoting physical activity through conversational agents: Mixed methods systematic review, J. Med. Internet Res., 23, 10.2196/25486 Madotto, A., Wu, C.-S., Fung, P., 2018. Mem2Seq: Effectively incorporating knowledge bases into end-to-end task-oriented dialog systems. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 1468–1478. Mager, M., Çetinoğlu, Ö., Kann, K., 2019. Subword-level language identification for intra-word code-switching. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 2005–2011. Mave, D., Maharjan, S., Solorio, T., 2018. Language identification and analysis of code-switched social media text. In: Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching. pp. 51–61. Mesnil, 2013, Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding, 3771 Myers-Scotton, 1993, Common and uncommon ground: Social and structural factors in codeswitching, Lang. Soc., 22, 475, 10.1017/S0047404500017449 Myers-Scotton, 1997 Ni, J., Pandelea, V., Young, T., Zhou, H., Cambria, E., 2022. HiTKG: Towards goal-oriented conversations via multi-hierarchy learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. pp. 11112–11120. Papangelis, 2019 Pfeiffer, J., Kamath, A., Rücklé, A., Cho, K., Gurevych, I., 2021. AdapterFusion: Non-destructive task composition for transfer learning. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. pp. 487–503. Pfeiffer, J., Rücklé, A., Poth, C., Kamath, A., Vulić, I., Ruder, S., Cho, K., Gurevych, I., 2020. AdapterHub: A framework for adapting transformers. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. pp. 46–54. Pooja, 2016, Code-mixing and code-switching in Telugu t.v. channels, Int. J. Engl. Lang. Lit. Transl. Stud. (IJELR), 3, 608 Poplack, 1978 Priyadharshini, R., Chakravarthi, B.R., Cn, S., Durairaj, T., Subramanian, M., Shanmugavadivel, K., Hegde, S.U., Kumaresan, P., 2022. Overview of abusive comment detection in Tamil-ACL 2022. In: Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages. pp. 292–298. Priyadharshini, 2020, Named entity recognition for code-mixed Indian corpus using meta embedding, 68 Ramshaw, 1999, Text chunking using transformation-based learning, 157 Raymond, 2007, Generative and discriminative algorithms for spoken language understanding Robinson, 2003, An interactional structure of medical activities during acute visits and its implications for patients’ participation, Health Commun., 15, 27, 10.1207/S15327027HC1501_2 Rojowiec, R., Roth, B., Fink, M., 2020. Intent recognition in doctor-patient interviews. In: Proceedings of the 12th Language Resources and Evaluation Conference. pp. 702–709. Ruder, 2019, A survey of cross-lingual word embedding models, J. Artificial Intelligence Res., 65, 569, 10.1613/jair.1.11640 S, 2022, TamilATIS: Dataset for task-oriented dialog in tamil, 25 Sacks, 1978, A simplest systematics for the organization of turn taking for conversation, 7 Sampath, 2022, Findings of the shared task on emotion analysis in tamil, 279 Sane, 2019 Sane, 2019 Schegloff, 1977, The preference for self-correction in the organization of repair in conversation, Language, 53, 361, 10.1353/lan.1977.0041 Schuster, S., Gupta, S., Shah, R., Lewis, M., 2019. Cross-lingual transfer learning for multilingual task oriented dialog. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 3795–3805. Scotton, 1977 Sharma, 2015, Pos tagging for code-mixed indian social media text: Systems from iiit-h for icon nlp tools contest Shen, T., Zhou, T., Long, G., Jiang, J., Pan, S., Zhang, C., 2018. Disan: Directional self-attention network for rnn/cnn-free language understanding. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32. Sitaram, 2019 Solorio, 2008, Learning to predict code-switching points, 973 Sravani, D., Kameswari, L., Mamidi, R., 2021a. Political discourse analysis: A case study of code mixing and code switching in political speeches. In: Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. pp. 1–5. Sravani, 2021, Political discourse analysis: A case study of code mixing and code switching in political speeches, 1 Srirangam, V.K., Reddy, A.A., Singh, V., Shrivastava, M., 2019. Corpus creation and analysis for named entity recognition in Telugu-English code-mixed social media data. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. pp. 183–189. Srivastava, V., Singh, M., 2020. PHINC: A parallel hinglish social media code-mixed corpus for machine translation. In: Proceedings of the Sixth Workshop on Noisy User-Generated Text (W-NUT 2020). pp. 41–49. Stalnaker, 1978, Assertion, 315 Stewart, 1995, Effective physician-patient communication and health outcomes: a review, CMAJ: Can. Med. Assoc. J., 152, 1423 Stivers, 2005, Domains of knowledge and responsibility: Questioning in acute pediatric encounters Stolcke, 2000, Dialogue act modeling for automatic tagging and recognition of conversational speech, Comput. Linguist., 26, 339, 10.1162/089120100561737 Strong, 2018 Tur, 2011 Valizadeh, M., Parde, N., 2022. The AI doctor is in: A survey of task-oriented dialogue systems for healthcare applications. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 6638–6660. Varshney, 2022 Vaswani, 2017, Attention is all you need, 5998 Vickers, 2015, Third party interaction in the medical context: Code-switching and control, J. Pragmat., 84, 154, 10.1016/j.pragma.2015.05.009 Vyas, 2014, Pos tagging of english-hindi code-mixed social media content, 974 Wang, 2005, Spoken language understanding, IEEE Signal Process. Mag., 22, 16, 10.1109/MSP.2005.1511821 Wang, 2022, LUNA: Learning slot-turn alignment for dialogue state tracking, 3319 Wei, 2018, Task-oriented dialogue system for automatic diagnosis, 201 Weizenbaum, 1966, ELIZA—a computer program for the study of natural language communication between man and machine, Commun. ACM, 9, 36, 10.1145/365153.365168 Weld, 2021 Wen, T.-H., Vandyke, D., Mrkšić, N., Gasic, M., Barahona, L.M.R., Su, P.-H., Ultes, S., Young, S., 2017. A network-based end-to-end trainable task-oriented dialogue system. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. pp. 438–449. West, 1984 Winata, 2019, Learning multilingual meta-embeddings for code-switching named entity recognition, 181 Wood, 2019, Departing from doctor-speak: A perspective on code-switching in the medical setting, J. Gen. Intern. Med., 34, 464, 10.1007/s11606-018-4768-0 Wu, 2020, SlotRefine: A fast non-autoregressive model for joint intent detection and slot filling, 1932 Xu, 2020 Xu, 2013, Convolutional neural network based triangular crf for joint intent detection and slot filling, 78 Yan, Z., Duan, N., Chen, P., Zhou, M., Zhou, J., Li, Z., 2017. Building task-oriented dialogue systems for online shopping. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 31. Yang, 2019, Improving multilingual sentence embedding using bi-directional dual encoder with additive margin softmax, 5370 Yang, Y., Li, Y., Quan, X., 2021. Ubar: Towards fully end-to-end task-oriented dialog system with gpt-2. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. pp. 14230–14238. Yao, 2013, Recurrent neural networks for language understanding, 2524 Yirmibeşoğlu, 2018, Detecting code-switching between turkish-english language pair, 110 Young, T., Cambria, E., Chaturvedi, I., Zhou, H., Biswas, S., Huang, M., 2018. Augmenting end-to-end dialogue systems with commonsense knowledge. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32. Young, 2020, Dialogue systems with audio context, Neurocomputing, 388, 102, 10.1016/j.neucom.2019.12.126 Young, T., Xing, F., Pandelea, V., Ni, J., Cambria, E., 2022. Fusing task-oriented and open-domain dialogues in conversational agents. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. pp. 11622–11629. Zeng, 2020, MedDialog: A large-scale medical dialogue dataset, 9241 Zhang, 2020, DIALOGPT : Large-scale generative pre-training for conversational response generation, 270 Zhang, 2020, Recent advances and challenges in task-oriented dialog systems, Sci. China Technol. Sci., 1 Zhang, 2022, Efficient dialog policy learning by reasoning with contextual knowledge, 11667 Zhang, 2022, New intent discovery with pre-training and contrastive learning, 256 Zhong, P., Wang, D., Li, P., Zhang, C., Wang, H., Miao, C., 2021. Care: Commonsense-aware emotional response generation with latent concepts. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. pp. 14577–14585. Zhou, 2019 Zhu, 2020, Crosswoz: A large-scale chinese cross-domain task-oriented dialogue dataset, Trans. Assoc. Comput. Linguist., 8, 281, 10.1162/tacl_a_00314