A code-mixed task-oriented dialog dataset for medical domain
Tài liệu tham khảo
Aguilar, 2019
Aguilar, G., Kar, S., Solorio, T., 2020. LinCE: A centralized benchmark for linguistic code-switching evaluation. In: Proceedings of the 12th Language Resources and Evaluation Conference. pp. 1803–1813.
Aguilar, G., Solorio, T., 2020. From english to code-switching: Transfer learning with strong morphological clues. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. pp. 8033–8044.
Alsentzer, E., Murphy, J., Boag, W., Weng, W.-H., Jindi, D., Naumann, T., McDermott, M., 2019. Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop. pp. 72–78.
Angara, 2017, Foodie fooderson a conversational agent for the smart kitchen, 247
Bai, 2022, Incremental intent detection for medical domain with contrast replay networks, 3549
Bali, K., Sharma, J., Choudhury, M., Vyas, Y., 2014 “I am borrowing ya mixing?” An analysis of English-Hindi code mixing in facebook. In: Proceedings of the First Workshop on Computational Approaches to Code Switching. pp. 116–126.
Ball, K., Garrette, D., 2018. Part-of-Speech tagging for code-switched, transliterated texts without explicit language identification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. pp. 3084–3089.
Banerjee, 2018, A dataset for building code-mixed goal oriented conversation systems, 3766
Barman, 2016, Part-of-speech tagging of code-mixed social media content: Pipeline, stacking and joint modelling, 30
Basu, 2022, Strategies to improve few-shot learning for intent classification and slot-filling, 17
Basu, 2022, Strategies to improve few-shot learning for intent classification and slot-filling, 17
Bhargava, 2013, Easy contextual intent prediction and slot detection, 8337
Bickmore, 2016, Improving access to online health information with conversational agents: a randomized controlled experiment, J. Med. Internet Res., 18, 10.2196/jmir.5239
Bohra, 2018, A dataset of Hindi-English code-mixed social media text for hate speech detection, 36
Brown, 2003, Outcomes of patient-provider interaction, 155
Budzianowski, P., Wen, T.-H., Tseng, B.-H., Casanueva, I., Ultes, S., Ramadan, O., Gasic, M., 2018. MultiWOZ-A large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. pp. 5016–5026.
Byrne, 1976
Chakravarthi, B.R., 2020. HopeEDI: A multilingual hope speech detection dataset for equality, diversity, and inclusion. In: Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media. pp. 41–53.
Chakravarthi, 2020, A sentiment analysis dataset for code-mixed malayalam-english, 177
Chakravarthi, 2020, A sentiment analysis dataset for code-mixed malayalam-english, 177
Chakravarthi, 2021, Overview of the HASOC-DravidianCodeMix shared task on offensive language detection in tamil and malayalam
Chakravarthi, B.R., Muralidaran, V., 2021. Findings of the shared task on hope speech detection for equality, diversity, and inclusion. In: Proceedings of the First Workshop on Language Technology for Equality, Diversity and Inclusion. pp. 61–72.
Chakravarthi, 2021, Findings of the shared task on offensive language identification in tamil, malayalam, and kannada, 133
Chakravarthi, 2021
Chelba, 2003, Speech utterance classification, I
Chen, 2016, End-to-end memory networks with knowledge carryover for multi-turn spoken language understanding, 3245
Chen, 2020, A simple framework for contrastive learning of visual representations, 1597
Chen, 2019
Clift, 2016
Colby, 1971, Artificial paranoia, Artificial Intelligence, 2, 1, 10.1016/0004-3702(71)90002-6
Conneau, 2020, Unsupervised cross-lingual representation learning at scale, 8440
Conneau, 2017
Core, M.G., Allen, J., 1997. Coding dialogs with the DAMSL annotation scheme. In: AAAI Fall Symposium on Communicative Action in Humans and Machines, Vol. 56. Boston, MA, pp. 28–35.
Cote, 1986, Architecture of SNOMED: its contribution to medical language processing, 74
Coucke, 2018
Dabre, 2022, IndicBART: A pre-trained model for indic natural language generation, 1849
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K., 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 4171–4186.
Dowlagar, 2020
Dowlagar, S., Mamidi, R., 2021a. Gated convolutional sequence to sequence based learning for english-hingilsh code-switched machine translation. In: Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. pp. 26–30.
Dowlagar, S., Mamidi, R., 2021b. Graph convolutional networks with multi-headed attention for code-mixed sentiment analysis. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. pp. 65–72.
Dowlagar, S., Mamidi, R., 2021c. A pre-trained transformer and CNN model with joint language ID and part-of-speech tagging for code-mixed social-media text. In: Proceedings of the 2021 Conference on Recent Advances in Natural Language Processing. pp. 367–374.
Dowlagar, 2022, CMNEROne at SemEval-2022 task 11: Code-mixed named entity recognition by leveraging multilingual data, 1556
FB, 1963, Medical subject headings, Bull. Med. Libr. Assoc., 51, 114
Feng, 2022, Dynamic schema graph fusion network for multi-domain dialogue state tracking, 115
Feng, 2021, A sequence-to-sequence approach to dialogue state tracking, 1714
Feng, 2022, Language-agnostic BERT sentence embedding, 878
Fleiss, 1971, Measuring nominal scale agreement among many raters., Psychol. Bull., 76, 378, 10.1037/h0031619
Gambäck, B., Das, A., 2014. On measuring the complexity of code-mixing. In: Proceedings of the 11th International Conference on Natural Language Processing, Goa, India. pp. 1–7.
Gao, 2019, Neural approaches to conversational ai, Found. Trends Inf. Retr., 13, 127, 10.1561/1500000074
Gehring, J., Auli, M., Grangier, D., Yarats, D., Dauphin, Y.N., 2017. Convolutional sequence to sequence learning. In: Proceedings of the 34th International Conference on Machine Learning-Volume 70. pp. 1243–1252.
Gerz, 2021, Multilingual and cross-lingual intent detection from spoken data, 7468
Goo, C.-W., Gao, G., Hsu, Y.-K., Huo, C.-L., Chen, T.-C., Hsu, K.-W., Chen, Y.-N., 2018. Slot-gated modeling for joint slot filling and intent prediction. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers). pp. 753–757.
Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T., 2018. Learning Word vectors for 157 languages. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018).
Gu, J., Wu, Q., Wu, C., Shi, W., Yu, Z., 2021. PRAL: A tailored pre-training model for task-oriented dialog generation. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). pp. 305–313.
Gumperz, 1964, Hindi-Punjabi code-switching in Delhi
Gumperz, 1977, The sociolinguistic significance of conversational code-switching, RELC J., 8, 1, 10.1177/003368827700800201
Gundapu, 2020
Gupta, 2018, Uncovering code-mixed challenges: A framework for linguistically driven question generation and neural based question answering, 119
Gupta, A., Zhang, P., Lalwani, G., Diab, M., 2019. CASA-NLU: Context-aware self-attentive natural language understanding for task-oriented chatbots. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). pp. 1285–1290.
Haffner, 2003, Optimizing SVMs for complex call classification, I
He, 2021, From context-aware to knowledge-aware: Boosting OOV tokens recognition in slot tagging with background knowledge, Neurocomputing, 445, 267, 10.1016/j.neucom.2021.01.134
Hemphill, 1990, The ATIS spoken language systems pilot corpus
Henderson, 2014, The second dialog state tracking challenge, 263
Heredia, 2001, Bilingual language mixing: Why do bilinguals code-switch?, Curr. Dir. Psychol. Sci., 10, 164, 10.1111/1467-8721.00140
Heritage, 2006, Problems and prospects in the study of physician-patient interaction: 30 years of research, Annu. Rev. Sociol., 32, 351, 10.1146/annurev.soc.32.082905.093959
Holmes, 2017
Hosseini-Asl, 2020, A simple language model for task-oriented dialogue, Adv. Neural Inf. Process. Syst., 33, 20179
Hoxha, 2016, DREAM: Classification scheme for dialog acts in clinical research query mediation, J. Biomed. Inform., 59, 89, 10.1016/j.jbi.2015.11.011
Hu, 2020, Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation, 4411
Jitta, 2017, “Nee intention enti?” towards dialog act recognition in code-mixed conversations, 243
Johnson, 2016, MIMIC-III, a freely accessible critical care database, Scientific Data, 3, 1, 10.1038/sdata.2016.35
Jordan, 1997, Serial order: A parallel distributed processing approach, 471, 10.1016/S0166-4115(97)80111-2
Joshi, 1982, Processing of sentences with intra-sentential code-switching
Jurafsky, 2000
Kachru, 1978
Kakwani, D., Kunchukuttan, A., Golla, S., Gokul, N., Bhattacharyya, A., Khapra, M.M., Kumar, P., 2020. iNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings. pp. 4948–4961.
Kanakagiri, T., Radhakrishnan, K., 2021. Task-oriented dialog systems for dravidian languages. In: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. pp. 85–93.
Khanuja, 2021
Khanuja, S., Dandapat, S., Srinivasan, A., Sitaram, S., Choudhury, M., 2020. GLUECoS: An evaluation benchmark for code-switched NLP. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. pp. 3575–3585.
Lample, 2017
Laranjo, 2018, Conversational agents in healthcare: a systematic review, J. Am. Med. Inform. Assoc., 25, 1248, 10.1093/jamia/ocy072
Li, X., Chen, Y.-N., Li, L., Gao, J., Celikyilmaz, A., 2017. End-to-end task-completion neural dialogue systems. In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers). pp. 733–743.
Li, 2022, BiERU: Bidirectional emotional recurrent unit for conversational sentiment analysis, Neurocomputing, 467, 73, 10.1016/j.neucom.2021.09.057
Li, 2021, Retrieve & memorize: Dialog policy learning with multi-action memory, 447
Liu, 2016, Attention-based recurrent neural network models for joint intent detection and slot filling, 685
Louvan, 2020, Recent neural methods on slot filling and intent classification for task-oriented dialogue systems: A survey, 480
Luo, 2021, Promoting physical activity through conversational agents: Mixed methods systematic review, J. Med. Internet Res., 23, 10.2196/25486
Madotto, A., Wu, C.-S., Fung, P., 2018. Mem2Seq: Effectively incorporating knowledge bases into end-to-end task-oriented dialog systems. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 1468–1478.
Mager, M., Çetinoğlu, Ö., Kann, K., 2019. Subword-level language identification for intra-word code-switching. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 2005–2011.
Mave, D., Maharjan, S., Solorio, T., 2018. Language identification and analysis of code-switched social media text. In: Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching. pp. 51–61.
Mesnil, 2013, Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding, 3771
Myers-Scotton, 1993, Common and uncommon ground: Social and structural factors in codeswitching, Lang. Soc., 22, 475, 10.1017/S0047404500017449
Myers-Scotton, 1997
Ni, J., Pandelea, V., Young, T., Zhou, H., Cambria, E., 2022. HiTKG: Towards goal-oriented conversations via multi-hierarchy learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. pp. 11112–11120.
Papangelis, 2019
Pfeiffer, J., Kamath, A., Rücklé, A., Cho, K., Gurevych, I., 2021. AdapterFusion: Non-destructive task composition for transfer learning. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. pp. 487–503.
Pfeiffer, J., Rücklé, A., Poth, C., Kamath, A., Vulić, I., Ruder, S., Cho, K., Gurevych, I., 2020. AdapterHub: A framework for adapting transformers. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. pp. 46–54.
Pooja, 2016, Code-mixing and code-switching in Telugu t.v. channels, Int. J. Engl. Lang. Lit. Transl. Stud. (IJELR), 3, 608
Poplack, 1978
Priyadharshini, R., Chakravarthi, B.R., Cn, S., Durairaj, T., Subramanian, M., Shanmugavadivel, K., Hegde, S.U., Kumaresan, P., 2022. Overview of abusive comment detection in Tamil-ACL 2022. In: Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages. pp. 292–298.
Priyadharshini, 2020, Named entity recognition for code-mixed Indian corpus using meta embedding, 68
Ramshaw, 1999, Text chunking using transformation-based learning, 157
Raymond, 2007, Generative and discriminative algorithms for spoken language understanding
Robinson, 2003, An interactional structure of medical activities during acute visits and its implications for patients’ participation, Health Commun., 15, 27, 10.1207/S15327027HC1501_2
Rojowiec, R., Roth, B., Fink, M., 2020. Intent recognition in doctor-patient interviews. In: Proceedings of the 12th Language Resources and Evaluation Conference. pp. 702–709.
Ruder, 2019, A survey of cross-lingual word embedding models, J. Artificial Intelligence Res., 65, 569, 10.1613/jair.1.11640
S, 2022, TamilATIS: Dataset for task-oriented dialog in tamil, 25
Sacks, 1978, A simplest systematics for the organization of turn taking for conversation, 7
Sampath, 2022, Findings of the shared task on emotion analysis in tamil, 279
Sane, 2019
Sane, 2019
Schegloff, 1977, The preference for self-correction in the organization of repair in conversation, Language, 53, 361, 10.1353/lan.1977.0041
Schuster, S., Gupta, S., Shah, R., Lewis, M., 2019. Cross-lingual transfer learning for multilingual task oriented dialog. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 3795–3805.
Scotton, 1977
Sharma, 2015, Pos tagging for code-mixed indian social media text: Systems from iiit-h for icon nlp tools contest
Shen, T., Zhou, T., Long, G., Jiang, J., Pan, S., Zhang, C., 2018. Disan: Directional self-attention network for rnn/cnn-free language understanding. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
Sitaram, 2019
Solorio, 2008, Learning to predict code-switching points, 973
Sravani, D., Kameswari, L., Mamidi, R., 2021a. Political discourse analysis: A case study of code mixing and code switching in political speeches. In: Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. pp. 1–5.
Sravani, 2021, Political discourse analysis: A case study of code mixing and code switching in political speeches, 1
Srirangam, V.K., Reddy, A.A., Singh, V., Shrivastava, M., 2019. Corpus creation and analysis for named entity recognition in Telugu-English code-mixed social media data. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. pp. 183–189.
Srivastava, V., Singh, M., 2020. PHINC: A parallel hinglish social media code-mixed corpus for machine translation. In: Proceedings of the Sixth Workshop on Noisy User-Generated Text (W-NUT 2020). pp. 41–49.
Stalnaker, 1978, Assertion, 315
Stewart, 1995, Effective physician-patient communication and health outcomes: a review, CMAJ: Can. Med. Assoc. J., 152, 1423
Stivers, 2005, Domains of knowledge and responsibility: Questioning in acute pediatric encounters
Stolcke, 2000, Dialogue act modeling for automatic tagging and recognition of conversational speech, Comput. Linguist., 26, 339, 10.1162/089120100561737
Strong, 2018
Tur, 2011
Valizadeh, M., Parde, N., 2022. The AI doctor is in: A survey of task-oriented dialogue systems for healthcare applications. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 6638–6660.
Varshney, 2022
Vaswani, 2017, Attention is all you need, 5998
Vickers, 2015, Third party interaction in the medical context: Code-switching and control, J. Pragmat., 84, 154, 10.1016/j.pragma.2015.05.009
Vyas, 2014, Pos tagging of english-hindi code-mixed social media content, 974
Wang, 2005, Spoken language understanding, IEEE Signal Process. Mag., 22, 16, 10.1109/MSP.2005.1511821
Wang, 2022, LUNA: Learning slot-turn alignment for dialogue state tracking, 3319
Wei, 2018, Task-oriented dialogue system for automatic diagnosis, 201
Weizenbaum, 1966, ELIZA—a computer program for the study of natural language communication between man and machine, Commun. ACM, 9, 36, 10.1145/365153.365168
Weld, 2021
Wen, T.-H., Vandyke, D., Mrkšić, N., Gasic, M., Barahona, L.M.R., Su, P.-H., Ultes, S., Young, S., 2017. A network-based end-to-end trainable task-oriented dialogue system. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. pp. 438–449.
West, 1984
Winata, 2019, Learning multilingual meta-embeddings for code-switching named entity recognition, 181
Wood, 2019, Departing from doctor-speak: A perspective on code-switching in the medical setting, J. Gen. Intern. Med., 34, 464, 10.1007/s11606-018-4768-0
Wu, 2020, SlotRefine: A fast non-autoregressive model for joint intent detection and slot filling, 1932
Xu, 2020
Xu, 2013, Convolutional neural network based triangular crf for joint intent detection and slot filling, 78
Yan, Z., Duan, N., Chen, P., Zhou, M., Zhou, J., Li, Z., 2017. Building task-oriented dialogue systems for online shopping. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 31.
Yang, 2019, Improving multilingual sentence embedding using bi-directional dual encoder with additive margin softmax, 5370
Yang, Y., Li, Y., Quan, X., 2021. Ubar: Towards fully end-to-end task-oriented dialog system with gpt-2. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. pp. 14230–14238.
Yao, 2013, Recurrent neural networks for language understanding, 2524
Yirmibeşoğlu, 2018, Detecting code-switching between turkish-english language pair, 110
Young, T., Cambria, E., Chaturvedi, I., Zhou, H., Biswas, S., Huang, M., 2018. Augmenting end-to-end dialogue systems with commonsense knowledge. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
Young, 2020, Dialogue systems with audio context, Neurocomputing, 388, 102, 10.1016/j.neucom.2019.12.126
Young, T., Xing, F., Pandelea, V., Ni, J., Cambria, E., 2022. Fusing task-oriented and open-domain dialogues in conversational agents. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. pp. 11622–11629.
Zeng, 2020, MedDialog: A large-scale medical dialogue dataset, 9241
Zhang, 2020, DIALOGPT : Large-scale generative pre-training for conversational response generation, 270
Zhang, 2020, Recent advances and challenges in task-oriented dialog systems, Sci. China Technol. Sci., 1
Zhang, 2022, Efficient dialog policy learning by reasoning with contextual knowledge, 11667
Zhang, 2022, New intent discovery with pre-training and contrastive learning, 256
Zhong, P., Wang, D., Li, P., Zhang, C., Wang, H., Miao, C., 2021. Care: Commonsense-aware emotional response generation with latent concepts. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. pp. 14577–14585.
Zhou, 2019
Zhu, 2020, Crosswoz: A large-scale chinese cross-domain task-oriented dialogue dataset, Trans. Assoc. Comput. Linguist., 8, 281, 10.1162/tacl_a_00314