Những bước đầu tiên hướng tới tương tác giữa con người và robot giống như tự nhiên

Autonomous Robots - Tập 22 - Trang 411-423 - 2006
Matthias Scheutz1, Paul Schermerhorn1, James Kramer1, David Anderson1
1Artificial Intelligence and Robotics Laboratory, Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, USA

Tóm tắt

Tương tác giữa con người và robot giống như con người (NHL-HRI) tự nhiên đòi hỏi robot phải có khả năng nhận diện và tạo ra nhiều hành vi tinh tế của con người, thường bị coi nhẹ bởi con người. Chúng tôi đề xuất một sự phân chia sơ bộ các yêu cầu đối với NHL-HRI thành ba nhóm thuộc tính: (1) hành vi xã hội, (2) nhận thức hướng đến mục tiêu, và (3) trí tuệ vững chắc, đồng thời trình bày kiến trúc DIARC mới cho các robot tình cảm phức tạp nhằm tương tác giữa con người và robot, với mục tiêu đáp ứng một số yêu cầu đó. Chúng tôi mô tả ngắn gọn các thuộc tính chức năng của DIARC và việc triển khai nó trong hệ thống ADE của chúng tôi. Sau đó, chúng tôi báo cáo kết quả từ các đánh giá của đối tượng con người trong phòng thí nghiệm cũng như những trải nghiệm của chúng tôi với robot chạy ADE tại cuộc thi Robot AAAI 2005 trong sự kiện Tương tác Mở và Triển lãm Robot.

Từ khóa

#tương tác con người-robot #robot tình cảm #kiến trúc DIARC #nhận thức xã hội #trí tuệ nhân tạo

Tài liệu tham khảo

Andronache, V. and Scheutz, M. 2004. Integrating theory and practice: The agent architecture framework APOC and its development environment ADE. In Proceedings of AAMAS 2004. Andronache, V. and Scheutz, M. 2006. ADE—a tool for the development of distributed architectures for virtual and robotic agents. In Petta, P., Müller, J. (Eds.), Best of at2ai-4, vol. 20. Bless, H., Schwarz, N., and Wieland, R. 1996. Mood and the impact of category membership and individuating information. European Journal of Social Psychology, 26:935–959. Breazeal, C., Hoffman, G., and Lockerd, A. 2004. Teaching and working with robots as a collaboration. In Proceedings of AAMAS 2004. Breazeal, C. and Scassellati, B. 1999. How to build robots that make friends and influence people. In Iros, pp. 858–863. Breazeal, C.L. 2002. Designing Sociable Robots. MIT Press. Burkhart, F. 2005. Emofilt: the simulation of emotional speech by prosody-transformation. In Proceedings of Interspeech 2005. Byers, Z., Dixon, M., Goodier, K., Grimm, C.M. and Smart, W.D. 2003. An autonomous robot photographer. In Proceedings of Iros 2003. Las Vegas, NV. Clore, G., Gasper, K., and Conway, H. 2001. Affect as information. In Forgas, J. (Ed.), Handbook of Affect and Social Cognition, Erlbaum, Mahwah, NJ, pp. 121–144. The CMU Sphinx group open source speech recognition engines. 2004. http://cmusphinx.sourceforge.net/html/cmusphinx.php. Desai, M. and Yanco, H.A. 2005. August. Blending human and robot inputs for sliding scale autonomy. In Proceedings of the 14th IEEE International Workshop on Robot and Human Interactive Communication. Nashville, TN. Ekman, P. 1993. Facial expression and emotion. American Psychologist, 48(4):384–392. Ekman, P. and Friesen, W.V. 1977. Manual for the Facial Action Coding System (FACS). Consulting Psychologists Press, Palo Alto. The festival speech synthesis system. 2004. http://www.cstr.ed.ac.uk/projects/festival/. Centre for Speech Technology Research. Fillmore, C.J., Baker, C.F., and Sato, H. 2002. The framenet database and software tools. In Proceedings of the Third International Conference on Language Resources and Evaluation (LREC). Las Palmas, Spain, pp. 1157–1160. Grosz, B.L. and Sidner, C.L. 1990. Plans for discourse. In Cohen, P.R., Morgan, J. and Pollack, M.E. (Eds.), Intentions in Communication. MIT Press, MA, pp. 417–444. Hanson, D., Olney, A., Prilliman, S., Mathews, E., Zielke, M., Hammons, D., Fernandez, R., and Stephanou, H. 2005. July. Upending the uncanny valley. In Proceedings of the AAAI-05 Robot Workshop. Pittsburgh, PA. Ichise, R., Shapiro, D., and Langley, P. 2002. Learning hierarchical skills from observation. In Proceedings of the Fifth International Conference on Discovery Science, pp. 247–258. Kanda, T., Iwase, K., Shiomi, M., and Ishiguro, H. 2005. A tension-moderating mechanism for promoting speech-based human-robot interaction. In Iros, pp. 527–532. Kipper, K., Dang, H., and Palmer, M. 2005. Class-based construction of a verb lexicon. In Proceedings of AAAI 2000. Kramer, J. and Scheutz, M. 2006a. Reflection and reasoning for system integrity inan agent architecture infrastructure. (Under review) Kramer, J. and Scheutz, M. 2006b. ADE: A framework for robust complex robotic architectures. (Under review) Lisetti, C.L., Brown, S., Alvarez, K., and Marpaung, A. 2004. A social informatics approach to human-robot interaction with an office service robot. IEEE Transactions on Systems, Man, and Cybernetics–Special Issue on Human Robot Interaction, 34(2):195–209. Lowe, D.G. 2004. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91–110. Michaud, F., Brosseau, Y., Côté, C., Létourneau, D., Moisan, P., Ponchon, A., Raievsky, C., Valin, J., Beaudry, É., and Kabanza, F. 2005. Modularity and integration in the design of a socially interactive robot. In Proceedings IEEE International Workshop on Robot and Human Interactive Communication, pp. 172–177. Michaud, F., Duquette, A., and Nadeau, I. 2003. Characteristics of mobile robotic toys for children with pervasive developmental disorders. In Proceedings IEEE Conference on Systems, Man, and Cybernetics. Middendorff, C. and Scheutz, M. 2006. Real-time evolving swarms for rapid pattern detection and tracking. In Proceedings of Artificial Life x. Mueller, E.T. 1998. Natural Language Processing with Thoughttreasure. Signiform, New York. Murphy, R.R., Lisetti, C., Tardif, R., Irish, L., and Gage, A. 2002. Emotion-based control of cooperating heterogeneous mobile robots. IEEE Transactions on Robotics and Automation, 18(5):744–757. Ortony, A., Norman, D., and Revelle, W. 2005. Effective functioning: A three level model of affect, motivation, cognition, and behavior. In Fellous, J. and Arbib, M. (Eds.), Who Needs Emotions? The Brain Meets the Machine. Oxford University Press, New York. Pellom, B. and Hacioglu, K. 2003. Recent improvements in the CU SONIC ASR system for noisy speech: The SPINE task. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Picard, R. 1997. Affective Computing. MIT Press, Cambridge, Mass, London, England. Rani, P., Sarkar, N., and Smith, C.A. 2003. Affect-sensitive human-robot cooperation-theory and experiments. In Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 2382–2387. Schank, R. and Abelson, R.R. 1977. Scripts, Plans, Goals, and Understanding. Lawrence Erlbaum Associates, Hillsdale, NJ. Scheutz, M. 2000. Surviving in a hostile multiagent environment: How simple affective states can aid in the competition for resources. In Hamilton, H.J. (Ed.), Advances in Artificial Intelligence, 13th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, ai 2000, Montréal, Quebec, Canada, may 14-17, 2000, Proceedings, Springer, vol. 1822, pp. 389–399. Scheutz, M. 2004. Useful roles of emotions in artificial agents: A case study from artificial life. In Proceedings of AAAI 2004. Scheutz, M. 2006. ADE–steps towards a distributed development and runtime environment for complex robotic agent architectures. Applied Artificial Intelligence, 20(4–5). Scheutz, M. and Andronache, V. 2003. APOC—a framework for complex agents. In Proceedings of the AAAI Spring Symposium. AAAI Press. Scheutz, M. and Andronache, V. 2004. Architectural mechanisms for dynamic changes of behavior selection strategies in behavior-based systems. IEEE Transactions of System, Man, and Cybernetics Part B, 34(6):2377–2395. Scheutz, M. and Logan, B. 2001. Affective versus deliberative agent control. In Colton, S. (Ed.), Proceedings of the AISB’01 Symposium on Emotion, Cognition and Affective Computing. York: Society for the Study of Artificial Intelligence and the Simulation of Behaviour, pp. 1–10. Scheutz, M., McRaven, J., and Cserey, G. 2004. Fast, reliable, adaptive, bimodal people tracking for indoor environments. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Scheutz, M., Schermerhorn, P., Kramer, J., and Middendorff, C. 2006. The utility of affect expression in natural language interactions in joint human-robot tasks. In Proceedings of the ACM Conference on Human-Robot Interaction (HRI2006). Sleator, D. and Temperley, D. 1993. Parsing english with a link grammar. In Proceedings of the Third International Workshop on Parsing Technologies. Yamamoto, S., Nakadai, K., Valin, J., Rouat, J., Michaud, F., Komatani, K., Ogata, T., and Okuno, H. 2005. Making a robot recognize three simultaneous sentences in real-time. In Proceedings IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 897–902. Yang, M.-H., Kriegman, D.J. and Ahuja, N. 2002. Detecting faces in images: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(1):34–58.