Explanation in artificial intelligence: Insights from the social sciences

Artificial Intelligence - Tập 267 - Trang 1-38 - 2019
Tim Miller1
1School of Computing and Information Systems, University of Melbourne, Melbourne, Australia

Tóm tắt

Từ khóa


Tài liệu tham khảo

Allemang, 1987, Computational complexity of hypothesis assembly, vol. 87, 1112

Angwin, 2016, Machine bias, ProPublica

Antaki, 1992, Explaining in conversation: towards an argument model, Eur. J. Soc. Psychol., 22, 181, 10.1002/ejsp.2420220206

Arioua, 2015, Formalizing explanatory dialogues, 282

Aronson, 1971, On the grammar of ‘cause’, Synthese, 22, 414, 10.1007/BF00413436

Baehrens, 2010, How to explain individual classification decisions, J. Mach. Learn. Res., 11, 1803

Bekele, 2018, Human-level explanatory biases for person re-identification

Besnard, 2008

Biran, 2017, Explanation and justification in machine learning: a survey, 8

Boonzaier, 2005, Distinguishing the effects of beliefs and preconditions: the folk psychology of goals and actions, Eur. J. Soc. Psychol., 35, 725, 10.1002/ejsp.280

Brafman, 2008, From one to many: planning for loosely coupled multi-agent systems, 28

Broekens, 2010, Do you get it? User-evaluated explainable BDI agents, 28

Bromberger, 1966, Why-questions, 68

Buchanan, 1984

Burguet, 2004, Effets de contexte sur l'explication causale, 219

Byrne, 1991, The construction of explanations, 337

Cawsey, 1991, Generating interactive explanations, 86

Cawsey, 1992

Cawsey, 1993, Planning interactive explanations, Int. J. Man-Mach. Stud., 38, 169, 10.1006/imms.1993.1009

Cawsey, 1993, User modelling in interactive explanations, User Model. User-Adapt. Interact., 3, 221, 10.1007/BF01257890

Chakraborti, 2017, Plan explanations as model reconciliation: moving beyond explanation as soliloquy

Chan, 2002, Comparison of machine learning and traditional classifiers in glaucoma diagnosis, IEEE Trans. Biomed. Eng., 49, 963, 10.1109/TBME.2002.802012

Chandrasekaran, 1989, Explaining control strategies in problem solving, IEEE Expert, 4, 9, 10.1109/64.21896

Charniak, 1991, A probabilistic model of plan recognition, 160

Chen, 2014

Chevaleyre, 2007, A short introduction to computational social choice, 51

Chin-Parker, 2010, Background shifts affect explanatory style: how a pragmatic theory of explanation accounts for background effects in the generation of explanations, Cogn. Process., 11, 227, 10.1007/s10339-009-0341-4

Chin-Parker, 2017, Contrastive constraints guide explanation-based category learning, Cogn. Sci., 41, 1645, 10.1111/cogs.12405

Chockler, 2004, Responsibility and blame: a structural-model approach, J. Artif. Intell. Res., 22, 93, 10.1613/jair.1391

Cimpian, 2014, The inherence heuristic: an intuitive means of making sense of the world, and a potential precursor to psychological essentialism, Behav. Brain Sci., 37, 461, 10.1017/S0140525X13002197

A. Cooper, The inmates are running the asylum: why high-tech products drive us crazy and how to restore the sanity, Sams Indianapolis, IN, USA, 2004.

DARPA Explainable

Davey, 1991, Characteristics of individuals with fear of spiders, Anxiety Res., 4, 299, 10.1080/08917779208248798

de Graaf, 2017, How people explain action (and autonomous intelligent systems should too)

Dennett, 1989

Dennett, 2017

Dignum, 2014, From autistic to social agents, 1161

Dodd, 1980, Leading questions and memory: pragmatic constraints, J. Mem. Lang., 19, 695

Dowe, 1992, Wesley Salmon's process theory of causality and the conserved quantity theory, Philos. Sci., 59, 195, 10.1086/289662

Eiter, 2002, Complexity results for structure-based causality, Artif. Intell., 142, 53, 10.1016/S0004-3702(02)00271-0

Eiter, 2006, Causes and explanations in the structural-model approach: tractable cases, Artif. Intell., 170, 542, 10.1016/j.artint.2005.12.003

Fagin, 1995

Fair, 1979, Causation and the flow of energy, Erkenntnis, 14, 219, 10.1007/BF00174894

Fischer, 2001, User modeling in human–computer interaction, User Model. User-Adapt. Interact., 11, 65, 10.1023/A:1011145532042

Fox, 2007, Argumentation-based inference and decision making—a medical perspective, IEEE Intell. Syst., 22, 34, 10.1109/MIS.2007.102

Fox, 2017, Explainable planning

Frosst

Gerstenberg, 2010, Spreading the blame: the allocation of responsibility amongst multiple agents, Cognition, 115, 166, 10.1016/j.cognition.2009.12.011

Gerstenberg, 2017, Eye-tracking causality, Psychol. Sci., 28, 1731, 10.1177/0956797617713053

Ghallab, 2004

Gilbert, 1995, The correspondence bias, Psychol. Bull., 117, 21, 10.1037/0033-2909.117.1.21

Ginet, 2008, In defense of a non-causal account of reasons explanations, J. Ethics, 12, 229, 10.1007/s10892-008-9033-z

Giordano, 2004, Conditional logic of actions and causation, Artif. Intell., 157, 239, 10.1016/j.artint.2004.04.009

Girotto, 1991, Event controllability in counterfactual thinking, Acta Psychol., 78, 111, 10.1016/0001-6918(91)90007-M

Greaves, 2000, What is a conversation policy?, 118

Grice, 1975, Logic and conversation, 41

Halpern, 2000, Axiomatizing causal reasoning, J. Artif. Intell. Res., 12, 317, 10.1613/jair.648

Halpern, 2005, Causes and explanations: a structural-model approach. Part I: causes, Br. J. Philos. Sci., 56, 843, 10.1093/bjps/axi147

Halpern, 2005, Causes and explanations: a structural-model approach. Part II: explanations, Br. J. Philos. Sci., 56, 889, 10.1093/bjps/axi148

Hankinson, 2001

Hanson, 1965

Harman, 1965, The inference to the best explanation, Philos. Rev., 74, 88, 10.2307/2183532

Harradon

Hart, 1985

B. Hayes, J.A. Shah, Improving robot controller transparency through autonomous policy explanation, in: Proceedings of the 12th ACM/IEEE International Conference on Human–Robot Interaction (HRI 2017), p. 2017.

Heider, 1958

Heider, 1944, An experimental study of apparent behavior, Am. J. Psychol., 57, 243, 10.2307/1416950

Hempel, 1948, Studies in the logic of explanation, Philos. Sci., 15, 135, 10.1086/286983

Hesslow, 1988, The problem of causal selection, 11

Hilton, 2017, Social attribution and explanation, 645

Hilton, 1988, Logic and causal attribution, 33

Hilton, 1990, Conversational processes and causal explanation, Psychol. Bull., 107, 65, 10.1037/0033-2909.107.1.65

Hilton, 1996, Mental models and causal explanation: judgements of probable cause and explanatory relevance, Think. Reasoning, 2, 273, 10.1080/135467896394447

Hilton, 2005, Counterfactuals, conditionals and causality: a social psychological perspective, 44

Hilton, 2010, Selecting explanations from causal chains: do statistical principles explain preferences for voluntary causes?, Eur. J. Soc. Psychol., 40, 383, 10.1002/ejsp.623

Hilton, 2005, The course of events: counterfactuals, causal sequences and explanation

Hilton, 1986, Knowledge-based causal attribution: the abnormal conditions focus model, Psychol. Rev., 93, 75, 10.1037/0033-295X.93.1.75

Hoffman, 2017, Explaining explanation, part 1: theoretical foundations, IEEE Intell. Syst., 32, 68, 10.1109/MIS.2017.54

Hume, 2000

Jaspars, 1988, Mental models of causal reasoning, 335

Josephson, 1996

Kahneman, 2011

Kahneman, 1982, The simulation heuristic

Kashima, 1998, The category of the mind: folk psychology of belief, desire, and intention, Asian J. Social Psychol., 1, 289, 10.1111/1467-839X.00019

Kass, 1987

Kelley, 1967, 192

Kelley, 1972

Knobe, 2003, Intentional action and side effects in ordinary language, Analysis, 63, 190, 10.1093/analys/63.3.190

Kulesza, 2015, Principles of explanatory debugging to personalize interactive machine learning, 126

Kulesza, 2013, Too much, too little, or just right? Ways explanations impact end users' mental models, 3

Kulesza, 2011, Why-oriented end-user debugging of naive Bayes text classification, ACM Trans. Interact. Intell. Syst. (TiiS), 1, 2

Lagnado, 2008, Judgments of cause and blame: the effects of intentionality and foreseeability, Cognition, 108, 754, 10.1016/j.cognition.2008.06.009

Langley, 2017, Explainable agency for intelligent autonomous systems

Leake, 1991, Goal-based explanation evaluation, Cogn. Sci., 15, 509

Leake, 1995, Abduction, experience, and goals: a model of everyday abductive explanation, J. Exp. Theor. Artif. Intell., 7, 407, 10.1080/09528139508953820

Leddo, 1984, Conjunctive explanations: when two reasons are better than one, J. Pers. Soc. Psychol., 47, 933, 10.1037/0022-3514.47.5.933

Levesque, 1989, A knowledge-level account of abduction, 1061

Causation, 1974, J. Philos., 70, 556

Lewis, 1986, Causal explanation, Philos. Pap., 2, 214

Lim, 2009, Assessing demand for intelligibility in context-aware applications, 195

Linegang, 2006, Human-automation collaboration in dynamic mission planning: a challenge requiring an ecological approach, Proc. Human Factors Ergonom. Soc. Annual Meeting, 50, 2482, 10.1177/154193120605002304

Lipton, 1990, Contrastive explanation, R. Inst. Philos. Suppl., 27, 247, 10.1017/S1358246100005130

Lipton

Lombrozo, 2006, The structure and function of explanations, Trends Cogn. Sci., 10, 464, 10.1016/j.tics.2006.08.004

Lombrozo, 2007, Simplicity and probability in causal explanation, Cogn. Psychol., 55, 232, 10.1016/j.cogpsych.2006.09.006

Lombrozo, 2009, Explanation and categorization: how “why?” informs “what?”, Cognition, 110, 248, 10.1016/j.cognition.2008.10.007

Lombrozo, 2010, Causal-explanatory pluralism: how intentions, functions, and mechanisms influence causal ascriptions, Cogn. Psychol., 61, 303, 10.1016/j.cogpsych.2010.05.002

Lombrozo, 2012, Explanation and abductive inference, 260

Lombrozo, 2014, Explanation and inference: mechanistic and functional explanations guide property generalization, Front. Human Neurosci., 8, 700, 10.3389/fnhum.2014.00700

Mackie, 1980

Malle, 1999, How people explain behavior: a new theoretical framework, Personal. Soc. Psychol. Rev., 3, 23, 10.1207/s15327957pspr0301_2

Malle, 2004

Malle, 2011, Attribution theories: how people make sense of behavior, 72

Malle, 2011, Time to give up the dogmas of attribution: an alternative theory of behavior explanation, Adv. Exp. Soc. Psychol., 44, 297, 10.1016/B978-0-12-385522-0.00006-8

Malle, 1997, The folk concept of intentionality, J. Exp. Soc. Psychol., 33, 101, 10.1006/jesp.1996.1314

Malle, 2000, Conceptual structure and social functions of behavior explanations: beyond person–situation attributions, J. Pers. Soc. Psychol., 79, 309, 10.1037/0022-3514.79.3.309

Malle, 2007, Actor-observer asymmetries in explanations of behavior: new answers to an old question, J. Pers. Soc. Psychol., 93, 491, 10.1037/0022-3514.93.4.491

Malle, 2001, Attention to behavioral events during interaction: two actor-observer gaps and three attempts to close them, J. Pers. Soc. Psychol., 81, 278, 10.1037/0022-3514.81.2.278

Marr, 1982

Marr, 1976

McCloy, 2000, Counterfactual thinking about controllable events, Mem. Cogn., 28, 1071, 10.3758/BF03209355

McClure, 2002, Goal-based explanations of actions and outcomes, Eur. Rev. Soc. Psychol., 12, 201, 10.1080/14792772143000067

McClure, 1997, For you can't always get what you want: when preconditions are better explanations than goals, Br. J. Soc. Psychol., 36, 223, 10.1111/j.2044-8309.1997.tb01129.x

McClure, 2001, When rich or poor people buy expensive objects: is the question how or why?, J. Lang. Soc. Psychol., 20, 229, 10.1177/0261927X01020003004

McClure, 1998, Are goals or preconditions better explanations? It depends on the question, Eur. J. Soc. Psychol., 28, 897, 10.1002/(SICI)1099-0992(1998110)28:6<897::AID-EJSP902>3.0.CO;2-P

McClure, 2003, The role of goal-based explanations, vol. 5, 306

McGill, 1993, Contrastive and counterfactual reasoning in causal judgment, J. Pers. Soc. Psychol., 64, 897, 10.1037/0022-3514.64.6.897

Menzies, 1993, Causation as a secondary quality, Br. J. Philos. Sci., 44, 187, 10.1093/bjps/44.2.187

Mercado, 2016, Intelligent agent transparency in human–agent teaming for multi-UxV management, Hum. Factors, 58, 401, 10.1177/0018720815621206

Mill, 1973, vol. III

Miller, 1990, Temporal order and the perceived mutability of events: implications for blame assignment, J. Pers. Soc. Psychol., 59, 1111, 10.1037/0022-3514.59.6.1111

Miller, 2017, Explainable AI: beware of inmates running the asylum, 36

Mitchell, 1986, Explanation-based generalization: a unifying view, Mach. Learn., 1, 47, 10.1007/BF00116250

Moore, 1993, Planning text for advisory dialogues: capturing intentional and rhetorical information, Comput. Linguist., 19, 651

Muise, 2015, Planning over multi-agent epistemic states: a classical planning approach, 1

Nott

O'Laughlin, 2002, How people explain actions performed by groups and individuals, J. Pers. Soc. Psychol., 82, 33, 10.1037/0022-3514.82.1.33

Overton, 2011, Scientific explanation and computation, 41

Overton, 2012

Overton, 2013, “Explain” in scientific discourse, Synthese, 190, 1383, 10.1007/s11229-012-0109-8

Pearl, 2018

Peirce, 1903, Harvard lectures on pragmatism, vol. 5

Petrick, 2016, Using general-purpose planning for action selection in human–robot interaction

Poole, 1989, Normality and faults in logic-based diagnosis, vol. 89, 1304

Pople, 1973, On the mechanization of abductive logic, vol. 73, 147

Popper, 2005

Prakken, 2006, Formal systems for persuasion dialogue, Knowl. Eng. Rev., 21, 163, 10.1017/S0269888906000865

Prasada, 2017, The scope of formal explanation, Psychon. Bull. Rev., 1

Prasada, 2006, Principled and statistical connections in common sense conception, Cognition, 99, 73, 10.1016/j.cognition.2005.01.003

Preston, 2005, Explanations versus applications: the explanatory power of valuable beliefs, Psychol. Sci., 16, 826, 10.1111/j.1467-9280.2005.01621.x

Ranney, 1988, Explanatory coherence and belief revision in naive physics, 426

Rao, 1995, BDI agents: from theory to practice, vol. 95, 312

Read, 1993, Explanatory coherence in social explanations: a parallel distributed processing account, J. Pers. Soc. Psychol., 65, 429, 10.1037/0022-3514.65.3.429

Rehder, 2003, A causal-model theory of conceptual representation and categorization, J. Exp. Psychol. Learn. Mem. Cogn., 29, 1141, 10.1037/0278-7393.29.6.1141

Rehder, 2006, When similarity and causality compete in category-based property generalization, Mem. Cogn., 34, 3, 10.3758/BF03193382

Reiter, 1987, A theory of diagnosis from first principles, Artif. Intell., 32, 57, 10.1016/0004-3702(87)90062-2

Ribeiro, 2016, Why should I trust you?: explaining the predictions of any classifier, 1135

Robnik-Šikonja, 2008, Explaining classifications for individual instances, IEEE Trans. Knowl. Data Eng., 20, 589, 10.1109/TKDE.2007.190734

Salmon, 2006

Samland, 2016, The role of prescriptive norms and knowledge in children's and adults' causal selection, J. Exp. Psychol. Gen., 145, 125, 10.1037/xge0000138

Samland, 2014, Do social norms influence causal inferences?, 1359

Scriven, 1972, The concept of comprehension: from semantics to software, 31

Shams, 2016, Normative practical reasoning via argumentation and dialogue

Singh, 2018, Combining planning with gaze for online human intention recognition

Slugoski, 1993, Attribution in conversational context: effect of mutual knowledge on explanation-giving, Eur. J. Soc. Psychol., 23, 219, 10.1002/ejsp.2420230302

Stubbs, 2007, Autonomy and common ground in human–robot interaction: a field study, IEEE Intell. Syst., 22, 42, 10.1109/MIS.2007.21

Susskind, 1999, Perceiving individuals and groups: expectancies, dispositional inferences, and causal attributions, J. Pers. Soc. Psychol., 76, 181, 10.1037/0022-3514.76.2.181

Swartout, 1993, Explanation in second generation expert systems, 543

Tetlock, 1989, Accountability: a social magnifier of the dilution effect, J. Pers. Soc. Psychol., 57, 388, 10.1037/0022-3514.57.3.388

Tetlock, 1996, The dilution effect: judgemental bias, conversational convention, or a bit of both?, Eur. J. Soc. Psychol., 26, 915, 10.1002/(SICI)1099-0992(199611)26:6<915::AID-EJSP797>3.0.CO;2-W

Thagard, 1989, Explanatory coherence, Behav. Brain Sci., 12, 435, 10.1017/S0140525X00057046

Trabasso, 2003, Story understanding and counterfactual reasoning, J. Exp. Psychol. Learn. Mem. Cogn., 29, 904, 10.1037/0278-7393.29.5.904

Tversky, 1983, Extensional versus intuitive reasoning: the conjunction fallacy in probability judgment, Psychol. Rev., 90, 293, 10.1037/0033-295X.90.4.293

Uttich, 2010, Norms inform mental state ascriptions: a rational explanation for the side-effect effect, Cognition, 116, 87, 10.1016/j.cognition.2010.04.003

Van Bouwel, 2002, Remote causes, bad explanations?, J. Theory Soc. Behav., 32, 437, 10.1111/1468-5914.00197

Van Fraassen, 1977, The pragmatics of explanation, Am. Philos. Q., 14, 143

Vasilyeva, 2015, Goals affect the perceived quality of explanations, 2469

von der Osten, 2017, The minds of many: opponent modelling in a stochastic game, 3845

Von Wright, 1971

Walton, 2004, A new dialectical theory of explanation, Philos. Explor., 7, 71, 10.1080/1386979032000186863

Walton, 2006, Examination dialogue: an argumentation framework for critically questioning an expert opinion, J. Pragmat., 38, 745, 10.1016/j.pragma.2005.01.016

Walton, 2007, Dialogical models of explanation, 1

Walton, 2011, A dialogue system specification for explanation, Synthese, 182, 349, 10.1007/s11229-010-9745-z

Walton, 1984

Weiner, 1980, BLAH, a system which explains its reasoning, Artif. Intell., 15, 19, 10.1016/0004-3702(80)90021-1

Weld

Wendt, 1998, On constitution and causation in international relations, Rev. Int. Stud., 24, 101, 10.1017/S0260210598001028

Wilkenfeld, 2015, Inference to the best explanation (IBE) versus explaining for the best inference (EBI), Sci. Educ., 24, 1059, 10.1007/s11191-015-9784-4

Williams, 2013, The hazards of explanation: overgeneralization in the face of exceptions, J. Exp. Psychol. Gen., 142, 1006, 10.1037/a0030996

Winikoff, 2017, Debugging agent programs with why?: questions, 251

Woodward, 2005

Woodward, 2006, Sensitive and insensitive causation, Philos. Rev., 115, 1, 10.1215/00318108-2005-001