How do we gauge understanding? Tests of understanding, such as Turing's imitation game, are numerous; yet, attempts to achieve a state of understanding are not satisfactory assessments. Intelligent agents designed to pass one test of understanding often fall short of others. Rather than approaching understanding as a system state, in this paper, we argue that understanding is a process that changes over time and experience. The only window into the process is through the lens of natural language. Usefully, failures of understanding reveal breakdowns in the process. We propose a set of natural language-based probes that can be used to map the degree of understanding a human or intelligent system has achieved through combinations of successes and failures.
Keywords: behavioral measurement; common ground; explainable AI; human-machine teaming; human-robot interaction; mental models; mutual understanding; natural language processing.
Copyright © 2022 Blaha, Abrams, Bibyk, Bonial, Hartzler, Hsu, Khemlani, King, St. Amant, Trafton and Wong.