People Teach With Rewards and Punishments as Communication, Not Reinforcements

doi:10.1037/xge0000569

People Teach With Rewards and Punishments as Communication, Not Reinforcements

Mark K. Ho, F. Cushman, M. Littman

Oct 29, 2018

Citations

Influential Citations

Citations

Quality indicators

Journal

Journal of Experimental Psychology: General

Full text

Semantic Scholar PDF

PDF

Key Takeaway

Key Takeaway: People are strongly biased to use rewards and punishments as communication, rather than reinforcement, in teaching new behaviors to other organisms.

Abstract

Carrots and sticks motivate behavior, and people can teach new behaviors to other organisms, such as children or nonhuman animals, by tapping into their reward learning mechanisms. But how people teach with reward and punishment depends on their expectations about the learner. We examine how people teach using reward and punishment by contrasting two hypotheses. The first is evaluative feedback as reinforcement, where rewards and punishments are used to shape learner behavior through reinforcement learning mechanisms. The second is evaluative feedback as communication, where rewards and punishments are used to signal target behavior to a learning agent reasoning about a teacher’s pedagogical goals. We present formalizations of learning from these 2 teaching strategies based on computational frameworks for reinforcement learning. Our analysis based on these models motivates a simple interactive teaching paradigm that distinguishes between the two teaching hypotheses. Across 3 sets of experiments, we find that people are strongly biased to use evaluative feedback communicatively rather than as reinforcement.

copied to clipboard