Shaping and Reinforcement Schedules

In this lesson, you will discover how marine mammal trainers use shaping to build complex behaviors by rewarding small steps toward the desired action. You will also learn about different schedules of reinforcement and how these schedules influence how quickly a behavior is learned and how well it is maintained.

Learning Objectives

  • Define shaping and explain its role in training new behaviors.
  • Identify the components of a shaping plan (target behavior, approximations, reinforcers).
  • Differentiate between continuous and intermittent reinforcement schedules.
  • Describe the effects of different reinforcement schedules (fixed ratio, variable ratio, fixed interval) on behavior.

Text-to-Speech

Listen to the lesson content

Lesson Content

What is Shaping?

Shaping is the process of teaching a new behavior by rewarding successive approximations of the desired behavior. It's like guiding an animal toward a target, rewarding them for getting closer and closer to that target until they finally perform the complete behavior. Think of teaching a dolphin to jump through a hoop. You wouldn't expect them to do it perfectly on the first try! Instead, you'd start by rewarding them for facing the hoop, then for moving towards it, then for touching it, and finally for jumping through it.

Example: Imagine you want to teach a sea lion to touch a target (a ball on a stick). You might initially reward the sea lion for simply looking at the target. Then, you'd only reward when it moves its nose toward the target. Next, you'd reward only when it touches the target. Finally, you would reward only when it touches the target with its nose.

Components of a Shaping Plan

A successful shaping plan includes three key elements:

  • Target Behavior: The complete, desired behavior you want the animal to perform (e.g., jumping through a hoop).
  • Approximations: The smaller, simpler steps that gradually lead to the target behavior. These are the successive 'baby steps' you'll reward.
  • Reinforcers: These are the things that the animal finds rewarding, such as food (fish), toys, or praise. Reinforcers are used to motivate the animal to perform the behaviors.

Example: Hoop Jump
* Target Behavior: Jump through the hoop.
* Approximations: Facing the hoop, moving towards the hoop, touching the hoop, jumping near the hoop, jumping through the hoop.
* Reinforcers: Fish, praise, or a favorite toy.

Reinforcement Schedules: Timing is Everything!

A reinforcement schedule describes when and how often you provide reinforcement. The timing of reinforcement has a big impact on how quickly an animal learns a behavior and how well it maintains that behavior. There are two main types of reinforcement schedules:

  • Continuous Reinforcement: Reinforcing the behavior every time it occurs. (e.g., giving a fish every time the sea lion touches the target). This is great for teaching a new behavior quickly.
  • Intermittent Reinforcement: Reinforcing the behavior only some of the time. This is used to maintain a behavior that has already been learned. Intermittent reinforcement leads to behaviors that are more resistant to extinction (stopping the behavior). There are several types of intermittent schedules:
    • Fixed Ratio: Reinforcement is given after a fixed number of behaviors (e.g., giving a treat after the sea lion touches the target 3 times - FR3).
    • Variable Ratio: Reinforcement is given after a variable number of behaviors (e.g., giving a treat after the sea lion touches the target sometimes after 2 touches, sometimes after 4 touches, sometimes after 6 touches - VR). This is the most effective schedule for maintaining behavior and making it resistant to extinction.
    • Fixed Interval: Reinforcement is given after a fixed amount of time, provided the behavior has occurred (e.g., giving a treat if the sea lion touches the target after 30 seconds - FI30s). Animals often 'pause' after reinforcement and then perform the behavior more as the time interval nears.
    • Variable Interval: Reinforcement is given after a variable amount of time, provided the behavior has occurred (e.g., giving a treat if the sea lion touches the target after a variable amount of time, like after 15 seconds, then after 45 seconds, then after 20 seconds - VI). This schedule also produces consistent behavior, but the timing is unpredictable.
Progress
0%