Lesson 3: Shaping and Reinforcement Schedules

Lesson Content

What is Shaping?

Shaping is the process of teaching a new behavior by rewarding successive approximations of the desired behavior. It's like guiding an animal toward a target, rewarding them for getting closer and closer to that target until they finally perform the complete behavior. Think of teaching a dolphin to jump through a hoop. You wouldn't expect them to do it perfectly on the first try! Instead, you'd start by rewarding them for facing the hoop, then for moving towards it, then for touching it, and finally for jumping through it.

Example: Imagine you want to teach a sea lion to touch a target (a ball on a stick). You might initially reward the sea lion for simply looking at the target. Then, you'd only reward when it moves its nose toward the target. Next, you'd reward only when it touches the target. Finally, you would reward only when it touches the target with its nose.

Components of a Shaping Plan

A successful shaping plan includes three key elements:

Target Behavior: The complete, desired behavior you want the animal to perform (e.g., jumping through a hoop).
Approximations: The smaller, simpler steps that gradually lead to the target behavior. These are the successive 'baby steps' you'll reward.
Reinforcers: These are the things that the animal finds rewarding, such as food (fish), toys, or praise. Reinforcers are used to motivate the animal to perform the behaviors.

Example: Hoop Jump
* Target Behavior: Jump through the hoop.
* Approximations: Facing the hoop, moving towards the hoop, touching the hoop, jumping near the hoop, jumping through the hoop.
* Reinforcers: Fish, praise, or a favorite toy.

Reinforcement Schedules: Timing is Everything!

A reinforcement schedule describes when and how often you provide reinforcement. The timing of reinforcement has a big impact on how quickly an animal learns a behavior and how well it maintains that behavior. There are two main types of reinforcement schedules:

Continuous Reinforcement: Reinforcing the behavior every time it occurs. (e.g., giving a fish every time the sea lion touches the target). This is great for teaching a new behavior quickly.
Intermittent Reinforcement: Reinforcing the behavior only some of the time. This is used to maintain a behavior that has already been learned. Intermittent reinforcement leads to behaviors that are more resistant to extinction (stopping the behavior). There are several types of intermittent schedules:
- Fixed Ratio: Reinforcement is given after a fixed number of behaviors (e.g., giving a treat after the sea lion touches the target 3 times - FR3).
- Variable Ratio: Reinforcement is given after a variable number of behaviors (e.g., giving a treat after the sea lion touches the target sometimes after 2 touches, sometimes after 4 touches, sometimes after 6 touches - VR). This is the most effective schedule for maintaining behavior and making it resistant to extinction.
- Fixed Interval: Reinforcement is given after a fixed amount of time, provided the behavior has occurred (e.g., giving a treat if the sea lion touches the target after 30 seconds - FI30s). Animals often 'pause' after reinforcement and then perform the behavior more as the time interval nears.
- Variable Interval: Reinforcement is given after a variable amount of time, provided the behavior has occurred (e.g., giving a treat if the sea lion touches the target after a variable amount of time, like after 15 seconds, then after 45 seconds, then after 20 seconds - VI). This schedule also produces consistent behavior, but the timing is unpredictable.

Deep Dive

Explore advanced insights, examples, and bonus exercises to deepen understanding.

Day 3: Marine Mammal Trainer - Beyond the Basics of Shaping & Reinforcement

Welcome back! Today, we're going beyond the fundamentals of shaping and reinforcement schedules. We'll explore the nuances of these techniques and how they're applied in the exciting world of marine mammal training. Get ready to dive deeper!

Deep Dive Section: Shaping and the Power of Context

We know that shaping relies on rewarding successive approximations of a desired behavior. But what happens when the context of the training changes? Consider a dolphin that has flawlessly learned to jump through a hoop. If you then introduce a new context, such as a different location in the pool or the presence of novel objects, the dolphin's performance may temporarily suffer. This is because the context itself can influence behavior.

Experienced trainers proactively manage context by:

Generalization: Gradually introducing variations in the environment during training (e.g., training in different parts of the pool, with different trainers, under varying weather conditions). This helps the animal learn that the desired behavior is the same regardless of context.
Stimulus Control: Utilizing specific cues (visual, auditory, or tactile) to signal the desired behavior. By consistently pairing a cue with the behavior, the trainer establishes a strong link between the cue and the action, reducing the impact of contextual changes.

Understanding the impact of context is crucial for maintaining well-trained behaviors across various situations, such as shows, medical procedures, or husbandry practices.

Bonus Exercises

Let's put your knowledge to the test!

Exercise 1: Shaping a Seal's "Wave"

Imagine you're training a seal to "wave." Create a shaping plan, outlining at least three approximations and the reinforcers you might use. Consider the seal's natural behaviors and how you'd build towards the final target.

Exercise 2: Identifying Reinforcement Schedules

You're working with a beluga whale on a new behavior.

For the first few days, you reward every successful attempt (continuous reinforcement).
Then, you switch to rewarding after every 5 successful attempts.
After that, you give a reward on a seemingly random schedule, averaging about every 10 attempts (but sometimes more or less).
Finally, you reward the whale on average every 2 minutes for performing the behavior.

Identify the reinforcement schedule in each of these scenarios. Explain how the whale's behavior might change as a result of switching from one schedule to another.

Real-World Connections

The principles of shaping and reinforcement extend far beyond marine mammal training. Consider these examples:

Dog Training: Shaping is used to teach complex tricks, from "shake" to agility courses. Reinforcement schedules are key for maintaining those skills.
Child Development: Parents use shaping to teach children new skills, like potty training or learning to ride a bike. Reinforcement (praise, treats) is used to encourage desired behaviors.
Business & Workplace: Managers can use reinforcement schedules to improve employee performance and morale. Recognizing and rewarding employees for achieving milestones is a form of positive reinforcement.

Challenge Yourself

Think about a complex behavior you'd like to teach your pet, a child, or even yourself (e.g., learning a new skill). Create a detailed shaping plan, including specific approximations, the types of reinforcement you'll use, and how you'll manage the context. Then, try it out and track your progress!

Further Learning

Ready to keep learning? Explore these topics:

Punishment vs. Reinforcement: Delve into the ethical considerations and effectiveness of using punishment in animal training.
Classical Conditioning (Pavlovian Conditioning): Learn how animals learn through association, often used in conjunction with operant conditioning (shaping and reinforcement).
Applied Behavior Analysis (ABA): Explore how behavioral principles are applied in various settings, from autism therapy to organizational management.

Continue your research through documentaries, online courses, and books about animal training and behavioral psychology!

Interactive Exercises

Shaping a New Trick

Imagine you want to teach a dolphin to wave (raising a flipper). Outline a shaping plan, including: 1) Target Behavior, 2) Three Approximations, and 3) Example Reinforcers. (Write your answer down or think about it before moving on.)

Reinforcement Schedule Scenarios

Match each of the following scenarios to the reinforcement schedule it represents. (e.g. A sea lion gets a fish after every time it barks - Continuous Reinforcement). 1) A sea lion gets a fish after performing a trick an average of 5 times. 2) A sea lion gets a fish after completing the trick exactly 10 times. 3) A sea lion gets a fish if they complete the trick after waiting 1 minute. 4) The sea lion gets a fish if they complete the trick an average of every 2 minutes.

Real-Life Examples of Shaping

Think about how you've used shaping, or seen it used, in everyday life. For example, how did you learn to ride a bike? How is a young child taught to read? Share your findings in the online classroom or with a classmate. (e.g. When learning to ride a bike I started by getting use to the bike - holding it and walking around. Next I would try to go a short distance and once I mastered that, I went further and further)

Cookie Preferences

Regenerating Content

Shaping and Reinforcement Schedules

Learning Objectives

Text-to-Speech

Lesson Content

What is Shaping?

Components of a Shaping Plan

Reinforcement Schedules: Timing is Everything!

Deep Dive

Day 3: Marine Mammal Trainer - Beyond the Basics of Shaping & Reinforcement

Deep Dive Section: Shaping and the Power of Context

Bonus Exercises

Exercise 1: Shaping a Seal's "Wave"

Exercise 2: Identifying Reinforcement Schedules

Real-World Connections

Challenge Yourself

Further Learning

Interactive Exercises

Shaping a New Trick

Reinforcement Schedule Scenarios

Real-Life Examples of Shaping

Practical Application

Key Takeaways

Next Steps

Your Progress is Being Saved!

Extended Learning Content

Extended Resources

Extended Resources

Question 2: Which of the following is NOT a key component of a shaping plan?

Question 3: A trainer gives a dolphin a fish every time it clicks its beak (a behavior they want to establish). What type of reinforcement schedule is this?

Question 4: A trainer gives a sea lion a treat after it completes a trick, on average, every 4 times (sometimes after 3, sometimes after 5). What reinforcement schedule is this?

Question 5: Which reinforcement schedule is most appropriate for maintaining a behavior that has already been learned?

Congratulations!

Cookie Preferences

Upgrade to Premium

Premium Benefits: