What does shaping mean in operant conditioning?

Shaping is a method for building a complex behavior by reinforcing small steps toward the final desired response. For example: first reinforcing movement toward a lever then reinforcing touching the lever then reinforcing pressing it This is useful because many behaviors do not appear fully formed at the start. Shaping allows them to be developed gradually through reinforcement.

Why did Skinner often use rats and pigeons in his studies?

Rats and pigeons were practical research animals because they: learn simple response patterns quite quickly can be kept in controlled conditions produce clear, countable behaviors such as lever pressing or key pecking Skinner wanted behavior that could be measured accurately. These animals made it easier to study changes in response rate over time.

Why can some behaviors continue even when reinforcement is not given every time?

Behavior can become very persistent when reinforcement is not delivered after every single response. This is because the organism learns that reinforcement is still possible, even if it does not happen right away. As a result: responding may continue for longer the behavior may be harder to eliminate the learner may keep trying because reinforcement is unpredictable This helps explain why some habits are especially resistant to stopping.

Operant conditioning, reinforcement and Skinner (5.1.2) | AQA A-Level Psychology Notes

Q: What is the difference between primary and secondary reinforcers?

A primary reinforcer is naturally rewarding because it satisfies a biological need, such as food or water. A secondary reinforcer becomes rewarding through association with a primary reinforcer or other rewards. Examples include money, grades, or tokens. In Skinner’s work, food was often used because it is a clear primary reinforcer for hungry animals.

Q: What is extinction in operant conditioning?

Extinction happens when a previously reinforced behavior is no longer reinforced, so the behavior gradually decreases. For example, if a lever press stops producing food, the animal will usually press the lever less often over time. Extinction does not always happen immediately. The organism may continue responding for a while before the behavior weakens.

AQA Syllabus focus:

'Operant conditioning, types of reinforcement and Skinner’s research.'

Operant conditioning explains how behavior is learned through consequences. In the behaviorist tradition, Skinner argued that actions followed by reinforcement become more likely, helping psychologists explain how behavior is acquired and maintained.

Operant conditioning

Operant conditioning focuses on behavior that an organism actively produces and that can be shaped by what happens after the response.

Operant conditioning is a type of learning in which behavior is strengthened or weakened by the consequences that follow it.

In this approach, the learner is not a passive receiver of stimulation. Instead, the organism acts on the environment, and the environment then provides a consequence. If that consequence is favorable, the behavior is more likely to happen again. Over time, responses that are followed by reinforcement become more frequent, while responses that are not reinforced become less common. This means learning is based on the relationship between a response and its consequence.

A central idea in operant conditioning is reinforcement, because reinforcement is what makes a response more likely to be repeated.

Reinforcement is any consequence of behavior that increases the likelihood of that behavior occurring again.

Operant conditioning is often described as trial-and-error learning. An organism may produce a range of behaviors, but the behaviors that lead to reinforcement are selected and repeated. This makes operant conditioning a powerful explanation for how everyday habits develop. It also fits the behaviorist emphasis on observable behavior, because psychologists can measure the frequency of a response before and after reinforcement. The main concern is not what a person is thinking, but what they do and how consequences alter the future probability of that behavior.

Types of reinforcement

The AQA specification requires knowledge of the types of reinforcement. Both types increase behavior, but they do so in different ways.

Positive reinforcement

Positive reinforcement occurs when a behavior is followed by the addition of a desirable consequence, making that behavior more likely in the future. The word positive means that something is added, not that the experience is morally good. For example, a student who studies and then receives praise may be more likely to study again. In the same way, a rat pressing a lever and receiving food is more likely to repeat the lever-pressing response. Positive reinforcement therefore strengthens behavior by producing a rewarding outcome after the response.

Positive reinforcement is especially important because it shows clearly how new behavior can be built up. If a response is followed consistently by a pleasant consequence, the organism quickly learns the connection between behavior and reward.

Negative reinforcement

Negative reinforcement occurs when a behavior is followed by the removal of something unpleasant, making that behavior more likely in the future. The word negative means that something is taken away. For example, if fastening a seat belt stops an annoying buzzer, seat-belt use is strengthened because the unpleasant stimulus has been removed.

This is often confused with punishment, but they are not the same. Negative reinforcement still increases behavior, because the organism learns that performing the response helps it escape or avoid an unpleasant situation. In operant conditioning, both positive and negative reinforcement strengthen behavior. The key difference is whether reinforcement works through adding a desirable stimulus or removing an aversive one.

Operant-conditioning decision tree that classifies consequences by two questions: whether the behavior becomes more or less likely, and whether something is added or removed. This makes the logic of “positive vs. negative” (add vs. remove) explicit while keeping the core AQA focus on reinforcement clear. Source

Understanding this distinction is essential. Many students assume that negative means “bad,” but in psychology it refers only to the removal of a stimulus. As long as the behavior becomes more frequent, reinforcement has taken place.

Skinner’s research

The Skinner box

B. F. Skinner investigated operant conditioning using highly controlled laboratory research. His most famous apparatus was the Skinner box, a chamber containing an animal such as a rat or pigeon. Inside the box, the animal could make a simple response, such as pressing a lever or pecking a key. When the correct response was made, a reinforcer was delivered, often food.

Diagram of a rat in a Skinner box (operant conditioning chamber), highlighting the manipulandum (lever) and the reinforcer delivery system (food dispenser). It concretely illustrates how a measurable response can be followed immediately by a consequence, allowing response rates to be studied under tightly controlled conditions. Source

In some studies, the response could also remove an unpleasant condition, demonstrating negative reinforcement.

The box allowed Skinner to record behavior very precisely.

Photograph of a cumulative recorder used with operant-conditioning apparatus to produce a continuous record of responding over time. It reinforces the behaviorist emphasis on objective measurement by showing how response rates could be captured mechanically rather than inferred subjectively. Source

Because the environment was controlled, he could observe how reinforcement changed the rate of responding. This made it possible to study learning scientifically, using objective and measurable data.

What Skinner showed

Skinner’s research showed that behavior can be conditioned by reinforcement. If a lever press was followed by food, the rat gradually pressed the lever more often. If a pigeon’s pecking response was reinforced, pecking increased. The response did not have to be present at a high level at the start. Through repeated reinforcement, the desired behavior became established.

His findings supported the behaviorist view that behavior is shaped mainly by environmental consequences. According to Skinner, much of what humans and animals do can be explained by reinforcement histories. A behavior that has been rewarded in the past is likely to be repeated in similar situations. This makes learning predictable and open to scientific investigation.

Skinner’s studies also emphasized that reinforcement must follow behavior in a way that the organism can detect. When the consequence clearly follows the response, the connection between action and outcome is learned more effectively.

Why Skinner’s research matters

Skinner’s work is important because it provided strong experimental support for the principles of operant conditioning. It showed that behavior can be studied in a systematic, controlled way and that reinforcement can reliably alter responding. For AQA Psychology, Skinner’s research is central because it demonstrates how the behaviorist approach explains learning through observable relationships between behavior and consequences, especially through positive reinforcement and negative reinforcement.

Practice Questions

Outline what is meant by negative reinforcement. (2 marks)

1 mark for stating that an unpleasant or aversive stimulus is removed after a behavior.
1 mark for stating that this removal increases the likelihood of the behavior being repeated.

Explain Skinner’s research into operant conditioning. (6 marks)

Award 1 mark for each relevant point, up to 6 marks. Credit any of the following:

Skinner used controlled laboratory experiments.
He used apparatus known as a Skinner box.
The box usually contained a rat or pigeon.
The animal could make a response such as pressing a lever or pecking a key.
A reinforcer such as food was delivered after the response.
This increased the frequency of the behavior.
Skinner also demonstrated negative reinforcement through removal of an unpleasant stimulus.
The research showed that consequences shape future behavior.
Learning was measured using observable, objective behavior.

Try All Topic Practice Questions

FAQ

A primary reinforcer is naturally rewarding because it satisfies a biological need, such as food or water.

A secondary reinforcer becomes rewarding through association with a primary reinforcer or other rewards. Examples include money, grades, or tokens.

In Skinner’s work, food was often used because it is a clear primary reinforcer for hungry animals.

Shaping is a method for building a complex behavior by reinforcing small steps toward the final desired response.

For example:

first reinforcing movement toward a lever
then reinforcing touching the lever
then reinforcing pressing it

This is useful because many behaviors do not appear fully formed at the start. Shaping allows them to be developed gradually through reinforcement.

Rats and pigeons were practical research animals because they:

learn simple response patterns quite quickly
can be kept in controlled conditions
produce clear, countable behaviors such as lever pressing or key pecking

Skinner wanted behavior that could be measured accurately. These animals made it easier to study changes in response rate over time.

Extinction happens when a previously reinforced behavior is no longer reinforced, so the behavior gradually decreases.

For example, if a lever press stops producing food, the animal will usually press the lever less often over time.

Extinction does not always happen immediately. The organism may continue responding for a while before the behavior weakens.

Behavior can become very persistent when reinforcement is not delivered after every single response.

This is because the organism learns that reinforcement is still possible, even if it does not happen right away.

As a result:

responding may continue for longer
the behavior may be harder to eliminate
the learner may keep trying because reinforcement is unpredictable

This helps explain why some habits are especially resistant to stopping.

Written by:

Valentina

Profile

Oxford University - Experimental Psychology

Valentina is an Oxford-educated psychologist. Experienced in creating educational resources, she has dedicated the past 5 years to nurturing future minds as an A-Level and IB Psychology tutor.