AQA Syllabus focus:
'Dealing with offending behaviour, including behaviour modification in custody.'
Behavior modification in custody applies learning principles inside prisons to strengthen prosocial conduct. AQA mainly expects knowledge of token economies, how they operate, and their strengths and limitations.
What behavior modification in custody means
Behavior modification in custody is based on operant conditioning, the idea that behavior followed by rewarding consequences is more likely to be repeated.

Diagram comparing classical and operant conditioning. The operant side visually summarises how consequences (reinforcement vs. punishment) follow behaviour and change its future likelihood. This supports understanding why prison token economies focus on strengthening observable prosocial actions through planned consequences. Source
In prisons, the aim is to increase desirable behaviors such as following rules, completing work, keeping cells clean, cooperating with staff, and controlling aggression. Rather than focusing on deep personal insight, it tries to shape daily conduct through planned reinforcement.
Behavior modification in custody is the use of learning principles in prison to encourage desirable behavior and discourage antisocial or disruptive behavior.
This approach is usually associated with token economy programs. These are common because they provide a structured and visible system of rewards, making it clear to inmates which behaviors will be reinforced.
Token economy programs
A token economy gives inmates tokens immediately after they show an approved behavior. The token itself may be a point, mark, card, or plastic chip, but it has value because it can later be exchanged for rewards.
Token economy is a system in which a person receives tokens for target behaviors, and the tokens are later exchanged for backup reinforcers such as privileges or goods.
The crucial idea is that the token becomes a secondary reinforcer because it is linked to meaningful rewards. In custody, these rewards might include extra phone time, access to preferred activities, extra recreational time, or other privileges allowed by the institution.
How the system is set up
For a token economy to work, staff must first identify target behaviors. These need to be clear, observable, and realistic. Examples include:
getting up on time
attending education or work sessions
showing respectful interactions with staff and other inmates
keeping living areas tidy
completing agreed treatment or training activities
Staff then decide:
how many tokens each behavior earns
when tokens will be awarded
which behaviors will not be rewarded
what rewards can be bought with accumulated tokens
The system works best when reinforcement is immediate and consistent. If tokens are delayed or given unpredictably, inmates may not connect the reward to the behavior. Complex behaviors can also be built gradually through shaping, where small steps toward the desired behavior are rewarded first.
Why token economies are used in prisons
Prisons need order, safety, and cooperation. Token economies are attractive because they give staff a practical method for managing large groups and reducing disruptive behavior. They can:
increase compliance with rules
reduce aggression and conflict
promote self-care and responsibility
make expectations transparent
provide a nonviolent way of controlling conduct
Because rewards are tied to specific actions, inmates receive clear feedback about what the institution values. This can make behavior management more predictable and less arbitrary.
Evidence for effectiveness
Research suggests token economies can improve behavior inside institutions. A commonly cited study is Hobbs and Holt (1976), who introduced a token economy into a young offenders’ institution. Inmates in the token economy condition showed more desirable behavior than those in a control condition, especially those who had been the most disruptive at the start. This supports the idea that reinforcement can modify institutional behavior.
However, evidence of short-term improvement in prison routines does not automatically show a lasting reduction in offending. Much of the success of token economies is measured by better behavior inside custody, such as rule-following or task completion, rather than by lower rates of reoffending after release.
Evaluation
One strength of behavior modification in custody is that it is practical and measurable.
Staff can define target behaviors clearly and record whether they occur. This makes progress easier to monitor than with approaches that depend on private thoughts or feelings. It can also produce relatively quick improvements in day-to-day prison management.
Another strength is that the method is rooted in well-established learning theory. The basic principle of reinforcement has strong psychological support, and the structured nature of token economies means the intervention can be standardized across units or institutions.
A major limitation is that behavior may be situation-specific. Inmates may learn to behave appropriately in a highly controlled environment because rewards are immediately available, but the same behavior may not continue after release, when the tokens and prison routines disappear. This means behavior modification may manage conduct in custody without changing the underlying causes of offending.
A related criticism is that token economies may rely too heavily on extrinsic motivation. If inmates act appropriately only to gain rewards, the behavior may weaken when reinforcement stops. Long-term change is more difficult if offenders do not internalize the values behind the behavior.
There are also ethical concerns. Prisons involve unequal power relationships, so even a reward system can become coercive if basic privileges are controlled too tightly. Critics argue that inmates may be manipulated into compliance rather than genuinely rehabilitated. Fairness is also important: if staff are inconsistent or biased in awarding tokens, the system may appear unjust and lose effectiveness.
Finally, success depends heavily on staff training and commitment. A token economy is not self-operating. If officers and prison workers do not apply the rules consistently, inmates may stop responding to the program. Different inmates may also value different rewards, so backup reinforcers must be meaningful for the system to work well.
Practice Questions
Outline what is meant by a token economy in custody. (2 marks)
1 mark for identifying that inmates receive tokens or points for desirable/target behavior.
1 mark for stating that tokens are later exchanged for rewards, privileges, or backup reinforcers.
No credit for vague answers that mention “rewards” without linking them to tokens and target behavior.
Discuss behavior modification in custody as a way of dealing with offending behavior. (6 marks)
AO1: 3 marks
1 mark for explaining that behavior modification in custody is based on operant conditioning/reinforcement.
1 mark for describing the token economy system, including target behaviors.
1 mark for explaining that tokens act as secondary reinforcers and are exchanged for privileges.
AO3: 3 marks Award 1 mark each for any relevant evaluative point, such as:
research support, for example Hobbs and Holt, showing improved institutional behavior
behavior change may be limited to the prison environment and may not continue after release
the approach may encourage extrinsic rather than internalized motivation
ethical concerns about control, fairness, or coercion
effectiveness depends on consistent staff implementation
FAQ
Rewards are usually chosen by balancing three factors: security, practicality, and motivational value.
Common principles include:
rewards must be allowed by prison rules
they should be available often enough to maintain interest
different inmates may need different reward options
Programs often work better when inmates can choose from a menu of approved rewards, because choice increases the reinforcing value of the tokens.
A program can weaken if tokens stop feeling valuable.
This may happen when:
rewards are too limited or repetitive
inmates need too many tokens before they can buy anything
staff delay giving tokens
inmates think the system is unfair
To keep the system effective, prisons may update reward menus, review token prices, and monitor whether staff are applying the rules consistently.
Yes. The same basic principles can be used, but the design often needs adjustment.
For example:
younger offenders may respond better to more immediate and frequent reinforcement
inmates with low literacy may need simple visual systems
units with high conflict may need fewer, very clear target behaviors at first
Adaptation matters because a one-size-fits-all system is less likely to stay motivating or fair across different prison populations.
Programs usually reduce this risk by tightly controlling how tokens are recorded and exchanged.
Methods include:
staff-only recording systems
named accounts or digital point systems
regular audits
fixed exchange times
rules against trading or stealing tokens
The more transparent the system is, the easier it is to detect disputes and maintain trust in the program.
Resistance is not always about disliking rewards. Sometimes it reflects prison culture.
Possible reasons include:
not wanting to appear compliant in front of peers
distrust of staff motives
believing the rewards are too small to matter
feeling the system is childish or insulting
This means non-participation does not always show that learning principles are wrong; it may show that social identity and group norms affect how the program is received.
