How Rewards Shape Human Behaviour
Every behaviour you perform — from brushing your teeth to your career choices to the relationships you sustain — has been shaped, in part, by reward. This is not a cynical observation about human nature. It is one of the most fundamental and rigorously established findings in all of psychology.
Understanding how rewards actually work — not in the intuitive way most people assume, but in the precise way science has revealed — is one of the most practically useful things you can learn about your own behaviour and the behaviour of everyone around you.
The Foundation: Operant Conditioning
The scientific study of how rewards shape behaviour began in earnest with B.F. Skinner’s work on operant conditioning in the mid-twentieth century. Skinner demonstrated, through meticulous experimentation, that behaviour is fundamentally governed by its consequences.
Behaviours followed by positive consequences are more likely to be repeated. Behaviours followed by negative consequences are less likely to be repeated. This elegantly simple principle — sometimes called the Law of Effect, first articulated by Edward Thorndike — underlies everything from animal training to classroom management to the design of every app on your phone.
The Four Mechanisms of Reinforcement
| Mechanism | What Happens | Effect on Behaviour | Example |
|---|---|---|---|
| Positive Reinforcement | Something desirable is added | Behaviour increases | Praise after good work |
| Negative Reinforcement | Something unpleasant is removed | Behaviour increases | Taking painkillers removes pain |
| Positive Punishment | Something unpleasant is added | Behaviour decreases | Fines for speeding |
| Negative Punishment | Something desirable is removed | Behaviour decreases | Losing screen time privileges |
The Most Powerful Schedule: Variable Ratio Reinforcement
Not all rewards are equally effective. Skinner’s research on reinforcement schedules revealed that the timing and predictability of rewards matters as much as the rewards themselves — often more.
The four primary reinforcement schedules produce dramatically different patterns of behaviour:
- Fixed ratio: Reward after every X responses. Produces high, consistent behaviour but with pauses after each reward.
- Variable ratio: Reward after an unpredictable number of responses. Produces the highest, most persistent behaviour. Most resistant to extinction. This is the schedule used by slot machines, social media likes, and loot boxes.
- Fixed interval: Reward after a fixed time period. Produces low behaviour between rewards, with a spike just before the reward is due.
- Variable interval: Reward after unpredictable time periods. Produces steady, persistent behaviour. Email checking behaviour follows this schedule.
The slot machine is not designed to make you win. It is designed to make you keep playing. Variable ratio reinforcement is the most behaviour-sustaining schedule ever discovered.
Intrinsic vs. Extrinsic Rewards
One of the most counterintuitive findings in reward psychology is the overjustification effect: adding external rewards to an already intrinsically motivated behaviour can actually reduce motivation over time.
Classic research by Lepper, Greene, and Nisbett (1973) demonstrated that children who enjoyed drawing and were given rewards for it subsequently showed less interest in drawing than children who had never been rewarded. The external reward shifted their perceived reason for drawing from intrinsic enjoyment to extrinsic gain — and when the reward stopped, so did the behaviour.
This has profound implications for education, workplace management, and parenting. Reward systems that override intrinsic motivation are, paradoxically, counterproductive in the long term.
Dopamine: The Reward Prediction System
At the neurological level, rewards work through the brain’s dopaminergic system. But as neuroscientist Wolfram Schultz established, dopamine responds not to rewards themselves but to reward prediction errors — the difference between expected and actual outcomes.
- When a reward is better than expected: dopamine spikes. Strong learning signal. Behaviour strongly reinforced.
- When a reward is exactly as expected: dopamine holds steady. Mild reinforcement.
- When an expected reward does not arrive: dopamine drops below baseline. Behaviour is suppressed. Strong learning signal in the opposite direction.
This means the brain is not simply rewarding behaviour — it is continuously updating predictions about the world, using reward outcomes as the data. The system is optimised for learning, not for pleasure.
How Rewards Shape Behaviour in Everyday Contexts
In Relationships
Attachment patterns, communication styles, and even the people we choose to be with are profoundly shaped by reward histories. Partners who intermittently validate and withdraw become powerfully compelling — for neurologically the same reasons as a variable-ratio slot machine.
In the Workplace
Meaningful recognition, autonomy, and mastery are more potent long-term motivators than financial rewards alone. Salary increases operate on a fixed schedule and quickly become baseline — their motivational power declines rapidly through hedonic adaptation.
In Digital Life
Every major platform deploys sophisticated reward schedules: variable likes, unpredictable content, streaks, badges, and notifications. The behavioural engineering is not accidental — it is the deliberate application of operant conditioning principles at industrial scale.
Using Reward Psychology Deliberately
For Building Habits
Attach an immediate, genuine reward to the behaviour you want to establish — before the long-term benefits accrue. The brain learns from immediate feedback, not deferred consequences.
For Maintaining Motivation
Preserve intrinsic motivation by not over-rewarding behaviours you already find meaningful. Add variety and unpredictability to reward systems to leverage variable-ratio dynamics without exploitation.
For Understanding Others
Every persistent behaviour in others — including ones that frustrate you — is being reinforced somehow. Identifying the reinforcement is the key to understanding (and, where appropriate, changing) the behaviour.
Key Takeaways
- All behaviour is shaped by its consequences — this is the Law of Effect
- Variable ratio reinforcement is the most powerful and persistent reward schedule
- External rewards can undermine intrinsic motivation through the overjustification effect
- Dopamine responds to prediction errors, not to rewards themselves
- Digital platforms, relationships, and workplaces all operate on reward schedule principles
- Understanding reinforcement is one of the most practical tools in behavioural psychology