How Rewards Shape Human Behaviour

April 22, 2026 | 5 min read | By admin

Every behaviour you perform — from brushing your teeth to your career choices to the relationships you sustain — has been shaped, in part, by reward. This is not a cynical observation about human nature. It is one of the most fundamental and rigorously established findings in all of psychology.

Understanding how rewards actually work — not in the intuitive way most people assume, but in the precise way science has revealed — is one of the most practically useful things you can learn about your own behaviour and the behaviour of everyone around you.

The Foundation: Operant Conditioning

The scientific study of how rewards shape behaviour began in earnest with B.F. Skinner’s work on operant conditioning in the mid-twentieth century. Skinner demonstrated, through meticulous experimentation, that behaviour is fundamentally governed by its consequences.

Behaviours followed by positive consequences are more likely to be repeated. Behaviours followed by negative consequences are less likely to be repeated. This elegantly simple principle — sometimes called the Law of Effect, first articulated by Edward Thorndike — underlies everything from animal training to classroom management to the design of every app on your phone.

The Four Mechanisms of Reinforcement

Mechanism	What Happens	Effect on Behaviour	Example
Positive Reinforcement	Something desirable is added	Behaviour increases	Praise after good work
Negative Reinforcement	Something unpleasant is removed	Behaviour increases	Taking painkillers removes pain
Positive Punishment	Something unpleasant is added	Behaviour decreases	Fines for speeding
Negative Punishment	Something desirable is removed	Behaviour decreases	Losing screen time privileges

The Most Powerful Schedule: Variable Ratio Reinforcement

Not all rewards are equally effective. Skinner’s research on reinforcement schedules revealed that the timing and predictability of rewards matters as much as the rewards themselves — often more.

The four primary reinforcement schedules produce dramatically different patterns of behaviour:

Fixed ratio: Reward after every X responses. Produces high, consistent behaviour but with pauses after each reward.
Variable ratio: Reward after an unpredictable number of responses. Produces the highest, most persistent behaviour. Most resistant to extinction. This is the schedule used by slot machines, social media likes, and loot boxes.
Fixed interval: Reward after a fixed time period. Produces low behaviour between rewards, with a spike just before the reward is due.
Variable interval: Reward after unpredictable time periods. Produces steady, persistent behaviour. Email checking behaviour follows this schedule.

The slot machine is not designed to make you win. It is designed to make you keep playing. Variable ratio reinforcement is the most behaviour-sustaining schedule ever discovered.

Intrinsic vs. Extrinsic Rewards

One of the most counterintuitive findings in reward psychology is the overjustification effect: adding external rewards to an already intrinsically motivated behaviour can actually reduce motivation over time.

Classic research by Lepper, Greene, and Nisbett (1973) demonstrated that children who enjoyed drawing and were given rewards for it subsequently showed less interest in drawing than children who had never been rewarded. The external reward shifted their perceived reason for drawing from intrinsic enjoyment to extrinsic gain — and when the reward stopped, so did the behaviour.

This has profound implications for education, workplace management, and parenting. Reward systems that override intrinsic motivation are, paradoxically, counterproductive in the long term.

Dopamine: The Reward Prediction System

At the neurological level, rewards work through the brain’s dopaminergic system. But as neuroscientist Wolfram Schultz established, dopamine responds not to rewards themselves but to reward prediction errors — the difference between expected and actual outcomes.

When a reward is better than expected: dopamine spikes. Strong learning signal. Behaviour strongly reinforced.
When a reward is exactly as expected: dopamine holds steady. Mild reinforcement.
When an expected reward does not arrive: dopamine drops below baseline. Behaviour is suppressed. Strong learning signal in the opposite direction.

This means the brain is not simply rewarding behaviour — it is continuously updating predictions about the world, using reward outcomes as the data. The system is optimised for learning, not for pleasure.

How Rewards Shape Behaviour in Everyday Contexts

In Relationships

Attachment patterns, communication styles, and even the people we choose to be with are profoundly shaped by reward histories. Partners who intermittently validate and withdraw become powerfully compelling — for neurologically the same reasons as a variable-ratio slot machine.

In the Workplace

Meaningful recognition, autonomy, and mastery are more potent long-term motivators than financial rewards alone. Salary increases operate on a fixed schedule and quickly become baseline — their motivational power declines rapidly through hedonic adaptation.

In Digital Life

Every major platform deploys sophisticated reward schedules: variable likes, unpredictable content, streaks, badges, and notifications. The behavioural engineering is not accidental — it is the deliberate application of operant conditioning principles at industrial scale.

Using Reward Psychology Deliberately

For Building Habits

Attach an immediate, genuine reward to the behaviour you want to establish — before the long-term benefits accrue. The brain learns from immediate feedback, not deferred consequences.

For Maintaining Motivation

Preserve intrinsic motivation by not over-rewarding behaviours you already find meaningful. Add variety and unpredictability to reward systems to leverage variable-ratio dynamics without exploitation.

For Understanding Others

Every persistent behaviour in others — including ones that frustrate you — is being reinforced somehow. Identifying the reinforcement is the key to understanding (and, where appropriate, changing) the behaviour.

Key Takeaways

All behaviour is shaped by its consequences — this is the Law of Effect
Variable ratio reinforcement is the most powerful and persistent reward schedule
External rewards can undermine intrinsic motivation through the overjustification effect
Dopamine responds to prediction errors, not to rewards themselves
Digital platforms, relationships, and workplaces all operate on reward schedule principles
Understanding reinforcement is one of the most practical tools in behavioural psychology

admin

Psychology researcher and writer at Psychology Lab. Passionate about translating complex science into accessible, practical knowledge for everyday readers.