How dopamine and serotonin drive learning, motivation, and decision-making

Executive overview

Dopamine is not a reward molecule — it is a learning signal that encodes the difference between successive expectations, not just the gap between expectation and final outcome. This temporal-difference model, borrowed from reinforcement learning algorithms now powering AI breakthroughs, is installed in every mobile creature on the planet, from honeybees to humans.

Serotonin runs in opposition to dopamine: it rises when negative events are anticipated and falls when things go well. SSRIs complicate this by pushing serotonin into dopamine terminals, which can blunt the rewarding properties of positive events — a mechanism largely ignored in clinical practice.

Dopamine is a currency: a common value signal that lets the brain compare dissimilar outcomes, set motivation levels, and update behavior step by step through the world.

Dopamine as a learning rule, not just a reward signal

  • Dopamine fluctuations encode temporal-difference reward prediction errors — the difference between your current expectation and your next expectation, not just expectation vs. outcome.
  • This allows the brain to chain events: a sound predicts a light, the light predicts reward; dopamine credits the sound even before reward arrives.
  • The same algorithm (Sutton & Barto, 1988) powers AlphaGo, AlphaGo Zero, and AlphaFold — AI systems that beat world champions and won Nobel Prizes.
  • Dopamine fluctuates constantly as you move through the world, tracking a running estimate of where things are heading.
  • "Dopamine hits" is not wrong, but it is a blunt and incomplete description; the ongoing updating matters as much as the terminal reward.

Tonic vs. phasic dopamine and motivation

  • Tonic dopamine sets a baseline level — a "water level" on which phasic spikes sit.
  • Parkinson's disease illustrates what happens when tonic dopamine collapses: the brain cannot differentiate the value of actions, so movement freezes — an active, rational freezing rather than a passive failure.
  • Dopamine is involved in movement, learning, and motivation for the same reason: all three are about valuing options and committing to action.
  • A sense of urgency — persistent, resilient drive to act — is a better everyday concept than "motivation," because it captures the readiness to move thoughts and body toward a goal.

The explore–exploit spectrum and ADHD

  • Bees exist on a spectrum from highly focused foragers (exploit known nectar sources) to distractible explorers (discover new ones); humans have both modes internally.
  • Dopamine-related chemicals (octopamine/tyramine ratio in bees; dopamine/norepinephrine in humans) set where on that spectrum an individual operates.
  • ADHD drugs raise dopamine and norepinephrine, stabilizing brain-state sequences and narrowing the foraging path — trading exploration for sustained exploitation.
  • Rapid short-form media may strengthen the exploratory, fast-updating mode while weakening the slow, goal-chaining mode — but direct evidence in humans is limited.
  • Effort and slower pacing appear to support deeper learning, though whether effort itself or the slower tempo is the active ingredient is unresolved.

Serotonin as the opponent system

  • Serotonin rises when negative outcomes are anticipated and falls when positive ones are expected; dopamine does the reverse — a clear opponent relationship confirmed in real-time human recordings.
  • This opponency is evident in economic exchange games recorded directly from deep brain structures in epilepsy and Parkinson's patients.
  • Serotonin signals "wait" and encodes aversive prediction — the active inhibition needed to delay action despite temptation.
  • When hunger or extreme stress flips the animal into emergency mode, dopamine shifts to encode aversive prediction errors rather than rewarding ones — prioritizing survival over reward-seeking.

SSRIs and the dopamine terminal problem

  • SSRIs block serotonin reuptake, raising serotonin in the synapse — but that excess serotonin is taken up by dopamine transporters and accumulates in dopamine terminals.
  • A 2005 paper (John Danny, Neuron) showed a 40% shift in serotonin into the dopamine system following SSRI treatment.
  • Result: neurons that normally signal positive outcomes now release the "negative juice," potentially impairing learning about rewarding events and, in some cases, contributing to anhedonia or accelerated suicidality.
  • SSRIs remain life-changing for some patients; heterogeneity of response is high, and placebo effects account for 50–80% of variance in psychotropic drug outcomes.
  • The path forward is direct neurochemical measurement in humans, not blunt receptor-level hypotheses.

Measuring dopamine and serotonin in living humans

  • Real-time sub-second dopamine and serotonin measurement in humans requires piggybacking on deep-brain electrodes implanted for Parkinson's or essential tremor treatment, using electrochemical detection algorithms.
  • A newer, minimally invasive approach (Christina Zalano, Northwestern): thread FDA-approved depth electrodes up the nose to rest against the olfactory epithelium — allows recordings in healthy, consenting participants.
  • Nasal recordings show the same opponent dopamine/serotonin dynamics seen in deep-brain recordings, and neurotransmitter fluctuations track the breathing cycle in synchrony with task demands.
  • During an ultimatum game, peak oxygen consumption (peroxide signal) and dopamine fluctuations align with the moments requiring the most learning updates — breathing, mitochondrial activity, and dopamine appear coupled.
  • A spin-out company (Nebula Neuro) aims to commercialize nasal neurochemical monitoring for consumer use, potentially allowing individuals to observe their own dopamine and serotonin in real time.

Dopamine, time perception, and AI convergence

  • Dopamine is critical for interval timing — anticipating events at specific moments; drugs that alter dopamine (cannabis, methylphenidate) distort subjective time.
  • The reinforcement learning algorithms discovered in biology have now been externalized into AI systems that outperform their biological source — AlphaGo Zero trained from scratch, AlphaFold solved protein structure, both using temporal-difference learning.
  • AI will increasingly help decode which brain states and neurotransmitter patterns correspond to optimal learning, concentration, and mental health — data-driven neuroscience rather than receptor-hypothesis medicine.
  • What was dismissed as backwater 30 years ago (neural networks, brain-machine interfaces) is now the leading edge — a pattern worth noting for any currently unfashionable idea in science or health.

Practical implications for everyday life

  • Resisting low-effort, high-frequency stimulation (short-form video, infinite scroll) may protect the slow goal-chaining circuits; physical separation from devices reduces their cognitive drag even when unused.
  • The dopamine system can learn to reward resistance — athletes who relish early mornings while others sleep, wrestlers who stay calm when air is cut off — suggesting deliberate exposure to discomfort trains the reward system itself.
  • Competitive sport is one of the few modern environments that reliably imposes losing, sustained effort, and recovery — training the expectation–disappointment–motivation loop that transfers broadly.
  • Feeding hungry people before important conversations or decisions is not trivial: hunger can flip dopamine into an aversion-prediction mode, distorting judgment toward negative outcomes.
  • Sleep and meditation allow the computational device to consolidate, erase stale predictions, and replenish transmitter resources; even short sleep can suffice if the individual's baseline need is genuinely lower.

More like this — when you're ready for early access.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Get early access to the full library.

Join the waitlist for a personal account and content recommendations based on what you're working on.

No spam. Unsubscribe at any time.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.

Be among the first to get personalised recommendations tailored to your stage in business.

No spam.

You're on the list. We'll be in touch before launch.