acquisition and responding for conditioned …...operant responses which result in its presentation...

59
Acquisition and Responding for Conditioned Reinforcement in the Mouse: Effects of Methylphenidate, and the Role of the Dopamine Transporter by James Donald Caleb Browne A thesis submitted in conformity with the requirements for the degree of Masters of Arts Graduate Department of Psychology, Program in Neuroscience University of Toronto © Copyright by J. D. Caleb Browne 2012

Upload: others

Post on 11-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

Acquisition and Responding for Conditioned Reinforcement in the Mouse: Effects of Methylphenidate,

and the Role of the Dopamine Transporter

by

James Donald Caleb Browne A thesis submitted in conformity with the requirements

for the degree of Masters of Arts Graduate Department of Psychology, Program in Neuroscience

University of Toronto

© Copyright by J. D. Caleb Browne 2012

Page 2: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

ii

Acquisition and Responding for Conditioned Reinforcement in the

Mouse: Effects of Methylphenidate, and the Role of the Dopamine

Transporter

J. D. Caleb Browne

Masters of Arts

Department of Psychology, Program in Neuroscience

University of Toronto

2012

Abstract

This work characterized the ability of mice to respond for conditioned reinforcement, a

phenomenon that can be used to investigate neural substrates of incentive learning. In both

C57Bl/6 and CD1 mice, a reward-associated stimulus acted as a conditioned reinforcer (CR).

Responding was stable over multiple test days, enhanced in CD1 mice by the dopamine

transporter (DAT) blocker methylphenidate, and was extinguished when responding no longer

produced the CR. However, transgenic C57Bl/6 mice overexpressing DAT, which decreased

striatal dopamine by 40% responded normally for CR. Therefore, these results suggest that mice

can be used to study brain mechanisms of incentive motivation. However, the choice of mouse

strain in this paradigm is important as outbred CD1 mice appeared more susceptible to a DAT

blocker compared to the inbred C57Bl/6 strain. These results also suggest that selective

responding for a CR remains intact in a chronically hypodopaminergic state.

Page 3: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

iii

Acknowledgements

I would like to extend my sincerest thanks to Paul Fletcher for his invaluable assistance and

expertise in carrying out the present experiments and immense help in writing this thesis. Also, I

would like to thank Suzanne Erb for her helpful comments in the preparation of this thesis and

Junchul Kim for his time in reading and defending this thesis. I would also like to thank my lab

mates Ashlie Soko, Elizabeth Guy, and Christie Burton for their technical assistance and

discussions about the topics covered in this research. Further, I would like to thank my family for

their continued support and Raena Dempsey for her patience and putting up with my incessant

“working on my thesis” statements.

Page 4: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

iv

Table of Contents

Chapter 1: Introduction 1

1 Incentive motivation and salience attribution 1

2 Role of the dopamine neurons in system in incentive motivation 1

3 Dopamine and conditioned reinforcement 2

4 Drugs of abuse and conditioned reinforcers 3

5 The nucleus accumbens and conditioned reinforcement 4

6 The basolateral amygdala and conditioned reinforcement 4

7 The dorsal striatum and conditioned reinforcement 5

8 The role of dopamine receptors in conditioned reinforcement 6

9 The dopamine transporter and its role in incentive motivation 6

10 Experimental framework and hypotheses 7

10.1 Major goals of these experiments 8

10.2 Hypotheses 9

Chapter 2: Methods 11

1 Subjects 11

2 Apparatus 11

2.1 Conditioned reinforcement 11

2.2 Locomotor activity 12

3 General behavioural procedures 12

3.1 Pre-training 12

3.2 Phase 1: Pavlovian conditioning (training) phase 12

3.3 Phase 2: Operant conditioning (testing) phase 13

4 Experiment 1a: Acquisition, expression, and amplification of responding for a

conditioned reinforcer 14

5 Experiment 1b: Temporal stability and subsequent amplification of responding for a

conditioned reinforcer 14

6 Experiment 1c: Extinction of responding for a conditioned reinforcer 15

6.1 Removal of water restriction as a motivating factor 15

6.2 Extinction training - CS not presented upon CR responding 15

Page 5: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

v

7 Experiment 2: Locomotor activation by 3.5 and 5 mg/kg doses of MPH 15

8 Experiment 3: Assessment of explicitly unpaired CS and US presentations and

responding for a non-conditioned stimulus in mice 15

9 Experiment 4a: Effect of whole-brain DA transporter overexpression on

responding for a conditioned reinforcer 16

10 Experiment 4b: Effect of MPH and AMPH on responding for a CR in DAT

overexpressing mice 16

11 Statistical Analyses 16

Chapter 3: Results 17

1 Experiment 1a: Acquisition, expression, and amplification of responding for a

conditioned reinforcer 17

1.1 Training 17

1.2 Test of responding for a CR 17

2 Experiment 1b: Temporal stability and subsequent amplification of responding

for a conditioned reinforcer 18

2.1 Extended CR responding 18

2.2 MPH test 2 18

3 Experiment 1c: Extinction of responding for a conditioned reinforcer 19

3.1 Removal from water restriction 19

3.2 Operant responding during extinction conditions 20

4 Experiment 2: Locomotor activation by 3.5 and 5 mg/kg doses of MPH 20

5 Experiment 3: Assessment of explicitly unpaired CS and US presentations and

responding for a non-conditioned stimulus in mice 20

5.1 Training 20

5.2 Test of responding for a CR 21

6 Experiment 4a: Effect of whole-brain DA transporter overexpression on responding

for a conditioned reinforcer 21

6.1 Training 21

6.2 Test of responding for a CR 21

7 Experiment 4b: Effect of MPH and AMPH on responding for a CR in DAT

overexpressing mice 21

Page 6: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

vi

7.1 Effect of MPH 21

7.2 Effect of AMPH 22

Chapter 4: Discussion 23

1 Characterization of the acquisition and expression of conditioned reinforcement

in mice 23

2 Strain differences in conditioned reinforcement 24

3 Operant response type in responding for a CR 24

4 Modulation of responding for a CR by MPH administration in normal mice 25

5 Incentive salience attribution and motivation in DAT upregulated mice 26

6 Effect of MPH and AMPH on responding for a CR in DAT upregulated mice 27

Chapter 5: Conclusions 29

Chapter 6: References 30

Chapter 7: Figure Captions 37

Chapter 8: Figures 40

Page 7: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

vii

List of Figures

Figure 1: Pavlovian Conditioning in C57Bl/6 and CD1 Mice

Figure 2: Effect of MPH on Responding for a CR

Figure 3: Stability of Selective CR Responding Over Time

Figure 4: Second Test of MPH on Responding for a CR

Figure 5: Responding for a CR with Free Water Access

Figure 6: Extinction of CR Responding

Figure 7: Locomotor Activation by MPH

Figure 8: Explicitly Unpaired Pavlovian Conditioning

Figure 9: Test of Responding for an Unpaired CS

Figure 10: Pavlovian Conditioning in DAT-Tg and WT Mice

Figure 11: CR Responding in DAT-Tg and WT Mice

Figure 12: Effect of MPH on Responding in DAT-Tg and WT Mice

Figure 13: Effect of AMPH on Responding in DAT-Tg and WT Mice

Page 8: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

1

Chapter 1 Introduction

1 Incentive motivation and salience attribution

Incentive motivation promotes goal directed behaviour towards stimuli such as food, sex, and

water, which have intrinsic value in promoting an individual’s fitness. Rewarding stimuli act as

incentives which are viewed as beneficial and elicit or enhance the motivated behaviour required

in attaining them (Bindra, 1968). Evolution has favoured the development of neural circuitry

which amplifies motivation by making stimuli associated with incentives sought after (Robinson

and Berridge, 1993; Schultz, 1998). Through repeated association with a rewarding

unconditioned stimulus (US), Pavlovian learning mechanisms transform benign stimuli into

conditioned stimuli (CS). Congruency in the presence of the CS and US gives an increasingly

accurate prediction of US availability based on the presence of the CS.

The availability of a reward may be signalled by cues associated with it, and without these cues

incentive motivation would be hit or miss; to obtain a valuable reward an animal would simply

have to run into it. Such reward-associated cues require a relevant, up to date account of their

predictive value which is performed by Pavlovian associative memory mechanisms (Robinson

and Berridge, 1993; Berridge and Robinson, 1998). These stimuli do not, however, remain

passively associated with a reward; to guide goal directed behaviour, stimuli must have the

ability to activate motivational neurocircuitry which then initiates behaviour directed at obtaining

the reward. These cues are therefore imbued with incentive salience making them demand

attention and ‘wanted’ in their own right (Robinson and Berridge, 1993; Berridge, 2007). In this

way, the presence of a reward-predictive cue will always be noticed and shift goal directed

behaviour towards it.

2 Role of the dopamine neurons in system in incentive motivation

The mesocorticolimbic dopamine (DA) system constitutes a major component of the circuitry

involved in reward information processing and appears to modulate the motivational significance

of both primary rewards and associated environmental stimuli (Beninger, 1983; Robinson and

Page 9: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

2

Berridge, 1993; Berridge and Robinson, 1998; Schultz, 1998). Specifically, this DA system

appears to be responsible for the attribution of incentive salience to stimuli as opposed to either

processing their hedonic impact or mediating their association with primary rewards (Berridge et

al., 1989; Berridge and Robinson, 1998; Wyvell and Berridge, 2000). This means that the

involvement of DA in incentive motivation is to mediate the drive to obtain a reward, not

necessarily the perception of ‘liking’ a reward.

The mesocorticolimbic DA system originates in the ventral tegmental area (VTA) of the

midbrain where dopaminergic neurons project to the nucleus accumbens (NAc), amygdala,

hippocampus and medial prefrontal cortex (Ungerstedt, 1971; Kalivas and Volkow, 2005). These

regions receive a pattern of phasic DA output from the VTA as a result of experiencing

motivationally relevant stimuli (White, 1989; Schultz, 1998; McClure et al., 2003; Berridge,

2007; Tsai et al., 2009). The mesocorticolimbic DA system greatly differs from the nigrostriatal

DA system in its projections, originating locus and function. The nigrostriatal system connects

the substantia nigra with the striatum and appears to be largely involved in motor planning and

execution as part of the basal ganglia motor loop (Graybiel et al., 1994). The major source of DA

within the nigrostriatal system is the pars compacta subregion of the substantia nigra, and this

region has particularly dense projections to the dorsal striatum (Haber et al., 2000; Björklund and

Dunnett, 2007).

3 Dopamine and conditioned reinforcement

The attribution of incentive salience to a previously neutral, reward associated stimulus is

thought to be mediated by a DA-dependent mechanism (Robinson and Berridge, 1993). VTA

neurons release phasic bursts of DA to forebrain regions, particularly the NAc, in response to a

stimulus associated with a reward or event that denotes motivational significance (White, 1989;

Schultz, 1998; McClure et al., 2003; Berridge, 2007; Tsai et al., 2009). Although at first robust,

the DA output a reward elicits seems to progressively decrease as it is repeatedly experienced

(Schultz, 1998). However, DA output as a result of experiencing reward-predictive stimuli

remains potentiated, maintaining the salience of those stimuli (Schultz et al., 1993; Phillips et al.,

2003). When these stimuli are presented alone, their incentive salience drives the behavioural

output of an animal to obtain them as if they were the reward itself (Robinson and Berridge,

Page 10: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

3

1993). Such high motivational value attributed to a reward-predictive CS gives it incentive

properties, so that it may become a reward in its own right. In this way the initially-neutral CS

becomes a conditioned reinforcer (CR) capable of reinforcing new learning and strengthening

operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984).

Considering this CR was previously a neutral stimulus which acquired motivational significance

based on its association with a primary reward, responding for this CR is an example of pure

incentive motivation.

Evidence suggests that responding for a CR is mediated predominantly by a DA dependent

mechanism. Administration of substances which amplify DA transmission such as

psychostimulants (e.g. amphetamine, cocaine, methylphenidate and pipradrol) can increase

responding for a CR, enhancing the control a CR exerts over behaviour (Hill, 1970; Robbins,

1978; Beninger and Phillips, 1980). Conversely, the DA receptor antagonist α-flupenthixol

attenuates responding for a CR and prevents the enhancement of responding as a result of

psychostimulants (Robbins et al., 1983; Fletcher and Higgins, 1997). Therefore, the DA system

appears to play a large role in the regulation of responding for a salient, reward associated CR.

4 Drugs of abuse and conditioned reinforcers Rewarding stimuli are highly variable in the environment. The mesocorticolimbic DA system

deals with this in its dynamic nature, allowing fluctuations in reward processing and valuation

(Schultz, 1998). This renders the mesocorticolimbic DA system susceptible to changes in its

physiology which in turn can lead to pathological incentive motivation. Many drugs of abuse act

directly or indirectly on the DA system resulting in elevated DA release (Di Chiara, 1995;

Berridge and Robinson, 1998; Koob and Le Moal, 2001). With repeated drug administration, the

mesocorticolimbic DA system can become sensitized to the behavioural activating and

neurochemical effects of the drug (Koob and Le Moal, 1997; Pierce and Kalivas, 1997). This

sensitization of the DA system can result in permanent neurochemical and neuroanatomical

alterations. Behaviourally, this may manifest in perseveration of drug seeking and drug taking

behaviours due to the amplification of salience attributed to drugs and drug-associated stimuli

(Robinson and Berridge, 1993). As the DA system is progressively sensitized to repeated drug

administration, so is the salience of the drug and drug-associated stimuli; the perceived

Page 11: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

4

importance of the drug and its associated stimuli is progressively increased and “stamped in” to

the mesocorticolimbic DA system (Robinson and Berridge, 1993).

5 The nucleus accumbens and conditioned reinforcement

The NAc appears to act as a limbic-motor interface where goal-directed behaviour is processed

based on learned motivational significance of stimuli (Mogenson et al., 1980). In terms of

conditioned reinforcement, the NAc has been shown to be a candidate region in mediating not

only the establishment of responding for a CR but also the response-enhancing effect of

stimulants (Parkinson et al., 1999). Intra-NAc infusion of amphetamine (AMPH) potentiates CR

responding (Taylor and Robbins, 1984), while NAc DA depletion attenuates AMPH-induced

elevated CR responding (Robbins and Everitt, 1982). However, the NAc has two functionally

and neuroanatomically distinct subregions which may differentially contribute to incentive

motivation (Voorn et al., 1989; Pontieri et al., 1995): the NAc shell and NAc core. The NAc

shell contains projections to the ventral pallidum and the VTA, while the NAc core projects to

basal ganglia regions such as the globus pallidus and substantia nigra (Meredith et al., 1992). The

NAc shell appears to be involved in modulating the amplified behavioural control of CR

responding observed upon administration of psychostimulant drugs (Parkinson et al., 1999; Ito et

al., 2004). On the other hand, the NAc core appears to play a large role in controlling the basal,

Pavlovian attributed incentive salience value of CRs (and therefore reward-related stimuli in

general) which directs instrumental behaviour (Cardinal et al., 2002). This conclusion is

supported by evidence that NAc core lesions impair the incentive motivation towards CR

responding for both drug- and food-associated CRs (Parkinson et al., 1999; Ito et al., 2004).

6 The basolateral amygdala and conditioned reinforcement

Control over CR responding exerted by the NAc is tightly connected with the basolateral

amygdala (BLA). The BLA is innervated by the VTA and sends glutamatergic projections to the

entire ventral striatum including both the NAc core and shell (Kelley et al., 1982). The BLA is

critical for the formation and retrieval of associations of discrete cues with rewards based on

their incentive properties (Gaffan and Harrison, 1987; Everitt et al., 2003; Ito et al., 2006).

Page 12: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

5

Lesions of the BLA result in the inability of a CR to facilitate the acquisition of a new response,

a stringent criterion of a CR’s incentive salience (Mackintosh, 1975; Cador et al., 1989). In other

words, animals with damage to the BLA fail to respond for a CR. Reduced CR responding as a

result of BLA lesions has been shown for CRs associated with natural rewards, such as water

(Cador et al., 1989), sucrose (Burns et al., 1993), and sex (Everitt et al., 1989), as well as

psychostimulants (Whitelaw et al., 1996), strongly implicating the BLA in incentive salience

attribution to many types of reward-related stimuli. Such lesions also reduce the ability of

elevated DA transmission in the NAc as a result of psychostimulant administration to potentiate

CR responding (Cador et al., 1989; Burns et al., 1993). This suggests that the incentive salience

of a CR, as measured by responding to obtain it, and its enhancement by psychostimulants

involve an interaction between VTA-NAc and VTA-BLA mediated DAergic mechanisms (Ito

and Canseliet, 2010; Ito and Hayen, 2011). Therefore, the BLA appears to be crucial for learning

about the significance of reward-associated stimuli, as well as for allowing CRs to support and

guide instrumental behaviours.

7 The dorsal striatum and conditioned reinforcement

The powerful control that a CR exerts over instrumental behaviour is relatively resistant to

extinction (Weiss et al., 2001; Vanderschuren et al., 2005). This persistence may be mediated by

the dorsal striatum (DStri) (Everitt and Robbins, 2005), which is involved in the formation of

habit learning and has been implicated in CR control over drug seeking and taking behaviour

exhibited by both rats and humans (Ito et al., 2002; Volkow et al., 2006). Merely experiencing a

CR associated with a drug can lead to a large efflux of DA to the DStri (Ito et al., 2002).

Inactivation of the DStri also affects the ability of CRs to both maintain persistent responding

and reactivate responding following extinction (Di Ciano et al., 2008). Such findings implicate

the DStri in a transition from initial goal-directed behaviour towards CRs to a more habitual,

automatic response as a result of conditioning and time. Thus the DStri may mediate the switch

from action-outcome to stimulus-response control by CRs over behaviour (Everitt et al., 2001;

Vanderschuren et al., 2005).

Page 13: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

6

8 The role of dopamine receptors in conditioned reinforcement

Dopamine is central to the attribution and maintenance of incentive salience to reward-related

stimuli, particularly psychostimulant drugs of abuse, and regions involved in the mediation of

CR responding are sensitive to changes in DA innervation. G-protein coupled dopamine

receptors in these regions are of the D1-like family (D1 and D5), which stimulate adenylyl

cyclase and subsequently the intracellular cAMP signalling cascade, or the D2-like family (D2,

D3 and D4) which inhibit adenylyl cyclase (Jaber et al., 1996; Anderson and Pierce, 2005). Both

D1 and D2 receptors within the NAc have been shown to mediate the control of responding for

CRs: D1 and D2 receptor antagonists diminish CR responding and potentiated responding due to

psychostimulants, while direct receptor agonists may amplify CR responding (Beninger et al.,

1991; Wolterink et al., 1993).

Interestingly, intra-accumbal D3 receptor antagonism has been shown to disrupt the ability of a

drug associated CR but not a sucrose associated CR to control responding (Di Ciano et al.,

2003). A similar result emerges after D3 receptor antagonism within the BLA for cocaine-

associated CRs (Di Ciano, 2008). Reinstatement of cocaine seeking by CRs following extinction

is also diminished as a result of D3 receptor blockade, but has no effect on sucrose-conditioned

place preference (Vorel et al., 2002). Such evidence of a DA receptor specific to drugs of abuse

as opposed to natural rewards has led to the D3 as a therapeutic target for the treatment of

addiction (Heidbreder et al., 2005).

9 The dopamine transporter and its role in incentive motivation

Appropriate termination of neurotransmission is integral to incentive motivation and neural

communication as a whole. The principle mechanism for this termination in monoamine neurons

is reuptake of neurotransmitter back into the presynaptic neuron by Na+/Cl

—dependent

neurotransmitter transporters (Torres and Amara, 2007). The DA transporter (DAT) exerts

powerful regulatory control over dopaminergic tone by removing DA from the synapse

following bursts of activity. DAT is located most prevalently in the mesocorticolimbic DA

system with the highest density observed in the VTA and SNc DA projection neurons, and

Page 14: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

7

concurrently their respective ventral and dorsal striatal target regions (Ciliax et al., 1995, 1999).

To maintain temporal precision in neural communication DA is cleared from the synapse by

DAT within 1 second following a phasic burst of activity. If this reuptake transporter is not

present, removal of DA from the synapse is increased 100-fold with kinetics reaching that of

passive diffusion, along with proportional compensatory decreases in DA release (Giros et al.,

1996).

DAT is also the site of action for many drugs of abuse. Cocaine and MPH bind to DAT and

block the reuptake of DA into the presynaptic terminals following phasic burst firing, while

AMPH destabilizes presynaptic DA containing vesicles and reverses DAT, resulting in an

outflow of DA (Giros et al., 1996; Salahpour et al., 2008). Considering that drugs which enhance

responding for a CR such as MPH and AMPH also alter DA kinetics at DAT, this reuptake

transporter may play a large role in the regulation of responding for a CR by controlling the

amount of DA active within the synapse. If higher levels of DAT are present in the synapse,

burst firing of DA neurons signalling motivational incentive salience may be toned down as a

result of more rapid removal from the synapse. This would potentially result in a suppressed

motivational signal manifested in the conditioned reinforcement paradigm as reduced responding

for a CR. The present experiment used mice which overexpressed DAT to examine whether this

form of decreased DAergic tone manifests in decreased motivation to obtain a CR. These mice

exhibit a three-fold increase in DAT which resulted in a 40% decrease in striatal DA while

showing no basic impairments in behaviour. However, DAT overexpressing mice may have a

reduction in basal incentive motivation in that they do not work as hard as wild-type mice to

obtain a milk reward (Salahpour et al., 2008).

10 Experimental framework and hypotheses

The acquisition of a new response task by conditioned reinforcement can be used to understand

the motivational significance of reward-associated stimuli (Mackintosh, 1975). Unlike second-

order schedules of reinforcement, this task can examine the ability of a CR to drive pure

incentive motivation towards obtaining it, and the ability of a drug to enhance this responding

with mutual exclusivity (Sutton and Beninger, 1999). Although this behavioural paradigm has

been used extensively in rats, surprisingly little attention has been given to the study of mice in

Page 15: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

8

tests of conditioned reinforcement. The use of mice provides not only the possibilities of

pharmacological and physical manipulations of incentive motivation but also behavioural genetic

techniques. Transgenic mice could be used to specifically examine the role of individual receptor

subtypes and neurotransmitter reuptake transporters involved in incentive motivation. Therefore,

extensively characterizing tests of acquisition of, and responding for, conditioned reinforcement

in mice will set the stage for such research and ultimately enhance the understanding of such

behavioural phenomena.

Very few published papers have examined responding for conditioned reinforcement, and the

effects of DAT-acting psychostimulants on this response, in mice. Experiments that have

examined this behaviour have included a relatively limited characterization of this behaviour

which is well established in rats. We felt it important to examine multiple factors to ensure that

this was not a species- or operant-specific behaviour in mice, as other studies have only

examined one strain and one operant response type. For example, Mead and colleagues (2004)

examined lever press responding for a CR using one inbred C57BL/6 strain of mice, and

subsequently examined the ability of AMPH to enhance responding for the CR (Mead et al.,

2004). Therefore work in this field is very limited.

10.1 Five major goals of these experiments

1) Goal one was to fully characterize the ability of mice to learn approach behaviour to a reward

associated CS and to respond for that CS when it acts as a CR. The ability of mice to acquire

and express responding for a CR was also compared to a secondary control group of mice

which, during Pavlovian association training, presentation of the CS is explicitly unpaired

with the US. This allowed us to determine that the apparent incentive salience of the CS

acting as a CR was, in fact, derived from its association with a primary reward. We also

examined the time course of extinction of responding for a CR to ensure this behaviour had

not become completely habitual.

2) Goal two was to examine whether strain influenced responding for CR. Thus, two strains of

mice were compared: the inbred C57Bl/6 and the outbred CD1 strains. The C75BL/6 strain

has previously been used in this behavioural paradigm (Mead et al., 2004), and are the

common background strain used in transgenic mouse studies.

Page 16: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

9

3) Goal three was to examine whether the type of operant response was important in

determining acquisition and expression of responding for CR. Thus, two operant responses

resulting in CR presentation were compared: lever press and nosepoke. This allowed

comparison of a well established operant (lever press) with an operant that fits within the

behavioural repertoire of mice (nosepoke) (Crawley, 2007).

4) Goal four was to determine whether responding for CR is potentiated following systemic

administration of the psychomotor stimulant methylphenidate (MPH) which acts as a DA

reuptake inhibitor. Early work in rats suggested that DA reuptake blockers provide more

reliable enhancement of CR responding than AMPH when injected systemically (Hill, 1970;

Robbins, 1978; Beninger et al., 1981). This may relate to DAT blocker effects depending on

the initial burst firing of DA neurons as a result of stimulus salience as opposed to AMPH

which can cause DA release into the synapse by reversing DAT regardless of

neurotransmission.

5) Goal five was to utilize genetic techniques to examine the ability of mice exhibiting whole-

brain upregulation of DAT to acquire and express responding for a CR. This genetic

alteration produces a neurochemical profile of less DA being present in the synapse

following excitation, particularly in the striatum. The ability of both MPH and AMPH to alter

responding for a conditioned reinforcer in DAT upregulated mice was also examined. The

distinct mechanisms of action of MPH and AMPH permitted two different approaches to

altering incentive motivation towards the CR.

10.2 Hypotheses

1) Both C57Bl/6 and CD1 strains of mice will learn Pavlovian approach behaviour during

training, express selective responding for a CR on both lever press and nosepoke operants,

and extinguish conditioned reinforcement when the CR is no longer presented upon

responding.

2) Mice receiving explicitly unpaired presentations of CS and US during Pavlovian association

training will not respond for the CS acting as a CR.

Page 17: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

10

3) The psychomotor stimulant MPH will enhance CR responding of both inbred and outbred

mice strains.

4) C57Bl/6 mice overexpressing DAT will effectively learn Pavlovian approach behaviour, as

associative memory mechanisms will remain relatively unaffected, while exhibiting slightly

reduced responding for a CR on a lever press operant. This is based on the upregulation of

DAT resulting in a 40% decrease in striatal DAergic tone, which has been suggested to

suppress incentive motivation, while leaving glutamatergic associative memory mechanisms

largely unaffected.

5) Administration of MPH to DAT upregulated mice will result in enhanced responding for a

CR to the same levels as wild-type controls, while AMPH will enhance responding for a CR

in both groups, but to a higher level in DAT upregulated mice. MPH overwhelmingly blocks

DAT resulting in enhanced DA in the synapse; regardless of the quantitative difference in

DAT, essentially all transporters will be saturated and blocked by MPH. The amount of DAT

present would presumably play a larger role in the effects of AMPH. Mice with upregulated

DAT would essentially exhibit more potential pores in the presynaptic neuron permitting DA

into the synapse at a higher rate in the presence of AMPH, effectively amplifying this effect

compared to mice with normal levels of DAT

Page 18: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

11

Chapter 2 Methods

1 Subjects Thirty-six C57Bl/6 and 36 CD-1 mice from Jackson Labs (Maine, USA) were used in this

experiment, along with twelve male and eight female DAT upregulated C57Bl/6 mice of varying

ages obtained from the Salahpour lab at U of T (Salahpour et al., 2008). Mice were pair-housed

in a temperature and humidity controlled room on a 12 hour light dark cycle with lights on at 7

am. All training and testing occurred a minimum of 2 hours after lights on with food available ad

libitum. Water access was restricted for 22 hours for the majority of the study, except where

noted. During water restriction, all mice maintained adequate health and body weight (within

±20% of their average body weight). Mice received habituation to injections with two saline

injections on different days prior to testing with MPH. All procedures were approved by the

Centre for Addiction and Mental Heath Animal Care Committee and adhered to Canadian

Tricouncil guidelines for the humane treatment of experimental animals.

2 Apparatus

2.1 Conditioned reinforcement

Training and testing for responding for a CR was conducted in twelve operant conditioning

boxes (Med Associates, St Albans, VT) measuring 33 by 31 by 29 cm3. The rear stainless-steel

wall of the chamber is curved and contained an array of 5 2.5cm circular apertures located 2.5cm

above the floor and 2.5cm apart. During training, these ports were closed in all boxes using

circular stainless-steel plugs. Following training, these ports were open in the six of twelve boxes

using a nosepoke operant response, and were closed in the other six boxes using a lever press

operant response. The stainless-steel front wall of the chamber contained a horizontally centered

5cm2 reinforcer magazine 2.5cm above the floor. The magazine contained an infrared

photodetector at the entrance and a light mounted on the roof of the magazine. The wall also

contained two retractable levers which were removed during the training phase and inserted

during the testing phase into the six boxes using a lever press operant response. Positioned above

each lever was a yellow stimulus light. A motor-driven dipper was raised to deliver 0.06ml of

Page 19: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

12

liquid through a hole in the floor of the magazine. Each operant box was illuminated by a

houselight, and was enclosed in a sound-attenuating chamber equipped with a ventilation fan.

The boxes were controlled by an IBM-compatible computer running Med-PC for Windows.

2.2 Locomotor activity

A custom-built activity system containing 16 clear polycarbonate activity chambers was used.

Fourteen infrared photocells spaced 7.5 cm apart and 2 cm above the cage floor lined the cages

along their length.

3 General behavioural procedures

3.1 Pre-training

Prior to any training, mice received at least a week and a half of acclimatization to water

restriction which continued through the entire experiment, unless otherwise specified. This was

done to make the saccharin reward more palatable during Pavlovian association training. During

this period, all mice remained healthy and within 20% of their average bodyweight. It had been

suggested that 22 and 20 hours of water deprivation produce few adverse physiological or

behavioural effects on mice, and that mice adapt more effectively to water rather than food

deprivation (Tucci et al., 2006; Rowland, 2007). Mice also received multiple periods of

saccharin availability to reduce neophobia to saccharin during training.

3.2 Phase 1: Pavlovian conditioning (training) phase

Prior to the training, mice received one day of dipper training which involved up to 60

presentations of a 0.06ml 0.2% saccharin solution to a magazine illuminated upon each

presentation. The next day, water deprived mice were placed into operant boxes where a CS

complex was presented just prior to the delivery of saccharin solution in the dipper 30 times on

an RI60 schedule. The average session length was 40 minutes. The CS was a compound stimulus

consisting of the immediate extinguishing of the houselight, a 5s illumination of both left and

right yellow stimulus lights followed by the sound of the motorized dipper being raised to an

accessible level for 8s to allow adequate approach and consumption time. Both stimulus lights

remained on during the 8s saccharin availability, and were extinguished at the end of this period

Page 20: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

13

when the motorized dipper descended and the houselight was turned back on. The main

dependent variables recorded were the total number of head entries into the US magazine during

the 5s CS period and the number of entries during the 5s immediately prior to each CS period.

Trials on which mice failed to pick up the US were recorded as “Misses”’; the number of entries

while the dipper was elevated were also recorded (US responses). Following 30 of these CS-US

presentations, mice were promptly removed and placed back into their homecages and given 2

hours of water access after a short delay.

3.3 Phase 2: Operant conditioning (testing) phase

One day prior to testing, mice were placed in the operant chambers with levers or nosepoke ports

presented. Mice were allotted 20 minutes to make 10 responses on the active lever or nosepoke

port which resulted in CS presentation on a random ratio 2 schedule. The CS during the testing

phases was a shortened version of that in the training phase, in that the stimulus lights were

active for 2 seconds and the presentation of the dipper (devoid of saccharin) was 1 second long.

The dipper was only elevated for this amount of time to produce the sound of the dipper

elevating without providing long access to the empty dipper receptacle. Mice were subsequently

removed and placed back into their homecages where they received 2 hours of water access. The

purpose of this day was to habituate the mice to the response lever or nosepoke ports to reduce

confounding novelty on the following test day (Fletcher, 1995).

Following this habituation, responding for a CR was assessed. In the boxes animals received

training in, two levers or nosepoke ports were presented to mice: the CR lever or port and the no

conditioned reinforcer (NCR) lever or port. Responses on the CR lever or port resulted in the

presentation of the motivationally significant light-dipper complex CS, now termed a CR, on a

random ratio 2 schedule. Response on the NCR lever or port had no programmed consequences

and served as a control to examine the specificity of motivationally directed behaviour towards

the CR. After 40 minutes of conditioned reinforcement testing, mice were promptly removed

from the operant chambers and placed into their homecages where they received two hours of

water access after a short delay.

Page 21: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

14

4 Experiment 1a: Acquisition, expression, and amplification of responding for a conditioned reinforcer

This experiment assessed the ability of mice to acquire and express conditioned responding for a

saccharin-associated CR in two different strains and across two different operant responses. The

inbred C57Bl/6 mouse strain and outbred CD-1 strain were used to examine cross-strain

generalizability of conditioned reinforcement behaviour and enhancement of responding for a

CR due to MPH administration.

Twenty-four C57Bl/6 and 24 CD-1 mice under 22 hour water restriction received 14 days of

Pavlovian association training between the saccharin US and CS complex. Following the operant

habituation session responding for a CR was examined concurrently with the ability of the DA

reuptake blocker MPH to amplify this behaviour in both strains of mice and on both operant

response types (nosepoke or lever press). Prior to testing, mice received and injection of vehicle,

2.5, 3.5 or 5 mg/kg MPH HCl i.p. (Medisca, Montreal, QC, Canada) according to a Latin square

design to ensure counterbalanced allocation of MPH administration over four test days separated

by at least 72 hours without dosage overlap.

5 Experiment 1b: Temporal stability and subsequent amplification of responding for a conditioned reinforcer

Ten days after the final test day of experiment 1a, the persistence of responding for a CR in the

absence of drug was examined. Each day for 13 days, water restricted mice underwent identical

testing procedures as for experiment 1a without any injections. Following the 13th

day of

baseline testing, mice again received vehicle, 3.5 or 5 mg/kg MPH i.p. just prior to testing

following a Latin square design over three test days separated by 72 hours. The purpose of this

experiment was to ensure responding for a CR was a stable and reliable measure of incentive

motivation in mice, and to assess the effects of MPH following stabilization of this response.

Page 22: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

15

6 Experiment 1c: Extinction of responding for a conditioned reinforcer

6.1 Removal of water restriction as a motivating factor

An initial step in response extinction was to remove mice from water restriction and effectively

eliminate the motivating factor of thirst in responding for a saccharin-associated CR. All mice

were given free access to water and drug-free responding for a CR was assessed over a six day

period.

6.2 Extinction training - CS not presented upon CR responding

We then tested for extinction of responding for a CR with continued free access to water. During

a nine day period of extinction training, all procedures remained the same with the exception that

responding on the CR lever or nosepoke port no longer resulted in CR presentation.

7 Experiment 2: Locomotor activation by 3.5 and 5 mg/kg doses of MPH

To confirm that doses of MPH were behaviourally active in both mouse strains this experiment

measured the locomotor response to MPH. Sixteen C57Bl/6 and 16 CD-1 mice chosen at random

from the 48 in the previous experiments were used to assess the locomotor stimulating property

of the MPH doses used in the experiment. Mice were habituated to the locomotor activity boxes

for 1 hour per day over four days. Subsequently, over the course of three test days separated by

72 hours each, mice were placed in locomotor chambers for one hour to stabilize baseline

activity, after which each mouse received vehicle, 3.5 or 5 mg/kg MPH i.p. following a Latin

square design and had their activity monitored for another hour.

8 Experiment 3: Assessment of explicitly unpaired CS and US presentations and responding for a non-conditioned stimulus in mice

Twelve new C57Bl/6 and 12 CD-1 mice under 22 hours of water restriction received 14 days of

training similar to mice in experiment 1, but with an explicitly unpaired CS and US. In this case,

Page 23: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

16

extinguishing of the houselight and 5 second illumination of both stimulus lights was never

contingent with the elevation of the dipper and presentation of saccharin.

9 Experiment 4a: Effect of whole-brain DA transporter overexpression on responding for a conditioned reinforcer

Transgenic mice overexpressing DAT (DAT-Tg) were used in this experiment in an attempt to

examine the control DAT exerts over incentive motivation. Water restricted male (wild-type

(mWT), n = 6; mDAT-Tg, n = 5) and female (fWT, n = 4; fDAT-Tg, n = 4) C57Bl/6 mice of

varying ages obtained from the Salahpour lab at U of T underwent the same Pavlovian training

as mice in Experiment 1. Following training, one day of habituation to the lever press operant

was received to eliminate novelty driven responding during testing. Responding for a CR in a

drug-free state was then examined in all mice for three days.

10 Experiment 4b: Effect of MPH and AMPH on responding for a CR in DAT overexpressing mice

Following three baseline tests of responding for a CR, the ability of MPH and AMPH to alter

responding in DAT overexpressing mice was examined. Over the course of two days separated

by 72 hours, mice received either saline or 5 mg/kg MPH in a random order just prior to

assessment of responding for a conditioned CR. Following testing with MPH, the same mice

(with the exception of one fWT mouse; fWT, n = 3) received i.p. administration of saline, 0.2, or

1 mg/kg AMPH sulphate (U.S. Pharmacopeia, Rockville, MD) prior to assessment of responding

for a CR in a random order over the course of three days each separated by 72 hours.

11 Statistical Analyses

SPSS version 15.0 was used to perform three-way repeated measures ANOVAs; paired samples

t-tests were used for post-hoc comparisons between means. Within-subjects variables used in

ANOVAs included: Lever or Port, distinguishing operant response type and CR or NCR operant

responses; Day of training and testing; and Dose of MPH or AMPH. Between-subjects variables

used in ANOVAs included: Strain (C57 or CD1) and Genotype (WT or DAT overexpressing

mice).

Page 24: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

17

Chapter 3 Results

1 Experiment 1a: Acquisition, expression, and amplification of responding for a conditioned reinforcer

1.1 Training

Figures 1A and 1C show that over the course of Pavlovian conditioning, both C57Bl/6 and CD-1

mice learned to approach the location of a saccharin reward during CS presentations. Thus, both

strains showed increased responding during CS periods compared to pre-CS periods (CS vs

PreCS: F(1, 45) = 113.213, p < 0.001). This discrimination in responding emerged over time (Day

x CS vs PreCS: F(13, 585) = 64.607, p < 0.001). The lack of a significant main effect of strain (F(1,

45) = 0.007, ns) confirmed that the two strains were not different in their response patterns.

Figures 1B and 1D illustrate that, while early in training mice failed to respond for the US, as

training progressed the number of “missed” trials fell (Day: F(13, 598) = 107.94, p < 0.001), and the

number of responses made during the US presentation increased (Day: F(13, 585) = 68.60, p <

0.001). These measures did not differ between strain (Misses, Strain x Day: F(13, 598) = 0.729, ns;

US, responses Strain x Day: F(13, 585) = 0.194, ns).

1.2 Test of responding for a CR

Figures 2A and 2C show the effect of MPH on lever press responding for a CR in CD1 and

C57Bl/6 strains. Significant main effects of Lever (F(1, 22) = 21.54, p < 0.001) and Strain (F(1, 22) =

6.31, p < 0.05) were found, suggesting that responding on the CR lever was higher than on the

NCR lever, and that CD1 mice made more responses than C57Bl/6 mice. Although the main

effect of Dose was not significant (F(3, 66) = 2.426, p = 0.07), a significant interaction between

Lever and Dose was found (F(3, 66) = 3.66, p < 0.05), which suggested that MPH increased

responding on the CR lever. Selective post hoc tests determined that 3.5 mg/kg of MPH

enhanced lever pressing for a CR only in CD1 mice.

Page 25: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

18

Figures 2B and 2D show the effects of MPH on nosepoke responding for a CR in both strains of

mice. A significant main effect of Port (F(1, 22) = 20.12, p < 0.001) indicated that responding on

the CR port was higher than on the NCR port. The total number of responses was similar

between strains (Strain: F(1, 22) = 1.087, ns), and MPH had no effect on responding at any dose

tested (Dose: F(3, 66) = 0.796, ns)

2 Experiment 1b: Temporal stability and subsequent amplification of responding for a conditioned reinforcer

2.1 Extended CR responding

Figures 3A and 3C show lever pressing for a CR in both mouse strains over 13 days of testing. A

significant main effect of Lever (F(1, 22) = 36.42, p < 0.001), and Strain (F(1, 22) = 7.03, p < 0.05)

suggested that responding on the CR lever was higher than on the NCR lever, and that

responding was higher in CD1 than in C57Bl/6 mice. For both strains responding on the CR and

NCR levers remained stable over the 13 days of testing (Day: F(12, 264) = 3.74, ns; Day x Strain:

F(12, 264) = 3.74, ns)

Figures 3B and 3D illustrate nosepoking for a CR in both mouse strains over 13 days of testing.

A significant main effect of Port (F(1, 22) = 56.11, p < 0.001) reflected that responding was higher

on the CR nosepoke port compared to the NCR port. The ANOVA also showed a significant

three way interaction between Port, Test day and Strain (F(12, 264) = 4.91, p < 0.01), which implies

that the interaction between Port and Test day varied between strains. This is most likely

explained by the variability in responding specifically by CD1 mice within the first seven days.

However, over each test day, responding on the CR port was always higher than the NCR port.

2.2 MPH test 2

Figures 4A and 4C show the effects of MPH on lever pressing for CR in both mouse strains in a

second drug test. The ANOVA found significant main effects of Lever (F(1, 22) = 77.70, p <

0.001), Strain (F(1, 22) = 9.62, p < 0.01), and Dose (F(2, 44) = 3.34, p < 0.05). These main effects

reflect the fact that responding was higher on the CR lever compared to the NCR lever, that

MPH increased responding, and the CD1 mice responded at higher levels than C57Bl/6 mice.

The Lever x Dose interaction was significant (F(2, 44) = 6.04, p < 0.01), indicating that MPH

Page 26: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

19

increased responding on the CR lever. The overall three-way interaction was not significant;

however post-hoc tests showed that 3.5 and 5 mg/kg MPH increased responding on the CR lever

only in CD1 mice.

Figures 4B and 4D show the effects of MPH on nosepoking for a CR in both mouse strains.

Responses producing the CR were higher than NCR responses (main effect of Port, F(1, 22) =

54.60, p < 0.001), and the total number of responses made was generally higher in CD1

compared to C57Bl/6 mice (main effect of Strain, F(1, 22) = 5.87, p < 0.05). However, MPH

appeared to have no effect on responding at any doses tested (no main effect of dose, F(2, 44) =

0.267, p = ns). None of the interaction terms were significant.

3 Experiment 1c: Extinction of responding for a conditioned reinforcer

3.1 Removal from water restriction

Figures 5A and 5C illustrate lever press responding for a CR in both strains of mice following

removal from water restriction. Responding on the CR lever remained higher than on the NCR

lever in general (Lever: F(1, 22) = 18.62, p < 0.001), and CD1 mice made more responses than

C57Bl/6 mice (Strain: F(1, 22) = 8.61, p < 0.01). However, a Strain x Day two-way interaction (F(5,

110) = 3.6, p < 0.01) suggested the stability of responding differed between strains, likely a result

of the variability seen in CD1 mice during this period. The three way interaction was not

significant.

Figures 5B and 5D illustrate nosepoke responding for a CR in both strains of mice following

removal from water restriction. Responding on the CR nosepoke port remained higher than on

the NCR port (Port: F(1, 22) = 44.62, p < 0.00), and CD1 mice made more responses than C57Bl/6

mice (Strain: F(1, 22) = 7.49, p < 0.05). Although the magnitude of responding did not remain

stable over test days (Day: F(5, 110) = 8.6, p < 0.01), both strains followed a similar pattern of

responding (Strain x Day: F(5, 110) = 1.71, ns). Relative responses on the CR and NCR ports

changed over the course of testing (Port x Day: F(5, 110) = 8.28, p < 0.001), but responding on the

CR port was always higher than the NCR port. The three-way interaction was not significant.

Page 27: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

20

3.2 Operant responding during extinction conditions

Figures 6A and 6C show lever pressing during extinction conditions in both strains of mice. No

significant main effect of Lever was observed (F(1, 22) = 1.75, ns), suggesting that responding on

the CR lever did not differ from responding on the NCR lever.

Figures 6B and 6D show nosepoke responding during extinction conditions in both strains of

mice. Responding on the CR nosepoke port greatly decreased over extinction training days (Port

x Day: F(8, 176) = 12.64, p < 0.001).

4 Experiment 2: Locomotor activation by 3.5 and 5 mg/kg doses of MPH

Figure 7 depicts the effect of MPH to enhance locomotor activity as measured by total beam

breaks. CD1 mice were more active than C57Bl/6 mice in general (Strain: F(1, 30) = 14.70, p <

0.01), and MPH enhanced locomotor activity in both strains (Dose: F(2, 60) = 45.43, p < 0.001;

Dose x Strain: F(2, 60) = 2.38, ns). Paired samples t-tests determined that 3.5 and 5 mg/kg MPH

significantly enhanced locomotor activity in both strains (t(15)-values > 5.14, all p-values < 0.05).

5 Experiment 3: Assessment of explicitly unpaired CS and US presentations and responding for a non-conditioned stimulus in mice

5.1 Training

Figures 8A and 8B illustrate that neither C57Bl/6 nor CD1 mice increased their approach to the

reward magazine during CS presentation; in fact, magazine responses during the CS period were

significantly lower than those during the 5s pre-CS period in general (main effect of CS vs

PreCS, F(1, 22) = 82.22, p < 0.001).

Figure 8C and 8D illustrate that as training progressed the number of “missed” saccharin

presentations decreased (Day: F(13, 286) = 11.25, p < 0.001). However this occurred at different

rates between strains in that CD1 mice decreased in number of misses more slowly than C57Bl/6

mice (Strain: F(13, 286) = 4.5, p < 0.05; Strain x Day: F(13, 286) = 2.83, p < 0.01).

Page 28: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

21

5.2 Test of responding for a CR

Figures 9A and 9B show lever press responding for a CS that was never paired with the

saccharin US in both strains of mice. No difference was observed between responses made on

the CR and NCR levers (Lever: F(1, 22) = 0.03, ns).

6 Experiment 4a: Effect of whole-brain DA transporter overexpression on responding for a conditioned reinforcer

6.1 Training

Data were pooled between sexes in experiment 4a and 4b as no sex differences were apparent in

either genetic background (data not shown). Figures 10A and 10B show that over the course of

Pavlovian conditioning, both DAT-Tg and WT mice learned to approach the location of a

saccharin reward during CS presentations. DAT-Tg and WT mice did not differ in their response

patterns (Strain: F(1, 18) = 2.55, ns). Thus, both strains showed increased responding during CS

periods compared to pre-CS periods (CS vs PreCS: F(1, 18) = 122.26, p < 0.001). This

discrimination emerged over time (Day x CS vs PreCS: F(13, 234) = 18.48, p < 0.001).

6.2 Test of responding for a CR

Figures 11A and 11B show lever press responding for a CR in a drug-free state in both DAT-Tg

and WT genotypes of mice. Responding was selective for the CR lever relative to the NCR lever

(Lever: F(1, 17) = 75.72, p < 0.001), and no difference in magnitude of CR responding between

DAT-Tg and WT mice was apparent (Genotype: F(1, 17) = 0.44, ns; Genotype x Lever: F(1, 17) =

0.45, ns).

7 Experiment 4b: Effect of MPH and AMPH on responding for a CR in DAT overexpressing mice

7.1 Effect of MPH

Figure 12 illustrates the effect of MPH on lever press responding for a CR in both DAT-Tg and

WT mice. Responding on the CR lever was significantly higher than on the NCR lever (Lever:

F(1, 16) = 193.39, p < 0.001), but this did not differ between DAT-Tg and WT mice (Genotype x

Page 29: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

22

Lever: F(1, 16) = 1.53, ns). Administration of MPH had no effect on responding for a CR (Dose:

F(1, 16) = 0.12, ns; Dose x Lever: F(1, 16) = 1.83, ns) in both DAT-Tg and WT mice (Genotype x

Dose X Lever: ns).

7.2 Effect of AMPH

Figure 13 shows the effect of AMPH on responding for a CR in both DAT-Tg and WT mice.

Responding on the CR lever was significantly higher than on the NCR lever (main effect of

Lever, F(1, 16) = 147.10, p < 0.001). A significant main effect of AMPH was also found (F(2, 32) =

19.70, p < 0.001), with AMPH reducing responding at the highest dose. The main effect of

genotype was not significant. The interaction between lever and dose was significant (F(2, 32) =

8.54, p < 0.01), with amphetamine reducing CR responding only. This effect was seen in both

DAT-Tg and WT mice (Genotype x Dose X Lever: ns).

Page 30: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

23

Chapter 4 Discussion

This series of experiments provides a detailed characterization of the acquisition of a new

response by conditioned reinforcement in mice, and the augmentation of this behaviour by

changes in DAergic activity in the brain. The results presented here show that mice learn to

approach a CS associated with a reward, and that this CS acquires incentive salience which

drives behaviour towards obtaining it when acting as a CR. Furthermore, selective responding

for a CR is stable over multiple test days, is modulated by MPH administration, and extinguishes

when responding no longer results in CR presentation. Finally, mice chronically overexpressing

DAT exhibit no deficits in learning approach towards a reward associated CS, salience

attribution to this CS, and responding for this CS acting as a CR.

1 Characterization of the acquisition and expression of conditioned reinforcement in mice

The first goal of these experiments was to assess the ability of mice to learn approach behaviour

towards a reward predictive CS and selectively respond to obtain that CS when it acts as a

reward. Both inbred C57Bl/6 and outbred CD1 mouse strains display these facets of conditioned

reinforcement, suggesting that this behaviour is reliable in mice and therefore could be used as

an effective tool in examining brain mechanisms involved in incentive motivation. It is also clear

that the association with a primary reward is critical for imbuing the CS with incentive salience,

subsequently allowing it to act as a CR. Responding for a CR is also strikingly stable over

multiple test days in the two strains and on two types of operant response. With such extended

testing and repetition of this operant behaviour, it is possible that responding may have become

habitual and no longer driven by incentive motivation directed at the CR. However, although not

driven by motivational state (i.e. thirst), responding for the CR during this extended testing

period is still driven by its incentive motivational properties. When the CR was omitted

following lever and nosepoke responses that normally produced it, selective responding on those

levers and ports was abolished within three days of testing. This latter finding shows that

responding directed towards the CR was not habitual, and did indeed depend on the outcome of

the behavioural response. Overall, these results suggest that mice effectively attribute a reward

Page 31: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

24

associated CS with incentive salience enabling it to act as a CR, and this salience driven response

pattern is stable over an extended period of testing.

2 Strain differences in conditioned reinforcement

Both inbred C57Bl/6 and outbred CD1 mice show reliable responding for a CR which is resistant

to decrement as a result of repeated testing, but extinguishes when the operant response no

longer produces the CR. Although some variability in response patterns were present such as

CD1 mice making more responses on average, both strains exhibited significantly higher

responding on the manipulandum delivering the CR as opposed to the one with no programmed

consequence (NCR).

The higher and more variable responding exhibited by CD1 mice may stem from a higher basal

activity level within the DA system and differential susceptibility to DAergic drugs between

strains. CD1 mice showed higher CR responding throughout the study compared to C57Bl/6

mice, and MPH administration resulting in a more pronounced effect on responding for a CR in

CD1 mice. Although few studies have directly compared C57Bl/6 and CD1 strains of mice in

terms of DA system activity, one study suggests that the inbred C57Bl/6 mice may have mild

decreases in DA system function relative to outbred CD1 mice. It was also suggested that CD1

mice exhibit decreases in DA system activity but with more variability, presumably based on

their genetic diversity (Prasad and Richfield, 2008). These present experiments do, however,

suggest that while both strains of mice learn about, and respond for, conditioned reinforcement,

pharmacological manipulations, at least of the DAergic system, produce a stronger effect in CD1

relative to C57Bl/6 mice. Therefore, CD1 mice may be a better choice of mouse strain in which

to demonstrate drug effects on the incentive motivational properties of reward-related cues in

mice.

3 Operant response type in responding for a CR

Two operant response types were also examined in these experiments: lever pressing and

nosepoking for a CR. While lever pressing is a well established type of response in this and other

reinforcement paradigms, nosepoking has been suggested to be a more natural behaviour in mice

(Crawley, 2007). Therefore, mice may have showed more responding for a CR following a

Page 32: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

25

nosepoke response rather than a lever press; however, responding in fact appeared to be strongest

for lever pressing. To understand this effect, it is important to note that the nosepoke operant in

experiment 1 was not merely a substitution of levers for nosepoke ports; the operant response

elements were in different physical locations.

Nosepoke ports were located at the opposite end of the chamber from the reward magazine and

stimulus lights, while levers flanked the reward magazine. Therefore, mice engaging in either

nosepoking or lever pressing likely differed in several ways including head direction, visibility of

the visual component of the CR during the actual operant behaviour (head inside a hole versus

facing the CS), and distance from the sound of the dipper. In the lever press condition, mice are

oriented towards two major aspects of the CR that nosepoking mice are not: the stimulus lights

and the general direction of the sound that the dipper mechanism originates from. In the future, a

more thorough assessment of nosepoking relative to lever pressing for a CR should involve

substituting levers with nosepoke ports in the exact same location.

Both nosepoking and lever pressing served as effective operant behaviours for assessing

responding with conditioned reinforcement. Although the nosepoke response seemed more

variable than lever pressing over the course of experiment 1 (e.g. figure 3, particularly CD1, and

figure 5), responding on the CR port was always higher than responding on the NCR port.

However, this variability prompted the use of the more stable lever press operant in further

experiments. Thus, these experiments show that lever press responding for a CR is an effective

tool for examining cue-controlled incentive motivation in mice.

4 Modulation of responding for a CR by MPH administration in normal mice

The psychomotor stimulant MPH failed to increase responding in Experiment 1a. This result was

surprising given that this class of drug reliably increases CR responding in rats (Robbins, 1976;

Beninger and Phillips, 1980; Taylor and Robbins, 1984; Ito et al., 2000; Fletcher et al

unpublished observations). However, MPH has previously been shown to not significantly

enhance responding for a CR unlike other psychostimulants (Robbins, 1978). In the present

experiment mice were administered saline or MPH in a Latin square design over four days. This

meant that many mice had MPH “on board” during, and experienced injections just prior to, their

Page 33: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

26

very first assessment of this new behaviour. This may have resulted in a higher variability in

behaviour over these test days not reflective of true levels of responding for a CR. However, a

second assessment of the effect of MPH on responding for a CR following a period of

stabilization occurred in experiment 1b where MPH greatly enhanced lever pressing for a CR in

CD1 mice but not C57Bl/6 mice. As described in previously, this may relate to differences in DA

system of CD1 mice compared to C57Bl/6 mice. Nosepoke responding for a CR, however, did

not appear to be affected by any dose of MPH. Based on the confounding physical differences

between lever press and nosepoke responding in these experiments it is difficult to specify the

exact reason for this.

5 Incentive salience attribution and motivation in DAT upregulated mice

One reason for characterizing the conditioned reinforcement paradigm in mice was the potential

utility of using behavioural genetic techniques to examine underlying mechanisms of incentive

motivation and salience attribution. Given that MPH, which elevates extracellular DA (Schweri

et al., 1985), increased responding for CR we hypothesized that reduced DA activity would

impair this response. We tested this hypothesis using a transgenic mouse that has a three-fold

increase in DAT and consequent 40% decrease in DAergic tone in the striatum. These mice have

previously shown an increase in susceptibility to AMPH reward and possible reductions in basal

motivation to obtain a reward relative to wild-type animals (Salahpour et al., 2008). However, no

detriments were expected in forming a CS-US association as measured by approach to the CS

during US presentation, as this conditioned approach does not appear to require phasic DA

output (Berridge, 2007; Parker et al., 2010). However, lower DAergic tone was expected to

result in decreased incentive salience of this CS, which would suppress the motivation to respond

for it when acting as a CR. For example DA receptor antagonists which functionally block DA

activity reduce responding for CR (Beninger et al., 1980; Ito et al., 2000). Therefore, DAT

upregulated mice were expected to exhibit less responding for a CR compared to wild-type

control mice.

C57Bl/6 mice overexpressing DAT showed no impairments in learning conditioned approach to

a reward associated CS, which is consistent with reports in rats showing that impaired DA

Page 34: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

27

function does not block approach to a reward-related CS (Taylor and Robbins, 1984; Parker et

al., 2010). Unexpectedly, however, these mice despite their hypodopaminergic state showed no

impairments in responding for that CS acting as a CR. In comparison to the previous study by

Salahpour and colleagues (2008), these results suggest that mice chronically expressing whole-

brain upregulation of DAT do not exhibit major impairments in incentive salience attribution or

incentive motivation towards a cue associated with a natural reward.

One reason for this lack of effect could be a compensatory presynaptic mechanism of

upregulating the amount of DA released from vesicles to counteract the more rapid uptake of DA

which has developed as a result of chronic DAT overexpression. DAT knockout mice exhibit a

similar, but opposite form of compensatory mechanism, in that vesicular DA release is

downregulated 73% in homozygous and 34% in heterozygous mice (Giros et al., 1996). A

second possibility is a postsynaptic compensatory mechanism such as D1 receptor upregulation,

which may work in concert with presynaptic compensatory mechanisms to alleviate deleterious

effects of DAT upregulation. Phasic DAergic neurotransmission activates D1 receptors in the

NAc and DAT knockout mice exhibit a 55% downregulation in D1 receptor mRNA in the basal

ganglia (Giros et al., 1996; Goto and Grace, 2005). Therefore, these compensatory mechanisms

may facilitate signalling to normal levels in DAT upregulated mice by competing with the

enhanced uptake of DA back into presynaptic terminals, ultimately rescuing the effects of DAT

upregulation.

6 Effect of MPH and AMPH on responding for a CR in DAT upregulated mice

In an attempt to highlight the role of DAT in modulating DAergic tone and therefore the

incentive salience of a CR, MPH and AMPH were administered to DAT upregulated mice. These

two psychostimulants have two disparate mechanisms of action at the DAT. AMPH reverses

DAT following breakdown of presynaptic DA containing vesicles, making the AMPH effect

dependent on DAT quantity and not necessarily dependent on phasic DA output, while MPH

blocks reuptake of DA following burst firing (Schweri et al., 1985; Giros et al., 1996). DAT

upregulated mice have previously shown major increases in locomotor activity and a leftward

Page 35: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

28

shift in the dose required for induction of conditioned place preference compared to wild-type

mice following administration of AMPH, but not following MPH (Salahpour et al., 2008).

Administration of 5 mg/kg MPH to DAT upregulated mice found only a slight, albeit not

significant increase in responding for a CR. However, administration of this dose of MPH also

showed no effect compared to saline in this set of wild type mice, which was unexpected based

on the results obtained from experiment 1b. The sample size of each group was very small in this

experiment and variability within these groups in terms of mouse age and sex was high; perhaps

with a larger sample size in each condition, the true effect of MPH administration may be more

apparent.

Administration of AMPH had very interesting effects on all mice. A relatively high 1 mg/kg dose

of AMPH abolished general operant responding in both DAT upregulated and wild-type mice, in

that both CR and NCR lever responding dropped greatly compared to saline. Interestingly, this

decrease in responding on both the CR lever NCR lever was proportional between DAT

upregulated and wild-type mice. This may reflect an AMPH induced shift of salience away from

performing the operant task in general which would decrease responding for a CR. However, a

low 0.2 mg/kg dose of AMPH appeared to have no major effect on responding for a CR or on the

NCR lever, besides a slight decrease in responding on the CR lever in DAT upregulated mice

which did not reach significance.

Therefore, administration of either MPH or AMPH appeared to have no differential effect on

mice with upregulated DAT compared to their WT controls. This may be a result of pre- or

postsynaptic compensatory mechanisms in DAT upregulated mice which rescue basal DAergic

tone in these mice and prevents psychostimulants from having any major effect on operant

responding relative to wild-type mice. However, an important limitation of this pilot experiment

was a small sample size in each group and variability in the mice used in terms of age and sex.

Utilizing a larger sample size of these transgenic mice and a limited age group may provide more

reliable results in the future.

Page 36: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

29

Chapter 5 Conclusions

Although Mead and colleagues (2004), among others, have used the conditioned reinforcement

paradigm in mice previously, a thorough inspection of its viability has not been performed. The

present set of experiments has shown that using mice in this behavioural task is, in fact, reliable

and valid. Inbred and outbred strains of mice: 1) learn conditioned approach to a stimulus

predicting reward, 2) will perform different operant responses for the CS when it acts as a CR, 3)

show remarkable stability in responding for this CR over multiple weeks of testing, 4) exhibit at

least slight enhancements in responding for a CR as a result of MPH induced enhancements in

DAergic tone, and 5) extinguish responding for the CR when the operant response no longer

results in its presentation. This experiment also shows that the motivation to obtain the CR is

driven by its incentive salience as a result of being associated with a natural reward. It is

important to note that the data presented suggest that strains of mice may differ in their

magnitude of CR responding and responsivity to DAergic drugs. Therefore the strain of mouse

used in this paradigm should be carefully selected based on the manipulation of responding for a

CR.

The primary reason for examining the conditioned reinforcement paradigm in mice was to utilize

transgenic techniques in analyzing the underlying mechanisms of incentive motivation and

salience attribution. Mice chronically expressing a three-fold increase in DAT and a consequent

40% decrease in striatal DA showed no differences from wild-type animals in either conditioned

approach or conditioned reinforcement. This suggests that, perhaps by some long-term

compensatory mechanisms, these mice do not exhibit impairments in basic incentive motivation.

Administration of 5 mg/kg MPH, or 0.2 or 1 mg/kg AMPH did not significantly alter responding

relative to wild-type animals. However, the true effects of psychostimulant administration may

not have been captured due to the variability in sex and age of mice used in this experiment.

Therefore, acquisition of a new response with conditioned reinforcement is a reliable behavioural

paradigm in mice, and can be used in concert with transgenic techniques to further investigate

underlying mechanisms of incentive motivation.

Page 37: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

30

Chapter 6 References

Anderson SM, Pierce RC (2005) Cocaine-induced alterations in dopamine receptor signaling:

implications for reinforcement and reinstatement. Pharmacol Ther 106:389–403.

Beninger RJ (1983) The role of dopamine in locomotor activity and learning. Brain research

287:173–196.

Beninger RJ, Hanson DR, Phillips AG (1980) The effects of pipradrol on the acquisition of

responding with conditioned reinforcement: a role for sensory preconditioning.

Psychopharmacology 69:235–242.

Beninger RJ, Mazurski EJ, Hoffman DC (1991) Receptor subtype-specific dopaminergic agents

and unconditioned behavior. Pol J Pharmacol Pharm 43:507–528.

Beninger RJ, Phillips AG (1980) The effect of pimozide on the establishment of conditioned

reinforcement. Psychopharmacology (Berl) 68:147–153.

Berridge KC (2007) The debate over dopamine’s role in reward: the case for incentive salience.

Psychopharmacology 191:391–431.

Berridge KC, Robinson TE (1998) What is the role of dopamine in reward: hedonic impact,

reward learning, or incentive salience? Brain Res Brain Res Rev 28:309–369.

Berridge KC, Venier IL, Robinson TE (1989) Taste reactivity analysis of 6-hydroxydopamine-

induced aphagia: implications for arousal and anhedonia hypotheses of dopamine function.

Behav Neurosci 103:36–45.

Bindra D (1968) Neuropsychological interpretation of the effects of drive and incentive-

motivation on general activity and instrumental behavior. Psychological Review 75:1–22.

Björklund A, Dunnett SB (2007) Dopamine neuron systems in the brain: an update. Trends in

neurosciences 30:194–202.

Bugelski R (1938) Extinction with and without sub-goal reinforcement. Journal of Comparative

Psychology 26:121–134.

Burns LH, Robbins TW, Everitt BJ (1993) Differential effects of excitotoxic lesions of the

basolateral amygdala, ventral subiculum and medial prefrontal cortex on responding with

conditioned reinforcement and locomotor activity potentiated by intra-accumbens infusions

of D-amphetamine. Behav Brain Res 55:167–183.

Cador M, Robbins TW, Everitt BJ (1989) Involvement of the amygdala in stimulus-reward

associations: interaction with the ventral striatum. Neuroscience 30:77–86.

Page 38: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

31

Cardinal RN, Parkinson JA, Hall J, Everitt BJ (2002) Emotion and motivation: the role of the

amygdala, ventral striatum, and prefrontal cortex. Neurosci Biobehav Rev 26:321–352.

Di Chiara G (1995) The role of dopamine in drug abuse viewed from the perspective of its role

in motivation. Drug Alcohol Depend 38:95–137.

Di Ciano P, Robbins TW, Everitt BJ (2008) Differential effects of nucleus accumbens core,

shell, or dorsal striatal inactivations on the persistence, reacquisition, or reinstatement of

responding for a drug-paired conditioned reinforcer. Neuropsychopharmacology 33:1413–

1425.

Di Ciano P, Underwood RJ, Hagan JJ, Everitt BJ (2003) Attenuation of cue-controlled cocaine-

seeking by a selective D3 dopamine receptor antagonist SB-277011-A.

Neuropsychopharmacology 28:329–338.

Ciliax BJ, Drash GW, Staley JK, Haber S, Mobley CJ, Miller GW, Mufson EJ, Mash DC, Levey

AI (1999) Immunocytochemical localization of the dopamine transporter in human brain.

The Journal of comparative neurology 409:38–56.

Ciliax BJ, Heilman C, Demchyshyn LL, Pristupa ZB, Ince E, Hersch SM, Niznik HB, Levey AI

(1995) The dopamine transporter: immunochemical characterization and localization in

brain. The Journal of neuroscience 15:1714–1723.

Crawley JN (2007) What’s Wrong With My Mouse: Behavioral Phenotyping of Transgenic and

Knockout Mice, 2nd ed. John Wiley & Sons.

Everitt BJ, Cador M, Robbins TW (1989) Interactions between the amygdala and ventral

striatum in stimulus-reward associations: studies using a second-order schedule of sexual

reinforcement. Neuroscience 30:63–75.

Everitt BJ, Cardinal RN, Parkinson JA, Robbins TW (2003) Appetitive behavior: impact of

amygdala-dependent mechanisms of emotional learning. Ann N Y Acad Sci 985:233–250.

Everitt BJ, Dickinson A, Robbins TW (2001) The neuropsychological basis of addictive

behaviour. Brain Res Brain Res Rev 36:129–138.

Everitt BJ, Robbins TW (2005) Neural systems of reinforcement for drug addiction: from actions

to habits to compulsion. Nat Neurosci 8:1481–1489.

Fletcher PJ (1995) Effects of d-fenfluramine and metergoline on responding for conditioned

reward and the response potentiating effect of nucleus accumbens d-amphetamine.

Psychopharmacology (Berl) 118:155–163.

Fletcher PJ, Higgins GA (1997) Differential effects of ondansetron and alpha-flupenthixol on

responding for conditioned reward. Psychopharmacology 134:64–72.

Page 39: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

32

Gaffan D, Harrison S (1987) Amygdalectomy and disconnection in visual learning for auditory

secondary reinforcement by monkeys. J Neurosci 7:2285–2292.

Giros B, Jaber M, Jones SR, Wightman RM, Caron MG (1996) Hyperlocomotion and

indifference to cocaine and amphetamine in mice lacking the dopamine transporter. Nature

379:606–612.

Graybiel AM, Aosaki T, Flaherty AW, Kimura M (1994) The basal ganglia and adaptive motor

control. Science 265:1826–1831.

Haber SN, Fudge JL, McFarland NR (2000) Striatonigrostriatal pathways in primates form an

ascending spiral from the shell to the dorsolateral striatum. The Journal of neuroscience

20:2369–2382.

Heidbreder CA, Gardner EL, Xi ZX, Thanos PK, Mugnaini M, Hagan JJ, Ashby Jr. CR (2005)

The role of central dopamine D3 receptors in drug addiction: a review of pharmacological

evidence. Brain Res Brain Res Rev 49:77–105.

Hill RT (1970) Facilitation of conditioned reinforcement as a mechanism of psychomotor

stimulation. . In: Amphetamine and Related Compounds (Costa E, Garattini S, eds),

pp.781–795. New York: Raven.

Ito R, Canseliet M (2010) Amphetamine exposure selectively enhances hippocampus-dependent

spatial learning and attenuates amygdala-dependent cue learning.

Neuropsychopharmacology 35:1440–1452.

Ito R, Dalley JW, Howes SR, Robbins TW, Everitt BJ (2000) Dissociation in conditioned

dopamine release in the nucleus accumbens core and shell in response to cocaine cues and

during cocaine-seeking behavior in rats. The Journal of neuroscience 20:7489–7495.

Ito R, Dalley JW, Robbins TW, Everitt BJ (2002) Dopamine release in the dorsal striatum during

cocaine-seeking behavior under the control of a drug-associated cue. J Neurosci 22:6247–

6253.

Ito R, Hayen A (2011) Opposing roles of nucleus accumbens core and shell dopamine in the

modulation of limbic information processing. The Journal of neuroscience 31:6001–6007.

Ito R, Robbins TW, Everitt BJ (2004) Differential control over cocaine-seeking behavior by

nucleus accumbens core and shell. Nat Neurosci 7:389–397.

Ito R, Robbins TW, McNaughton BL, Everitt BJ (2006) Selective excitotoxic lesions of the

hippocampus and basolateral amygdala have dissociable effects on appetitive cue and place

conditioning based on path integration in a novel Y-maze procedure. The European journal

of neuroscience 23:3071–3080.

Page 40: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

33

Jaber M, Robinson SW, Missale C, Caron MG (1996) Dopamine receptors and brain function.

Neuropharmacology 35:1503–1519.

Kalivas PW, Volkow ND (2005) The neural basis of addiction: a pathology of motivation and

choice. Am J Psychiatry 162:1403–1413.

Koob GF, Le Moal M (1997) Drug abuse: hedonic homeostatic dysregulation. Science 278:52–

58.

Koob GF, Le Moal M (2001) Drug addiction, dysregulation of reward, and allostasis.

Neuropsychopharmacology 24:97–129.

Mackintosh NJ (1975) The Psychology of Animal Learning .

McClure SM, Daw ND, Montague PR (2003) A computational substrate for incentive salience.

Trends Neurosci 26:423–428.

Mead AN, Crombag HS, Rocha BA (2004) Sensitization of psychomotor stimulation and

conditioned reward in mice: differential modulation by contextual learning.

Neuropsychopharmacology 29:249–258.

Meredith GE, Agolia R, Arts MP, Groenewegen HJ, Zahm DS (1992) Morphological differences

between projection neurons of the core and shell in the nucleus accumbens of the rat.

Neuroscience 50:149–162.

Mogenson GJ, Jones DL, Yim CY (1980) From motivation to action: functional interface

between the limbic system and the motor system. Progress in neurobiology 14:69–97.

Parker JG, Zweifel LS, Clark JJ, Evans SB, Phillips PEM, Palmiter RD (2010) Absence of

NMDA receptors in dopamine neurons attenuates dopamine release but not conditioned

approach during Pavlovian conditioning. Proceedings of the National Academy of Sciences

107:13491–13496.

Parkinson JA, Olmstead MC, Burns LH, Robbins TW, Everitt BJ (1999) Dissociation in effects

of lesions of the nucleus accumbens core and shell on appetitive pavlovian approach

behavior and the potentiation of conditioned reinforcement and locomotor activity by D-

amphetamine. J Neurosci 19:2401–2411.

Phillips PE, Stuber GD, Heien ML, Wightman RM, Carelli RM (2003) Subsecond dopamine

release promotes cocaine seeking. Nature 422:614–618.

Pierce RC, Kalivas PW (1997) A circuitry model of the expression of behavioral sensitization to

amphetamine-like psychostimulants. Brain Res Brain Res Rev 25:192–216.

Page 41: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

34

Pontieri FE, Tanda G, Di Chiara G (1995) Intravenous cocaine, morphine, and amphetamine

preferentially increase extracellular dopamine in the “shell” as compared with the “core” of

the rat nucleus accumbens. Proc Natl Acad Sci U S A 92:12304–12308.

Prasad K, Richfield EK (2008) Sporadic midbrain dopamine neuron abnormalities in laboratory

mice. Neurobiology of disease 32:262–272.

Robbins TW (1976) Relationship between reward-enhancing and stereotypical effects of

psychomotor stimulant drugs. Nature 264:57–59.

Robbins TW (1978) The acquisition of responding with conditioned reinforcement: effects of

pipradrol, methylphenidate, d-amphetamine, and nomifensine. Psychopharmacology 58:79–

87.

Robbins TW, Everitt BJ (1982) Functional studies of the central catecholamines. Int Rev

Neurobiol 23:303–365.

Robbins TW, Watson BA, Gaskin M, Ennis C (1983) Contrasting interactions of pipradrol, d-

amphetamine, cocaine, cocaine analogues, apomorphine and other drugs with conditioned

reinforcement. Psychopharmacology 80:113–119.

Robinson TE, Berridge KC (1993) The neural basis of drug craving: an incentive-sensitization

theory of addiction. Brain Res Brain Res Rev 18:247–291.

Rowland NE (2007) Food or fluid restriction in common laboratory animals: balancing welfare

considerations with scientific inquiry. Comp Med 57:149–160.

Salahpour A, Ramsey AJ, Medvedev IO, Kile B, Sotnikova TD, Holmstrand E, Ghisi V,

Nicholls PJ, Wong L, Murphy K, Sesack SR, Wightman RM, Gainetdinov RR, Caron MG

(2008) Increased amphetamine-induced hyperactivity and reward in mice overexpressing

the dopamine transporter. Proceedings of the National Academy of Sciences 105:4405–

4410.

Schultz W (1998) Predictive reward signal of dopamine neurons. J Neurophysiol 80:1–27.

Schultz W, Apicella P, Ljungberg T (1993) Responses of monkey dopamine neurons to reward

and conditioned stimuli during successive steps of learning a delayed response task. J

Neurosci 13:900–913.

Schweri MM, Skolnick P, Rafferty MF, Rice KC, Janowsky AJ, Paul SM (1985) [3H]Threo-(+/-

)-methylphenidate binding to 3,4-dihydroxyphenylethylamine uptake sites in corpus

striatum: correlation with the stimulant properties of ritalinic acid esters. Journal of

neurochemistry 45:1062–1070.

Sutton MA, Beninger RJ (1999) Psychopharmacology of conditioned reward: evidence for a

rewarding signal at D1-like dopamine receptors. Psychopharmacology (Berl) 144:95–110.

Page 42: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

35

Taylor JR, Robbins TW (1984) Enhanced behavioural control by conditioned reinforcers

following microinjections of d-amphetamine into the nucleus accumbens.

Psychopharmacology (Berl) 84:405–412.

Torres GE, Amara SG (2007) Glutamate and monoamine transporters: new visions of form and

function. Current opinion in neurobiology 17:304–312.

Tsai H-C, Zhang F, Adamantidis A, Stuber GD, Bonci A, de Lecea L, Deisseroth K (2009)

Phasic firing in dopaminergic neurons is sufficient for behavioral conditioning. Science

(New York, NY) 324:1080–1084.

Tucci V, Hardy A, Nolan PM (2006) A comparison of physiological and behavioural parameters

in C57BL/6J mice undergoing food or water restriction regimes. Behav Brain Res 173:22–

29.

Ungerstedt U (1971) Stereotaxic mapping of the monoamine pathways in the rat brain. Acta

physiologica Scandinavica Supplementum 367:1–48.

Vanderschuren LJ, Di Ciano P, Everitt BJ (2005) Involvement of the dorsal striatum in cue-

controlled cocaine seeking. J Neurosci 25:8665–8670.

Volkow ND, Wang GJ, Telang F, Fowler JS, Logan J, Childress AR, Jayne M, Ma Y, Wong C

(2006) Cocaine cues and dopamine in dorsal striatum: mechanism of craving in cocaine

addiction. J Neurosci 26:6583–6588.

Voorn P, Gerfen CR, Groenewegen HJ (1989) Compartmental organization of the ventral

striatum of the rat: immunohistochemical distribution of enkephalin, substance P,

dopamine, and calcium-binding protein. J Comp Neurol 289:189–201.

Vorel SR, Ashby Jr. CR, Paul M, Liu X, Hayes R, Hagan JJ, Middlemiss DN, Stemp G, Gardner

EL (2002) Dopamine D3 receptor antagonism inhibits cocaine-seeking and cocaine-

enhanced brain reward in rats. J Neurosci 22:9595–9603.

Weiss F, Martin-Fardon R, Ciccocioppo R, Kerr TM, Smith DL, Ben-Shahar O (2001) Enduring

resistance to extinction of cocaine-seeking behavior induced by drug-related cues.

Neuropsychopharmacology 25:361–372.

White NM (1989) Reward or reinforcement: what’s the difference? Neurosci Biobehav Rev

13:181–186.

Whitelaw RB, Markou A, Robbins TW, Everitt BJ (1996) Excitotoxic lesions of the basolateral

amygdala impair the acquisition of cocaine-seeking behaviour under a second-order

schedule of reinforcement. Psychopharmacology (Berl) 127:213–224.

Page 43: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

36

Wolterink G, Phillips G, Cador M, Donselaar-Wolterink I, Robbins TW, Everitt BJ (1993)

Relative roles of ventral striatal D1 and D2 dopamine receptors in responding with

conditioned reinforcement. Psychopharmacology (Berl) 110:355–364.

Wyvell CL, Berridge KC (2000) Intra-accumbens amphetamine increases the conditioned

incentive salience of sucrose reward: enhancement of reward “wanting” without enhanced

“liking” or response reinforcement. J Neurosci 20:8122–8130.

Page 44: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

37

Chapter 7

Figure Captions

Figure 1: C57Bl/6 (n = 24) and CD1 (n = 24) mice both learn Pavlovian conditioned approach to a

reward location directed by a CS. A and C: Head entries made into the reward magazine upon CS

presentation increase over training compared to those made during a 5-second period just prior to CS

onset. B and D: Number of magazine entries upon saccharin presentation (US presentation, during the last

8s of the CS) increased while number of missed reward presentations decreased over training in both

strains.

Figure 2: Initial assessment of the effect of MPH on lever pressing and nosepoking for a CR in C57Bl/6

and CD1 mice. The number of CR and NCR responses was compared across saline, 2.5, 3.5 and 5 mg/kg

MPH. In both strains and operant response types, mice selectively responded for the CR. A and C: MPH

did not enhance lever pressing for a CR in C57Bl/6 mice (n = 12), while 3.5 mg/kg MPH enhanced CR

responding in CD1 mice (n = 12) relative to saline. B and D: No dose of MPH enhanced nosepoke

responding for a CR in either C57Bl/6 (n = 12) or CD1 (n = 12) mice. * Indicates p < 0.05 relative to

saline.

Figure 3: Extended assessment of responding for conditioned reinforcement in C57Bl/6 and CD1 mice

over 13 days of testing. A-D: In both strains, lever pressing and nosepoke (in each condition, n = 12)

responding for the CR remained significantly higher than NCR responding on all days.

Figure 4: Second assessment of the effect of MPH on lever pressing and nosepoking for a CR in C57Bl/6

and CD1 mice. The number of CR and NCR responses was compared across saline, 3.5 and 5 mg/kg

MPH. A and C: Both 3.5 and 5 mg/kg MPH slightly increased lever pressing for a CR in C57Bl/6 mice (n

= 12) and significantly enhanced CR responding in CD1 mice (n = 12). B and D: Neither dose of MPH

enhanced nosepoke responding for a CR in C57Bl/6 (n = 12) or CD1 (n = 12) mice. * Indicates p < 0.05

relative to saline.

Page 45: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

38

Figure 5: Phase 1 of response extinction: comparison of CR and NCR responding on lever press and

nosepoke operants over six days following removal from water restriction. A and C: Responding on the

CR lever remained significantly higher than on the NCR lever in C57Bl/6 mice (n = 12) following

removal from water restriction, while this appeared to result in more erratic responding by CD1 mice (n =

12). B and D: Nosepoke responding for a CR was resistant to removal from water restriction in both

C57Bl/6 (n = 12) and CD1 (n = 12) mice.

Figure 6: Phase 2 of response extinction: true extinction of conditioned reinforcement was achieved by

omitting CR presentations after responding. The number of CR and NCR responses were compared over

nine tests during which CR lever pressing or nosepoking had no programmed consequence. A and C: In

C57Bl/6 mice (n = 12), selective CR lever pressing decreased to NCR levels within three days of

extinction conditions, while CR lever pressing in CD1 mice (n = 12) dropped to NCR levels on the first

day of extinction. B and D: Nosepoking for a CR greatly decreased and approached NCR levels under

extinction conditions in both C57Bl/6 (n = 12) and CD1 (n = 12) strains, but remained significantly

higher during the nine-day period.

Figure 7: Locomotor stimulating effects of MPH. Locomotor activity was measured as the number of

beams broken in a 60-minute period after treatment with saline, 3.5, and 5 mg/kg MPH. Both C57Bl/6 (n

= 16) and CD1 (n = 16) mice exhibited significantly enhanced motor activation by 3.5 mg/kg of MPH,

which did not differ significantly from a 5 mg/kg dose. * Indicates p < 0.05 relative to saline.

Figure 8: Mice did not learn Pavlovian conditioned approach towards the CS used in experiment 1 when

it is explicitly unpaired with the saccharin reward. A and C: C57Bl/6 (n = 12) and CD1 (n = 12) mice did

not increase in number of head entries to a reward magazine during non-associated stimulus onset (when

reward is unavailable). B and D: Number of missed reward presentations declined in both strains of mice,

although at a less rapid pace in CD1 mice; mice increasingly obtain the rewards over training, the

presence of which are unsignalled.

Figure 9: In unpaired mice, responses on the CR lever were compared with the NCR lever over four days

of drug-free assessment of conditioned reinforcement. Neither C57Bl/6 (n = 12) nor CD1 (n = 12) showed

a preference for the lever that resulted in presentation of the non-associated stimulus.

Page 46: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

39

Figure 10: Mice overexpressing DAT exhibit normal Pavlovian conditioned approach to a reward-

predictive CS. Wild-type (WT, A; n = 10) and DAT overexpressing (DAT-Tg, B; n = 10) both increase

reward magazine entries during CS onset over the course of training relative to a 5s pre-CS period.

Figure 11: Mice overexpressing DAT exhibit normal responding for conditioned reinforcement.

Responding on the CR and NCR lever over three test days was compared between backgrounds of mice.

Both WT (A; n = 10) and DAT-Tg (B; n = 9) mice exhibited selective responding on the CR lever relative

to the NCR lever. The number of CR responses made on each day was also similar between WT and

DAT-Tg mice.

Figure 12: Effects of MPH on responding for a CR in DAT overexpressing mice. Responses made on the

CR and NCR levers were compared between WT and DAT-Tg mice following either saline or 5 mg/kg

MPH treatment. MPH did not significantly enhance responding for a CR in either WT (A; n = 9) or DAT-

Tg (B; n = 9) mice.

Figure 13: Effects of AMPH on responding for a CR in DAT overexpressing mice. Responses made on

the CR and NCR levers were compared between WT (A; n = 9) and DAT-Tg (B; n = 9) mice following

saline, 0.2, and 1 mg/kg AMPH. Administration of 1 mg/kg AMPH significantly attenuated responding

on both CR and NCR levers relative to saline and 0.2 mg/kg AMPH in both WT and DAT-Tg mice, while

0.2 mg/kg AMPH had no effect in either group of mice. * Indicates p < 0.05 relative to saline.

Page 47: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

40

Chapter 8

Figures

Figure 1

Page 48: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

41

Figure 2

Page 49: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

42

Figure 3

Page 50: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

43

Figure 4

Page 51: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

44

Figure 5

Page 52: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

45

Figure 6

Page 53: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

46

Figure 7

Page 54: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

47

Figure 8

Page 55: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

48

Figure 9

Page 56: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

49

Figure 10

Page 57: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

50

Figure 11

Page 58: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

51

Figure 12

Page 59: Acquisition and Responding for Conditioned …...operant responses which result in its presentation (Bugelski, 1938; Taylor and Robbins, 1984). Considering this CR was previously a

52

Figure 13