neural processes for learning and monitoring sequential
TRANSCRIPT
Neural processes for learning andmonitoring sequential regularities in
changeable environments.
A Dissertation
Presented to
The Faculty of the Graduate School of Arts and Sciences
Brandeis University
Psychology
Robert Sekuler, Department of Psychology, Advisor
In Partial Fulfillment
of the Requirements for the Degree
Doctor of Philosophy
by
Abigail LaBombard Noyce
February, 2014
The signed version of this form is on file in the Graduate School of Arts and Sciences.
This dissertation, directed and approved by Abigail LaBombard Noyce’s committee, has been
accepted and approved by the Graduate Faculty of Brandeis University in partial fulfillment of the
requirements for the degree of:
DOCTOR OF PHILOSOPHY
Malcolm Watson, Dean of Arts and Sciences
Dissertation Committee:
Robert Sekuler, Department of Psychology, Chair
Angela Gutchess, Department of Psychology
Paul DiZio, Department of Psychology
Marc Howard, Department of Psychology, Boston University
©Copyright by
Abigail LaBombard Noyce
2014
Acknowledgments
First, I need to thank my advisor, Robert Sekuler, for my education in psychophysics, cognitive
research, academic writing, and being a scientist. His curiosity, enthusiasm, and high standards
have shaped this dissertation and my entire graduate career. Without his guidance and support this
document would not have been possible.
My committee members, Paul DiZio, Angela Gutchess, and Marc Howard, have been generous
with their time, their expertise, and their enthusiasm for my work. I am grateful for all of these
things, particularly that last.
My vision lab colleagues, past and present, have celebrated victories and commiserated over
defeats. Lisa Payne, Sujala Maharjan, Yile Sun, Avigael Aizenman, Chad Dube, Nichola Cohen,
Sylvia Guillory, Heather Sternshein, Jessica Maryott, and Kristina Visscher were there when I
needed them. A particular thank you to my undergraduate RAs: Erika Bell, Stephanie Bond, Brian
Dorfman, Arielle Keller, Rebecca Meyer.
At the University of New Hampshire, William Stine and Jill McGaughy helped me discover
that I like research, and Anne Gilman treated me like a grown-up scientist before anybody else did.
More recently, my ITFF colleagues have provided cheerleading, wisdom, and hilarity.
I am blessed with the most excellent family, by birth, by marriage, and by choice. My parents
and siblings, my in-laws, and my friends have been understanding, interested, excited, and proud.
There are too many of them to list, but I am deeply grateful. Finally, I need to thank my beloved
iv
husband Daniel Noe, who has been picking up my slack for the last half-decade, and who has
always known I could do this.
v
AbstractNeural processes for learning and monitoring sequential regularities in
changeable environments.A dissertation presented to the Faculty of
the Graduate School of Arts and Sciences ofBrandeis University, Waltham, Massachusetts
by Abigail LaBombard Noyce
The world is largely stable and predictable. Humans and other organisms are sensitive to that
stability, and use it to support cognitive processes. This work consists of a series of studies that
explore how humans learn about such stability, use that information to generate predictions about
forthcoming sensory input, and detect when such predictions are inadequate. First, I present a
modeling study that quantified the distributions of errors that people make on a complex, visuomotor
sequence learning task, and examine the serial position dynamics of several parameters describing
short-term visual memory. Both precision and capacity for these sequences increases with familiarity,
and the worst-represented items show the largest increases. Next, I present an experiment that used
the same task to understand the effects of deviant items within familiar sequences. By measuring
ERPs to new, familiar, and deviant items, I dissociate the neural activity associated with detecting a
deviant from that associated with encoding task-relevant stimulus characteristics. Finally, I present
an experiment investigating the role of prediction in a task that is stochastic, rather than sequential,
and in which deviant events occurred among the distractors rather than among the task-relevant
stimuli. Unexpected events among the distractors seem to obligatorily attract attention, enhancing
or impairing performance. Further, I show that the individual differences in the neural response to
such unexpected events is predicted by temperament. Together, these studies illuminate how the
brain learns about predictability in a range of settings, and leverages such predictability to facilitate
cognition.
vi
Contents
Abstract vi
1 Introduction 11.1 Sequences in short-term memory . . . . . . . . . . . . . . . . . . . . . . . . . . . 21.2 Dissociating new and deviant events in a dynamic environment . . . . . . . . . . . 31.3 Neural and behavioral effects of implicit prediction monitoring . . . . . . . . . . . 6
2 Dynamic reallocation of VSTM resources 82.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3 P300 to violations of newly learned predictions 453.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 453.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 473.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 553.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
4 Leaky ignoring in the flanker task 654.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 654.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 674.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 744.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
5 Conclusion 875.1 Further evidence for human sensitivity to environmental structure and predictability 885.2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
References 93
vii
Chapter 1
Introduction
The nervous systems of humans (and many other organisms) are exquisitely tuned to patterns and
structure in sensory inputs. This tuning is generally adaptive , as the world contains a great deal of
sequential regularity—cases where one sensory event reliably predicts a following event. These
regularities extend from simple, predictable inputs, such as background noise or the sensation
produced by a surface on which one is sitting, all the way to elaborately structured regularities such
as the syntax of language, or the key of a piece of music.
The brain uses these regularities to increase the efficiency and efficacy of perceptual and
cognitive processes. Reports of visual stimuli held in short-term memory are biased towards the
average or most-common stimulus characteristics (Dube & Sekuler, 2012; Huang & Sekuler, 2010a,
2010b; Payne, Guillory, & Sekuler, 2013). Such bias occurs even when the characteristics of a
currently-visible stimulus must be reported (Chalk, Seitz, & Series, 2010). In tasks involving natural
scenes, perception is biased towards stimuli that are consistent with the surrounding context, as
seen by facilitated speed and accuracy of identifying such stimuli (e.g. Bar, 2004; Biederman,
Mezzanotte, & Rabinowitz, 1982; Kersten, Mamassian, & Yuille, 2004; Gronau, Neta, & Bar, 2008;
Mandler, 1980), and in tasks involving faces and attributes, people are faster and more accurate when
1
CHAPTER 1. INTRODUCTION
the physical features of the face are congruent with the trait being described (Cassidy, Zebrowitz,
& Gutchess, 2012). Cross-modally, visual input can impair or enhance speech perception, as in
the McGurk effect, or when auditory input is embedded in noise (Guttman, Gilroy, & Blake, 2005;
Zion Golumbic, Cogan, Schroeder, & Poeppel, 2013). Perception is also facilitated by occurring
at a predictable time. The brain appears to modulate the signal-to-noise gain of the visual system
when the timing of upcoming stimuli is known (Cravo, Rohenkohl, Wyart, & Nobre, 2013).
Regularities are also able to guide attention. It is well documented that attentional cueing can
facilitate detection and processing of cued stimuli (e.g Posner, 1980; Posner, Snyder, & Davidson,
1980; Huang & Sekuler, 2010a, 2010b). Such cueing depends on the expectation that cues are
predictive, which allows the subject to allocate attention appropriately (C. Summerfield & Egner,
2009). However, such attentional shifting does not require explicit cues. Structured scenes enable
the allocation of attention to appropriate objects (J. J. Summerfield, Lepsien, Gitelman, Mesulam,
& Nobre, 2006), and even when temporal structure is irrelevant, it triggers spatial and featural
attentional capture (Le Pelley, Vadillo, & Luque, 2013; Zhao, Al-Aidroos, & Turk-Browne, 2013).
Anticipation and predictions derived from such regularities also manifest in anticipatory eye
movements (Barnes, 2008; Burke & Barnes, 2007; Elsner, Falck-Ytter, & Gredeback, 2012; Kowler,
1989; Maryott, Noyce, & Sekuler, 2011; Rotman, Troje, Johansson, & Flanagan, 2006) and motor
control schema (Cohn, DiZio, & Lackner, 2000; Hu, 2010; Pigeon, Bortolami, DiZio, & Lackner,
2003). The studies that I report in this dissertation improve our understanding of the neural
mechanisms that underly anticipatory and predictive effects on cognition.
1.1 Sequences in short-term memory
Cognitive skills ranging from language and navigation to visually guided manipulation of tools and
devices depend upon an ability to represent the sequential order of events. In particular, humans
2
CHAPTER 1. INTRODUCTION
frequently need to maintain a sequence of events in short-term memory for subsequent analysis
or reproduction. Previous work with serial recall tasks has demonstrated that sequences held in
memory are susceptible to strong primacy and recency effects (e.g. Agam, Bullock, & Sekuler,
2005; Kahana & Jacobs, 2000; Lashley, 1951; Maryott et al., 2011), and may depend on chunking
adjacent items within a sequence (Farrell, Hurlstone, & Lewandowsky, 2013; Hitch, Fastame,
& Flude, 2005; Maryott & Sekuler, 2009). Although much of the classic work on learning and
short-term memory for sequences has used lists of verbal stimuli (e.g. Ebbinghaus, 1913; Hebb,
1961; Hitch et al., 2005; Howard & Kahana, 1999; Kahana & Jacobs, 2000), studying short-term
memory for visual stimuli has some pronounced advantages. Verbal (and verbalizable) stimuli
can be tightly compressed in short-term memory; further, subjects can actively rehearse a verbal
representation, refreshing it over the course of retention with very little loss of fidelity (Ricker &
Cowan, 2010).
The graded fidelity with which subjects can reproduce previously-seen visual stimuli is ideal for
testing the relationships between precision and capacity limits in short-term memory for sequences
of visual stimuli. The study presented in Chapter 2 investigates how familiarity changes the precision
and effective capacity of short-term memory. To do this, I quantified the distributions of errors that
people make on a visuomotor sequence learning task.
1.2 Dissociating new and deviant events in a dynamic environ-
ment
As described above, humans use sequential regularities in the environment to support a wide range
of cognitive processes. These effects depend upon the brain’s ability to work in a predictive,
anticipatory way, using context, prior knowledge, and previous experiences to build expectations
3
CHAPTER 1. INTRODUCTION
about forthcoming sensory inputs. However, to navigate and manage a dynamic environment, people
cannot blindly follow those expectations. Otherwise, we may step onto a non-existent stair, or walk
into a previously-safe location only to be attacked by an unexpected foe. Monitoring mechanisms
are required to detect events that deviate from the previously established regularities, allowing
people to change planned behaviors and learn about the new state of the world.
Monitoring processes are evident in tasks with an explicitly sequential structure, such as the
serial reaction time task (Nissen & Bullemer, 1987). In this task, various stimuli on the screen
cue subjects to press a particular key, and the sequence of stimuli (and thus key presses) repeats
throughout the course of the experiment. When a stimulus that is not part of the usual sequence
occurs, people are slower and less accurate to respond to it (Ferdinand, Mecklinger, & Kray, 2008;
Gobel, Sanchez, & Reber, 2011; Russeler & Rosler, 2000). These tasks do not require explicit
memory and encoding for the sequence of stimuli, and moreover, the sequence remains stable over
the course of a relatively-long experimental block.
When a subject’s expectation is specifically about a reward value, the relationship between
expectation and actual reward is coded by dopaminergic cells in the ventral tegmental area (Schultz,
2006; Schultz, Dayan, & Montague, 1997). Rewards that are lower than the predicted reward
lead to a decrease in dopamine release; rewards that are greater than the predicted reward lead
to an increase (Schultz et al., 1997). This dopaminergic signal is generally thought to be a key
driver of learning from outcomes that are better or worse than expected (e.g. Schoenbaum, Esber,
& Iordanova, 2013; Takahashi et al., 2009, 2011), but it’s not known whether it is appropriate
to extrapolate such mechanisms to predictions that are not explicitly about reward. (See Kakade
and Dayan (2002) for an argument that novelty is intrinsically rewarding, leading to dopaminergic
spiking, and Laurent (2008) for a counter.)
One area involved in prediction monitoring regardless of the presence of reward is the anterior
cingulate cortex (Botvinick, Cohen, & Carter, 2004; Kennerley, Behrens, & Wallis, 2011). The
4
CHAPTER 1. INTRODUCTION
anterior cingulate has strong links to frontal and prefrontal cortical areas, to sensory cortices, and
to the limbic system (Ridderinkhof, Ullsperger, Crone, & Nieuwenhuis, 2004). Its activation to
new information in the environment is accompanied by the P3 ERP component, a positive-going
deflection over central electrodes from 300–500 ms after such information is presented (Crottaz-
Herbette & Menon, 2006; Linden, 2005). ACC activation (and the P3) have been observed in a
wide range of cognitive contexts, and are theorized to reflect a cognitive control process that is
responsible for developing and adjusting the working memory representation of the context, task set
or forthcoming actions (Goldstein, Spencer, & Donchin, 2002; Crottaz-Herbette & Menon, 2006;
Nieuwenhuis, Aston-Jones, & Cohen, 2005; Ridderinkhof et al., 2004). A similar theory is proposed
by Holroyd and Yeung (2012), who assert that the ACC’s primary role is to hold and monitor the
options available for behavior. The P3 has at least two subcomponents: the Novelty P3, elicited by
stimuli that violate expectations, and the P300, elicited by task-relevant stimuli (Goldstein et al.,
2002; Linden, 2005; Nieuwenhuis et al., 2005; Polich, 2007).
Prediction monitoring processes’ main role may be to guide learning (Maier & Steinhauser,
2013). Pearce and Hall (1980) argue that classical conditioning depends on the unconditioned
stimulus being unpredictable, while Wills, Lavric, Croft, and Hodgson (2007) demonstrate that
associative learning occurs most rapidly for cues that remain predictive, but whose outcome changes
and Ouden, Friston, Daw, McIntosh, and Stephan (2009) show that when auditory distractors predict
visual distractors, the neural response to such predicted events decreases across trials. Similarly, in
motor control tasks, subjects learn from the mismatch between the expected motor output and the
movement that actually occurs (Cohn et al., 2000; Lackner & DiZio, 2005), but not from explicit
knowledge about the state of the world (Hu, 2010).
The experiment reported in Chapter 3 investigates the neural processes that support (i) explicit
sequential encoding and (ii) monitoring such sequences for deviant events. Using the visuomotor
sequence learning task described in Chapter 2, I measured ERPs to new, familiar, and deviant
5
CHAPTER 1. INTRODUCTION
sequence items. By comparing ERPs to new and to deviant items, I dissociated the neural activity
related to triggering a prediction-monitoring mechanism from that related to encoding a new
stimulus exemplar. I hypothesized that both new and deviant items would enhance a P3 ERP, but
that new items would elicit only an enhanced P300, while deviant items would also elicit a Novelty
P3. These results show that the theoretical structure surrounding P3 generation is also applicable to
dynamic environments where governing regularities are frequently changing.
1.3 Neural and behavioral effects of implicit prediction moni-
toring
While the task in Chapters 2 and 3 has sequential structure that subjects are explicitly aware of,
predictive and monitoring processes occur even when subjects are not attending to the governing
regularity. In Chapter 4, I looked for behavioral and neural evidence that people are sensitive to
regularities in peripheral stimuli that they are actively attempting to ignore.
Previous work in associative-learning contexts demonstrated that unexpected outcomes trigger
the allocation of attention (Wills et al., 2007). Further, deviant events in a stream of unattended
auditory stimuli capture attention, leading to poorer detection of targets in a simultaneously-
presented stimulus stream (Schroger, 1996), and deviant tones (Escera, Alho, Winkler, & Naatanen,
1998; Nostl, Marsh, & Sorqvist, 2012; Parmentier, Elford, Escera, Andres, & San Miguel, 2008) and
vibro-tactile stimuli (Parmentier, Ljungberg, Elsley, & Lindkvist, 2011; Ljungberg & Parmentier,
2012) preceding a visual target delay target detection.
The mismatch negativity (MMN) is a negative-going deflection in the ERP elicited by a stimulus
that violates some governing regularity. While it is historically studied in the auditory domain, the
existence of a visual mismatch negativity (vMMN) has recently been established (Czigler, 2007;
6
CHAPTER 1. INTRODUCTION
Kimura, Schroger, & Czigler, 2011; Stefanics, Kimura, & Czigler, 2011). The MMN is believed
to be generated by interactions between a predictive signal from higher-order (likely prefrontal)
areas and the bottom-up input reaching sensory cortex (Garrido, Kilner, Stephan, & Friston, 2009;
Wacongne, Changeux, & Dehaene, 2012; Wacongne et al., 2011; Winkler, 2007).
To investigate the potential attentional capture of deviant events in a task that requires stimulus
discrimination rather than mere target detection, I drew upon the Eriksen flanker task, an experi-
mental paradigm that was designed to elicit interactions between multiple, conflicting sources of
visual information (Eriksen & Eriksen, 1974). A single target, which can generally be in one of
two states, is accompanied by flanking distractors, which can match or differ from the target. The
congruency between the distractors and the target influences the accuracy and reaction time with
which subjects can report the state of the target (e.g Eriksen & Eriksen, 1974; Ridderinkhof, Band,
& Logan, 1999).
The experiment presented in Chapter 4 examined how predictability of distractors influenced
cognitive processing in the flanker task, and the relationship between such influences and the ERPs
elicited by onset of standard and deviant distractors.
7
Chapter 2
Dynamic reallocation of VSTM resources
2.1 Introduction
Holding visual information in short-term memory is critical in many daily tasks. As researchers
have attempted to characterize the structure of visual short-term memory (VSTM), it has become
necessary to understand how resources are allocated among multiple items that are held simultane-
ously in memory. Here, I investigate how such allocation changes as people gain experience with
the items. Real-life use of visual short-term memory almost always interacts with learning and
long-term memory.
Recent work attempting to quantify the allocation of VSTM resources among items has drawn
on signal-detection approaches to understanding cognition. This approach assumes that the rep-
resentations of items in memory are noisy, or otherwise imperfect, and attempts to characterize
that imperfection. The most common method has been to ask subjects to reproduce from memory
some characteristic of an item, measure the degree to which their reproduction differs from the
initial stimulus, and use the resulting distribution of errors as a proxy for the noise of the memory
representation (e.g. Wilken & Ma, 2004; Zhang & Luck, 2008).
8
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
A family of related models has been proposed to characterize such error distributions. The
earliest had two parameters: one describing the probability that subjects responded randomly,
presumably due to guessing, rather than by reporting the probed item; the other describing the
spread of responses around the true value (Zhang & Luck, 2008, 2009, 2011). Another group
extended this basic model to include the possibility that subjects report the identity of a non-probed
item from the memory set, leading to a third parameter: the probability of a non-target report (Bays,
Catalao, & Husain, 2009; Bays, Gorgoraptis, Wee, Marshall, & Husain, 2011; Bays, Wu, & Husain,
2011; Gorgoraptis, Catalao, Bays, & Husain, 2011; Zokaei, Gorgoraptis, Bahrami, Bays, & Husain,
2011). Finally, recent work has moved towards fitting the error distributions with a model in which
the amount of resources allocated to each object may vary from trial to trial (van den Berg, Shin,
Chou, George, & Ma, 2012; Fougnie, Suchow, & Alvarez, 2012).
While the majority of these experiments have used “snapshot” displays, in which an array
of items are briefly presented and then one item is probed, a handful of notable exceptions have
controlled for the possibility that this approach may confound encoding bottlenecks, such as
perceptual crowding, with limits on VSTM capacity (Bays et al., 2009). For example, Emrich
and Ferber (2012) and Gorgoraptis et al. (2011) both showed that inter-item confusion (i.e. non-
target reports) increases when items are shown in succession, but precision and the rate of random
responses remain constant. However, both of these studies used arrays of static stimuli and compared
a “snapshot” display to a series of two or three displays each containing a subset of the items. The
static nature of their stimuli leave open questions about VSTM structure for dynamic stimuli.
Further, none of the previous work with this family of models has investigated the effects of
learning on VSTM representations of information. It is well-established that memory accuracy
improves as items become more familiar (e.g. Ebbinghaus, 1913; Hebb, 1961; Howard & Kahana,
1999; Maryott et al., 2011), presumably due to increased support from long-term memory. However,
it is unknown how such support changes the distribution of errors, or the probability that a non-target
9
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
is reported.
Sekuler, Siddiqui, Goyal, and Rajan (2003) and Agam et al. (2005) established a VSTM memory
task with the dynamic properties that have been lacking in the previous modeling work. In their task,
subjects observed and reproduced motion trajectories enacted by a disk on a computer screen. The
stimuli themselves comprised position changes over time, and each segment of a motion trajectory
was linked with its successor. Agam et al. put forth a single, scalar measure of reproduction accuracy
that has thus far been used to describe memory performance in this task. Several following papers
have used that measure to characterize the effects of learning on such performance (Agam, Galperin,
Gold, & Sekuler, 2007; Agam, Huang, & Sekuler, 2010; Maryott et al., 2011). However, the body
of VSTM modeling work described above has made it clear that memory errors may arise from
multiple sources: An item may be successfully retrieved from memory, but with added noise; an
item may fail to be encoded into memory, or fail to be retrieved; or an item may be mis-bound with
its location in the stimulus set, leading to non-target reports. The measure used in these previous
studies does not distinguish between these sources of recall errors.
Here, I characterize familiarity-induced changes in VSTM representations by fitting four related
VSTM models to responses generated on the visuomotor sequence learning task introduced by
Agam et al. (2005). I fit each model to errors generated at each level of increasing familiarity in
order to assess how model parameters changed as subjects learned a sequence. Using the resulting
serial position dynamics, I then characterized the effects of learning on VSTM characteristics such
as precision and number of items stored. The results presented in this chapter show that VSTM has
a remarkable ability to reallocate memory resources over the course of learning.
10
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
2.2 Methods
To investigate the effects of learning on visual short-term memory, I fit several models to data from
a motion trajectory learning task. These data were previously collected by myself and others to
characterize the effects of learning on memory for non-verbal sequences, using a single measure
of accuracy (mean absolute error; Agam et al., 2005), and to investigate neural correlates of such
learning (see Agam et al., 2007; Maryott et al., 2011; Noyce & Sekuler, in revisions). All subjects
(N = 64) were young adults recruited from the Brandeis University community. Re-analyzing this
data with multiple parameters, as I have done here, leads to new insights into the effects of learning
on memory for these sequences.
2.2.1 Experimental task and accuracy measures
The data were collected from a task in which subjects observed and reproduced psuedo-random
motion trajectories. Each trajectory was presented multiple times in succession, and each subject
saw approximately 75 such trajectories. The details of the task varied slightly between experiments;
Table 2.1 summarizes the relevant details for each dataset.
Figure 2.1 illustrates the sequence of events within one presentation of a trajectory. On each
presentation, a small disk traversed a path comprising either five or six connected linear motion
segments. After each segment, the disk paused briefly before resuming its motion in a changed
direction. The disk then disappeared from view. After a retention interval, a second disk appeared
at a random point within the central portion of the display, cueing the subject to move a handheld
stylus over the surface of a graphics tablet in order to reproduce from memory the sequence of
disk motions that had just been seen. During the reproduction, the disk’s motion was yoked to the
movement of the stylus’ tip on the graphics tablet. No other feedback was provided. Note that
neither the stimulus nor the reproduction disk left a visible trail while moving across the computer
11
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Stimulus Cue Stimulus Trajectory Retention Interval Reproduction Cue Reproduction
0.60 s 4.50 s 3.75 s
Figure 2.1: One presentation of a motion sequence stimulus. At the start of each presentation, adisk appeared at the center of the display before beginning to move in a series of either five or sixconnected linear segments without leaving any visible trail. After enacting the movements, the diskdisappeared from view. After a retention interval, a second disk appeared, signaling the subjectto begin reproducing from memory the path that had been traveled by the first disk. Each trialconsisted of either four or five such presentations.
display. To obtain a complete representation of the disk’s path, subjects would have to extract it
from the disk’s motions, and maintain it in short-term memory until reproduction was required
(Geisler, Albrecht, Crane, & Stern, 2001; Jancke, 2000).
Each trial’s quasi-random sequence of motion segments was generated by the algorithm de-
scribed by Agam et al. (2005). The direction of a sequence’s initial motion was chosen randomly,
and the magnitude of the direction change at each “corner” of the trajectory was between 30 and
150. Changes in direction could be clockwise or counter-clockwise, with equal probability. The
motions comprising a sequence were constrained by several additional rules: Motion segments were
not permitted to intersect, could not come within one-half a segment’s length of intersecting, nor
could they extend beyond the boundaries of the visible display area.
The fidelity of each reproduction was quantified offline by means of an algorithm that used
the subject’s pauses and direction changes to divide the reproduction into segments (Agam et al.,
2005; Maryott & Sekuler, 2009; Maryott et al., 2011). After segmentation, the algorithm estimated
the direction of each such segment by fitting a line to its beginning and end points. Reproduction
accuracy was then quantified by directional error: the angular difference between the direction of a
12
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.1: Details and sources of the datasets included in these analyses.
Subjects Trials Published in
5 segments and4 presentations
12 76 Agam et al. (2007)
12 88 Noyce and Sekuler (in revisions); presented in Chapter 3
9 88 Noyce and Sekuler (unpublished)
6 segments and5 presentations
12 76 Agam et al. (2007)
8 72 Maryott et al. (2011)
11 72 Maryott et al. (2011)
motion segment in the reproduction and the direction of the corresponding segment in the stimulus
model. (See Agam et al. (2005) for a comparison of scalar measures for accuracy on this task.)
On every trial, a unique trajectory was presented either four or five times (see Table 2.1),
with subjects reproducing the sequence after each such presentation. This allowed me to measure
familiarity-induced changes in the quality of the reproduction.
2.2.2 Datasets
Table 2.1 identifies the sources and details of the behavioral data used to fit a family of four related
models. Four of the datasets were previously published using mean absolute direction error to
characterize learning. Each dataset is from an experiment with between 8 and 12 subjects and
approximately 75 trials per subject. Three datasets are from experiments using five-segment motion
trajectories with each trajectory repeated four times; three are from experiments using six-segment
trajectories with each trajectory repeated five times.
The precise size and timing of stimuli varied somewhat across the six studies. Each segment
of a trajectory took 500-525 ms; pauses at the corners of segments lasted 225-450 ms. Stimuli
were thus 3400-5400 ms in length. During one experiment, subjects were instructed to track the
13
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
stimulus disc with their eyes (Maryott et al., 2011, Experiment 1), in two others they were instructed
to maintain fixation at the center of the screen (Noyce & Sekuler, in revisions; Noyce & Sekuler,
unpublished), and in the remaining three they were given no instructions about eye movements
(Agam et al., 2007, Experiments 1 and 2; Maryott et al., 2011, Experiment 2). Depending on
the eye movement instructions, the size of the stimulus varied, such that each segment traversed
4.5(tracking), 1(fixation), or 1.5(free eye movements) visual angle.
2.2.3 Models
I fit four members of a family of VSTM models to data collected from a visual sequence-learning
task, and assessed the ways that the models’ parameters varied with serial position and repeated
presentation.
Model 0 : Slots Mixture Model
The simplest class of models I considered was based on the one described by Wilken and Ma (2004)
and Zhang and Luck (2008). Here, the probability that subjects make a response error x is given by:
P (x) = (1− g)×M(0,SD)(x)︸ ︷︷ ︸responses from memory
+ g × U(x)︸ ︷︷ ︸guess responses
where M(µ,σ)(x) is the probability density function of the von Mises distribution with mean µ and
standard deviation σ, as evaluated at x and U(x) is the probability density function of a uniform
distribution, evaluated at x.
This model predicts a distribution of response errors that comprises the sum of a von Mises
(circular Gaussian) distribution with standard deviation SD and mean zero, and a uniform dis-
tribution. g specifies the proportion of the distribution that is from the uniform component; the
14
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
remainder is from the von Mises component. In other words, SD relates to the precision of an item’s
representation in working memory, and g relates to the probability that the item is in memory at all.
Model 0 required each of these parameters to be fixed across serial positions, to allow testing
of the hypothesis that VSTM for items does not vary with their serial position. Zhang and Luck
(2008, 2009, 2011)’s conceptualization of VSTM as a set of two to four slots specifies that memory
resources are allocated in fixed quanta, and cannot accommodate effects of serial position on
memory accuracy.
Model 1: Standard Mixture Model
Model 1 allows both SD and g to vary across serial positions. The probability of a response error x
is given by the same equation as in Model 0, except that this distribution is computed independently
for each serial position on each repeated presentation. Previous analyses of these data have shown
strong serial-position effects on accuracy, and so this and all subsequent models allow parameters
to vary with serial position.
Model 2: Standard Mixture Model With Swaps
The third model in the family was described by Bays et al. (2009). This model extends the Standard
Mixture Model to account for the possibility that subjects report non-target items from a stimulus
display, rather than making random errors. Remember that “random” here means that subjects
selected a response without being affected by the item they’re supposed to be reproducing, nor by
any other items in the stimulus set. In Model 2, the probability that subjects make a response error
15
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
x is given by:
P (x) = (1− g −B)×M(0,SD)(x)︸ ︷︷ ︸responses from memory
+ g × U(x)︸ ︷︷ ︸guess responses
+D∑i=1
(B
D×M(di,sd)(x)
)︸ ︷︷ ︸
inter-item confusion responses
with distribution terms as above; di refers to the ith other segment in the trajectory, of D total other
segments. (These are equivalent to “distractors” in the snapshot displays.)
This model incorporates information about the other segments in the trajectory, and produces
a distribution of response errors that comprises the sum of a uniform distribution with several
Gaussian distributions. With probability B, responses are drawn from a von Mises distribution
centered on one of the other segments in the display (each other segment is equiprobable); with
probability g, responses are drawn from the uniform distribution; with probability (1 − g − B),
responses are drawn from the von Mises distribution centered on the target.
Note that my implementation of this model does not restrict to “true swaps” among items. That
is, I don’t restrict to cases where item A is reported as item B and item B is reported as item A. The
swaps are estimated for each serial position independently. This allows the possibility that subjects
report one item at more than one serial position.
Model 3: Variable Precision Model
The final of my family of models is based on those described by Fougnie et al. (2012) and van
den Berg et al. (2012). Instead of fixing a precision parameter, this model allows it to vary across
trials. It is plausible that the allocation of memory resources among items, and even the amount of
memory resources available to encode the set, may vary across trials. Thus, the probability of a
16
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
response error x on the ith trial is given by:
P (x) = (1− g)×M(0,sdi)(x)︸ ︷︷ ︸responses from memory
+ g × U(x)︸ ︷︷ ︸guess responses
where sdi is drawn from the Gamma distribution with scale:
θ =2sdSD2
modeSD +√
modeSD2 + 4sdSD2
and shape:
k = 1 + modeSD +1
θ
The parameters that are fitted here are g, the probability that responses are from the uniform
distribution, and modeSD and sdSD which characterize the Gamma distribution from which
precision values are drawn on any given trial. High modeSD values characterize a distribution
“centered” around less-precise representations, while high sdSD values characterize a distribution
with a lot of variability in precision values.
This model produces a distribution of responses that comprises the sum of a uniform distribution
and many von Mises distributions with mean 0 and varying SDs. The resulting distribution is
“peakier” than that of the Standard Mixture Model.
2.2.4 Model fitting
Models were specified and fitted using the MemToolbox (Suchow, Brady, Fougnie, & Alvarez,
2013) for Matlab (The MathWorks, In., Natick, MA). Within each dataset, I fitted each model
to the distribution of responses produced at each serial position on each repeated presentation.
For example, for the five-segment datasets, I fit each model independently to five segment serial
positions on each of four repeated presentations. The model fitting protocol used an iterative
17
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
maximum likelihood procedure, which started three searches at different locations within the
parameter space, adjusted the parameters until a local maximum was reached, and ran until the three
searches converged. Further, I took advantage of MemToolbox’s hierarchical fitting abilities, which
constrain the parameters by requiring that each fitted parameter be normally distributed across
the subjects in each dataset. This helps keep the fitting procedure from converging on extremely
unlikely values, a trait that was particularly important because of the noisiness of these data, and
because of the correlations among model parameters.
2.2.5 Measures
After fitted model parameters were calculated, I used two measures to summarize familiarity’s
influence on the representation of items in VSTM. First, to characterize the overall trend in each
parameter, I took its mean fitted value across serial positions, on each repeated presentation.
Second, to characterize the magnitude of serial position effects, I took the standard deviation of the
parameter’s values at each serial position.
To test the significance of these summary measures’ changes across presentations, I used a
permutation test. On each of 10,000 iterations, each subject’s fitted parameter values were randomly
allocated among segments and presentations. The two summary measures described above were
calculated subjectwise for each simulated presentation. The slope of the best-fitting line over serial
positions was calculated, giving a distribution of 10,000 such slopes for each measure and parameter.
Familiarity-induced changes in parameter values were then tested against these distributions.
I used two measures to compare models. First, the Akaike information criterion (AIC), given by
AIC = 2k − 2 ln(L)
where k is the number of parameters in the model, and L is the likelihood of the fitted model.
18
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Because the models were fitted using maximum likelihood estimation, AIC is maximized in the
fitting process. I therefore also compared models using a statistic that will behave somewhat
differently, the Kuiper statistic (Kuiper, 1960). Unlike measures such as likelihood and root
mean squared error, the Kuiper statistic is intended for comparing continuous, rather than discrete,
distributions. It is related to the better-known Kolmogorov-Smirnov (K-S) measure, but is more
sensitive at the tails of the distribution and is cyclicly invariant (and so better suited for assessing
probability distributions in circular space). The Kuiper statistic consists of the empirically-obtained
cumulative distribution function’s maximum distance above the model cumulative distribution
function, added to the empirical cumulative distribution function’s maximum distance below the
model cumulative distribution function.
Kuiper = max(cdfmodel − cdfdata) + max(cdfdata − cdfmodel)
For each pairwise comparison between models, I bootstrapped a distribution of mean difference
in Kuiper statistics and mean difference in AIC. For each subject, I averaged the probability density
functions of the two models being compared, and used the resulting distribution to generate two
samples of trials. I calculated the Kuiper statistic and AIC for each sample, took the differences,
and averaged those differences across my 64 subjects. I thus derived a distribution of 500 such
differences that arose when the null hypothesis was true and the samples were drawn from the
same distribution. Comparing my empirically-obtained Kuiper statistics and AIC differences to this
distribution allowed me to obtain p-values.
19
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.2: Results of repeated-measures ANOVAs on mean absolute directional error, with factorspresentation and serial position.
five-segment data six-segment data
parameter source df1, df2 F partial η2 df1, df2 F partial η2
accuracy
presentation 3, 96 77.435∗∗∗ .708 4, 120 156.237∗∗∗ .839
serial position 4, 128 44.036∗∗∗ .579 5, 150 57.457∗∗∗ .657
pres. × serial pos. 12, 384 5.655∗∗∗ .150 20, 600 15.708∗∗∗ .344
∼p < .1. ∗p < .05. ∗∗p < .01. ∗∗∗p < .001.
2.3 Results
In this section, I first present the actual accuracy data, then fitted parameters from each model, and
finally comparisons between the four models I tested.
2.3.1 Accuracy
Figures 2.2 and 2.3 show the distributions of errors that subjects make while performing this task, for
the five-segment and six-segment data. Subjects’ mean absolute directional error on each segment
was 21.66 (SD = 6.46).
Separate two-way repeated-measures ANOVAs on the five-segment and six-segment data
confirmed a significant main effect of presentation (p < .001), a significant main effect of serial
position (p < .001), and a significant interaction between the two (p < .001). The full results are
shown in Table 2.2.
20
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Segment serial position
Presentation 40.2
0
Presentation 10.2
0
0.2
0
Presentation 30.2
0
0 180 0 180 0 180 0 180 0
1 2 3 4 5
Presentation 2
Prop
ortio
n of
erro
rs
Figure 2.2: Histograms showing the distributions of errors generated by 33 subjects on the five-segment task. Each row corresponds to a presentation; each column corresponds to a serial position.Distributions generated on early presentations are short, squat, and kurtotic, especially at thepenultimate serial position; distributions generated on late presentations are tall and narrow.
21
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Segment serial position
Presentation 40.2
0
Presentation 10.2
0
0.2
0
Presentation 30.2
0
0 180 0 180 0 180 0 180 0
1 2 3 4 5
Presentation 2
Prop
ortio
n of
erro
rs
6
0.2
0
Presentation 5
180 0
Figure 2.3: Histograms showing the distributions of errors generated by 31 subjects on the six-segment task. Each row corresponds to a presentation; each column corresponds to a serial position.Distributions generated on early presentations on short, squat, and kurtotic, especially at thepenultimate serial position; distributions generated on later presentations are tall and narrow.
22
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
0
0.1
0.2
Fitte
d g
valu
es
10
20
30
40
Fitte
d SD
val
ue
-4 -3 -2 last-1 -4 -3 -2 last-1-5
Segment serial position
-4 -3 -2 last-1 -4 -3 -2 last-1-5
5 segments over 4 presentations 6 segments over 5 presentations
12
3
4
12
3
4
Standard0
1 2 3
4 5
Figure 2.4: Model 0: Standard Mixture Model with no serial position variability. Fitted SD(top) and g (bottom) values from the standard mixture model, by segment serial position andrepeated presentation. Note that the fitted g values for 5-segment sequences are nearly identical onpresentations three and four, making the plotted lines overlap to the point where presentation fourappears to be missing. Error bars are repeated-measures standard error (Loftus & Masson, 1994;Cousineau, 2005; Morey, 2008).
23
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCESPr
opor
tion
of T
rials
0
0.3
0
0.3
0
0.3
0
0.3
Segment Serial Position
1 2 3 4 5
Directional error
0 0 0 0 0180 180 180 180 180-180 -180 -180 -180 -180
Presentation 1
Presentation 2
Presentation 3
Presentation 4
Figure 2.5: Model 0: Error distributions and fitted model predictions for a representative subject.Panels are arranged in rows by presentation and in columns by serial position.
2.3.2 Model 0: Standard Mixture Model with no serial position variability
Figure 2.4 shows the fitted SD and g values from Model 0 (Standard Mixture Model with no
serial position variability), across segment serial position and repeated presentation. In each panel,
five-segment sequences are shown on the left and six-segment sequences are shown on the right.
Note that for both parameters and both sequence lengths, the fitted values are higher (reflecting
worse memory performance) on the first presentation than on subsequent presentations.
Figure 2.5 shows the error distributions generated by a representative subject who was perform-
24
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.3: Results of repeated-measures ANOVAs on each parameter of Model 0, with factorpresentation.
five-segment data six-segment data
parameter source df1, df2 F partial η2 df1, df2 F partial η2
SD presentation 3, 96 10.270∗∗∗ .243 4, 120 12.684∗∗∗ .297
g presentation 3, 96 5.132∗∗ .138 4,120 15.395∗∗∗ .339
∗p < .05. ∗∗p < .01. ∗∗∗p < .001.
ing the five-segment version of the task. Overlain on the histograms are the distributions predicted
by this subject’s fitted Model 0 parameters.
Separate one-way repeated-measures ANOVAs on the five-segment and six-segment data
confirmed a significant main effect of presentation (p < .003 for both SD and g). The full results
are shown in Table 2.3.
Figure 2.6 summarizes the change in each parameter across repeated presentations. On average,
SD decreases across presentation (p < .001), suggesting that subjects represent each item more
precisely as items become more familiar. Similarly, on average, g decreases across presentations
(p < .001), suggesting that increased familiarity supports subjects in successfully forming a
representation of each item.
2.3.3 Model 1: Standard Mixture Model
Figure 2.7 shows the fitted SD and g values from Model 1 (Standard Mixture Model), across
segment serial position and repeated presentation. In each panel, five-segment sequences are shown
on the left and six-segment sequences are shown on the right. Note that for both parameters and
both sequence lengths, fitted values are higher (reflecting worse memory performance) on the first
presentation than on subsequent presentations. Further, the shapes of the first presentation curves
25
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
-1.6
-0.7
Mea
n fit
ted
g (lo
g sc
ale)
5 segments6 segments
1.0
1.5
Fitte
d SD
val
ue (l
og s
cale
)
1 2 3 4 5
1 2 3 4 5
Repeated presentation
Standard0
30
20
10
0.20
0.10
0.05
5 segments6 segments
Figure 2.6: Effects of repeated presentations on the fitted parameters of Model 0 (Standard MixtureModel with no serial position variability). (top) Mean SD significantly decreases over repeatedpresentations for 5-segment (blue) and 6-segment (green) stimuli (p < .001). (bottom) Mean gsignificantly decreases over repeated presentations (p < .001).
for SD differ substantially from the those for g, with SD showing a strong primacy effect through
the penultimate item, while g shows only a 1-item primacy effect.
Figure 2.8 shows the error distributions generated by a representative subject who was perform-
ing the five-segment version of the task. Overlain on the histograms are the distributions predicted
by this subject’s fitted Model 1 parameters.
Separate two-way repeated measures ANOVAs on the five-segment and six-segment data
separately confirmed a significant main effect of presentation (p < .001) and a significant main
effect of serial position (p < .001) for both parameters, and a significant presentation by serial
26
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
10
20
30
40
Fitte
d sd
val
ue
-4 -3 -2 last-1 -4 -3 -2 last-1-5
Segment serial position
0
0.1
0.2
Fitte
d g
valu
e
-4 -3 -2 last-1 -4 -3 -2 last-1-5
5 segments over 4 presentations 6 segments over 5 presentations
12
3
4
12
3
4
12
3
4
5
12
3
4
5
Standard
Figure 2.7: Model 1: Standard Mixture Model. Fitted SD (top) and g (bottom) values from thestandard mixture model, by segment serial position and repeated presentation.
position interaction for SD (p < .01). The full results are shown in Table 2.4.
Figure 2.9 summarizes the change in each parameter across repeated presentations. On average,
SD decreases across presentations (p < .001), suggesting that subjects represent each item more
precisely as items become more familiar. Further, the inter-item variation in SD also decreases as
items become more familiar (p < .001), suggesting that the learning process enables subjects to
more evenly allocate resources among items in a sequence. Similarly, on average, g decreases across
27
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCESPr
opor
tion
of T
rials
0
0.3
0
0.3
0
0.3
0
0.3
Segment Serial Position
1 2 3 4 5
Directional error
0 0 0 0 0180 180 180 180 180-180 -180 -180 -180 -180
Presentation 1
Presentation 2
Presentation 3
Presentation 4
Figure 2.8: Model 1: Error distributions and fitted model predictions for a representative subject.Panels are arranged in rows by presentation and in columns by serial position.
28
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.4: Results of repeated-measures ANOVAs on each parameter of Model 1, with factorspresentation and serial position.
five-segment data six-segment data
parameter source df1, df2 F partial η2 df1, df2 F partial η2
SD
presentation 3, 96 38.131∗∗∗ .544 4, 120 64.438∗∗∗ .682
serial position 4, 128 21.169∗∗∗ .398 5, 150 24.223∗∗∗ .447
pres. × serial pos. 12, 384 2.833∗∗ .081 20, 600 6.456∗∗∗ .177
g
presentation 3,96 27.084∗∗∗ .458 4,120 44.223∗∗∗ .600
serial position 4,128 15.484∗∗∗ .587 5, 150 13.946∗∗∗ .317
pres. × serial pos. 12, 384 1.457 .044 20, 600 2.475∗∗ .076
∼p < .1. ∗p < .05. ∗∗p < .01. ∗∗∗p < .001.
presentations (p < .001), suggesting that increased familiarity supports subjects in successfully
forming a representation of each item, and the inter-item variation in g also decreases (p < .001),
further supporting the hypothesis that learning enables subjects to allocate resources more evenly.
2.3.4 Model 2: Standard Mixture Model With Swaps
Figure 2.10 shows the fitted SD, g, and B values from Model 2 (the model with inter-item swaps),
across segment serial position and repeated presentation. Five-segment sequences are shown on
the left of each panel; six-segment sequences are shown on the right. The fitted values of SD and
g are higher (reflecting worse memory performance) on the first presentation than on subsequent
presentations, and the curves for these parameters are extremely similar to those from Model 1.
B (probability of inter-item confusions) accounts for less than 1% of all responses, and shows no
consistent patterns across serial position or repeated presentation.
Figure 2.11 shows the error distributions generated by a representative subject who was perform-
ing the five-segment version of the task. Overlain on the histograms are the distributions predicted
29
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
-1.6
-0.9
Seria
l pos
ition
effe
cts
on g
(log
sca
le)
-1.6
-0.8
Mea
n fit
ted
g (lo
g sc
ale)
0.3
1.0
Seria
l pos
ition
effe
cts
on s
d (lo
g sc
ale)
1.2
1.5
Mea
n fit
ted
sd (l
og s
cale
)
5 segments6 segments
5 segments6 segments
5 segments6 segments
5 segments6 segments
1 2 3 4 5 1 2 3 4 5
1 2 3 4 5 1 2 3 4 5
Repeated presentation Repeated presentation
Standard
20
25
3010
7
5
2
0.15
0.10
0.05
0.12
0.10
0.07
0.05
Friday, October 4, 13
Figure 2.9: Effects of repeated presentations on the fitted parameters of Model 1 (Standard MixtureModel). (A) Mean SD decreases over repeated presentations for 5-segment (blue) and 6-segment(green) stimuli. (B) Variability in SD across serial positions decreases over repeated presentations.(C) Mean g decreases over repeated presentations. (D) Variability in g across serial positionsdecreases over repeated presentations.
by this subject’s fitted Model 2 parameters. These distributions appear virtually identical to those
predicted by Model 1 because the contribution of B is very small.
Separate two-way repeated measures ANOVAs on the five-segment and six-segment data
confirmed a significant main effect of presentation (p < .001) and a significant main effect of serial
position (p < .001) for both SD and g, and a significant presentation by serial position interaction
for SD (p < .001). Neither main effect, nor the interaction, was significant for B. The full results
30
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
0
0.1
0.2
Fitte
d B
valu
es
0
0.1
0.2
Fitte
d g
valu
e
10
20
30
40
Fitte
d sd
val
ue
-4 -3 -2 last-1 -4 -3 -2 last-1-5
Segment serial position
-4 -3 -2 last-1 -4 -3 -2 last-1-5
5 segments over 4 presentations 6 segments over 5 presentations
12
3
4
12
3
4
12
3
4
5
12
3
4
5
Swap
-4 -3 -2 last-1-5-4 -3 -2 last-1
12
3
4
12
3
4
5
Figure 2.10: Model 2: Fitted SD (top), g (middle), and B (bottom) values from the model withinter-item confusions, by segment serial position and repeated presentation.
31
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCESPr
opor
tion
of T
rials
0
0.3
0
0.3
0
0.3
0
0.3
Segment Serial Position
1 2 3 4 5
Directional error
0 0 0 0 0180 180 180 180 180-180 -180 -180 -180 -180
Presentation 1
Presentation 2
Presentation 3
Presentation 4
Figure 2.11: Model 2: Error distributions and fitted model predictions for a representative subject.Panels are arranged in rows by presentation and in columns by serial position.
32
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.5: Results of repeated-measures ANOVAs on each parameter of Model 2, with factors serialposition and presentation.
five-segment data six-segment data
parameter source df1, df2 F partial η2 df1, df2 F partial η2
SD
presentation 3, 96 37.866∗∗∗ .542 4, 120 59.317∗∗∗ .664
serial position 4, 128 21.869∗∗∗ .406 5, 150 24.439∗∗∗ .449
pres. × serial pos. 12, 384 2.948∗∗∗ .084 20, 600 5.823∗∗∗ .162
g
presentation 3,96 26.498∗∗∗ .435 4,120 39.566∗∗∗ .569
serial position 4,128 16.169∗∗∗ .336 5, 150 12.844∗∗∗ .300
pres. × serial pos. 12, 384 1.349 .041 20, 600 3.120∗∗∗ .094
B
presentation 3,96 .104 .003 4,120 1.619 .051
serial position 4,128 1.498 .045 5, 150 1.698 .054
pres. × serial pos. 12, 384 1.198 .036 20, 600 .743 .024
∼p < .1. ∗p < .05. ∗∗p < .01. ∗∗∗p < .001.
are shown in Table 2.4.
Figure 2.12 shows the change in each parameter across repeated presentations. SD and g show
essentially the same patterns as in Model 1 (means and variability decrease across presentations, all
ps < .001); B does not reliably change with presentation, nor does its serial position variability (all
ps > .1).
2.3.5 Model 3: Variable Precision Model
Figure 2.13 shows the fitted modeSD, sdSD, and g values from Model 3 (variable precision model
with guessing). Five-segment sequences are shown on the left of each panel, six-segment sequences
are shown on the right. The fitted values of modeSD, sdSD, and g are all higher (reflecting worse
memory performance) on the first presentation than on subsequent presentations.
33
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
0.3
1.0
Seria
l pos
ition
effe
cts
on s
d (lo
g sc
ale)
-2.2
-1.5
Seria
l pos
ition
effe
cts
on B
(log
sca
le)
-2.4
-1.8
Mea
n fit
ted
B (lo
g sc
ale)
1.2
1.5
Mea
n fit
ted
sd (l
og s
cale
)
-1.8
-0.8
Seria
l pos
ition
effe
cts
on g
(log
sca
le)
-1.8
-0.8
Mea
n fit
ted
g (lo
g sc
ale)
5 segments6 segments
5 segments6 segments
5 segments6 segments
5 segments6 segments
5 segments6 segments
1 2 3 4 5 1 2 3 4 5
1 2 3 4 5 1 2 3 4 5
5 segments6 segments
Swap
1 2 3 4 5 1 2 3 4 5
Repeated presentation Repeated presentation
0.15
0.10
0.05
0.02
0.15
0.10
0.05
0.02
20
25
3010
7
5
0.015
0.010
0.005
0.03
0.02
0.01
2
Figure 2.12: Effects of repeated presentations on the fitted parameters of Model 2 (model withinter-item confusions). (A, C) Mean SD and g decrease over repeated presentations for 5-segment(blue) and 6-segment (green) stimuli. (B, D) Variability in SD and g across serial positions decreasesover repeated presentations. (E) B does not vary over repeated presentations. (F) Variability in Bacross serial positions does not vary with repeated presentations.
34
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
0
5
10
Fitte
d sd
SD v
alue
10
20
30
40
Fitte
d m
odeS
D va
lue
0
0.1
0.2
Fitte
d g
valu
e
-4 -3 -2 last-1 -4 -3 -2 last-1-5
Segment serial position
-4 -3 -2 last-1 -4 -3 -2 last-1-5
5 segments over 4 presentations 6 segments over 5 presentations
12
3
4
12
3
4
VariP gamma
-4 -3 -2 last-1-5-4 -3 -2 last-1
12
3
4
12
3
4
5
1 2 3
4 5
12
3
4
5
Friday, October 4, 13
Figure 2.13: Model 3: Fitted modeSD (top), sdSD (middle), and g (bottom) values from thevariable-precision model with guessing, by segment serial position and repeated presentation.
35
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCESPr
opor
tion
of T
rials
0
0.3
0
0.3
0
0.3
0
0.3
Segment Serial Position
1 2 3 4 5
Directional error
0 0 0 0 0180 180 180 180 180-180 -180 -180 -180 -180
Presentation 1
Presentation 2
Presentation 3
Presentation 4
Figure 2.14: Model 3: Error distributions and fitted model predictions for a representative subject.Panels are arranged in rows by presentation and in columns by serial position.
36
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.6: Results of repeated-measures ANOVAs on each parameter of Model 3, with factorspresentation and serial position.
five-segment data six-segment data
parameter source df1, df2 F partial η2 df1, df2 F partial η2
modeSD
presentation 3, 96 6.615∗∗∗ .171 4, 120 15.299∗∗∗ .338
serial position 4, 128 6.645∗∗∗ .172 5, 150 6.003∗∗∗ .167
pres. × serial pos. 12, 384 0.936 .028 20, 600 3.288∗∗∗ .099
sdSD
presentation 3, 96 5.414∗∗ .145 4,120 6.856∗∗∗ .186
serial position 4, 128 3.021∗ .086 5, 150 2.840∗ .087
pres. × serial pos. 12, 384 0.941 .029 20, 600 1.037 .033
g
presentation 3, 96 19.409∗∗∗ .378 4,120 52.522∗∗∗ .637
serial position 4, 128 11.425∗∗∗ .263 5, 150 10.533∗∗∗ .260
pres. × serial pos. 12, 384 1.583 .047 20, 600 2.095∗∗ .065
∗p < .05. ∗∗p < .01. ∗∗∗p < .001.
Figure 2.14 shows the error distributions generated by a representative subject who was perform-
ing the five-segment version of the task. Overlain on the histograms are the distributions predicted
by this subject’s fitted Model 2 parameters. The markedly-different shapes of these distributions is
caused by the combination of multiple von Mises distributions, due to the assumption of variable
precision across trials
Separate two-way repeated measures ANOVAs on the five-segment and six-segment data
separately confirmed a significant main effect of presentation (p < .001) and a significant main
effect of serial position (p < .001) for both modeSD and g, and a significant presentation by serial
position interaction for both parameters on the six-segment data (p < .05). Neither main effect, nor
the interaction, was significant for sdSD. The full results are shown in Table 2.6.
Figure 2.15 summarizes the change in each parameter across repeated presentations. On average,
modeSD decreases across presentations (p < .001 for both 5-segment and 6-segment stimuli),
37
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
-0.3
0.7
Seria
l pos
ition
effe
cts
on s
dSD
(log
scal
e)
-0.4
0.5
Mea
n fit
ted
sdSD
(log
sca
le)
0.4
1.0
Seria
l pos
ition
effe
cts
on m
odeS
D (lo
g sc
ale)
1.1
1.5
Mea
n fit
ted
mod
eSD
(log
scal
e)
5 segments6 segments
-1.6
-0.9
Seria
l pos
ition
effe
cts
on g
(log
sca
le)
-1.5
-0.7
Mea
n fit
ted
g (lo
g sc
ale)
1 2 3 4 5 1 2 3 4 5
1 2 3 4 5 1 2 3 4 5
VariP gamma
1 2 3 4 5 1 2 3 4 5
Repeated presentation Repeated presentation
5 segments6 segments
0.15
0.10
0.05
0.12
0.10
0.07
0.05
30
25
20
15
10
7
5
5 segments6 segments
5 segments6 segments
2.0
1.0
1.5
2.5
5 segments6 segments
5.0
2.0
1.05 segments6 segments
Friday, October 4, 13
Figure 2.15: Effects of repeated presentations on the fitted parameters of Model 3 (the variable-precision model with guessing). (A) Mean modeSD decreases over repeated presentations for5-segment (blue) and 6-segment (green) stimuli. (B) Variability in modeSD across serial positionsdecreases over repeated presentations. (C) Mean sdSD does not reliably change over repeatedpresentations. (D) Variability in sdSD across serial positions does not reliably change over repeatedpresentations. (E) Mean g decreases over repeated presentations. (F) Variability in g across serialpositions decreases over repeated presentations.38
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
implying that subjects are drawing from a range of precisions centered around increasingly precise
representations. The inter-item variability in modeSD also decreases with repeated presentations.
However, while both the mean values of sdSD and the serial position effects on sdSD had negative
slopes, their decrease was less reliable (.020 < p < .071). Finally, as in Models 1 and 2 both the
average fitted values of g and the serial position variability of g decrease with repeated presentations
(p < .001), similarly to the behavior of g in Models 1 and 2.
One point which is not evident from any of the stats displayed here is that a majority of subject-
segment-presentation combinations converge at the minimum sdSD value, making them effectively
equivalent to the Model 1 (standard mixture model) fits. On average, 90.5% of fits to the five-
segment data and 87.0% of fits to the six-segment data converged on the minimum sdSD value.
To test whether such convergence depended on the presentation and segment serial position, I ran
two-way repeated-measures ANOVAs on the five-segment and six-segment data separately. There
was no significant effect of presentation, serial position, or the interaction between them (all ps
> .09). This high rate of convergence to the minimum sdSD value makes me somewhat skeptical
of Model 3’s apparent improvements.
2.3.6 Assessing model fits
I used the Akaike information criterion (AIC) and Kuiper statistic to assess the fit of each model.
After computing the difference in these measures between pairs of models, I bootstrapped a
distribution of differences that occurred when data were drawn from the same generative model.
This allowed me to compute p-values for measures whose distributions are not known a priori, such
as AIC and Kuiper.
Table 2.7 shows the pairwise differences in mean Akaike information criterion (AIC) between
each of the models, and the associated p-values. As expected, the models with increasing complexity
39
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
Table 2.7: Pairwise AIC comparison between models. Positive numbers imply a better fit (lowerAIC) for the the model in column B; negative numbers would imply a better fit for the model incolumn A.
A B difference p-value
Model 0 vs. Model 1 5.102 .500
Model 2 9.217 .224
Model 3 19.005 .032
Model 1 vs. Model 2 3.779 .596
Model 3 17.531 .024
Model 2 vs. Model 3 13.752 .076
Table 2.8: Pairwise Kuiper statistic comparison between models. Positive numbers imply a betterfit (lower Kuiper statistic) for the the model in column B; negative numbers imply a better fit forthe model in column A.
A B difference p-value
Model 0 vs. Model 1 −0.008 .524
Model 2 −0.006 .596
Model 3 0.006 .636
Model 1 vs. Model 2 0.002 .776
Model 3 0.014 .036
Model 2 vs. Model 3 0.013 .232
40
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
do show an AIC benefit, although the only significant differences are between Model 3, the variable
precision model, and Models 0 and 1 (mean AIC is 19.005 lower in Model 3 than Model 0,
p = .032; AIC is 17.531 lower in Model 3 than Model 1, p = .024). Table 2.8 shows the pairwise
differences in mean Kuiper statistic between each of these models. Effects here are almost entirely
non-significant, with Model 0 outperforming Models 1 and 2 (although with p > .52), and Models 2
and 3 outperforming Model 1. However, the only significant effect is between Model 3, the Variable
Precision Model, and Model 1, the Standard Mixture Model (mean Kuiper statistic is 0.014 lower in
Model 3, p = .036).
2.4 Discussion
I investigated the effects of learning on visual short-term memory by using data collected from a
visuomotor sequence-learning task. Subjects viewed and reproduced sequences of motion segments
several successive times, and the accuracy with which they did so was recorded. I then fit four
related models to the distributions of response errors, and assessed how the models’ parameters
changed with increased familiarity with each sequence. Unlike previous work on the role of learning
in this task (Agam et al., 2007, 2010; Maryott et al., 2011), my model-fitting can specify the degree
to which different kinds of memory failures contribute to subjects’ performance.
I found that no matter which model I fit, the value of the parameter or parameters describing the
spread of subjects’ responses decreased over repeated presentations, suggesting that the learning
process utilizes long-term memory to increase the precision of representation in memory. Further,
this decrease was not linear; rather, the decrease from the first presentation to the second was
larger than any of the following changes. Familiarity produced a noticeable increase in VSTM
precision after only a single exposure. The magnitude of the serial position effect on this parameter
also decreased across repetitions, again with large decreases from the first to second presentation,
41
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
followed by smaller changes thereafter.
Similarly, when models included a parameter describing the probability that subjects responded
randomly rather than giving a response based on the item being probed, that parameter’s value
decreased over repeated presentations. This suggests that long-term memory resources not only
enable subjects to reproduce items more precisely, but also enable them to maintain more items in
memory. Again, the magnitude of the serial position effect decreased across repeated presentations,
and, again, both of these decreases were largest from the first to second presentation, with smaller
changes thereafter.
Previous work on this task assessed the likelihood of transpositions, operationalizing them
as cases where swapping two segments in the reproduction led to lower absolute error on both
(Agam et al., 2005). Here, I am using a very different approach to assess the frequency of swaps.
Simulations suggest that, even if each memory representation is centered on the correct segment,
the imperfections of such representations can lead to cases where swapping segments would reduce
absolute error. The analysis here only counts swaps that occur above the levels that are attributable
to imperfect representations. Unlike Agam et al. (2005)’s result, my Model 2 found very low levels
of inter-item confusions, with no systematic effects of serial position and presentations. Apparent
transpositions can result from noisy representations alone.
My results make clear that the learning induced by repeated encounters with each trajectory
changes two separate aspects of the VSTM representation of the segments in that trajectory. The
representation of each segment becomes more precise, and the probability that each segment is
successfully stored and retrieved increases. In short, familiarity is not just inducing a tradeoff
between parameters, but actually increasing the amount of information that subjects are able to
hold in VSTM and retrieve on demand. Further, these simultaneous changes are seen after even
one prior presentation – the mechanism underlying this increased capacity is capable of one-shot
learning. I should note that I am assuming that long-term memory supports increased retention
42
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
ability, but it is also possible that the support is occurring instead (or as well) at encoding or at
retrieval. These studies do not allow me to determine at which memory stage the improvements are
occurring. This is counter to other results that propose a trade-off between precision and capacity in
VSTM (Roggeman, Klingberg, Feenstra, Compte, & Almeida, 2013). However, those authors did
not look at the effects of learning or familiarity; their subjects only saw each display one time.
One possible neural mechanism by which long-term memory processes are able to support
VSTM after previous exposures to a stimulus is chunking, in which subjects group segments into
larger items (Miller, 1956; Agam & Sekuler, 2008; Gobet & Simon, 1998). When an item is
composed of a few chunks rather than of many segments, each chunk can have more memory
resources allocated to it, possibly leading to improved precision and capacity. Maryott and Sekuler
(2009) showed that young adults perform better on trajectories whose segments are easily grouped
than they do on those whose segments are not, suggesting that a grouping or chunking process may
underly performance on this task.
Another possible mechanism underlying these changes with familiarity is that as the brain
learns about a given stimulus, it develops a sparser representation of the elements of that sequence
(Barlow, 1961; Agam et al., 2010; Karlsson & Frank, 2008). That is, the number of neurons
required to represent each element is decreased, likely through changes in neuronal tuning. As
the representation becomes sparser, there may be less overlap among the representations of the
elements of a given trajectory, so that it is easier for the brain to maintain separate representations.
This could allow both more precise representations of a direction of motion, and the maintenance of
a greater number of such representations simultaneously.
Note that familiarity-induced changes in both precision and guess rate converge on a flat serial
position curve. On successive presentations of the same stimulus sequence, the VSTM system is
reallocating memory resources such that errors are distributed more evenly among items. In other
words, the presentation-over-presentation changes in precision and guess rate are larger for items
43
CHAPTER 2. DYNAMIC REALLOCATION OF VSTM RESOURCES
that had high amounts of error on the first presentation, and lower (or even zero) for those that had
low amounts of error. Such dynamic reallocation suggests a strong preference within the system
for representing all items imperfectly, rather than trading off high precision on one item for poor
precision on another.
It also demonstrates that the VSTM system has access to a great deal of metacognitive informa-
tion about the quality of the representations in long-term memory. This information may be directly
accessible from long-term memory, or it may be accessed more indirectly. For example, the system
may attempt to predict the direction of the disk on the subsequent presentation, and use the degree
of error on each segment as a cue to allocate more memory resources to that segment. Future work
should attempt to distinguish between these possibilities.
Further, my results are inconsistent with a slots conceptualization of VSTM resource allocation
(as proposed by Zhang & Luck, 2008, 2009, 2011), in which short-term memory resources are
allocated in quanta of fixed size. Such a model is insufficient to account for the variation in precision
and guess rate that I see in my fitted model parameters. The effects of segment serial position and of
repeated presentation, along with the model-fitting comparison’s support for the variable-precision
model, implies greater flexibility in the system, and integration with long-term memory processes,
than has previously been demonstrated. In particular, the reliable familiarity-induced decrease in
guessing cannot be accounted for by a slot model. VSTM resources must be flexibly allocated to
produce the results presented in this chapter.
44
Chapter 3
P300 to violations of newly learned
predictions
3.1 Introduction
The human brain frequently operates in feedforward mode, exploiting previously-experienced
regularities to build expectations for future events. This proactive operation facilitates perceptual
processing (Bar, 2009) and makes it possible for appropriate behaviors to be prepared and executed
in a timely fashion (e.g. Kowler, 1989; Maryott et al., 2011). Among the richest regularities
available to the brain are ones entailed not in single isolated events, but in event sequences. In fact,
the brain constructs and continuously updates its representation of such sequential regularities in an
obligatory and effortless manner (Johnson & Donchin, 1982; Kimura, Kondo, Ohira, & Schroger,
2011; Sternberg & McClelland, 2012).
In order to benefit fully from the advantages of feedforward operation, the brain must have
a mechanism to detect events that violate its expectations, and to thereby trigger an appropriate
response (Winkler, 2007). Such responses might include heightening attention to the unexpected
45
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
event, modifying or delaying a prepared behavior, or updating the brain’s representation of the
regularity at hand.
Event-related brain potentials (ERPs) provide a direct measure of neural activity that is time-
locked to specific events (Luck, 2005). Their temporal precision makes ERPs a useful tool for
studying the neural reaction to individual events within a sequential structure. The P3 is one ERP
component that has been hypothesized to reflect the brain’s response to a novel or surprising stimulus
(Linden, 2005; Polich, 2007; K. C. Squires, Wickens, Squires, & Donchin, 1976). While the P3
has historically been studied in oddball paradigms (e.g Courchesne, Hillyard, & Galambos, 1975;
N. K. Squires, Squires, & Hillyard, 1975; Zuijen, Simoens, Paavilainen, Naatanen, & Tervaniemi,
2006), where a stream of stimuli contains both frequent and infrequent stimulus exemplars, it has
also been studied in sequence-learning tasks. Stimuli that deviate from an established sequential
structure elicit a larger P3 response than stimuli that conform to the sequential structure (Ferdinand et
al., 2008; Russeler & Rosler, 2000; Russeler, Hennighausen, Munte, & Rosler, 2003; Schlaghecken,
Sturmer, & Eimer, 2000). However, the studies demonstrating this P3 enhancement have required
subjects to learn only a single sequence per block of trials, allowing them to encode and maintain
one sequential regularity that remained valid for an extended period of time.
I wanted to investigate the neural mechanisms underlying sequence learning and prediction
monitoring in a more ecologically valid setting, where the governing regularities were short-lived.
To do so, I adopted the task used by Maryott et al. (2011) to investigate the behavioral response to
deviant events embedded in complex sequential structures that frequently changed. In that study,
subjects had to remember and reproduce short sequences of movements, each approximately five
seconds long. Each sequence was seen and reproduced several times, but occasionally a deviant,
unexpected item was inserted into a well-learned sequence. Subjects successfully incorporated the
deviant items into their representation of the sequence, and even showed a slight benefit for deviant
items. Note that Maryott et al.’s task differs from traditional sequence-learning paradigms such as
46
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
the serial reaction time task (Nissen & Bullemer, 1987) in that it required that the brain frequently
update its representation of the governing sequential structure, as a new sequence began every 60-90
seconds.
To investigate whether this dynamic, changing context would alter the brain’s response to a
deviant sequence item, I recorded EEG signals from subjects who were performing a variant of the
task used by Maryott et al., and measured ERPs to new, familiar, and deviant sequence items. I
hypothesized that I would find a P3-like component in response to both new and deviant sequence
items, but that the timing and distribution would vary somewhat between the two conditions,
capturing the distinctions that have previously been found between the Novelty P3 and the task-
relevance P300 (Polich, 2007; Goldstein et al., 2002; Linden, 2005).
Here, I show that both new and deviant sequence items evoke a larger P3 than do familiar
sequence items. Deviant sequence items elicit both a P300 and a Novelty P3 relative to familiar
items, while new sequence items elicit only a P300 (and that of slightly lower amplitude). My results
show that the neural response to deviant sequence elements in a frequently updated environment
confirms that to deviant items in more stable settings.
3.2 Methods
3.2.1 Subjects
Twelve young adults (seven female, ages 19–28) participated in this experiment. All were naıve to
the task; all were right-handed.
47
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
3.2.2 Experimental task
To induce and measure sequence learning, I asked participants to observe and reproduce pseudo-
random motion trajectories. Each trajectory was presented four successive times, and each partici-
pant saw 128 different trajectories.
On each presentation of a trajectory, a small disk traversed a path comprising five connected
linear motion segments. Figure 2.1 illustrates the sequence of events within one such presentation.
Each segment of the trajectory was 1 cm (approximately 1 visual angle) in length, and the disk
moved at a constant speed of 2 cm per second, taking 500 ms to travel the length of each segment.
After each segment, the disk paused for 400 ms before resuming its motion in a changed direction.
The yellow disk then disappeared from view. After a retention interval of 3750 ms, a second disk
appeared, cueing the subject to move a handheld stylus over the surface of a graphics tablet (31×
24 cm; Intuos 3, Wacom, Vancouver, WA) in order to reproduce from memory the sequence of
disk motions that had just been seen. During the reproduction, the disk’s motion was yoked to the
movement of the stylus’ tip on the graphics tablet. No other feedback was provided. Note that
neither the stimulus nor the reproduction disk left a visible trail while moving across the computer
display. Subjects viewed the stimuli from a distance of approximately 57 cm, and were instructed to
maintain fixation on a central cross, and refrain from blinking, while the stimulus disk was on the
screen.
Each trial’s quasi-random sequence of five motion segments was generated by the algorithm
described by Agam et al. (2005). The direction of a sequence’s initial motion was chosen randomly,
and the direction change at each corner of the trajectory was between 30 and 150. These changes in
direction could be clockwise or counter-clockwise, with equal probability. The motions comprising
any sequence were constrained by several additional rules: Motion segments were not permitted to
intersect, could not come within one-half a segment’s length of intersecting, nor could they extend
48
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
beyond the boundaries of the display area.
3.2.3 Design and procedure
On every trial, that trial’s unique trajectory was presented four times, with participants reproducing
the sequence after each such presentation. Each set of four presentations constituted either a
Congruent trial, in which all four presentations of a sequence were identical to one another, or
a Flip trial, in which one motion segment changed direction on the sequence’s fourth (and final)
presentation. On the last (fourth) presentation of a Flip trial, the established trajectory’s final
segment was replaced by a segment in which the disk’s motion was 180 opposite. Figure 3.1
illustrates these two conditions.
I operationalized three types of events: new, familiar, and deviant. New events were motion
segments on their first presentation; familiar events were segments on their fourth presentation;
deviant events were the “flipped” final segment of the final presentation on Flip trials. See Section
3.2.6 (page 52) for further descriptions of these operationalizations. Note that new events are
congruent with the task context, and thus very similar to other events within the experiment.
Each subject completed four 45-minute experimental sessions, in which he or she observed
and reproduced 32 different trajectories four times each. In each session, the first two trials were
Congruent trials, followed by 20 Congruent and 10 Flip trials, block-randomized so that Flip
trials were more evenly distributed. Forty trials (approximately 31%) were Flip trials; the remainder
(88 trials) were Congruent trials. This approximates the ratio of Flip to Congruent trials used by
Maryott et al. (2011), in which subjects did not anticipate the flips (as demonstrated by subjects’
anticipatory eye movements). Subjects were never informed that some trials would be Flip trials.
49
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
Presentation
1 2 3 4
1Flip
Congruent
flip
2 3 4
Figure 3.1: The sequence of presentations that made up a Congruent trial or Flip trial. In myanalyses I consider the first presentation of a stimulus to be new, the fourth presentation of aCongruent trial to be familiar, and the “flipped” segment at the end of a Flip trial to be deviant.
3.2.4 Behavioral data analysis
The fidelity of each reproduction was quantified offline by means of a two-step algorithm (Agam et
al., 2005; Maryott et al., 2011) that used pauses and direction changes to divide the reproduction
into segments. The algorithm then estimated the direction of each such segment by fitting a line
to its beginning and end points. Reproduction accuracy was quantified by directional error: the
absolute angular difference between the direction of a motion segment in the reproduction and the
direction of the corresponding segment in the stimulus model.
Note that this segmentation algorithm fails if it divides a reproduced trajectory into a number
of segments that differs from the number in the model trajectory. To increase the likelihood that
reproductions would be successfully divided into five motion segments, I instructed subjects to
try to produce the same number of segments that had been in the stimulus (five) and to, insofar as
possible, draw straight lines with corners between them. These instructions allowed the segmentation
algorithm to successfully divide over 90% of trials.
50
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
3.2.5 Electrophysiological recording
A high-density EEG system (Electrical Geodesics, Inc., Eugene, OR) with 129 electrodes sampled
scalp electroencephalographic signals at 250 Hz using a high-impedance amplifier. Signals were
recorded for later, off-line analysis. All channels were adjusted for scalp impedance below 50 kΩ;
after 12 trials, channels were again adjusted for impedance below 50 kΩ scalp impedance before
the subject completed the final 20 trials of the session.
3.2.6 EEG analysis
EEG data were cleaned and analyzed in the EEGLAB (Delorme & Makeig, 2004) and FieldTrip
(Oostenveld, Fries, Maris, & Schoffelen, 2011) toolboxes for Matlab (The Mathworks, Inc., Natick,
MA). Data were re-referenced to the average voltage, bandpass filtered to between 0.25 and 75 Hz,
and divided into epochs for each segment and each presentation. Every such epoch extended from
200 ms before that segment’s disk motion onset to 600 ms after. Data were visually inspected for
muscle artifacts, eye movements, and bad channels; epochs containing such artifacts were rejected.
Independent components analysis was used to isolate eye blink activity, which was subtracted from
the data. Finally, data were again visually inspected for artifacts not corrected by the previous two
processes. After cleaning, data were averaged across trials and sessions to create a subject average
ERP for each combination of condition (Congruent or Flip), segment (one, two, three, four, or
five), and presentation (one, two, three, or four,).
For each of the following investigations, I used a data-driven, non-parametric clustering approach
(Maris & Oostenveld, 2007) to select time windows and electrodes for analysis. The FieldTrip
toolbox includes software implementing this approach. It first calculates Student’s t for each
electrode and time point, and identifies clusters of time- and/or space-adjacent electrodes with
|t| > tcritical. Critical t-values were selected by the experimenters according to several criteria,
51
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
including the degrees of freedom of the comparison, the magnitude of the difference between the
conditions, and the degree of spatial and temporal specificity desired.
For each cluster, the t-scores of the member electrodes and time-points were summed, giving a
cluster score that reflected both the extent of the cluster (in space and time) and the magnitude of
the difference between the conditions. A reference distribution of test statistics was generated by
randomly permuting the data across the two conditions being compared, computing such scores
for each resulting cluster, and taking the largest such cluster score on each of 1,000 permutations.
Where cluster-wise p-values are reported, they are derived by comparing the empirically-obtained
cluster score to such a reference distribution.
To confirm that experimenter selection of tcritical is appropriate, I compared the clusters generated
by the algorithm at three different values of tcritical. Results are shown in Figure 3.2. In each panel,
blue circles denote the electrodes that are included in a statistically-significant (cluster-wise p < .05)
positive cluster; red circles denote electrodes included in a statistically significant negative cluster.
Below each electrode map is a timeline with a gray box showing the time window over which the
cluster is significant. With the least-stringent value, tcritical = 2.093, the algorithm identifies a
positive central cluster composed of 42 electrodes over a time window from 384–526 ms after disk
motion onset (Figure 3.2, upper left panel). At a moderate value, tcritical = 2.861, the algorithm
identifies a positive central cluster composed of 18 electrodes over a time window from 396–448
ms after disk motion onset (Figure 3.2, upper right panel). Finally, at tcritical = 3.883, the algorithm
identifies a positive central cluster composed of 5 electrodes over a time window of 404–412 ms
after disk motion onset (Figure 3.2, lower left panel). Adjusting the value of tcritical expands or
contracts the electrodes and time windows included in a cluster without dramatically distorting the
cluster’s position or timing.
Grand average ERPs for each comparison of interest were created by averaging across subjects
and across the electrodes identified as part of the cluster. I did three such comparisons.
52
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
tcritical = 2.093
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
tcritical = 2.861
tcritical = 3.883
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
cluster criterion: α = .05
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
Member of a negative cluster
Member of a positive cluster
384-516 ms
cluster criterion: α = .01
396-448 msdisk motion onset
404-412 ms
cluster criterion: α = .001
disk motion onset
disk motion onset
time
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
cluster criterion: α = .05
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
Member of a negative cluster
Member of a positive cluster
384-516 ms
cluster criterion: α = .01
396-448 msdisk motion onset
404-412 ms
cluster criterion: α = .001
disk motion onset
disk motion onset
time
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
cluster criterion: α = .05
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
Member of a negative cluster
Member of a positive cluster
384-516 ms
cluster criterion: α = .01
396-448 msdisk motion onset
404-412 ms
cluster criterion: α = .001
disk motion onset
disk motion onset
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
cluster criterion: α = .05
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
HydroCel Geodesic Sensor Net128 Channel Map
Version 1.0
For questions or additional assistance please refer to the EGI Sensor Net Technical Manual or contact us at:
Electrical Geodesics, Inc.1600 Millrace Drive, Suite 307Eugene, Oregon USA 97403Phone: (541) 687-7962 Fax: (541) 687-7963Email: [email protected] or [email protected] N-PRT-LAM-4300-001
12
3
4
5
6
7
89
10
12
13
14
15
16
17
18
19
20
21
22Fp1
23
24F3
43
44
45T3
4647
38
39
41
42
32
34
35
37
25
26
27
28
29
30
31
48
49
50 51 52P3
5354
55
56 57LM
58T5
61
COM
6364 65
66
67
68
71
72
73
74
75Oz
76
77
78
79
80
81
82
84
86
87
88
89
90
92P4
93
S/N
94
95
96T6
97
98
99
100RM
101
102
103
105106
107
108T4
110111
112
113
115
116117118
114
119120
121
122F8
124F4
125
126127
1289 99Fp2
12333F7
36C3
40
5960
69
70O1
83O2
8591
104C4
REFCz
109
11Fz
62Pz
5
Member of a negative cluster
Member of a positive cluster
384-516 ms
cluster criterion: α = .01
396-448 msdisk motion onset
404-412 ms
cluster criterion: α = .001
disk motion onset
disk motion onset
time
Figure 3.2: The electrodes identified by my clustering algorithm at three different values of tcritical.In each panel, blue circles denote the electrodes that are included in a statistically-significant(p < .05) positive cluster, while red circles denote electrodes included in a statistically-significantnegative cluster. The gray boxes on each timeline show the time window of each cluster. As clustersize decreases with higher criterion values, the position of the cluster is stable in both space andtime.
53
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
First, to assess the changes in neural responses as a new sequence item becomes increasingly
familiar, I collapsed across segments two, three, four, and five1 of Congruent trials and computed
subject-average ERPs for each presentation. I will refer to these four presentations as pres1,
pres2, pres3, and pres4. Because the clustering approach requires pairwise comparisons, I used the
difference between pres1 and pres2 to identify the electrodes and time window for this comparison.
As seen in Chapter 2, the encoding and maintenance of repeated presentations of a given sequence
item are supported by long-term memory. Therefore, the neural response to pres1 should reflect the
increased demands of encoding an item that has not been previously seen and thus cannot draw on
such long-term memory support.
Next, to measure the differences between neural responses to deviant sequence items and those
to familiar sequence items, I calculated ERPs to segment five of presentation four on Congruent
and Flip trials. Note that this is the segment and presentation on which the “flip” occurs on Flip
trials. I will use Con p4s5 and Flip p4s5 to denote these two segments. Here, the response to
Flip p4s5 segments should reflect both the increased encoding demands also seen in pres1 and the
prediction-monitoring processes that trigger such new encoding.
Finally, because both the comparison between new and familiar sequence items and the com-
parison between deviant and familiar sequence items incorporate an increased encoding load for
one condition (new and deviant), and a reliance on expectation for the other (familiar), I directly
compared the neural responses to new and deviant sequence items to allow me to dissociate the
ERP correlates of unexpectedness from those of mere novelty. I computed ERPs to segment five of
Flip trials on both presentation one and presentation four. I will refer to these two segments as
Flip p1s5 and Flip p4s5. On presentation one, subjects cannot predict the segment’s direction of
motion, nor do they expect to be able to predict it, while on presentation four, subjects do have
1The sudden onset of disk motion at the start of the first segment evokes a neural response that is very differentfrom that to the other segments (Agam & Sekuler, 2007).
54
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
predictions about the disk’s motion, which are violated by the deviant stimulus.
3.3 Results
3.3.1 Behavioral accuracy
Figure 3.3A shows subjects’ reproduction accuracy on Congruent trials. I ran a 5×4 repeated-
measures ANOVA with factors segment (one, two, three, four, or five) and presentation (one,
two, three, or four). There was a main effect of segment (F(4,44) = 8.955, p = .003, partial
η2 = .449), and a segment by presentation interaction (F(12,132) = 6.310, p = .048, partial η2 =
.364). There was also a main effect of presentation (F3,33) = 38.866, p < .001, partial η2 = .779).
Follow-up analyses showed that the improvement in reproduction accuracy from presentation
one to presentation two was significant (F(1,11) = 62.654, p < .001, partial η2 = .850), but the
changes from presentation two to three and from three to four were not.
I also compared mean directional error on Flip and Congruent trials on presentation four
(that is, the presentation on which these two trial types differed). I found no significant main effect
of condition (Flip vs. Congruent, F(1,11) = 2.554, p = .138, partial η2 = .189), as shown in Figure
3.3B. These results replicate a central finding of Maryott et al. (2011) and confirm that subjects
can successfully incorporate unexpected events into their planning and execution of a reproduced
motion sequence.
3.3.2 ERPs
Figure 3.4 illustrates the changes in neural activity that accompany learning, as a stimulus goes
from being new to being familiar over the course of four successive presentations. A data-driven
clustering and permutation algorithm identified the cluster of electrodes that best captured (p = .098)
55
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
10
20
30
40
1 2 3 4 5
Erro
r in
repr
oduc
tion
(°)
Segment serial position
1st
2nd
3rd
4th
Presentation
10
20
30
40
1 2 3 4 5
Erro
r in
repr
oduc
tion
(°)
Segment serial position
Flip
Congruent
A
B
Figure 3.3: A: Mean directional error on Congruent trials across the five segments in a motionsequence. Data are displayed separately for each of the four presentations of a sequence. Subjects’accuracy improved over repeated presentations. B: Mean directional error on Flip and Congruenttrials for the fourth (final) presentation. The two conditions do not differ significantly. Error barsare repeated-measures standard error (Morey, 2008).
the change in scalp voltage topographies between pres1 and pres2. This cluster was derived using
tcritical = 2.201, the critical t-value at α = .05, df = 11. The cluster consisted of nine electrodes
that were more positive-going on pres1 than pres2 during an interval from 200–280 ms after disk
motion onset. The inset in Figure 3.4 depicts the distribution of t-scores between ERPs to the two
presentations during that time window and the locations of the electrodes comprising this cluster;
56
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
−200 ms −196 ms
−8
−4
0
4
8
t−scores
1 μV
100 ms
8
-8t200-280 ms
disk motion onset 200-280 ms
pres1pres2
pres3pres4
Figure 3.4: ERPs, collapsed across segments two through five, to pres1, pres2, pres3, and pres4 onCongruent trials, at a cluster of central electrodes. The inset topographical plot shows the locationsof the nine electrodes making up the significant cluster, and the distribution across the scalp of tvalues at 200–280 ms after disk motion onset. The traces show ERPs at the cluster, timelocked todisk motion onset of each segment. Error ribbons are within-subject standard error (Morey, 2008),and thus do not reflect the process or results of the permutation significance test.
57
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
−200 ms −196 ms
−8
−4
0
4
8
t−scores
1 μV
100 ms
Flip_p4s5(deviant)
Con_p4s5(familiar)
8
-8t396-448 ms
disk motion onset 396-448 ms
Figure 3.5: ERPs to Con p4s5 (familiar) and Flip p4s5 (deviant) at a cluster of central electrodes.The inset topographical plot shows the locations of the 18 electrodes making up the significantcluster, and the distribution across the scalp of t values at 396–448 ms after disk motion onset. Thetraces show ERPs at the cluster, timelocked to disk motion onset of each segment.
the traces in the main body of the figure show ERPs at this cluster, timelocked to each segment’s
disk motion onset, for pres1, pres2, pres3, and pres4.
Figure 3.5 illustrates the differences between neural activity that accompanies familiar sequence
item and that accompanying deviant items. I used the same approach as above to identify the cluster
of electrodes that best captures (p < .001) the difference between Flip p4s5 and Con p4s5 (i.e.,
deviant and familiar sequence items). This cluster was derived using tcritical = 3.106, the critical
t-value at α = .01, df = 11. The resulting cluster consisted of 18 electrodes that were more
positive-going to the deviant item than to the familiar item from 396–448 ms after disk motion
onset. The inset in Figure 3.5 depicts the distribution of t-scores between ERPs to Flip p4s5 and to
58
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
Con p4s5 segments during that time window and the locations of the electrodes comprising this
cluster; the traces in the main body of the figure depict ERPs at the cluster, timelocked to disk
motion onset of segment five, for Flip p4s5 (deviant) and Con p4s5 (familiar).
Figure 3.6 highlights the differences between neural response to new and to deviant sequence
items. Using the clustering algorithm described above, I identified two clusters of electrodes that
captured differences between a new item, Flip p1s5, and a deviant item in the same sequential
position, Flip p4s5. These clusters were derived using tcritical = 2.201, the critical t-value at
α = .05, df = 11. Figure 3.6A depicts the first (by time) resulting cluster (p = .025), which
consisted of 16 electrodes that were more negative-going to the deviant sequence item than to the
new item from 244–344 ms after disk motion onset for the segment. The inset of Figure 3.6A shows
the distribution of t-scores between ERPs to the two segments during that time window (negative
values imply Flip p1s5 is more positive) and the locations of the electrodes comprising this cluster;
the traces in the main body of Figure 3.6A depict ERPs at that cluster, time locked to disk motion
onset. I included traces of the ERP to the equivalent familiar segment, Con p4s5, for comparison.
The second cluster (p = .002) consisted of 26 electrodes that were more positive-going to deviant
items than to new items from 376–468 ms after disk motion onset for the segment. The inset of
Figure 3.6B depicts the distribution of t-scores between the two conditions during this time window
and the locations of the electrodes comprising this cluster. The traces in the main body of Figure
3.6B show ERPs at the cluster, time locked to disk motion onset.
Note that the two clusters depicted in Figure 3.6 likely capture two different ERP sub-components.
The enhanced positive-going response to new motion segments (Flip p1s5) relative to familiar
motion segments (Con p4s5) is only evident at the centro-parietal electrodes of the cluster shown in
Figure 3.6A. Further, it onsets earlier than the response to deviant segments (Flip p4s5), although
the response to deviant segments peaks substantially higher than that to new segments. In contrast,
the enhanced positive-going response to deviant segments is evident at both clusters. Over fronto-
59
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
−200 ms −196 ms
−8
−4
0
4
8
t−scores
−200 ms −196 ms
−8
−4
0
4
8
t−scores
1 μV
100 ms
244-344 ms
disk motion onset
376-468 ms
244-344 ms
A 8
-8t
B
disk motion onset 376-468 ms
8
-8t
1 μV
100 ms
Flip_p1s5 (new)Flip_p4s5 (deviant)
Con_p4s5 (familiar)
Figure 3.6: ERPs to Flip p1s5 (new), Flip p4s5 (deviant), and Con p4s5 (familiar) at two clusters.The inset in each panel shows the locations of the electrodes making up each cluster and itsdistribution of t-scores; the traces show ERPs at the cluster to each segment. A: A slightly right-lateralized centro-parietal cluster composed of 16 electrodes that were more negative-going onFlip p4s5 (deviant) than on Flip p1s5 (new) from 244–344 ms after disk motion onset for thesegment. B: A fronto-central cluster composed of 26 electrodes that were more positive-going onFlip p4s5 (deviant) than on Flip p1s5 (new) from 376–468 ms after disk motion onset.
60
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
central electrodes (Figure 3.6B), its amplitude is substantially larger than that of the ERP to new or
familiar items, although it onsets at the same time.
3.4 Discussion
I recorded high-density scalp EEG while subjects performed a visuomotor sequence-learning task,
with occasional deviant elements inserted into familiar sequences. Subjects successfully reproduced
the sequences, including the deviant items. Because the sequence changed every 60-90 seconds,
subjects had to dynamically update their representation of the relevant sequential structure. In
order to characterize the neural mechanisms that respond to unpredictability within newly-learned
sequences, I measured ERPs to novel, familiar, and deviant sequence items. As compared to familiar,
well-predicted sequence items, both new and deviant items evoked an enhanced centralized P3, as
identified by a data-driven, bottom-up analysis approach. For new elements, this P3 was consistently
timed and over fronto-central electrodes, while the deviant elements evoked a P3 over fronto-central
electrodes as well as a slightly later positivity over centro-parietal electrodes.
These results fill an important gap in the previous work on the neural response to sequence
deviants, most of which has been done using a serial reaction time task (SRTT). In the SRTT,
subjects make key presses to a stream of letters or other stimuli (Nissen & Bullemer, 1987). When
a repeating sequence of letters is embedded within the stream, subjects respond more quickly
to those items, and some subjects develop explicit knowledge of the sequence’s presence and
structure. Subjects who have such explicit knowledge show an enhanced P3 to letters that violate
the established sequence, but subjects without such knowledge do not (Ferdinand et al., 2008;
Schlaghecken et al., 2000). Similarly, subjects who are instructed that an underlying sequence exists
show an enhanced P3 (Russeler et al., 2003).
Russeler and Rosler (2000) created an SRTT variant in which multiple stimuli mapped onto each
61
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
response key, so that they could separate the neural response to perceptual deviants (wrong letter
but same response) from that to perceptual + motor deviants (wrong letter AND different keypress).
They found an enhanced P3 only to the latter category of sequence deviants, and concluded that
the P3 effect reflects the need to change or update a response rather than the mere detection of
an unexpected stimulus. In my task, participants are explicitly aware of the sequential structure,
as their task is to memorize and reproduce it. Thus, deviant sequence items in my study require
participants to change a planned motor output as well as to encode information about the new
event. My findings of P3 enhancement to deviant sequence items is consistent with Russeler and
Rosler’s hypothesis. Interestingly, de Bruijn, Schubotz, and Ullsperger (2007) showed participants
slideshows of everyday actions, with or without execution errors, and found an enhanced P300 for
observed errors relative to correct executions. In that task, subjects are presumably drawing on their
previous knowledge about the sequential structure of the familiar, everyday actions; observed errors
deviate from this structure but do not require a response from the participant.
Two distinct subcomponents with very similar timing have ben identified in the P300 literature
(Polich, 2007; Linden, 2005; Goldstein et al., 2002; Spencer, Dien, & Donchin, 2001; N. K. Squires
et al., 1975). The slightly earlier subcomponent, the Novelty P3, is centered over fronto-central
electrodes, and is described as elicited by deviant stimuli that are salient, low-probability, and
do not require a response (Goldstein et al., 2002). The second subcomponent, the P300, is more
posterior, centered over centro-parietal electrodes. The P300 is elicited by task-relevant stimuli such
as targets. Deviant sequence elements in the SRTT enhance the P300 (also sometimes called P3b)
subcomponent (Ferdinand et al., 2008; Russeler et al., 2003; Schlaghecken et al., 2000).
My results support the presence of separable P3 subcomponents using a clustering approach
to separate ERPs to novel from deviant items. One, a centro-parietal ERP peaking around 400 ms
after disk motion onset, is enhanced by new and deviant items in comparison to familiar ones. This
response appears to be parallel to the P300, as both new and deviant sequence items are task-relevant.
62
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
This activity most likely reflects the perception and encoding of a task-relevant stimulus (which
subjects must do for both new and deviant items). The second subcomponent is a fronto-central
ERP, also peaking about 400 ms after disk motion onset. At these electrodes, the ERP to novel and
familiar sequence items are indistinguishable, but the P3-like peak to deviant sequence items is
substantially enhanced. This response likely parallels the P3a or Novelty P3 subcomponent and
suggests that “novel” may be better characterized as “expectation-violating” (Vossel, Weidner, Thiel,
& Fink, 2009; Yu & Dayan, 2005). This ERP may specifically reflect inhibition or rejection of a
perceptual representation and motor plan that are actively stored in working memory.
Differences between the neural responses to new and deviant stimuli highlight the importance
of prediction-monitoring in cognition. On the surface, new and deviant items are quite similar:
both require the subject to perceive and encode a direction of motion that has not previously been
observed, in order to reproduce it from memory. The difference is in the context: When viewing a
deviant sequence item, participants have strong predictions about the disk’s direction of motion;
when viewing a new item, they have no such predictions. The differences in neural response between
these two conditions therefore reflects the effects of these predictions on perceiving and encoding
the element.
Although my results contribute to the understanding of the relationship between sequential
structure, unpredictability, and the P3, the present study has two important limitations. First, because
participants were explicitly aware of both the sequential structure of the task and the points at
which new trials (and thus new sequences) began, these findings cannot be extended to explain the
mechanisms by which people come to identify sequential structure in continuous streams of sensory
input, nor to explain the processes by which people identify changes in that structure. Second,
because the deviant items in this paradigm always differed from the familiar items by 180, I cannot
say anything about the effect of the magnitude of prediction violations. Both of these questions
will need to be investigated before the relationship between expected unpredictability, unexpected
63
CHAPTER 3. P300 TO VIOLATIONS OF NEWLY LEARNED PREDICTIONS
unpredictability, and the subcomponents of the P3 can be fully described.
In summary, I have shown that, when learning motion sequences, people show distinct neural
responses to new stimuli, familiar stimuli, and stimuli that deviate from the governing sequence.
The neural responses to a deviant sequence item are different than those to a new sequence item,
further supporting the hypothesis that identifying prediction errors is a cognitive process. Finally,
my results extend previous work on monitoring sequential regularities and show that the neural
mechanisms involved are similar when the sequential structure is dynamic and when it is stable over
time.
64
Chapter 4
Leaky ignoring in the flanker task
4.1 Introduction
The human brain is sensitive to events that violate its explicit and implicit predictions about
forthcoming sensory inputs (Ferdinand et al., 2008; Nassar et al., 2012; Ouden et al., 2009; Schultz
et al., 1997). As I showed in Chapter 3, humans detect such prediction-violating events and
incorporate them into working memory representations in a task that requires explicit reproduction
of visuomotor sequences. In this chapter, I directly investigate the effect of such events on attention.
In associative learning or other explicitly predictive tasks, prediction-violating events trigger the
allocation of attention (Wills et al., 2007). Further, as is familiar from everyday experience, when
an unattended stimulus stream contains an event that is unexpected, that event becomes distracting,
impairing performance on the attended task (e.g. Escera et al., 1998; Parmentier et al., 2008;
Schroger, 1996).
The Eriksen flanker task is frequently used to investigate the cognitive processes involved in
response conflict and spatial attention (Eriksen & Eriksen, 1974; Ridderinkhof et al., 1999). In
this paradigm, a central target is surrounded by flanking distractors which may be congruent with
65
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
(i.e., match) the target or be incongruent with it. The classic finding from this task is that, despite
subjects’ attempts to ignore the flanking distractors, flankers that are incongruent with the central
target interfere with processing, leading to reduced speed and accuracy on those trials.
I modified the flanker task to incorporate unexpected events among the unattended flankers.
I hypothesized that the behavioral effect of the flankers should be magnified on trials with those
unexpected flanker events, due to unexpected events’ obligatory attraction of attentional resources.
To characterize the effects of unexpected flankers, I compared ERPs to the onset of stimuli
containing expected and unexpected flankers. The visual mismatch negativity (vMMN) is an early,
negative-going deflection in the ERP that occurs in response to occasional deviant elements within
a sequence of visual stimuli that obey some regularity (Czigler, 2007; Pazo-Alvarez, Cadaveira,
& Amenedo, 2003). It is parallel to the well-established auditory MMN, which is theorized to
be generated in auditory cortex when a predictive signal from prefrontal areas is disconfirmed by
incoming sensory information (Garrido et al., 2009; Wacongne et al., 2011, 2012). The auditory
and visual MMNs do not depend on attention, and arise even when subjects are attending to other
stimulus streams, such as a listening task in one ear with a sequence of tones in the other, or a
visuomotor task w/ irrelevant stimuli in the periphery (Naatanen, Paavilainen, Titinen, Jiang, &
Alho, 1993; Stefanics et al., 2011; Winkler, Teder-Salejarvi, Horvath, Naatanen, & Sussman, 2003).
There is some evidence that the MMN reflects characteristics of stimulus processing (Kimura,
Kondo, et al., 2011; Naatanen, Paavilainen, Rinne, & Alho, 2007; Tales, Troscianko, Wilcock,
Newton, & Butler, 2002). I measured the vMMN elicited by unexpected flankers in the Eriksen
flanker task, and tested its relationship to behavioral evidence of oddball flanker interference.
I also measured individual differences in task performance and neural activity. The processes
involved in this task are likely influenced by individual differences in emotional, motor, and
attentional reactivity (Rothbart, 2007). I hypothesized that individual differences in temperament—
particularly the factors relating to perceptual sensitivity and attentional control—would predict
66
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
individual susceptibility to unexpected flankers.
Finally, I investigated the mechanisms linking the neural response to a prediction error with
changes in behavior. Error-induced behavior changes are theorized to rely on on the low-level
motivational systems that drive organisms to avoid aversive experiences (Hajcak & Foti, 2008;
Holroyd & Coles, 2002; Ridderinkhof, Ramautar, & Wijnen, 2009), and responses to prediction
errors may be similarly organized. A recent study found that the human startle response is slightly
potentiated after an incorrect response (Hajcak & Foti, 2008). I thus attempted to replicate their
finding, and investigated whether similar startle potentiation accompanies unexpected flanker events.
4.2 Methods
4.2.1 Subjects
Twenty members of the Brandeis University community (15 female, age range 18-21) participated
in this study. All were right-handed (mean score on the revised Edinburgh Handedness Inventory
89.49, SD = 12.85). Two other subjects completed one experimental session but did not return for
the second; their data are not reported here.
4.2.2 Experimental task
I designed a modified Eriksen flankers task using chevron stimuli (Eriksen & Eriksen, 1974;
Ridderinkhof et al., 1999). The basic trial structure is shown schematically in Figure 4.1. On each
trial, subjects were presented with an array of five chevrons that were displayed for 50 ms and were
not masked upon offset. They were instructed to report whether the central chevron was pointing
to the left or to the right. I will refer to this central chevron as the target, and the two chevrons on
each side of it as the flankers. The four flankers were always consistently oriented, and the central
67
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
> > < > >+
50 ms
Stimulus Response
< 2000 ms
A
A
> > < > >+
50 ms
Stimulus Response
“L” or “R”keypress
< 2000 ms
350 ms
Startle Probe
> > < > >+
50 ms
Stimulus Response
< 2000 ms
B
105 dB50 ms
Startle Probe
300 ms
“L” or “R”keypress
“L” or “R”keypress
105 dB50 ms
Figure 4.1: The sequence of events within a trial with no startle probe. After fixation, the flankerstimulus appeared for 50 ms, after which participants had two seconds to report the direction of thecenter chevron.
chevron’s orientation was equiprobably congruent or incongruent with its flankers. After a subject’s
response, a fixation cross was displayed for an inter-trial interval of 1000 ms before the next trial
display appeared.
Subjects viewed the display from a difference of approximately 57 cm. Each chevron subtended
approximately 1.40 visual angle, and the full array extended to an eccentricity of 4.67.
In order to maintain more-consistent error levels across subjects and conditions, subjects received
feedback about their performance after every thirty trials (after Hajcak & Foti, 2008). If the subject
had responded correctly on between 75% and 90% of those trials, the feedback was “You’re doing
great!” If their accuracy was lower than 75%, the feedback instructed them to increase their
accuracy; if it was above 90%, the feedback instructed them to respond more quickly.
Trial types were distributed among four conditions in a two-by-two design, as shown in Figure
4.2. The first factor governed the direction of the four flanker chevrons. On ninety percent of trials,
the flanker chevrons pointed in one direction (the Standard direction) and on ten percent of trials
they pointed the other (the Oddball direction). The second factor governed the relationship between
68
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
> > > > >< < < < <
< < > < < > > < > >
90% left flankers
Standard Oddball
Congruent
Incongruent
45%
45%
5%
5%
< < < < <
< < > < <> > < > >
> > > > >
90% right flankers
Standard Oddball
Congruent
Incongruent
45%
45%
5%
5%
Figure 4.2: Diagram of the two-by-two trial design. One factor, flanker direction, governedthe direction of the four flanker chevrons; the second, congruency, governed the relationshipbetween the central target and its flankers. Whether the Standard flanker direction was left orright was counterbalanced within subjects. 90% of trials incorporated the Standard flankers(45% Congruent and 45% Incongruent); 10% of trials incorporated the Oddball flankers (5%Congruent and 5% Incongruent).
the central target and the flankers. On half of trials, the target was Congruent with the flankers, and
on half it was Incongruent. Note that left-facing and right-facing target were equally frequent, and
the directions comprising Standard and Oddball flankers were counterbalanced within subjects.
4.2.3 Startle probes
In order to elicit startle responses, occasional trials included a 105 dB burst of white noise lasting
50 ms (Blumenthal et al., 2005; Hajcak & Foti, 2008). These startle probes were presented through
a single speaker resting on the display monitor. As one aim of this study was to compare response-
induced to stimulus-induced startle priming, participants completed four blocks of trials. Two of
these blocks included stimulus-aligned startle probes; the other two included response-aligned
69
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
> > < > >+
50 ms
Stimulus Response
< 2000 ms
A
A
> > < > >+
50 ms
Stimulus Response
“L” or “R”keypress
< 2000 ms
350 ms
Startle Probe
> > < > >+
50 ms
Stimulus Response
< 2000 ms
B
105 dB50 ms
Startle Probe
300 ms
“L” or “R”keypress
“L” or “R”keypress
105 dB50 ms
Figure 4.3: A: The sequence of events within one trial with a stimulus-aligned startle probe. The50 ms noise burst played 350 ms after stimulus onset. See page 71 for a description of the rulesgoverning whether startle probes occurred. B: The sequence of events within one trial with aresponse-aligned startle probe. Subjects had to respond within two seconds of stimulus onset; thenoise burst played 300 ms after their response.
70
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
startle probes. Block order was counter-balanced across participants using all possible orders.
In the stimulus-aligned condition, startle probes occurred 350 ms after stimulus onset and were
contingent on the flanker direction of the stimulus displayed on that trial (Figure 4.3A). In these
blocks, startle probes occurred on 50% of Oddball trials, 50% of Standard trials immediately
following an Oddball trial, and 4% of all other Standard trials. No explicit constraint on was placed
on startle probes on successive trials. The two separate rules for Standard trials control for the
predictability of the Oddball-contingent startle probes. On average, blocks with stimulus-aligned
startle probes featured 64 probes per 510-trial block.
In the response-aligned condition, startle probes occurred 300 ms after the subject’s response,
and were contingent on whether the subject responded correctly (Figure 4.3B). In these blocks,
startle probes occurred on 50% of error trials, on 50% of correct trials following an error trial, and
on 4% of all other correct trials. On average, blocks with response-aligned startle probes featured
73 probes per 510-trial block.
To assess whether unexpected stimuli and unexpected responses similarly recruit defensive
alerting mechanisms, I measured the eye blink response to startle probes following Oddball
and Standard stimuli, and the response to probes following Error and Correct responses. The
magnitude of subjects’ eye-blink responses was measured via electromyography (EMG), using a
subset of the electrodes on the EEG cap. Page 74 describes details of the recording and analysis.
4.2.4 Procedures and analyses
Each recording block comprised 510 trials, with the first thirty discarded as practice. Each subject
completed two blocks with stimulus-aligned startle probes (one with left-facing flankers as the
Standard direction, the other with right-facing flankers) and two blocks with response-aligned
startle probes. These four blocks were completed in two separate recording sessions; the order of
71
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
blocks was counterbalanced across subjects using all possible orders. By the end of the experiment,
each subject had completed 2040 trials, 1920 of which were included for analysis.
Subjects filled out an anonymous questionnaire after each recording session, confirming that
they got reasonable amounts of sleep, were not under the influence of any psychoactive substances,
and had no medical history, such as a head injury or neurological diagnosis, that should lead me to
exclude their data.
Behavioral measures
Subjects’ reaction times and responses were recorded from each trial and analyzed. I computed
accuracy for each of Congruent Standard, Incongruent Standard, Congruent Oddball, and
Incongruent Oddball conditions. I also computed decile reaction times and accuracy for all
four conditions, computed ex-Gaussian fits to reaction time distributions (Balota, Yap, Cortese, &
Watson, 2008), and measured post-error slowing—the difference in reaction time between the trial
following an error and the trial preceding an error (Dutilh et al., 2012).
After the end of their final experimental session, subjects completed the Adult Temperament
Questionnaire Short Form (ATQ). This instrument has 77 items which form several self-report
scales describing temperament factors (Evans & Rothbart, 2007). I selected two factors a priori for
testing: Attentional Control and Orienting Sensitivity. Attentional Control refers to the capacity to
focus attention, and to shift attention as desired. “It’s often hard for me to alternate between two
different tasks,” is an example of a reverse-scored Attentional Control item. Orienting Sensitivity
refers to awareness of low-intensity environmental and self-generated stimuli and experiences. “I
often notice visual details in the environment,” is an example of an Orienting Sensitivity item.
I hypothesized that the Attentional Control would account for some variability in people’s task
performance, while Orienting Sensitivity would account for variability in startle responses.
72
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
EEG recording and measures
A high-density EEG system (Electrical Geodesics, Inc., Eugene, OR) with 129 electrodes sampled
scalp electroencephalographic signals at 250 Hz using a high-impedance amplifier. Signals were
recorded for later, off-line analysis. All channels were adjusted for scalp impedance below 50 kΩ
impedance; after one experimental block, channels were returned to at most 50 kΩ scalp impedance
before the subject completed the session.
After recording, EEG data were preprocessed using the EEGLAB Matlab toolbox (Delorme &
Makeig, 2004). Continuous EEG signals were bandpass filtered to between 0.25 and 100 Hz using
a first-order Butterworth filter. A 60 Hz notch filter was also applied to the continuous data, to
reduce line electrical noise. Then, data were broken into epochs time-locked to stimulus onset and
subject responses. Stimulus-locked epochs comprised a window from 200 ms preceding stimulus
onset to 500 ms ms after; response-locked epochs comprised 200 ms preceding keypress responses
to 500 ms after. Finally, data were cleaned using the process described in Section 3.2.6 (page 51).
Using the clustering analysis described in Section 3.2.6 (page 52), I ran two comparisons. First,
I identified time windows and electrodes that captured the differences between Standard and
Oddball stimuli, and computed the ERPs to stimulus onset at those electrodes. The mean difference
between the two ERPs during the significant time window was my measure of vMMN magnitude.
To verify that the clustering algorithm identifies valid ERP components, I also compared the
ERPs elicited by Correct and Incorrect responses. The error response negativity (ERN) is a
negative-going deflection in the ERP that occurs very shortly after incorrect responses. I identified
time windows and electrodes that captured the differences between Correct and Error responses,
and computed ERPs to subject responses at those electrodes. The mean difference between the two
ERPs during the significant time window was my measure of ERN magnitude.
73
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
EMG recording and measures
EMG data were recorded simultaneously with the EEG data, using electrodes 14, 21, 126, and
127 of the sensor net. These electrodes are positioned directly above and below the eyes, and
are designed to capture the vertical/blink component of eye-movement EMG. Using a 4th-order
Butterworth filter, the data were bandpass filtered to 28–100 Hz and broken into epochs from 200
ms preceding each startle probe to 350 ms after its onset. Epoched data were rectified, low-pass
filtered at 50 Hz and visually inspected for non-blink artifacts (Blumenthal et al., 2005; Hajcak
& Foti, 2008). Artifact-contaminated epochs were removed from the processing stream. Cleaned
data were baseline-corrected using the 200 ms immediately preceding each startle probe, and blink
magnitude was operationalized as the maximum voltage within a time window from 20–150 ms
after a startle probe.
4.3 Results
4.3.1 Behavioral measures
Figure 4.4 shows the effects of congruency and expectedness on subjects’ speed and accuracy
in the flankers task. On trials in which the flankers were in the Standard condition (presented
in blue in Figure 4.4), subjects were faster and more accurate on Congruent trials (proportion
correct 0.93, SD = 0.04; median correct reaction time 356 ms, SD = 33) than on Incongruent
trials (proportion correct 0.86, SD = 0.05; median correct reaction time 378 ms, SD = 37), with the
largest accuracy differences occurring on trials where subjects responded quickly. This effect was
exaggerated when the flankers were in the Oddball condition (presented in green in Figure 4.4).
On Oddball Congruent trials, subjects were faster and more accurate (proportion correct 0.95, SD
= 0.04; median correct reaction time 343 ms, SD = 31) than on Standard Congruent trials, and on
74
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
0.25
0.50
0.75
1.00
0.1 0.2 0.3 0.4 0.5 0.6 0.7
Prop
ortio
n co
rrect
Reaction time (s)
Oddball Congruent
Standard Congruent
Standard Incongruent
Oddball Incongruent
0.50
0.75
1.00
Congruent Incongruent
Prop
ortio
n co
rrect
StandardsOddballs
0.30
0.35
0.40
0.45
Congruent Incongruent
Med
ian
reac
tion
time
(s)
Figure 4.4: (A) Reaction time and proportion correct for each decile of Standard (blue) and Oddball(green) trials, when the flankers are Congruent (solid lines) and Incongruent (dashed lines). (B)Proportion correct (left) and median reaction times (right) for each condition, summarizing theabove. Error bars are repeated-measures standard error.
75
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
Table 4.1: ANOVA results for ex-Gaussian fits to RT distributions.
parameter source df1, df2 F partial η2
µ
Flanker 1,19 10.976∗∗ .366
Congruency 1,19 0.015 .015
Flanker × Congruency 1,19 2.202 .104
σ
Flanker 1,19 0.848 .043
Congruency 1,19 25.323∗∗∗ .571
Flanker × Congruency 1,19 3.343 .150
τ
Flanker 1, 19 11.425∗∗ .376
Congruency 1,19 0.094 .005
Flanker × Congruency 1,19 0.004 .000
∗p < .05. ∗∗p < .01. ∗∗∗p < .001.
Oddball Incongruent trials, subjects were slowest and least accurate (proportion correct 0.59, SD
= 0.16; median correct reaction time 427 ms, SD = 47).
These results were confirmed by two-way repeated measures ANOVAs with factors congruency
and flanker direction. There was a main effect of congruency on proportion correct (F(1,19) =
94.211, p < .001), a main effect of flanker direction on proportion correct (F(1,19) = 80.019,
p < .001), and a congruency × flanker direction interaction (F(1,19) = 101.809, p < .001).
There was a significant main effect of congruency on median correct reaction time (F(1,19) =
92.244, p < .001), no main effect of flanker direction on median correct reaction time (F(1,19) =
0.026, p = .875), but a significant congruency × flanker direction interaction (F(1,19) = 30.917,
p < .001).
To further characterize the reaction times, I fit ex-Gaussian curves to the reaction time distribu-
tion generated by each subject on each condition. These curves have three parameters: τ , which
characterizes the positive tail of the distribution (the exponential component); µ, the mean of the
76
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
0
0.2
0.4
Congruent Incongruent
τ (s)
0
0.5
1.0
Congruent Incongruent
µ (s
)
0
0.1
Congruent Incongruent
σ (s
)
StandardsOddballs
Figure 4.5: Ex-Gaussian fits to reaction time distributions for each stimulus condition. A: Oddballflanker directions increased µ relative to Standard. B: Incongruent trials had higher σ thanCongruent trials. C: Standard flanker directions increased τ relative to Oddball trials.
77
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
Gaussian component, and σ, the standard deviation of the Gaussian component. Figure 4.5 shows
the results. Two-way repeated-measures ANOVAs on each parameter confirmed the effects. There
was a main effect of flanker on µ (p = .037), with unexpected flanker directions shifting the entire
reaction time distribution slower relative to expected flanker directions, and on τ , with unexpected
flanker directions actually decreasing the skewness of the RT distribution. (While this effect seems
paradoxical at first, it is due to the two-second timeout for subjects’ responses. With such a limit, as
the distribution shifts right, it must become less skewed.) There was a main effect of congruency on
σ, suggesting that incongruent stimuli increased the variability of subjects’ reaction times relative
to congruent stimuli. Complete ANOVA results are shown in Table 4.1.
There was substantial post-error slowing in this experiment. On average, subjects responded
30 ms more slowly (SD = 24 ms) on the trial following an error than on the trial preceding one. I
ran a three-way repeated-measures ANOVA on this post-error slowing with factors congruency,
flanker direction, and startle probe timing. There was a marginal main effect of startle probe
timing, with subjects exhibiting less post-error slowing on blocks with with stimulus-contingent
startle (26 ms, SD = 18 ms) than on blocks with response-contingent-startle (42 ms, SD = 38 ms;
F(1,19) = 4.031, p = .059). There was no main effect of congruency or flanker direction (p > .5)
and no significant interactions (all ps > .08).
On average, subjects scored near the center of the temperament self-report range from the ATQ.
The mean Perceptual Sensitivity score was 5.00 (SD = 0.86), and the mean Attentional Control
score was 4.03 (SD = 0.91). There was a non-significant negative correlation between Attentional
Control and proportion correct (r = −0.291, p = .213),
78
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK−200 ms 0 ms
−8
−4
0
4
8
t−scores
-2
0
2
-200 -100 0 100 200 300 400 500
ampl
itude
(μV)
Time (ms)
StandardOddball
-0.5
0
0.5
-200 -100 0 100 200 300 400 500
Diffe
renc
e (µ
V)
Time (ms)
t
-8
8
0
A
B
C
Figure 4.6: ERPs to Standard and Oddball flanker directions at a cluster of posterior electrodes.A: Topographical plot showing the locations of the 16 electrodes making up the cluster, and thedistribution across the scalp of t-values at 180–320 ms after stimulus onset. B: Traces show ERPstime locked to stimulus onset for Standard (shown in blue) and Oddball (green) flankers, at thecluster of electrodes shown in panel A. C: Trace depicts the difference between the two traces shownin panel B. Negative values occurred when ERPs to Oddball flankers were more negative-goingthan those to Standard flankers. Error ribbons are within-subject standard error, and thus do notreflect the process or the results of the permutation significance test.
79
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
4.3.2 ERPs
Standard vs. Oddball
Figure 4.6 illustrates the neural activity that accompanied the presentation of Standard and Oddball
flanker directions. A data-driven clustering and permutation analysis (Maris & Oostenveld, 2007;
see page 52) identified the cluster of electrodes that best captured (p < .001) the difference in
scalp voltage topographies between Standard and Oddball flankers. This cluster was derived with
tcritical = 3.883, the critical t-value at α = .001 and df = 19. The cluster was composed of 16
posterior electrodes that were more negative-going on Oddball than on Standard trials during an
interval from 180–320 ms after stimulus onset. Figure 4.6A depicts the locations of the electrodes
making up the cluster and the distribution of t-scores between ERPs to the two flanker directions
during that time window. The traces in Figure 4.6B show ERPs at the cluster, time locked to
stimulus onset, to Standard (blue) and Oddball (green) flankers. Figure 4.6C shows the difference
between the two traces in Figure 4.6B. Negative values reflect Oddball-evoked ERPs that were
more negative-going than Standard-evoked ERPs.
I tested whether vMMN magnitude reflected subjects’ susceptibility to interference from un-
expected flanker directions. To account for differences in neural function and anatomy that are
unrelated to such susceptibility, I partialled out each subjects’ N2 magnitude to Standard flankers.
N2 magnitude was operationalized as each subject’s mean amplitude of the ERP from 180–220 ms
after stimulus onset on Standard trials. The residual vMMN values were marginally correlated
with Attentional Control score (r = −0.439, p = .053), suggesting that subjects who scored
higher on Attentional Control tended to have larger (negative) MMNs. They were also correlated
with Orienting Sensitivity score (r = 0.464, p = .039), such that subjects who scored higher on
Orienting Sensitivity tended to have smaller vMMNs. Figure 4.7 illustrates the relationship between
these three measures. Combining Attentional Control and Orienting Sensitivity in a multiple linear
80
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
-0.3
-0.2
-0.1
0
0.1
0.2
0.3
0.4
3 4 5 6 7
N2-c
orre
cted
vM
MN
Orienting Sensitivity Score
Bottom quartile
Top quartile
Attentional Control Score
r2 = .35p = .026
Figure 4.7: A multiple linear regression on Attentional Control and Orienting Sensitivity scorespredicts inter-individual differences in N2-corrected vMMN magnitude. Data points are plottedaccording to Orienting Sensitivity score and vMMN magnitude; note that because vMMN isnegative, lower values here correspond to larger vMMNs. The two fit lines show the relationshipbetween Orienting Sensitivity and vMMN magnitude at different ranges on the Attentional Controlfactor.
regression significantly predicted N2-corrected vMMN magnitude (R2 = 0.35, p = .026).
Residual MMN values were also correlated with proportion correct on Oddball Congruent
trials (r = −0.448, p = .048), such that those subjects who were most accurate on these trials had
larger (negative) MMNs. However, residual MMN magnitude did not correlate significantly with
proportion correct on Oddball Incongruent trials (r = 0.194, p = .412), nor did it correlate with
any reaction time measures (all ps > .08).
Correct vs. Error
Figure 4.8 depicts the neural activity that occurs following Correct and Error responses. The same
approach as above was used to identify the cluster of electrodes that best captured (p < .001) the
difference in scalp voltage topographies following Correct and Error responses. The cluster was
81
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK−200 ms 0 ms
−8
−4
0
4
8
t−scores
-4
0
4
-200 -100 0 100 200 300 400 500
Diffe
renc
e (µ
V)
Time (ms)
-2
0
2
-200 -100 0 100 200 300 400 500
ampl
itude
(µV)
Time (ms)
t
-8
8
0
A
B
C
CorrectError
Figure 4.8: ERPs to Correct and Error responses at a cluster of central electrodes. A: Topographicalplot showing the locations of the 33 electrodes making up the cluster, and the distribution acrossthe scalp of t-values from 100 ms before subject responses to 80 ms after. B: Traces show ERPstime locked to responses for Correct (shown in yellow) and Error (red) responses, at the cluster ofelectrodes shown in panel A. C: Trace depicts the difference between the two traces shown in panelB. Negative values occurred when ERPs to Error responses were more negative-going than thoseto Correct responses.
82
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
0
5
10
Star
tle b
link
mag
nitud
e (µ
V)
CorrectUnpredictable
CorrectPredictable
ErrorStandardUnpredictable
StandardPredictable
Oddball0
5
10
Star
tle b
link
mag
nitud
e (µ
V)
Integral over 40 ms around peak
Figure 4.9: Magnitude of the blink EMG response to startle probes following Standard (blue)and Oddball (green) flankers, and to startle probes following Correct (yellow) and Error (red)responses.
derived using tcritical = 5.949, the critical t-value at α = .00001 and df = 19. This cluster was
composed of 33 central electrodes that were more negative-going on Error than on Correct trials,
during a window from 100 ms before participants’ responses to 80 ms after. Figure 4.8A illustrates
the distribution of t-scores between ERPs to Correct and Error responses during that time window,
and the locations of the electrodes making up this cluster. The traces in Figure 4.8B show ERPs at
the cluster, time locked to subjects’ responses, for Correct (yellow) and Error (red) trials. Figure
4.8C shows the difference between the two traces in Figure 4.8B. Negative values reflect Error
ERPs that are more negative-going than Correct ERPs.
4.3.3 Startle blink
On average, subjects’ mean startle blink magnitude was 7.676 µV (SD = 6.556). Figure 4.9 shows
blink magnitudes for each startle probe condition. Because there was no significant difference
between the two Standard conditions, nor between the two Correct conditions (p > .06), I
collapsed across Standard Unpredictable and Predictable, and across Correct Unpredictable
83
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
and Predictable. Startle blinks were of almost identical magnitude following Oddball flankers
(7.19 µV, SD = 6.84) than following Standard flankers (7.19 µV, SD = 6.80). This difference was
not significant (paired t(19) = 0.010, p = .993). Startle blinks following Error responses (7.95 µV,
SD = 7.47) were very slightly smaller than those following Correct responses ( 8.26 µV, SD =
6.32). This difference was not significant (paired t(19) = 0.496, p = .626).
As I hypothesized, average startle blink magnitude was significantly positively correlated with
subjects’ Orienting Sensitivity score on the ATQ (r = 0.539, p = .010). However, startle blink
magnitude and startle priming were not correlated with any of the ERP measures (all ps > .1).
Interestingly, subjects’ Orienting Sensitivity scores were also significantly correlated with the effects
of Oddball flankers on exGaussian fits to reaction times. There was a significant positive correlation
between Orienting Sensitivity score and the flanker direction × congruency interaction on µ (the
mean of the Gaussian component; r = 0.482, p = .031) and a significant negative correlation
between Orienting Sensitivity score and the interaction on τ (the exponent component; r = −0.491,
p = .028). This suggests that Orienting Sensitivity captures subjects susceptibility to the Oddball
flankers, as well as their overall sensitivity to the startle probes.
4.4 Discussion
I modified the Eriksen flanker task by manipulating flanker frequency to create Standard and
Oddball flanker directions. As expected, I replicated the usual flanker congruency effect: subjects
were faster and more accurate when flankers were congruent with the target than when the flankers
were incongruent. This effect was markedly larger for Oddball flankers than for Standard flankers.
On trials with Oddball flankers that were congruent with the target, subjects were fastest and most
accurate, while on trials with Oddball flankers that were incongruent with the target, subjects were
worst.
84
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
Several groups of researchers have proposed competing models of the interference between
flankers and targets. Yu, Dayan, and Cohen (2009) proposed a Bayesian account where subjects’
priors incorporated a bias for congruency among spatially proximate regions. White et al. (2011,
2012) proposed a diffusion model in which the spotlight of spatial attention changes size over time.
The sudden onset of the stimulus requires attention to be spread across the entire display; over
time, it contracts to encompass only the target. Neither of these models is capable of describing
the differences in my data between Standard and Oddball flankers without adding a parameter
describing subjects’ expectations about flanker directions. Such expectations facilitate suppression
of incongruent flankers when they are as predicted, but not when they are deviant. The behavioral
evidence from this experiment thus supports the hypothesis that unexpected events obligatorily
capture attention (Parmentier, Elsley, Andres, & Barcelo, 2011; Schroger, 1996), enhancing the
flanker congruency effect.
This same hypothesis was further supported by the visual mismatch negativity (vMMN) elicited
by the Oddball flankers. This ERP component is thought to reflect a potentiation of the neural
response to stimuli that do not match a predictive feedback signal sent from higher cortical areas to
the sensory cortices (Garrido et al., 2009; Wacongne et al., 2012, 2011). The presence of a vMMN
to the Oddball flankers confirm that subjects’ brains are sensitive to the regularity governing flanker
direction, despite the fact that most subjects did not express awareness of that regularity.
I assessed whether subjects’ self-reported temperament factors could predict the effects of
Oddball flankers. Two factors of temperament—Attentional Control and Orienting Sensitivity—
together predicted about a third of the variance in vMMN magnitude. It appears that high Attentional
Control ability facilitates predictive coding, leading to larger differences between Standards and
Oddballs. Similarly, high Orienting Sensitivity impairs ignoring of the Standards, reducing such
differences.
Future work should explore whether temperament predicts the relative strength of bottom-up
85
CHAPTER 4. LEAKY IGNORING IN THE FLANKER TASK
and top-down neural processes in other settings. Another area for future work is to identify traits
that predict individual differences in behavior in response to the Oddball flanker directions, as my
results only predict the magnitude of the neural responses. Finally, future work should investigate
the role that alpha-band EEG oscillations play in performing this modified flanker task. Such
oscillations are thought to gate incoming sensory information, reducing interference from incoming
sensory signals (Jones et al., 2010; Payne et al., 2013; Romei, Gross, & Thut, 2010).
My attempts to demonstrate startle potentiation to errors and to Oddball flankers did not result
in any measurable effect. It is possible that my strategy of measuring EMG using the oculomotor
channels of the EEG cap resulted in electrode placement that was not reliably precise, or that the
high-impedance amplifier was not able to capture subtle EMG effects. Further, the effect I was
attempting to replicate is quite small (Hajcak & Foti, 2008), and it may be too small to be reliably
identified.
In summary, my novel modification of the Eriksen flanker task has produced a very strong
behavioral effect, showing that Oddball flankers have an exaggerated effect on subjects responses. I
also demonstrated that the vMMN produced by such flankers varies across subjects, and that about
a third of such variation can be explained by individual differences in temperament. The human
brain is sensitive to stochastic as well as sequentially-structured regularities, and can use them to
facilitate ignoring of incongruent distractors.
86
Chapter 5
Conclusion
The studies presented and discussed in the preceding chapters expand our understanding of how
humans learn to represent sequences and structure, how they use those representations to support
cognition, and how they monitor whether those representations actually predict sensory input.
In Chapter 2 I used cognitive modeling to fit the error distributions that subjects generated on a
visuomotor sequence learning task. The best-fitting parameters make it clear that subjects’ improved
performance on familiar sequences is due to increases in both the precision with which items are
represented and the probability that all items have actually entered short-term memory, as seen in
Figures 2.7, 2.10, and 2.13. I also found that serial position curves flatten as sequences become
familiar, suggesting that items that are poorly-represented on the first presentation are allocated
more VSTM resources when the sequence is presented a subsequent time (Figures 2.9, 2.12, and
2.15).
In Chapter 3 I measured neural responses to new, familiar, and deviant sequence items, and found
that the neural responses to both new and deviant items differ from those to familiar items (Figures
3.4 and 3.5). Further, one P3 subcomponent is enhanced for both new and deviant items, while
another is enhanced only for deviant items (Figure 3.6), dissociating the processes of identifying
87
CHAPTER 5. CONCLUSION
and encoding task-relevant information from the processes detecting deviance.
Chapter 4 showed that unexpected events interspersed within a stream of actively-ignored stimuli
interfere with attention and action planning (Figure 4.4), and that individual differences in the neural
responses to such events are related to temperament (Figure 4.7).
5.1 Further evidence for human sensitivity to environmental
structure and predictability
Regardless of task or setting, sequences, predictions, and prediction monitoring are greatly important
to human cognition. For example, it appears that contextual cueing for items occurs even if the
items themselves are not present. When contexts are predictive of their accompanying items,
re-presentation of the context without the items strengthens subjects’ recollection of the items on a
later memory test (Smith, Hasinski, & Sederberg, 2013). This finding extends previous work on the
role of context in item processing and recognition (Biederman et al., 1982; Kersten et al., 2004;
Gronau et al., 2008; Mandler, 1980), and shows that context alone is capable of activating an object
representation to such a degree that memory is facilitated. Note that this only occurs when contexts
are reliably predictive, showing that the system has access to information about which cues should
be attended to and which should not.
Even more interesting are cases where people fail to use prediction information to facilitate
some task. For example, in a series of studies where people could choose how to allocate time
spent studying, a substantial portion of subjects made suboptimal decisions, even with information
available about their level of mastery and what would be tested (Ariel, 2013). Further, people’s
belief about the causal structure of the world influences their predictions within a series of events.
Subjects who believe they caused an outcome assume non-temporally-dependent structure, and make
88
CHAPTER 5. CONCLUSION
optimal predictions; subjects who believe outcomes are computer-generated search for patterns,
and probability-match (Green, Benson, Kersten, & Schrater, 2010). Subjects also assume positive
recency effects for short runs of events, and negative recency effects for long runs—the classic
Gambler’s Fallacy (Scheibehenne & Studer, 2013). Finally, despite Green et al. (2010)’s results
suggesting that people seek sequential patterns and structures, non-verbalizable sequences that are
periodic are quite difficult for subjects to learn (Kalish, 2013).
5.2 Future work
Several future lines of research could extend the projects presented in the preceding pages.
5.2.1 Modeling projects
Multi-parameter models for describing distributions are a rich and valuable tool. Both the VSTM
mixture models described in Chapter 2 and the exGaussian models described in Chapter 4 enrich
our understanding of exactly how task manipulations influence people’s processing and perfor-
mance. Future work with the VSTM mixture models should explicitly test the dynamic reallocation
hypotheses by fitting parameters to errors that are produced when a segment is poorly-represented
on the first presentation and to those that are produced when a segment is well-represented. The
hypothesis predicts smaller presentation-over-presentation changes on the remaining segments of
the display when one segment is poorly represented, due to memory resources being allocated to
that segment.
Another possible line of investigation is to follow up on the apparent encoding-limited nature
of this VSTM task. Repeated presentations lead to improved performance, suggesting that more
information is encoded each time. This contradicts the claims of Vogel, Woodman, and Luck (2006),
89
CHAPTER 5. CONCLUSION
who found that visual short-term memory is equally effective whether subjects are exposed to a
stimulus for 200 ms or 500 ms. They argue that this shows that VSTM cannot be encoding-limited
at moderate display lengths, which my task has. Vogel et al. (2006) did not, however, test the effect
of multiple short presentations. Future work could compare memory performance for single short
displays, single long displays, and multiple short displays, and determine whether there is a benefit
of repeated exposure.
5.2.2 Types of predictions
Chapters 3 and 4 test two very different paradigms within the space of predictive cognition. In
Chapter 3, the predictions involved are explicitly known, are derived from a fixed sequence, and
the task is to represent the sequence itself. In Chapter 4, the predictions are not explicitly known
(by most subjects), they are stochastic rather than sequential, and they are incidental to the actual
task subjects are performing. This dissertation does not speak to the great range of situations that
are in the middle. People have varying degrees of access to information about regularities in the
world; those regularities can perfectly sequential, somewhat sequential, or entirely stochastic; and
the predictability can be central or incidental to the task people are trying to perform. One can
conceptualize a three-dimensional space defined by strictness of temporal regularity on one axis,
relevance of that regularity on a second axis, and people’s access to information about the regularity
on a third. An area of rich further exploration would be to develop a task and a set of regularities
which can be used to systematically explore the space from sequential to stochastic regularities,
applying to elements which are task-relevant, irrelevant, or distractors.
Other work has been done that measures the influence of predictability at different spaces
in this domain. The serial reaction time task (SRTT; Nissen & Bullemer, 1987) measures the
accuracy and reaction time with which subjects can make keypress responses. Various stimuli on
90
CHAPTER 5. CONCLUSION
the screen cue subjects to press particular keys, and the sequence of stimuli (and thus keypresses)
repeats throughout an experimental block. Here, the prediction is sequential, not stochastic, and and
central to the task. Subjects develop varying degrees of awareness of the sequential structure; such
awareness does not affect the facilitation of performance for trials that obey the sequence, but does
predict neural responses to trials that deviate from it (Ferdinand et al., 2008; Russeler et al., 2003;
Russeler & Rosler, 2000; Schlaghecken et al., 2000). In short, these results are in the same area of
this three-dimensional conceptual space as my results from Chapter 3.
In a very different domain, Vo and Wolfe (2012) measured ERPs to stimuli that violated scene
context rules, which is in between the explicitly sequential and the completely stochastic structures
that I created for Chapters 3 and 4. Th predictability was central to the task (identifying objects
in naturalistic scenes), and subjects were generally aware when the context rules were violated.
Their ERP results are very different from mine, but due to the differences in the tasks and stimuli,
those differences are hard to interpret. One possible explanation is that predictions derived from
temporal regularities are qualitatively different from predictions derived from scene structure and
co-occurrence rules, and that a different system is involved in monitoring in this case. Further work
should more carefully follow up on this question.
5.2.3 Neuroimaging
My work has focused on the neural processes that are easily measured by scalp EEG: the anterior
cingulate and frontal areas, as measured by the P3, and sensory cortices, as measured by the MMN.
Other brain areas are of course involved in detecting and responding to deviant events. In particular,
the hippocampus has been implicated in binding events to their spatial and temporal contexts, and in
detecting associative novelty when an event occurs outside its usual context (Kumaran & Maguire,
2006, 2007; Manns, Howard, & Eichenbaum, 2007). Investigating hippocampal contributions
91
CHAPTER 5. CONCLUSION
will require the use of fMRI or MEG, as those imaging methods are more capable of measuring
the activity of subcortical structures. A question of particular interest is whether and how the
hippocampus and the anterior cingulate cortex are connected. It is not known whether hippocampal
detection of associative novelty lead to anterior cingulate activation, or vice versa.
In summary, this dissertation describes three projects that further our understanding of the pro-
cesses by which people learn sequences and regularities, use predictions based on such regularities
to support cognition, and monitor when the predictions are validated by actual events.
92
References
ReferencesAgam, Y., Bullock, D., & Sekuler, R. (2005). Imitating unfamiliar sequences of connected linear
motions. Journal of Neurophysiology, 94, 2832-2843.Agam, Y., Galperin, H., Gold, B. J., & Sekuler, R. (2007). Learning to imitate novel motion
sequences. Journal of Vision, 7(5:1), 1–17.Agam, Y., Huang, J., & Sekuler, R. (2010). Neural correlates of sequence encoding in visuomotor
learning. Journal of Neurophysiology, 103, 1418-1424.Agam, Y., & Sekuler, R. (2007). Interactions between working memory and visual perception: An
ERP/EEG study. Neuroimage, 36, 933–942.Agam, Y., & Sekuler, R. (2008). Geometric structure and chunking in reproduction of motion
sequences. Journal of Vision, 8(1:11), 1–12.Ariel, R. (2013). Learning what to learn: The effects of task experience on strategy shifts in
the allocation of study time. Journal of Experimental Psychology: Learning, Memory, andCognition.
Balota, D. A., Yap, M. J., Cortese, M. J., & Watson, J. M. (2008). Beyond mean response latency:Response time distributional analyses of semantic priming. Journal of Memory and Language,59, 495-523.
Bar, M. (2004). Visual objects in context. Nature Reviews Neuroscience, 5, 617-629.Bar, M. (2009). The proactive brain: Memory for predictions. Philosophical Transactions of the
Royal Society B: Biological Sciences, 364, 1235-1243.Barlow, H. B. (1961). Possible principles underlying the transformations of sensory messages. In
W. Rosenblith (Ed.), Sensory communication (pp. 217–234). MIT Press.Barnes, G. R. (2008). Cognitive processes involved in smooth pursuit eye movements. Brain and
Cognition, 68, 309-26.Bays, P. M., Catalao, R. F. G., & Husain, M. (2009). The precision of visual working memory is
set by allocation of a shared resource. Journal of Vision, 9(10:7), 1-11.Bays, P. M., Gorgoraptis, N., Wee, N., Marshall, L., & Husain, M. (2011). Temporal dynamics of
encoding, storage, and reallocation of visual working memory. Journal of Vision, 11(10:6),1-15.
Bays, P. M., Wu, E. Y., & Husain, M. (2011). Storage and binding of object features in visualworking memory. Neuropsychologia, 49, 1622-31.
Biederman, I., Mezzanotte, R. J., & Rabinowitz, J. C. (1982). Scene perception: Detecting andjudging objects undergoing relational violations. Cognitive Psychology, 14, 143–177.
Blumenthal, T. D., Cuthbert, B. N., Filion, D. L., Hackley, S., Lipp, O. V., & Boxtel, A. van.(2005). Committee report: Guidelines for human startle eyeblink electromyographic studies.Psychophysiology, 42, 1-15.
Botvinick, M. M., Cohen, J. D., & Carter, C. S. (2004). Conflict monitoring and anterior cingulatecortex: An update. Trends in Cognitive Sciences, 8, 539–546.
Burke, M. R., & Barnes, G. R. (2007). Sequence learning in two-dimensional smooth pursuit eye
93
References
movements in humans. Journal of Vision, 7(1:5), 1–12.Cassidy, B. S., Zebrowitz, L. A., & Gutchess, A. H. (2012). Appearance-based inferences bias
source memory. Memory & Cognition, 40, 1214-1224.Chalk, M., Seitz, A. R., & Series, P. (2010). Rapidly learned stimulus expectations alter perception
of motion. Journal of Vision, 10(8:2), 1-18.Cohn, J. V., DiZio, P., & Lackner, J. R. (2000). Reaching during virtual rotation: Context specific
compensations for expected Coriolis forces. Journal of Neurophysiology, 83, 3230-3240.Courchesne, E., Hillyard, S. A., & Galambos, R. (1975). Stimulus novelty, task relevance and the
visual evoked potential in man. Electroencephalography and Clinical Neurophysiology, 39,131–143.
Cousineau, D. (2005). Confidence intervals in within-subject designs: A simpler solution to Loftusand Masson’s method. Tutorial in Quantitative Methods for Psychology, 1, 42–45.
Cravo, A. M., Rohenkohl, G., Wyart, V., & Nobre, A. C. (2013). Temporal expectation enhancescontrast sensitivity by phase entrainment of low-frequency oscillations in visual cortex.Journal of Neuroscience, 33, 4002-4010.
Crottaz-Herbette, S., & Menon, V. (2006). Where and when the anterior cingulate cortex modulatesattentional response: combined fMRI and ERP evidence. Journal of Cognitive Neuroscience,18, 766–780.
Czigler, I. (2007). Visual mismatch negativity. Journal of Psychophysiology, 21, 224-230.de Bruijn, E. R. A., Schubotz, R. I., & Ullsperger, M. (2007). An event-related potential study on the
observation of erroneous everyday actions. Cognitive Affective & Behavioral Neuroscience,7, 278–285.
Delorme, A., & Makeig, S. (2004). EEGLAB: An open source toolbox for analysis of single-trialEEG dynamics including independent component analysis. Journal of Neuroscience Methods,134, 9-21.
Dube, C., & Sekuler, R. (2012). On the nature of prototype effects in visual working memory formotion. Poster presented at the annual meeting of the Vision Sciences Society, Naples, FL.
Dutilh, G., Ravenzwaaij, D. van, Nieuwenhuis, S., Maas, H. L. van der, Forstmann, B. U., &Wagenmakers, E.-J. . J. (2012). How to measure post-error slowing: A confound and a simplesolution. Journal of Mathematical Psychology, 56, 208-216.
Ebbinghaus, H. (1913). Memory: A contribution to experimental psychology. New York, NY:Teachers College, Columbia University.
Elsner, C., Falck-Ytter, T., & Gredeback, G. (2012). Humans anticipate the goal of other people’spoint-light actions. Frontiers in Psychology, 3, 120:1–7.
Emrich, S. M., & Ferber, S. (2012). Competition increases binding errors in visual working memory.Journal of Vision, 12(4:12), 1-16.
Eriksen, B. A., & Eriksen, C. W. (1974). Effects of noise letters upon the identification of a targetletter in a nonsearch task. Perception & Psychophysics, 16, 143-149.
Escera, C., Alho, K., Winkler, I., & Naatanen, R. (1998). Neural mechanisms of involuntaryattention to acoustic novelty and change. Journal of Cognitive Neuroscience, 10, 590–604.
94
References
Evans, D. E., & Rothbart, M. K. (2007). Developing a model for adult temperament. Journal ofResearch in Personality, 41, 868-888.
Farrell, S., Hurlstone, M. J., & Lewandowsky, S. (2013). Sequential dependencies in recall ofsequences: Filling in the blanks. Memory & Cognition.
Ferdinand, N. K., Mecklinger, A., & Kray, J. (2008). Error and deviance processing in implicit andexplicit sequence learning. Journal of Cognitive Neuroscience, 20, 629-642.
Fougnie, D., Suchow, J. W., & Alvarez, G. A. (2012). Variability in the quality of visual workingmemory. Nature Communications, 3, 1229:1–8.
Garrido, M. I., Kilner, J. M., Stephan, K. E., & Friston, K. J. (2009). The mismatch negativity: Areview of underlying mechanisms. Clinical Neurophysiology, 120, 453-463.
Geisler, W. S., Albrecht, D. G., Crane, A. M., & Stern, L. (2001). Motion direction signals in theprimary visual cortex of cat and monkey. Visual Neuroscience, 18, 501–516.
Gobel, E. W., Sanchez, D. J., & Reber, P. J. (2011). Integration of temporal and ordinal informationduring serial interception sequence learning. Journal of Experimental Psychology: Learning,Memory, and Cognition, 37, 994-1000.
Gobet, F., & Simon, H. (1998). Expert chess memory: revisiting the chunking hypothesis. Memory,6, 225–255.
Goldstein, A., Spencer, K. M., & Donchin, E. (2002). The influence of stimulus deviance andnovelty on the P300 and novelty P3. Psychophysiology, 39, 781–790.
Gorgoraptis, N., Catalao, R. F. G., Bays, P. M., & Husain, M. (2011). Dynamic updating of workingmemory resources for visual objects. Journal of Neuroscience, 31, 8502-8511.
Green, C. S., Benson, C., Kersten, D., & Schrater, P. (2010). Alterations in choice behavior bymanipulations of world model. Proceedings of the National Academy of Sciences of theUnited States of America, 107, 16401-16406.
Gronau, N., Neta, M., & Bar, M. (2008). Integrated contextual representation for objects’ identitiesand their locations. Journal of Cognitive Neuroscience, 20, 371-388.
Guttman, S. E., Gilroy, L. A., & Blake, R. (2005). Hearing what the eyes see: Auditory encodingof visual temporal sequences. Psychological Science, 16, 228-235.
Hajcak, G., & Foti, D. (2008). Errors are aversive: Defensive motivation and the error-relatednegativity. Psychological Science, 19, 103-108.
Hebb, D. O. (1961). Distinctive features of learning in the higher animal. In A. Fessard, R. Gerard,& J. Konorski (Eds.), Brain mechanisms and learning (pp. 37–46). Oxford, UK: BlackwellScientific Publications.
Hitch, G. J., Fastame, M. C., & Flude, B. (2005). How is the serial order of a verbal sequencecoded? Some comparisons between models. Memory, 13, 247-258.
Holroyd, C. B., & Coles, M. G. H. (2002). The neural basis of human error processing: Rein-forcement learning, dopamine, and the error-related negativity. Psychological Review, 109,679-709.
Holroyd, C. B., & Yeung, N. (2012). Motivation of extended behaviors by anterior cingulate cortex.Trends in Cognitive Sciences, 16, 122–128.
95
References
Howard, M. W., & Kahana, M. J. (1999). Contextual variability and serial position effects in freerecall. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25, 923-941.
Hu, E. (2010). Context-specificity of internal model formation. Unpublished master’s thesis,Brandeis University.
Huang, J., & Sekuler, R. (2010a). Attention protects the fidelity of visual memory: Behavioral andelectrophysiological evidence. Journal of Neuroscience, 30, 13461-13471.
Huang, J., & Sekuler, R. (2010b). Distortions in recall from visual memory: Two classes ofattractors at work. Journal of Vision, 10(2:24), 1-27.
Jancke, D. (2000). Orientation formed by a spot’s trajectory: a two-dimensional populationapproach in primary visual cortex. Journal of Neuroscience, 20, RC86:1–6.
Johnson, R., & Donchin, E. (1982). Sequential expectancies and decision making in a changingenvironment: An electrophysiological approach. Psychophysiology, 19, 183–200.
Jones, S. R., Kerr, C. E., Wan, Q., Pritchett, D. L., Hamalainen, M., & Moore, C. I. (2010).Cued spatial attention drives functionally relevant modulation of the mu rhythm in primarysomatosensory cortex. Journal of Neuroscience, 30, 13760–13765.
Kahana, M. J., & Jacobs, J. (2000). Interresponse times in serial recall: Effects of intraserialrepetition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 26,1188-1197.
Kakade, S., & Dayan, P. (2002). Dopamine: Generalization and bonuses. Neural Networks, 15,549-559.
Kalish, M. L. (2013). Learning and extrapolating a periodic function. Memory & Cognition, 41,886–896.
Karlsson, M. P., & Frank, L. M. (2008). Network dynamics underlying the formation of sparse,informative representations in the hippocampus. Journal of Neuroscience(52), 14271–14281.
Kennerley, S. W., Behrens, T. E. J., & Wallis, J. D. (2011). Double dissociation of value computa-tions in orbitofrontal and anterior cingulate neurons. Nature Neuroscience, 14, 1581–1589.
Kersten, D., Mamassian, P., & Yuille, A. (2004). Object perception as Bayesian inference. AnnualReview of Psychology, 55, 271-304.
Kimura, M., Kondo, H., Ohira, H., & Schroger, E. (2011). Unintentional temporal context-based prediction of emotional faces: An electrophysiological study. Cerebral Cortex, 22,1774–1785.
Kimura, M., Schroger, E., & Czigler, I. (2011). Visual mismatch negativity and its importance invisual cognitive sciences. NeuroReport, 22, 669–673.
Kowler, E. (1989). Cognitive expectations, not habits, control anticipatory smooth oculomotorpursuit. Vision Research, 29, 1049-1057.
Kuiper, N. H. (1960). Tests concerning random points on a circle. Proceedings of the KoninklijkeNederlandse Akademie van Wetenschappen, Series A, 63, 38–47.
Kumaran, D., & Maguire, E. A. (2006). An unexpected sequence of events: Mismatch detection inthe human hippocampus. PLoS Biology, 4, 2372-2382.
Kumaran, D., & Maguire, E. A. (2007). Match-mismatch processes underlie human hippocampal
96
References
responses to associative novelty. Journal of Neuroscience, 27, 8517–8524.Lackner, J. R., & DiZio, P. (2005). Motor control and learning in altered dynamic environments.
Current Opinion in Neurobiology, 15, 653-659.Lashley, K. S. (1951). The problem of serial order in behavior. In Hixon Symposium, 1948
(p. 112-135).Laurent, P. A. (2008). The emergence of saliency and novelty responses from reinforcement
learning principles. Neural Networks, 21, 1493–1499.Le Pelley, M. E., Vadillo, M., & Luque, D. (2013). Learned predictiveness influences rapid
attentional capture: Evidence from the dot probe task. Journal of Experimental Psychology:Learning, Memory, and Cognition, 39, 1888–1900.
Linden, D. E. J. (2005). The P300: Where in the brain is it produced and what does it tell us?Neuroscientist, 11, 563–576.
Ljungberg, J. K., & Parmentier, F. B. R. (2012). Cross-modal distraction by deviance: Functionalsimilarities between the auditory and tactile modalities. Experimental Psychology, 59, 355–363.
Loftus, G. R., & Masson, M. E. J. (1994). Using confidence intervals in within-subject designs.Trends in Cognitive Sciences, 1, 476–490.
Luck, S. J. (2005). An Introduction to the Event-Related Potential Technique. Cambridge, MA:MIT Press.
Maier, M. E., & Steinhauser, M. (2013). Updating expected action outcome in the medial frontalcortex involves an evaluation of error type. Journal of Neuroscience, 33, 15705-15709.
Mandler, G. (1980). Recognizing: The judgment of previous occurrence. Psychological review, 87,252-271.
Manns, J. R., Howard, M. W., & Eichenbaum, H. (2007, 11). Gradual changes in hippocampalactivity support remembering the order of events. Neuron, 56(3), 530-40.
Maris, E., & Oostenveld, R. (2007). Nonparametric statistical testing of EEG- and MEG-data.Journal of Neuroscience Methods, 164, 177–190.
Maryott, J., Noyce, A., & Sekuler, R. (2011). Eye movements and imitation learning: Intentionaldisruption of expectation. Journal of Vision, 11(1:7), 1-16.
Maryott, J., & Sekuler, R. (2009). Age-related changes in imitating sequences of observedmovements. Psychological Aging, 24, 476-486.
Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacityfor processing information. Psychological Review, 63, 81-97.
Morey, R. D. (2008). Confidence intervals from normalized data: A correction to Cousineau (2005).Tutorial in Quantitative Methods for Psychology, 4, 61–64.
Naatanen, R., Paavilainen, P., Rinne, T., & Alho, K. (2007). The mismatch negativity (MMN)in basic research of central auditory processing: A review. Clinical Neurophysiology, 118,2544–2590.
Naatanen, R., Paavilainen, P., Titinen, H., Jiang, D., & Alho, K. (1993). Attention and mismatchnegativity. Psychophysiology, 30, 436-450.
97
References
Nassar, M. R., Rumsey, K. M., Wilson, R. C., Parikh, K., Heasly, B., & Gold, J. I. (2012). Rationalregulation of learning dynamics by pupil-linked arousal systems. Nature Neuroscience, 15,1040-1046.
Nieuwenhuis, S., Aston-Jones, G., & Cohen, J. D. (2005). Decision making, the P3, and the locuscoeruleus-norepinephrine system. Psychological Bulletin, 131, 510–532.
Nissen, M. J., & Bullemer, P. (1987). Attentional requirements of learning: Evidence fromperformance measures. Cognitive Psychology, 19, 1-32.
Nostl, A., Marsh, J. E., & Sorqvist, P. (2012). Expectations modulate the magnitude of attentionalcapture by auditory events. PLoS One, 7(11), e48569.
Noyce, A., & Sekuler, R. (in revisions). Two P3 components elicited by violations of newly-learnedpredictions.
Oostenveld, R., Fries, P., Maris, E., & Schoffelen, J.-M. M. (2011). FieldTrip: Open source softwarefor advanced analysis of MEG, EEG, and invasive electrophysiological data. ComputationalIntelligence and Neuroscience, 2011, 156869:1–9.
Ouden, H. E. M. den, Friston, K. J., Daw, N. D., McIntosh, A. R., & Stephan, K. E. (2009). A dualrole for prediction error in associative learning. Cerebral Cortex, 19(5), 1175-1185.
Parmentier, F. B. R., Elford, G., Escera, C., Andres, P., & San Miguel, I. (2008). The cognitivelocus of distraction by acoustic novelty in the cross-modal oddball task. Cognition, 106,408–432.
Parmentier, F. B. R., Elsley, J. V., Andres, P., & Barcelo, F. (2011). Why are auditory novelsdistracting? Contrasting the roles of novelty, violation of expectation and stimulus change.Cognition, 119(3), 374–380.
Parmentier, F. B. R., Ljungberg, J. K., Elsley, J. V., & Lindkvist, M. (2011). A behavioral study ofdistraction by vibrotactile novelty. Journal of Experimental Psychology: Human Perceptionand Performance, 37, 1134–1139.
Payne, L., Guillory, S., & Sekuler, R. (2013). Attention-modulated alpha-band oscillations protectagainst intrusion of irrelevant information. Journal of Cognitive Neuroscience, 25, 1463–1476.
Pazo-Alvarez, P., Cadaveira, F., & Amenedo, E. (2003). MMN in the visual modality: A review.Biological Psychology, 63, 199-236.
Pearce, J. M., & Hall, G. (1980). A model for Pavlovian learning: Variations in the effectiveness ofconditioned but not of unconditioned stimuli. Psychological Review, 87, 532-552.
Pigeon, P., Bortolami, S. B., DiZio, P., & Lackner, J. R. (2003). Coordinated turn-and-reachmovements. I. Anticipatory compensation for self-generated Coriolis and interaction torques.Journal of Neurophysiology, 89, 276-289.
Polich, J. (2007). Updating P300: an integrative theory of P3a and P3b. Clinical Neurophysiology,118, 2128-2148.
Posner, M. I. (1980). Orienting of attention. The Quarterly Journal of Experimental Psychology,32, 3-25.
Posner, M. I., Snyder, C. R., & Davidson, B. J. (1980). Attention and the detection of signals.
98
References
Journal of Experimental Psychology: General, 109, 160-174.Ricker, T. J., & Cowan, N. (2010). Loss of visual working memory within seconds: The com-
bined use of refreshable and non-refreshable features. Journal of Experimental Psychology:Learning, Memory, and Cognition, 36, 1355-1368.
Ridderinkhof, K. R., Band, G. P., & Logan, G. D. (1999). A study of adaptive behavior: Effects ofage and irrelevant information on the ability to inhibit one’s actions. Acta Psychologica, 101,315-337.
Ridderinkhof, K. R., Ramautar, J. R., & Wijnen, J. G. (2009). To P(E) or not to P(E): A P3-like ERPcomponent reflecting the processing of response errors. Psychophysiology, 46, 531–538.
Ridderinkhof, K. R., Ullsperger, M., Crone, E. A., & Nieuwenhuis, S. (2004). The role of themedial frontal cortex in cognitive control. Science, 306, 443-447.
Roggeman, C., Klingberg, T., Feenstra, H. E. M., Compte, A., & Almeida, R. (2013). Trade-off between capacity and precision in visuospatial working memory. Journal of CognitiveNeuroscience.
Romei, V., Gross, J., & Thut, G. (2010). On the role of prestimulus alpha rhythms over occipito-parietal areas in visual input regulation: correlation or causation? Journal of Neuroscience,30, 8692–8697.
Rothbart, M. K. (2007). Temperament, development, and personality. Current Directions inPsychological Science, 16, 207–212.
Rotman, G., Troje, N. F., Johansson, R. S., & Flanagan, J. R. (2006). Eye movements whenobserving predictable and unpredictable actions. Journal of Neurophysiology, 96, 1358-1369.
Russeler, J., Hennighausen, E., Munte, T. F., & Rosler, F. (2003). Differences in incidental andintentional learning of sensorimotor sequences as revealed by event-related brain potentials.Cognitive Brain Research, 15, 116–126.
Russeler, J., & Rosler, F. (2000). Implicit and explicit learning of event sequences: Evidence fordistinct coding of perceptual and motor representations. Acta Psychologica, 104, 45-67.
Scheibehenne, B., & Studer, B. (2013). A hierarchical Bayesian model of the influence of runlength on sequential predictions. Psychonomic Bulletin & Review, epub ahead of print.
Schlaghecken, F., Sturmer, B., & Eimer, M. (2000). Chunking processes in the learning of eventsequences: Electrophysiological indicators. Memory & Cognition, 28, 821-831.
Schoenbaum, G., Esber, G. R., & Iordanova, M. D. (2013). Dopamine signals mimic rewardprediction errors. Nature Neuroscience, 16, 777–779.
Schroger, E. (1996). A neural mechanism for involuntary attention shifts to changes in auditorystimulation. Journal of Cognitive Neuroscience, 8, 527–539.
Schultz, W. (2006). Behavioral theories and the neurophysiology of reward. Annual Review ofPsychology, 57, 87-115.
Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward.Science, 275, 1593-1599.
Sekuler, R., Siddiqui, A., Goyal, N., & Rajan, R. (2003). Reproduction of seen actions: Stimulus-selective learning. Perception, 32, 839-854.
99
References
Smith, T. A., Hasinski, A. E., & Sederberg, P. B. (2013). The context repetition effect: Predictedevents are remembered better, even when they don’t happen. Journal of ExperimentalPsychology: General, 142, 1298–1308.
Spencer, K. M., Dien, J., & Donchin, E. (2001). Spatiotemporal analysis of the late ERP responsesto deviant stimuli. Psychophysiology, 38, 343–358.
Squires, K. C., Wickens, C., Squires, N. K., & Donchin, E. (1976). The effect of stimulus sequenceon the waveform of the cortical event-related potential. Science, 193, 1142-1146.
Squires, N. K., Squires, K. C., & Hillyard, S. A. (1975). Two varieties of long-latency positivewaves evoked by unpredictable auditory stimuli in man. Electroencephalography and ClinicalNeurophysiology, 38, 387-401.
Stefanics, G., Kimura, M., & Czigler, I. (2011). Visual mismatch negativity reveals automaticdetection of sequential regularity violation. Frontiers in Human Neuroscience, 5, 46:1-9.
Sternberg, D. A., & McClelland, J. L. (2012). Two mechanisms of human contingency learning.Psychological Science, 23, 59–68.
Suchow, J. W., Brady, T. F., Fougnie, D., & Alvarez, G. A. (2013). Modeling visual workingmemory with the MemToolbox. Journal of Vision, 13(10:9), 1–8.
Summerfield, C., & Egner, T. (2009). Expectation (and attention) in visual cognition. Trends inCognitive Sciences, 13, 403-409.
Summerfield, J. J., Lepsien, J., Gitelman, D. R., Mesulam, M. M., & Nobre, A. C. (2006). Orientingattention based on long-term memory experience. Neuron(6), 905–916.
Takahashi, Y. K., Roesch, M. R., Stalnaker, T. A., Haney, R. Z., Calu, D. J., Taylor, A. R., et al.(2009). The orbitofrontal cortex and ventral tegmental area are necessary for learning fromunexpected outcomes. Neuron, 62, 269-280.
Takahashi, Y. K., Roesch, M. R., Wilson, R. C., Toreson, K., O’Donnell, P., Niv, Y., et al. (2011).Expectancy-related changes in firing of dopamine neurons depend on orbitofrontal cortex.Nature Neuroscience, 14, 1590-1597.
Tales, A., Troscianko, T., Wilcock, G. K., Newton, P., & Butler, S. R. (2002). Age-related changesin the preattentional detection of visual change. NeuroReport, 13, 969-972.
van den Berg, R., Shin, H., Chou, W.-C. C., George, R., & Ma, W. J. (2012). Variability in encodingprecision accounts for visual short-term memory limitations. Proceedings of the NationalAcademy of Sciences of the United States of America, 109, 8780-8785.
Vo, M., & Wolfe, J. (2012). Semantic and syntactic inconsistencies in scenes elicit differential ERPsignatures. Presented at the annual meeting of the Vision Sciences Society, Naples, FL.
Vogel, E. K., Woodman, G. F., & Luck, S. J. (2006). The time course of consolidation invisual working memory. Journal of Experimental Psychology: Human Perception andPerformance(6), 1436–1451.
Vossel, S., Weidner, R., Thiel, C. M., & Fink, G. R. (2009). What is odd in Posner’s location-cueingparadigm? Neural responses to unexpected location and feature changes compared. Journalof Cognitive Neuroscience, 21, 30–41.
Wacongne, C., Changeux, J.-P. P., & Dehaene, S. (2012). A neuronal model of predictive coding
100
References
accounting for the mismatch negativity. Journal of Neuroscience, 32, 3665-3678.Wacongne, C., Labyt, E., Wassenhove, V. van, Bekinschtein, T., Naccache, L., & Dehaene, S. (2011).
Evidence for a hierarchy of predictions and prediction errors in human cortex. Proceedingsof the National Academy of Sciences of the United States of America, 108, 20754–20759.
White, C. N., Brown, S., & Ratcliff, R. (2012). A test of Bayesian observer models of processingin the Eriksen flanker task. Journal of Experimental Psychology: Human Perception andPerformance, 38, 489–497.
White, C. N., Ratcliff, R., & Starns, J. J. (2011). Diffusion models of the flanker task: Discreteversus gradual attentional selection. Cognitive Psychology, 63, 210–238.
Wilken, P., & Ma, W. J. (2004). A detection theory account of change detection. Journal of Vision,4, 1120-1135.
Wills, A. J., Lavric, A., Croft, G. S., & Hodgson, T. L. (2007). Predictive learning, predictionerrors, and attention: Evidence from event-related potentials and eye tracking. Journal ofCognitive Neuroscience, 19, 843-854.
Winkler, I. (2007). Interpreting the mismatch negativity. Journal of Psychophysiology, 21, 147–163.Winkler, I., Teder-Salejarvi, W. A., Horvath, J., Naatanen, R., & Sussman, E. (2003). Human
auditory cortex tracks task-irrelevant sound sources. NeuroReport, 14, 2053-2056.Yu, A. J., & Dayan, P. (2005). Uncertainty, neuromodulation, and attention. Neuron, 46, 681-92.Yu, A. J., Dayan, P., & Cohen, J. D. (2009). Dynamics of attentional selection under conflict:
Toward a rational Bayesian account. Journal of Experimental Psychology: Human Perceptionand Performance, 35, 700–717.
Zhang, W., & Luck, S. J. (2008). Discrete fixed-resolution representations in visual workingmemory. Nature, 453, 233-235.
Zhang, W., & Luck, S. J. (2009). Sudden death and gradual decay in visual working memory.Psychological Science, 20, 423–428.
Zhang, W., & Luck, S. J. (2011). The number and quality of representations in working memory.Psychological Science, 22, 1434-1441.
Zhao, J., Al-Aidroos, N., & Turk-Browne, N. B. (2013). Attention is spontaneously biased towardregularities. Psychological Science, 24, 667–677.
Zion Golumbic, E., Cogan, G. B., Schroeder, C. E., & Poeppel, D. (2013). Visual input enhancesselective speech envelope tracking in auditory cortex at a “cocktail party”. Journal ofNeuroscience, 33, 1417-1426.
Zokaei, N., Gorgoraptis, N., Bahrami, B., Bays, P. M., & Husain, M. (2011). Precision of workingmemory for visual motion sequences and transparent motion surfaces. Journal of Vision,11(14:2), 1-18.
Zuijen, T. L. van, Simoens, V. L., Paavilainen, P., Naatanen, R., & Tervaniemi, M. (2006). Implicit,intuitive, and explicit knowledge of abstract regularities in a sound sequence: an event-relatedbrain potential study. Journal of Cognitive Neuroscience, 18, 1292–1303.
101