bib koening et al. 2005. the neural basis for novel semantic categorization

www.elsevier.com/locate/ynimg

NeuroImage 24 (2005) 369–383

The neural basis for novel semantic categorization$

Phyllis Koenig,a,* Edward E. Smith,b Guila Glosser,a Chris DeVita,a Peachie Moore,a

Corey McMillan,a Jim Gee,c and Murray Grossmana

aDepartment of Neurology, University of Pennsylvania, USAbDepartment of Psychology, University of Michigan, USAcDepartment of Radiology, University of Pennsylvania, 19104-4283, USA

Received 5 April 2004; revised 23 July 2004; accepted 30 August 2004

Available online 10 December 2004

We monitored regional cerebral activity with BOLD fMRI during

acquisition of a novel semantic category and subsequent categorization

of test stimuli by a rule-based strategy or a similarity-based strategy.

We observed different patterns of activation in direct comparisons of

rule- and similarity-based categorization. During rule-based category

acquisition, subjects recruited anterior cingulate, thalamic, and

parietal regions to support selective attention to perceptual features,

and left inferior frontal cortex to helps maintain rules in working

memory. Subsequent rule-based categorization revealed anterior

cingulate and parietal activation while judging stimuli whose con-

formity with the rules was readily apparent, and left inferior frontal

recruitment during judgments of stimuli whose conformity was less

apparent. By comparison, similarity-based category acquisition showed

recruitment of anterior prefrontal and posterior cingulate regions,

presumably to support successful retrieval of previously encountered

exemplars from long-term memory, and bilateral temporal-parietal

activation for perceptual feature integration. Subsequent similarity-

based categorization revealed temporal–parietal, posterior cingulate,

and anterior prefrontal activation. These findings suggest that large-

scale networks support relatively distinct categorization processes

during the acquisition and judgment of semantic category knowledge.

D 2004 Elsevier Inc. All rights reserved.

Keywords: BOLD fMRI; Rule-based category; Similarity-based category

Introduction

Semantic memory involves semantic content and categorization

processes that act on that content (Tulving, 1972). Categorization is

1053-8119/$ - see front matter D 2004 Elsevier Inc. All rights reserved.

doi:10.1016/j.neuroimage.2004.08.045

$ Portions of this work were presented at the annual meeting of the

Cognitive Neuroscience Society in San Francisco, April, 2002. This work

was supported in part by grants from the US Public Health Service

(AG15116, AG17586, and NS35867).

* Corresponding author. Department of Neurology, University of

Pennsylvania, 3 West Gates, 3400 Spruce Street, Philadelphia, PA 19104-

4283. Fax: +1 215 349 8464.

E-mail address: [email protected] (P. Koenig).

Available online on ScienceDirect (www.sciencedirect.com).

essential to organizing semantic content in a meaningful way

(Smith, 1995). At least two broad forms of categorization processes

have been proposed. In similarity-based categorization, category

membership is determined by overall similarity to established

exemplars or to a prototype (Medin and Schaffer, 1978; Medin

et al., 1993; Smith and Medin, 1981). This process appears to

entail exemplar or prototype retrieval, as well as organization of

perceptual features contributing to an object’s overall appearance

(Ashby et al., 1991). Rule-based categorization is employed when

specific features must be considered (Bruner et al., 1956; Smith

et al., 1998a,b). This process is resource demanding, since it

presumably involves executive processes such as selective attention

(to focus on requisite features), inhibitory control (to suppress

irrelevant features), and working memory (to retain assessments of

various features). Behavioral studies show that healthy subjects are

capable of implementing either rule-based or similarity-based

semantic categorization processes, even for the same stimulus set

(Allen and Brooks, 1991; Grossman et al., 2003b; Koenig et al.,

2002; Rips, 1989; Smith and Sloman, 1994). Qualitative distinc-

tions between these two categorization processes suggest that they

depend on partially distinct neural substrates. The present study

used BOLD fMRI in an event-related design to examine cortical

recruitment patterns in healthy young subjects during the acquis-

ition and categorization of novel, meaningful stimuli by rule-based

and by similarity-based strategies.

Behavioral studies of patients with central nervous system

disease provide some evidence for neurologically separable

categorization systems. In two recent studies, patients with

Alzheimer’s disease and patients with frontotemporal dementia

showed impaired rule-based categorization of object descriptions

(Grossman et al., 2003b) and of novel visual items (Koenig et al.,

2004, submitted for publication), although they were able to

categorize by similarity-based processing in a relatively normal

manner. This is consistent with the executive resource limitations

observed in both Alzheimer’s disease (Fisher et al., 1990; LaFleche

and Albert, 1995; Perry and Hodges, 1999) and frontotemporal

dementia (Elfgren et al., 1993; Pasquier et al., 1999).

Functional neuroimaging work also implicates somewhat dis-

tinct brain regions for the processes supporting rule-based and

P. Koenig et al. / NeuroImage 24 (2005) 369–383370

similarity-based categorization. Neuroimaging studies of clinical

tasks involving rule-based processes like Wisconsin Card Sorting

and Raven’s Progressive Matrices emphasize activation of anterior

cingulate and dorsolateral prefrontal cortices (Elliott et al., 1999;

Rao et al., 1997; Savage et al., 2001). Some investigators relate

executive processes such as selective attention and inhibitory control

(i.e., processes characteristic of categorizing by rules) to activation

of frontal regions (Petrides et al., 1993; Vendrell et al., 1995;Wagner

et al., 2001). Switching processes that govern alternating between

attributes of an object, objects in a category, and processes applied to

an object or a category have been related to dorsolateral prefrontal

cortex (D’Esposito et al., 1995; Garavan et al., 2000; Monchi et al.,

2001; Savage et al., 2001; Seger et al., 2000a,b).

By comparison, similarity-based categorization, as purportedly

used in classifying dot patterns or learning artificial grammars,

appears to implicate parietal as well as frontal-striatal brain regions

(Aizenstein et al., 2000; Poldrack and Gabrieli, 2001; Reber et al.,

1998a; Seger et al., 2000a,b). The inferior parietal component may

support feature integration necessary for global comparisons

(Wilkinson et al., 2002). Neuroimaging work also suggests the

contribution of posterior cingulate cortex, which may support

retrieval of previously encountered exemplars (Rugg et al., 1998;

Tulving et al., 1996) and anterior prefrontal cortex (Buckner et al.,

1998; Lepage et al., 2002) for the successful retrieval of

information from long-term memory.

Very few imaging studies have explicitly compared rule-based

and similarity-based processing. A recent fMRI study (Grossman

et al., 2002a,b) examined cortical activation during rule-based and

similarity-based category assignments of written descriptions of

objects that bore equal resemblance to either of two choice

categories, but violated the rule-like specifications of one. For

example, a bround object 3 in. in diameterQ equally resembled a

bpizzaQ or a bquarter,Q but, by the rule governing a quarter’s

invariable size, could only unequivocally belong to the category

bpizza.Q A direct contrast of rule-based minus similarity-based

categorization decisions showed left dorsolateral prefrontal activa-

tion, consistent with support for executive resources during rule-

based categorization.

Closer in structure to the present study is a recent PET study

examining activation during categorization of novel visual stimuli

(cartoon-like animals) by rule- and similarity-based processes

(Patalano et al., 2001). Both categorization conditions showed

activation in visual cortex, while only the rule-based categorization

showed bilateral superior parietal, right dorsolateral prefrontal, and

bilateral supplementary motor cortex activation. The authors

argued that these regions support selective attention to spatial

position, shifting attentional focus, and working memory.

The present study departs from the previous ones in several

ways. We used a visual category of novel animals, although unlike

the stimuli in the Patalano et al. study, the animals were relatively

naturalistic. We imaged category acquisition as well as category

use, whereas Patalano et al. imaged only the latter. Hence, the

present study assesses cortical activation during learning by rule-

and similarity-based processes as well as decision-making by those

processes subsequent to learning. We designed the study so that we

could directly compare the two categorization processes: Unlike

Patalano et al., we employed a single category, structured such that

subjects would generally endorse the identical items as members

across both the rule-based and similarity-based categorization

conditions. Finally, we employed empirical assessments of the

relative contribution of particular stimulus features to inter-

stimulus resemblance. This allowed us to construct a non-arbitrary

category; that is, our category members were subjectively

perceived as similar, and our rules of category membership

reflected this category coherence. Our intention was to make our

category easily learnable, so that patterns of activation would

reflect the two categorization processes rather than subjects’ efforts

to master a difficult task.

With these materials, we expected to observe distinct neural

networks for each semantic categorization process: For rule-based

categorization, we anticipated a network centering around inferior

frontal and anterior cingulate regions to sustain working memory

(e.g., for retaining the rules) and executive resources such as

selective attention (e.g., for identifying relevant features). For

similarity-based categorization, we anticipated a network including

lateral temporal-parietal cortex supporting perceptual feature inte-

gration, and posterior cingulate and anterior prefrontal activation

supporting successful retrieval of previously encountered exemplars.

Methods

Subjects

Twenty-eight healthy young volunteers participated, including

11 males and 17 females with a mean (FSD) age of 22.2 (F3.3)

years and mean education of 15.2 (F1.7) years. All were right-

handed native speakers of English. We were specifically interested

in examining cortical activation supporting rule-based and sim-

ilarity-based categorization, rather than participants’ spontaneous

choice of categorization strategy. Hence, we excluded data from

participants whose behavioral results suggested that they had not

complied with our instructions. These included participants who

seemed to focus on a single feature (i.e., they apparently applied a

simple rule) in the Similarity condition. These also included

participants who apparently did not cooperate, that is, made grossly

inaccurate or random judgments. Eleven (6 males, 5 females, mean

age of 22.0 (F2.6) years, mean education of 15.1 (F1.2) years)

participated in the Rule condition: Fourteen participants were

originally run in this condition, but three were excluded because

their behavioral responses at test were less accurate than the group

mean by at least two standard deviations. Nine (2 males, 7 females,

mean age of 22.9 (F4.7) years, mean education of 15.3 (F2.5)

years) participated in the Similarity condition: Fourteen were

originally run, but data from 1 participant were excluded because

responses at test did not differ from chance. Four additional

participants (2 males, 2 females, mean age of 21.3 (F2.6) years,

mean education of 15.3 (F1.9) years) were excluded from the

Similarity condition because they appeared to spontaneously

employ a consistent rule-like strategy at test. These four exhibited

clearly systematic patterns in their judgments, that is, focusing

exclusively on a single feature to determine category membership,

or granting membership in exact proportion to how many features

each item shared with the prototype. We do not report the results of

these participants here since the group may be too small to

demonstrate a reliable imaging profile, but the results are available

from the authors.

Stimuli

Stimuli were taken from a set of 64 realistic images of

biologically plausible novel animals. Examples can be seen in

P. Koenig et al. / NeuroImage 24 (2005) 369–383 371

the training stimuli shown in Fig. 1. The stimulus set, and the

method of obtaining empirical rankings of the features’ contribu-

tion to judged resemblance (briefly described below), are described

in detail elsewhere (Koenig et al., 2002). The set represented all

possible combinations of six dichotomous features, that is, color

(red-brown or yellow-green), legs (long or short), neck (low or

raised), snout (long or short), tail (straight or curled), and teeth

(fangs or tusks). All instances of a feature were identical, for

example, all straight-tailed animals had tails of identical dimen-

sions. All other features (e.g., texture, torso shape, facial details)

were consistent throughout the set. Prior to this study, a cohort of

healthy young subjects rated all pair-wise combinations in the set

for resemblance. We then used multidimensional scaling to obtain

a ranking of the six features’ contribution to resemblance

judgments.

We identified the four highest-ranking features from the multi-

dimensional scaling procedure, which served as bcontributingQfeatures. These were snout, legs, color, and neck type. One animal

was then chosen at random to serve as a prototype of the category.

We defined category MEMBERS as those animals that matched the

Fig. 1. Illustration of screens used during the training session. Panel A: rule-

based training, showing the four criterial features in the upper half of the

screen and two crutters in the lower half (MEMBER on left). Panel B:

Similarity-based training, showing a prototype in the upper half of the

screen and two crutters in the lower half (MEMBER on left).

prototype in at least three of the four contributing features, that is, the

same type of legs, snout, color, and/or neck. LOW DISTORTION

items matched the prototype in two of the contributing features, for

example, the same type of legs and snout but a different color and

neck type. HIGH DISTORTION items matched the prototype on no

more than one contributing feature. The two lowest-ranked features

(i.e., teeth and tail) served as bdistractorQ features that were irrelevantto category membership and were equally distributed across

MEMBERS, LOWDISTORTION items, and HIGHDISTORTION

items. Thus, our category structure was in accordance with

perceived resemblance: membership reflected the presence of salient

features shared with the prototype, rather than simply the number of

features in common.

It has been suggested that the nature of features, particularly

integral versus non-integral, differentially contribute to how

conducive a category is to similarity- or rule-based learning

(Kemler and Smith, 1979). We intended our category to be

learnable by either process. Pilot studies suggest that the individual

features are readily discernible (e.g., non-integral), hence making

the category accessible by rule-based learning. In addition, pilot

studies suggest that subjects are generally not explicitly aware of

the bases for their resemblance-based judgments, even when they

appear confident of their decision. This is characteristic of

categories with integral features; hence, the category is accessible

by similarity-based learning.

Training items were 40 unique pairs consisting of a

MEMBER and a HIGH DISTORTION item, created from

recombinations of eight items of each type. The MEMBER in

each pair had three contributing features and at least one

distractor feature in common with the prototype. Each contribu-

ting feature in common with the prototype (e.g., a long snout)

was present in six of the eight of the training set MEMBERS.

The HIGH DISTORTION items had only one contributing feature

and not more than one distractor feature in common with the

prototype. Each contributing feature differing from the prototype

(e.g., a short snout) was present in six of the eight training set

HIGH DISTORTION items. The eight MEMBERS and eight

HIGH DISTORTION items were recombined to create five

different sets of eight pairs. Within each set, pairs were randomly

ordered with the stipulation that each of the two occurrences of a

particular combination of three contributing features appeared

once among the first four pairs and once among the last four.

This distribution of feature configurations was designed to make

it highly unlikely that participants could demonstrate apparent

success during the training phases (see below) by adopting

simplified response strategies such as focusing on a single

feature, as such strategies would lead to misjudgments at frequent

intervals. For instance, a subject who consistently focused on a

particular feature, such as endorsing all yellow items, would

choose correctly no more than 75% of the time for each

consecutive set of four training trials, as one of the four training

MEMBERS would be red. The subject’s accuracy could be

further decreased by the presence of the preferred feature in one

out of every four consecutive training HIGH DISTORTION

items. Similarly, subjects who focused on a different individual

feature for each trial had maximally a 75% chance of choosing

correctly each time. In addition, subjects who employed a feature-

focus strategy would show a consistent level of accuracy

throughout training, rather than exhibiting learning over time.

Across sets, MEMBERS were always paired with different HIGH

DISTORTION items. The sets were presented continuously and


in sequence. Stimuli were video-projected onto a screen, which

subjects viewed via a system of mirrors while in the magnet bore.

Presentation was controlled by a computer using PsyScope

software (Cohen et al., 1993), which recorded responses.

We presented these MEMBER/HIGH DISTORTION pairs in

each of two training conditions, described below.

Rule training procedure

Participants were told that an animal called a bcrutterQ had to

have at least three of four particular features, and that they would

be shown pairs of animals, one of which would be a crutter and one

of which would not. Their task was to decide which animal in each

pair was a crutter, based on the bat-least-three-of-four featuresQrule. The training session in the scanner followed. The 40-item

sequence of MEMBER/HIGH DISTORTION training pairs was

presented. Each pair was arrayed horizontally in the lower half of

the projected image, and was accompanied by brief written

descriptions and outlines of the requisite features (e.g., the words

blong snoutQ with a sketch of a long snout) in the upper half. The

description of the color feature (i.e., byellow colorQ) was illustratedby a yellow rectangle rather than a sketch. The sketches were black

outlines of the contours of the features, intended to clarify the

verbal descriptions by clearly showing which feature variants they

referred to. At the same time, the sketches’ lack of visual detail

ensured that subjects could not rely on a perceptual identity match

when identifying the features in the stimulus items. The

MEMBERS and HIGH DISTORTION items were equally dis-

tributed in the left and right positions. The sentence bWhich has 3

features?Q appeared at the top of the screen. Fig. 1 panel A shows a

sample Rule training stimulus. Stimuli appeared every 12 s.

Participants’ responses were indicated by a button press on a hand-

held box, with the buttons corresponding spatially to the location

(left or right) of the chosen item. The stimulus remained until the

subject responded, after which the screen became blank; response

times never exceeded 6 s. Nine seconds after a stimulus item first

appeared, the item reappeared for 2 s with a black rectangle

surrounding the correct choice (i.e., the category member), as a

means of providing feedback. We analyzed only items to which

participants responded (five participants each missed one training

trial).

Prior to entering the scanner, participants received a practice

session at a laptop computer. Although we expected the training

tasks to be readily comprehendible, we wished to ensure that

subjects’ efforts during the first few trials would be directed

towards performing the training task rather than acclimating

themselves to the procedure. Hence, the practice stimuli were

designed to familiarize subjects with the training procedure without

exposing them to the animal stimuli. They consisted of simple

abstract geometric figures with three variable features: large or

small size, circular or triangular appendage, and striped or dotted

pattern. None of these features coincided with features of the

animal stimuli. The practice stimuli were presented in a series of

images that were configured analogously to the training stimuli;

that is, written descriptions and sketches of the three features

appeared above a pair of items, one of which contained two of the

features and one of which contained one. Participants were told

that an object called a bdortQ had to have at least two of the

features, and they were asked to indicate which item in each pair

was a dort. Feedback was analogous to the actual training trials,

that is, a black rectangle appearing around the correct choice.

Participants were exposed to three practice trials, and all of them

indicated understanding of the procedure.

Similarity training procedure

Participants were told that they would see pictures of an animal

called a bcrutterQ along with pairs of pictured animals, one of which

would also be a crutter and one of which would not. Their task was

to decide which animal in each pair was a crutter, based on which

animal was bmore similarQ to the sample crutter. Subjects were

urged to make a quick decision based on their initial impressions.

The training session in the scanner followed. A sequence of stimuli

was presented consisting of the prototype in the upper half of the

projected image and the same horizontally arrayed MEMBER/

HIGH DISTORTION pairs as in the Rule training procedure in the

lower half. The caption bcrutterQ appeared directly below the

prototype and the sentence bWhich is a crutter?Q appeared at the

top of the screen. The stimulus displays for the Similarity and Rule

training conditions thus each contained pictorial and verbal

elements (including sentences and captions), and were designed

to minimize display discrepancies between these inherently differ-

ent training processes. Fig. 1 panel B shows a sample Similarity

training stimulus. All other aspects (e.g., timing, mode of response,

feedback, and data collection) were the same as in the Rule training

session. We analyzed only trials to which participants responded

(three participants missed a total of seven trials).

Participants practiced at a laptop computer prior to this

training procedure as well. The practice items were the same as

those used prior to the Rule training session. The stimuli

consisted of a prototype geometric configuration in which the

three requisite features were represented; this appearing above the

same pairs used in the Rule practice. Participants were told that

the prototype was a bdortQ and were asked to indicate which item

in each pair was also a dort based on greater similarity to the

sample dort. All participants indicated understanding of this

training task as well.

Testing procedure

Both training conditions were followed by the identical test.

Participants saw a sequential presentation of individual animals in

a fixed semi-random order, and judged whether each was a crutter.

They were instructed to judge in accordance with how they had

been trained; that is, participants in the Rule condition were to

judge category membership by an item’s adherence to the bthree-out-of-four featuresQ rule, and participants in the Similarity

condition were to judge category membership by an item’s overall

resemblance to the training example. This sequence consisted of a

50-item subset of the complete set of 64 animals. We used a

reduced set so that the test session would not exceed an imaging

run that is a maximum of 10 min in duration. The subset contained

20 MEMBERS, 14 LOW DISTORTION items, and 16 HIGH

DISTORTION items. Items of a particular type (e.g., LOW

DISTORTION) appeared no more than three times sequentially.

Stimuli appeared every 12 s, and remained visible until the subject

responded. Participants indicated their choice by button press, with

the right and left buttons corresponding, respectively, to byesQ andbno.Q No feedback was provided. Responses and reaction times

were recorded. Analyses included only trials in which subjects

responded (only one Rule participant did not respond to one LOW

DISTORTION stimulus).


Imaging data acquisition and statistical analysis

The experiment was carried out at 4T on a GE prototype

Echospeed scanner capable of ultrafast imaging. Firm foam

padding was used to restrict head motion. Each imaging protocol

began with a 10–15 min acquisition of 5 mm thick adjacent slices

for determining regional anatomy, including sagittal localizer

images (TR = 500 ms, TE = 10 ms, 192 � 256 matrix), T2-

weighted axial images (FSE, TR = 2000 ms, TEeff = 85 ms), and

T1-weighted axial images of slices used for fMRI anatomic

localization (TR = 600 ms, TE = 14 ms, 192 � 256 matrix).

Gradient echo echoplanar images were acquired for detection of

alterations of blood oxygenation accompanying increased mental

activity. All images were acquired with fat saturation, a rectangular

FOV of 20 � 15 cm, flip angle of 908, 5 mm slice thickness, an

effective TE of 50 ms, and a 64 � 40 matrix, resulting in a voxel

size of 3.75 � 3.75 � 5 mm. The echoplanar acquisitions consisted

of 18 contiguous slices covering the entire brain every 2 s. To

minimize susceptibility artifact, a separate acquisition lasting 1–2

min was needed for phase maps to correct for distortion in

echoplanar images (Alsop, 1995). We also inspected raw data of

individual subjects to check for artifacts. Raw data were stored by

the MRI computer on DAT tape and then processed off-line.

Initial data processing was carried out with Interactive Data

Language (Research Systems) on a Sun (Santa Clara, CA) Ultra 60

workstation. Raw image data were reconstructed using a 2D FFT

with a procedure that minimized artifact due to magnetic field

inhomogeneities. Individual subject data were then prepared for

statistical parametric mapping (SPM 99) developed by the Well-

come Department of Cognitive Neurology (Frackowiak et al.,

1997). This system, operating on a MatLab platform, combines

raw images from subjects into a statistical t map. Briefly, the

images in each subject’s time series were registered to the initial

image in the series. The images were then aligned to a standard

coordinate system (Talairach and Tournoux, 1988). The data were

spatially smoothed with a 12-mm Gaussian kernel to account for

small variations in the location of activation and sulcal anatomy

across subjects. Low-pass temporal filtering was implemented to

control auto-correlation with a first-order auto-regressive method.

We used an event-related approach to data acquisition. Each

event was 12 s in length, thus capturing most of the hemodynamic

response function associated with each stimulus. Hence, the initial

images acquired for each 12-s trial were used in the analyses. The

exclusive use of the initial image per trial also insured that

activation results would not be affected by between-condition

differences in time until the button-press response. Event-related

data acquisition allowed us to group and compare images

corresponding to specific stimuli. For the training phase, we

assessed condition-specific cerebral activation. We also examined

changes in activation with increased exposure to stimuli during the

course of training by analyzing activation for the first eight and last

eight of the 40 training trials (i.e., the first and fifth quintiles).

Thus, we used a condition (rule-based, similarity-based) � learning

phase (first quintile, last quintile) factorial design. For the test

phase, we grouped MEMBERS and HIGH-DISTORTION stimuli.

This was because their membership status was relatively trans-

parent, according to similarity-based as well as by rule-based

processing. Hence, even though the two categorization conditions

elicited different decision-making processes, MEMBERS should

generally be endorsed to comparable extents across conditions, and

HIGH-DISTORTION should stimuli should generally be rejected

to comparable extents across conditions. To help ensure that

patterns of activation reflected subjects’ employing the catego-

rization processes of interest, we analyzed activation only for

accurate responses to these stimuli. As shown below, this included

the vast majority of MEMBER and HIGH-DISTORTION stimuli

in both conditions. We included all LOW-DISTORTION stimuli in

our analyses, regardless of subjects’ category judgments, since

membership or non-membership status was equally plausible by

similarity-based criteria. Moreover, as shown below, proportions of

member/nonmember judgments for LOW DISTORTION items

differed depending on the categorization condition, and we did not

want our imaging analyses to be biased by the inclusion of

different numbers of a particular type of stimulus across conditions.

Test data were analyzed with a condition (rule-based, similarity-

based) � stimulus (MEMBER/HIGH-DISTORTION, LOW-DIS-

TORTION) factorial design. The resulting statistical contrasts were

converted to z-scores for each compared voxel. We used a

statistical threshold of P b 0.001 uncorrected for multiple

comparisons. The extent criterion was set to a 20 voxel threshold.

Results

Training phase

Behavioral results

Participants were accurate at judgments MEMBERS under both

training conditions. Accuracy in the Rule training condition was

96% (F6%), which differed from chance [t(10) = 28.09; P b

0.001]; accuracy in the Similarity training condition was 85%

(F9%), likewise differing from chance [(8) = 11.08; P b 0.001].

The higher accuracy for identifying MEMBERS in the Rule

condition relative to the Similarity condition reached significance

[t(17) = 3.0, P = 0.009]. Moreover, participants took significantly

longer to respond in the Rule condition [3946 ms F 914] than in

the Similarity condition [2886 ms F 1003; t(18) = 2.5, P = 0.02].

These results are consistent with the qualitative differences

between the two categorization processes: Rule-based training,

which depends on working memory and executive processes such

as selective attention, appears to be more cognitively demanding,

and yields judgments with a higher rate of identifying MEMBERS.

We also compared judgment accuracy during the first and fifth

quintiles for each condition. For the Rule condition, accuracy was

89% during the first quintile and 100% during the fifth. First

quintile accuracy differed from 100% in a one-sample t test, t(10) =

2.83, P b 0.02. (We could not use paired t tests to directly compare

first versus fifth quintile accuracy in the Rule condition because the

100% accuracy in the fifth quintile had no variance.) For the

Similarity condition, accuracy was 81% during the first quintile

and 94% during the fifth. These differed significantly by paired t

test, t(8) = 3.62, P = 0.007. Thus, participants became more

successful at correctly identifying the MEMBER by the latter part

of the session in both training conditions.

Imaging results

We analyzed training data with a condition (Rule, Similarity) �learning phase (early, late) factorial analysis. Table 1 summarizes

the activation peaks using Talairach coordinates, the associated

Brodmann areas, and the z scores of the heights for each activation

peak during training. We found significant differences for the main

effect of condition, as illustrated in Fig. 2. Fig. 2 Panel A shows

Table 1

Distribution of activation during the training phase of category acquisition

Anatomic area Brodmann area Talairach coordinates # Voxels Z score

x y z

MAIN EFFECTS

RULE minus SIMILARITY

L inferior frontal 6/44 �56 0 36 17 3.47

L anterior cingulate 24/32 �8 8 40 69 4.11

L inferior parietal 40 �44 �48 40 26 3.43

L parietal/occipital 19/7 �8 �76 32 236a 4.23

R superior parietal 7 28 �64 52 236a 4.64

L thalamus �8 �20 4 39 3.52

SIMILARITY minus RULE

L temporal–parietal 39 �40 �72 12 25 3.59

R temporal–parietal 39 56 �60 12 36 3.32

R anterior prefrontal 8 20 28 44 12 3.30

L anterior prefrontal 10 �16 56 8 23 3.20

EARLY minus LATE

R superior parietal 7 4 �72 48 30 3.28

R superior temporal 22 60 �28 4 13 3.13

LATE minus EARLY

L temporal–parietal 40/22 �44 �20 20 241 4.09

L posterior cingulate 23/29 �8 �36 20 55 3.43

R superior temporal/insula 42 28 �20 16 91 3.31

L thalamus �20 �24 4 30 3.13

INTERACTION EFFECTS

RULE: EARLY minus LATE

R superior parietal 7 8 �72 48 40 3.65

L parietal–occipital 19 �32 �88 16 15 3.48

L thalamus �4 �8 8 85 3.27

RULE: LATE minus EARLY

L inferior frontal 6/44/45 �44 0 4 82 3.06

SIMILARITY: LATE minus EARLY

L temporal–parietal 40/22 �52 �20 24 279 3.99

R temporal–parietal 40/22 56 �16 20 29 3.40

L posterior cingulate 29/23 �8 �36 20 103 4.26

R striatum 20 0 �4 49 3.83

a These regions were part of the same cluster.


that the Rule condition, relative to the Similarity condition,

activated left inferior frontal, left anterior cingulate, left inferior

parietal, bilateral parietal/occipital, and left thalamus regions. By

comparison, Fig. 2 Panel B shows that the Similarity condition,

relative to the Rule condition, activated bilateral temporal–parietal

and bilateral anterior prefrontal regions. These findings suggest

different patterns of activation depending on the training condition.

We also found that activation for each training condition varied

depending on the early versus later phase of training. Thus, the

factorial analysis showed significant main effects for the early

versus late portions of the learning phase, that is, changes over the

course of acquisition collapsed across conditions. As summarized

in Table 1, the early portion, relative to the late portion of the

learning phase, activated right superior parietal and right superior

temporal areas. The late portion of training, relative to the early

portion, activated left temporal–parietal, left posterior cingulate,

right temporal/insula, and left caudate regions. The overall

interaction effects in the factorial analysis involved bilateral

striatum including right putamen and left caudate (x = 20, y = 4,

z = �4; z score = 4.02), and left inferior frontal cortex in a manner

that approached significance (x = �48, y = 4, z = �4; z score =

2.88). More detailed evaluation of the interaction effect is

illustrated in Fig. 2. As shown in Fig. 2 Panel C, greater activation

later in the Rule training phase, relative to earlier, was seen in left

inferior frontal cortex. For the Similarity condition, Fig. 2 Panel D

shows relatively greater activation later in training compared to the

early in bilateral temporal–parietal, left posterior cingulate, and

right striatal regions. Thus, distinct activation patterns appeared to

become consolidated over time in part in different brain regions,

depending on the specific training condition. These patterns

suggest that left inferior frontal cortex supports increased use of

executive resources such as working memory as rule-based training

progressed. In contrast, later activation of temporal–parietal,

posterior cingulate, and striatal regions during similarity-based

training suggests an increasing tendency towards feature config-

uration and integration along with successful recall of previously

seen training items.

Test phase

Behavioral results

We calculated b% yesQ scores, that is, how often participants

granted membership to each item type (i.e., MEMBER, LOW

DISTORTION, and HIGH DISTORTION items) at test. Perform-

ance during the test phase is summarized in Fig. 3. As can be

seen, patterns of responses differed across conditions and types of

Fig. 2. Distribution of activation during training phase. Panel A: Main effect: Rule-based training minus Similarity-based training. Panel B: Main effect:

Similarity-based training minus Rule-based training. Panel C: Interaction effect: Rule-based training Late minus Early. Panel D: Interaction effect: Similarity-

based training Late minus Early.


items in a manner consistent with the two qualitatively distinct

categorization processes. The Rule condition thus showed a sharp

category boundary, reflecting the requirement that only items

meeting the three-out-four feature criteria be endorsed as

members; the Similarity condition showed graded responses

across types of items with declining membership endorsement

as the number contributing of features shared with the prototype

declined. A repeated measures ANOVA, with a condition (Rule,

Similarity) � stimulus type (MEMBER, LOW DISTORTION,

HIGH DISTORTION) design, revealed a main effect of stimulus

Fig. 3. Category membership judgments during the test phase following

Rule-based training or Similarity-based training, illustrating the percent of

time that subjects accepted MEMBERS, LOW DISTORTION stimuli, and

HIGH DISTORTION stimuli as category members.

type [F(2,36) = 371.5, P b 0.001] and a significant condition x

stimulus type interaction [F(2,36) = 22.0, P b 0.001]. There was

no main effect of condition [F(1,18) = 2.26, ns]. The t tests

revealed that judgments differ significantly across conditions for

all three stimulus types: participants in the Rule condition

responded byesQ more often to MEMBERS [t(18) = 4.0, P =

0.001] and less often to LOW DISTORTION items [t(18) = 3.1,

P = 0.006] and HIGH DISTORTION items [t(18) = 2.5, P =

0.02] than participants in the Similarity condition. Linear trend

analyses confirmed the different shapes of the performance

curves: We saw blinearityQ in both the Rule condition [F(1,30) =

777.8, P b 0.01] and the Similarity condition [F(1,24) = 78.2, P b

0.01]. However, bdeviation from linearityQ was seen in the Rule

condition [F(1,30) = 157.3, P b 0.01] but not the Similarity

condition [F(1,24) = 3.2, ns]. These findings confirmed that

participants respond differently across conditions for the different

types of items, reflecting the sharp categorical effect of rule-based

categorization and the graded responses of similarity-based

categorization.

There was a trend towards longer reaction times in the Rule

condition (2453 ms) relative to the similarity condition (2144 ms)

that did not reach significance, t(18) = 0.79, ns. Reaction times did

not differ among item types in the Similarity condition [F(2,8) =

0.86, ns]. However, reaction times did differ among different item

types in the Rule condition [F(2,10) = 10.1, P = 0.001. While

reaction times for MEMBERS and HIGH DISTORTION items

were equivalent [t(10) = 1.71, ns], participants responded more

slowly to LOW DISTORTION items than to either MEMBERS

[ t(10) = 2.31, P b 0.05,] or to HIGH DISTORTION items [ t(10) =

7.09, P b 0.001.] This finding suggests that judging MEMBERS


and HIGH DISTORTION items may have been facilitated by prior

exposure to such items during training.

We compared judgment accuracy for old versus new items in

both test conditions; that is, MEMBERS that had been seen five

times during training with MEMBERS seen for the first time at

test, and similarly, HIGH DISTORTION items seen during training

with those newly seen at test. To ensure that the old and new

stimulus items were comparable, we included as new items only

those with exactly three contributing features appropriately

matching or not matching the prototype; that is, we did not include

test MEMBERS matching the prototype on all four features or test

HIGH DISTORTION items matching it on none. Accuracy for old

versus new items was equivalent for each item type in each

condition: Rule MEMBERS, t(10) = 1.00, P = 0.34; HIGH

DISTORTION, t(10) = 1.18, P = 0.86; Similarity MEMBERS,

t(8) = .67, P = 0.52; HIGH DISTORTION, t(8) = 0.33, P = 0.75.

This suggests that participants were using categorization processes

at test for all items, rather than relying on recognition of training

items.

Imaging results

We analyzed categorization data with a condition (rule-based,

similarity-based) � stimulus type (MEMBERS+HIGH DISTOR-

TION, LOW DISTORTION) factorial analysis. Table 2 summa-

rizes the activation peaks using Talairach coordinates, the

Table 2

Distribution of activation during the test phase of categorization

Anatomic area Brodman

MAIN EFFE

RULE minus SIMILARITY

L inferior frontal 6/44

L anterior cingulate 24/32

SIMILARITY minus RULE

L temporal–parietal 39/22

R posterior temporal 21

L anterior prefrontal 10

R anterior prefrontal 10

R posterior cingulate 23/29/30

MEMBER + HIGH DISTORTION minus LOW DISTORTION

No significant contrasts –

LOW DISTORTION minus MEMBER + HIGH DISTORTION

L inferior frontal 6

R inferior frontal 6

B anterior cingulate 24/32

L dorsolateral prefrontal 46

L striatum

R striatum

INTERACTION

RULE minus SIMILARITY: MEMBER + HIGH DISTORTION

L inferior parietal 40

L parietal–occipital 19

L anterior cingulate 24/32

SIMILARITY minus RULE: MEMBER + HIGH DISTORTION

L temporal–parietal 39/22

R temporal–parietal 39/22

RULE minus SIMILARITY: LOW DISTORTION

L inferior frontal 6/44

SIMILARITY minus RULE: LOW DISTORTION

L temporal–parietal 39/21/19

a These regions were part of the same cluster.

associated Brodmann areas, and the z scores of the heights for

each activation peak during the test phase. The activations

associated with the main effects are illustrated in Fig. 4.

Resembling the activation patterns seen during training, the two

conditions differed in activated areas during the test phase of the

study. Panel A reveals activation during the Rule condition,

relative to the Similarity condition, in left inferior frontal and left

anterior cingulate regions. Panel B shows activation for the

Similarity condition, relative to the Rule condition, in bilateral

temporal–parietal regions that is more prominent on the left, in

bilateral anterior prefrontal regions, and in right posterior cingulate

cortex.

We also found that these condition-specific activations differed

somewhat, depending on the types of items being judged. Thus,

the main effect for item type revealed activation for LOW

DISTORTION items, compared to MEMBERS+HIGH DISTOR-

TION items, in left dorsolateral prefrontal, bilateral anterior

cingulate, and bilateral striatal areas, as well as bilateral inferior

frontal areas that is marginal on the left. The overall interaction

effect for the factorial analysis showed left inferior frontal (x =

�40, y = 28, z = 0; z score = 2.82) and right dorsolateral

prefrontal (x = 32, y = 40, z = 20; z score = 2.90) recruitment. We

examined interaction effects more closely, as summarized in Table

2 and illustrated in Fig. 4. For MEMBERS+HIGH DISTORTION

items, Panel C shows that the Rule condition, relative to the

n area Talairach coordinates # Voxels Z score

x y z

CTS

�60 4 28 59 3.84

�4 8 44 61 3.50

�32 �64 8 261 5.08

40 �44 4 44 3.08

�24 56 20 20 3.39

16 44 �4 112 3.08

8 �48 8 24 3.33

– – – – –

�32 �12 48 25 2.90

40 �16 40 186a 3.34

0 4 48 186a 3.28

�52 24 24 54 3.20

�16 0 0 45 3.32

28 20 0 39 3.24

EFFECTS

�52 �36 40 49 3.14

�32 �76 40 23 3.40

�4 8 40 24 2.99

�32 �64 8 1411a 4.75

32 �60 8 1411a 4.46

�64 0 20 91 3.19

�44 �68 8 108 4.12

Fig. 4. Distribution of activation during the test phase. Panel A: Main effect: Rule-based categorization minus Similarity-based categorization. Panel B: Main

effect: Similarity-base categorization minus Rule-based categorization. Panel C: Interaction effect: Rule-based minus Similarity-based categorization for

MEMBER + HIGH DISTORTION items. Panel D: Interaction effect: Rule-based minus Similarity-based categorization for LOW DISTORTION items. Panel

E: Interaction effect: Similarity-based minus Rule-based categorization for MEMBER + HIGH DISTORTION items. Panel F: Interaction effect: Similarity-

based minus Rule-based categorization for LOW DISTORTION items.


Similarity condition, activated left inferior parietal, left parietal–

occipital, and left anterior cingulate regions. For LOW DISTOR-

TION items, Panel D shows activation for the Rule condition

minus the Similarity condition in left inferior frontal cortex. Panel

E shows that the Similarity condition, compared to the Rule

condition, activated bilateral temporal–parietal cortex for MEM-

BERS+HIGH DISTORTION items. Panel F shows activation in

left temporal–parietal cortex for LOW DISTORTION items in the

Similarity condition minus the Rule condition. These findings

suggest that relatively distinct patterns of activation were

associated with each categorization condition, consistent with

support for executive resources for rule-based categorization and

perceptual feature integration for similarity-based categorization.

In addition, the condition-specific patterns vary somewhat

depending on the type of stimulus item being judged, suggesting

that, for each condition, different strategic shifts are required for


judging items whose membership status is readily apparent versus

judging equivocal items.

We expected our participants to make category judgments at

test in accordance with how they had been trained. Overlap in areas

of activation within condition from training to test would reflect

this continuity. We looked for such overlap by comparing the

Talairach coordinates of peak activation between Rule minus

Similarity, and between Similarity minus Rule, for training and for

test. In both cases, two areas were observed for which all three

respective coordinates differed by less than 12 mm, our unit of

spatial smoothing, between the corresponding training and test

phases. As can be seen in Tables 1 and 2, Rule-based processing,

relative to Similarity-based, recruited left inferior frontal (BA 6/44)

and left anterior cingulate (BA 24/32) regions, suggesting that

executive processing and monitoring of competing responses were

common to both rule-based learning and subsequent test. The high

levels of accuracy in category membership judgments in this

resource-demanding task are consistent with a constant level of

response monitoring. Similarity-based processing, relative to Rule-

based, recruited left temporal–parietal (BA 39) and left anterior

prefrontal (BA 10) regions, suggesting that overall feature

configuration and recall of previously encountered exemplars were

common to both similarity-based learning and subsequent test. Our

experimental design was not conducive to subtractions between the

training and test phases. The stimulus displays differed; for

instance, subjects saw multiple items along with written material

during training, but only single items at test. The tasks also

differed; for instance, subjects chose between two repeatedly

presented items during training but evaluated unique individual

items at test. In addition, training trials were consistent, while test

trials involved evaluating different item types in a random pattern.

These confounds would make it impossible to reliably interpret the

results of such subtractions, and hence we cannot draw conclusions

concerning the non-overlapping areas of activation between

training and test.

Discussion

We taught two groups of participants a novel semantic category

using two different categorization strategies. Participants’ beha-

vioral responses were consistent with the qualitative distinctions

between rule-based and similarity-based categorization processes

that have been demonstrated elsewhere (Allen and Brooks, 1991;

Grossman et al., 2003b; Koenig et al., 2002; Rips, 1989; Smith and

Sloman, 1994): Categorization judgments following rule-based

training thus resulted in a clearly defined all-or-none category,

while categorization judgments following similarity-based training

resulted in graded membership that reflected an item’s resemblance

to the prototype. We monitored regional cerebral activity with

fMRI while participants first learned a novel semantic category

under rule-based or similarity-based conditions, and while they

subsequently categorized these items in accordance with their

training. We found distinct condition-specific activation profiles

during training, and unique patterns of activation during the

subsequent test phase that overlapped with the corresponding

training activation profiles. These findings are consistent with the

hypothesis that semantic categorization can be performed by at

least two qualitatively different processes, and that these processes

rely on distinct large-scale neural networks. We discuss each of

these processes and the associated brain activation patterns below.

Activation patterns during rule-based semantic categorization

Rule-based acquisition

There is considerable evidence that brain regions associated

with working memory and executive processes such as selective

attention are recruited during rule-based categorization. The rule

condition thus activated dorsal portions of left inferior frontal

cortex (BA 6 and 44). This region shows activation during many

tasks involving verbal working memory (Smith and Jonides,

1999; Smith et al., 1998a,b), and appears to be associated in

particular with maintaining information in an active state in

working memory (Awh et al., 1996; Smith and Jonides, 1999).

Recruitment may serve to maintain in verbal working memory

the features needed to diagnose category membership in

accordance with rules. Involvement of the prefrontal cortex

has been found in single-cell recordings of nonhuman primates

(i.e., monkeys), apparently specific to the category membership

status of items in recently learned all-or-none categories (Miller

et al., 2002). The important contribution of working memory is

emphasized by the significantly increasing left inferior frontal

activation during the course of rule-based training. This is in

keeping with the increased accuracy of judgments observed in

the latter part of the training phase, and suggests that the need

for working memory to employ the rules continued despite

subjects’ increasing familiarity with the visual appearance of the

stimuli. Brain regions associated with working memory are often

lateralized depending on the verbal or visual perceptual-spatial

nature of the material, with right-sided dominance for spatial

working memory and left-sided activation for verbal working

memory (Smith, 1997; Smith et al., 1996). An example of such

lateralization is apparent in the left-sided activation seen during

rule-based decision-making with verbally presented stimuli

(Grossman et al., 2002a,b), and in the right-sided activation

during rule-based decision-making with images (Patalano et al.,

2001). In the present study, although the novel category was

visual, left-lateralized activation may reflect the presentation of

the features relevant to the rule as verbal descriptions.

We also observed left anterior cingulate activation during rule-

based training. Some studies associate this brain region with

components of attention important for selecting and maintaining

sensitivity to specific features (Benedict et al., 1998; Corbetta

et al., 1993; Coull et al., 1996), compatible with Posner’s

executive component of attention (Posner and Petersen, 1990).

Thus, anterior cingulate may be important for selectively attending

to specific, critical features that determine membership in the new

category. Another line of work emphasizes the importance of the

anterior cingulate for governing the response to competing

candidate choices during task performance (Braver et al., 2001;

Carter et al., 1998; MacDonald et al., 2000), although competition

between response alternatives was also present during similarity-

based training. Regardless of the specific basis for anterior

cingulate recruitment, activation of this area and inferior frontal

cortex emphasizes the important role of executive processes during

rule-based categorization.

We also found left thalamic activation during rule-based

training. This region also appears to play an important role in

selective (Frith and Friston, 1996; LaBerge, 1995; Shulman et al.,

1997), a necessary component in the feature-selection process

inherent in rule-based, but not similarity-based, categorization. The

connectivity pattern of the thalamus is consistent with this structure

serving an attentional gating role, since it appears to modulate the


projection of sensory-motor information to the cerebral cortex and

cortical–cortical connectivity.

Parietal activation in BA 7 was observed during rule-based

learning, particularly during the early phase of training. Recruit-

ment of this parietal area has been associated with perception of the

spatial relationships among the feature elements of the stimuli

(Grady et al., 1994; Horwitz et al., 1992; Wilkinson et al., 2002),

and with visual selective attention (Bushnell et al., 1981). This

activation may diminish over time as subjects accumulate familiar-

ity with the spatial configuration of the stimuli, similar to

observations of diminishing cortical activation during learning in

studies of repetition priming (Buckner and Koutsaal, 1998;

Buckner et al., 1998; Raichle et al., 1994). It is noteworthy that

diminution during the course of learning was also found in the

thalamus. This finding suggests that the attention-gating role

played by the thalamus may be linked to diminishing cortical

activation during learning. The link between diminished activation

in these areas and learning over the course of training is consistent

with the increase in judgment accuracy that participants showed

during the final training trials.

Rule-based categorization

The pattern of activation in the test phase for rule-based

categorization judgments appears to represent recruitment of a

large-scale neural network that partially incorporates that which

was recruited during training. We observed recruitment of dorsal

portions of left inferior frontal cortex during the rule condition.

Inferior frontal activation suggests that participants maintained the

rules guiding categorization decisions in verbal working memory

(Awh et al., 1996; Smith and Jonides, 1999). Inferior frontal

activation was more evident during categorization of LOW

DISTORTION than MEMBER + HIGH DISTORTION items.

These items, which had two of the four requisite features, may

depend more on an exhaustive analysis of the rules maintained in

working memory because endorsing them as members requires

inspection of all four relevant features, in contrast with items which

can be accepted when three defining features have been identified.

It is also possible that assessing MEMBER and HIGH DISTOR-

TION items was facilitated because of their resemblance to items

that were repeatedly seen during training, in contrast to LOW

DISTORTION items. Hence, subjects may have had to rely more

exclusively on the application of rules in classifying the unfamiliar

LOW DISTORTION items. The greater reaction times for these

items are consistent both of these interpretations.

Anterior cingulate, also recruited during rule-based acqui-

sition, was again recruited in the left hemisphere during rule-

based categorization relative to similarity-based categorization.

Previous studies involving rule-based categorization also dem-

onstrated anterior cingulate activation (Elliott et al., 1999; Rao

et al., 1997; Savage et al., 2001). Anterior cingulate activation

was more strongly associated with rule-based categorization of

MEMBER + HIGH DISTORTION than with LOW DISTOR-

TION items. Together with the left parietal activation that was

also was seen during rule-based decisions about MEMBER +

HIGH DISTORTION items, these observations begin to empha-

size the contribution of selective attention to the diagnostic value

of specific perceptual features during rule-based processing of

objects.

Our findings thus suggest a pivotal role for executive resources

during rule-based categorization. Evidence for this first emerges

during acquisition. An anterior cingulate-thalamic-parietal network

appears to support selective attention to specific perceptual

features, and inferior frontal cortex is recruited increasingly during

rule-based learning to maintain rules in an activated state in

working memory. During subsequent categorization, anterior

cingulate and parietal cortex continue to play a particularly

important role in selectively attending to perceptual features of

stimuli, and inferior frontal cortex to maintain rules in working

memory.

Activation patterns during similarity-based semantic

categorization

Similarity-based acquisition

Similarity-based training was associated with bilateral anterior

prefrontal activation. This region is recruited in retrieval from long-

term memory (Buckner and Koutsaal, 1998; Buckner et al., 1998;

Lepage et al., 2002). We found activation in a posterior cingulate

distribution during similarity-based training as well. As with

anterior prefrontal cortex, activation of this area has been

associated with retrieval from episodic memory, particularly when

retrieval has been successful (Kapur et al., 1995; Rugg et al., 1998;

Tulving et al., 1996). It is noteworthy that posterior cingulate

recruitment increased over the course of learning, as did the

accuracy of participants’ judgments. Together with anterior

prefrontal activation, posterior cingulate recruitment during si-

milarity-based training, but not rule-based training, is consistent

with subjects’ increasingly depending on the recall of specific

exemplars during similarity-based training; that is, unlike rule-

based training in which each trial imposes the same rules on

successive training items, similarity-based training involves the

gradual formation of the category with exposure to a variety of

members.

We also observed bilateral temporal–parietal activation. Some

studies have emphasized the role of this area in the overall

configuration of features that may be important for similarity-

based judgments (Wilkinson et al., 2002). Relatedly, other work

associates activation of this region with object meaning

(Grossman et al., 2003a; Martin et al., 1995; Owen et al.,

1996a,b). Temporal–parietal cortex is a heteromodal association

region with a connectivity pattern that involves reciprocal

projections with modality-specific association cortices (Mesulam

et al., 1977; Pandya and Kuypers, 1969; Seltzer and Pandya,

1978). Activation in this brain region thus appears to be well suited

to integrating multiple features of a complex stimulus such as a

novel category for the purpose of deriving object meaning. The

increasing activation of these temporal–parietal regions over the

course of similarity-based training appears to emphasize the

importance of this integrative function in similarity-based acqui-

sition strategies.

Similarity-based categorization of a semantic category

The brain regions associated with similarity-based categoriza-

tion during the test phase overlapped the areas activated by

subjects during similarity-based training. Posterior cingulate and

anterior prefrontal activation thus were also evident during

categorization. This may continue to reflect dependence on

successful retrieval of previously encountered exemplars, that is,

the training items and the presented prototype, from memory

(Braver et al., 2001; Buckner and Koutsaal, 1998; Lepage et al.,

2002; Rugg et al., 1998; Tulving et al., 1996). The similarity

condition was also associated with bilateral temporal–parietal


recruitment. This region appears to play a continuing role in

integrating multiple features crucial for overall similarity-based

comparisons (Grossman et al., 2002a,b, 2003a).

Our observations thus suggest that similarity-based semantic

categorization activates brain regions important for feature

integration and comparisons with previously encountered exem-

plars. During both similarity-based acquisition and subsequent

categorization, this network was seen to include anterior prefrontal,

posterior cingulate, and temporal–parietal regions.

Activation patterns common to rule- and similarity-based

categorization

Several other brain regions associated with executive resources

were recruited during judgments of LOW DISTORTION items

relative to MEMBER + HIGH DISTORTION items across both the

rule-based and similarity-based conditions at test. These include

left dorsolateral prefrontal cortex and bilateral striatum. MacDo-

nald et al. (2000) have suggested that left dorsolateral prefrontal

cortex selectively attends to information relevant to task demands,

and that such activation can be increased by monitoring of

response conflict by anterior cingulate cortex. We suggested above

that, for rule-based categorization, LOW DISTORTION items may

put greater demands on maintenance of the specific rules in

working memory because all four requisite features must be

inspected. In the similarity-based condition, subjects may have

become sensitive to the conflicting status of LOW DISTORTION

test items that equally resemble MEMBER and HIGH DISTOR-

TION items, and subsequently resolved the category membership

status of the LOW DISTORTION items in a manner mediated in

part by dorsolateral prefrontal cortex. These bequivocalQ stimulus

items, then, may have placed greater demands on executive

processes in both categorization conditions, albeit for different

decision-making strategies.

Some researchers have associated striatal activation with rule-

based categorization (Ashby and Ell, 2001; Ashby et al., 1991).

For example, caudate activation was seen in a study of rule-based

card sorting (Rao et al., 1997). To the extent that judging stimuli

typical of neither members nor nonmembers is inherently resource-

demanding, LOW DISTORTION stimuli may require activation of

the striatum regardless of the rule-based or similarity-based nature

of categorization that we elicited.

Comparison with previous rule and similarity imaging studies

There are few neuroimaging studies directly comparing rule-

and similarity-based categorization. Most studies that examine one

or both of these processes differ critically from the present study,

making informative between-study comparisons difficult. For

instance, Tracy et al. (2003) compared categorization by either

family resemblance (i.e., similarity) or criterion attribute (i.e., an

exhaustive rule). They observed medial parietal, inferior frontal,

and anterior temporal activation in their rule-based task, and

extrastriate and cerebellar activation in their similarity-based task.

However, their stimuli were letter strings; the categories were

structured differently for each condition; subjects had to deduce the

category structure; and finally, the same subjects participated in

both conditions, in alternate blocks. In another example, Reber

et al. (1998b) found decreased posterior occipital activity

associated with classifying members in a resemblance-based

categorization task. However, this was in comparison with a

recognition memory task, rather than a rule-based task. In addition,

the task used semantically meaningless dot patterns which subjects

learned to categorize by an implicit, rather than an explicit,

process. Hence, studies such as these seem unlikely to draw on the

same neural mechanisms as the present study.

The fMRI study by Grossman et al. (2002a,b), described in

Introduction, compared rule-based and similarity-based categori-

zation of minimally described real world objects. Thus, that study

differs from the current study in many ways, including its use of

verbal stimuli, familiar categories (and hence no acquisition

phase), and subject-derived rather than experimenter-imposed rules

(e.g., participants must rely on their prior knowledge to realize that

bquarters must be of a certain size,Q and then reason accordingly).

These differences would seem to account for activation discre-

pancies across the two studies. For instance, left dorsolateral

prefrontal, left anterior cingulate, and left inferior parietal

activation observed in both processing conditions in the Grossman

et al. (2002a,b) study could reflect processing requirements that are

common to both conditions in that study, but not in the present one.

These could include resource-demanding reasoning, responses that

must be monitored in the similarity condition because they are

equally plausible and in the rule condition because they are

potentially error-prone, and feature configuration necessary to

make judgments about a verbally presented category. Nonetheless,

the Rule condition, relative to the Similarity condition, activated

left frontal areas, albeit more dorsal than those activated in the

present study.

The PET imaging study by Patalano et al. (2001) seems

closer in design to the present study, and also shows left inferior

frontal activation during rule-based categorization that is not

observed during similarity-based categorization. The discrepan-

cies between other observed patterns of activation across the two

studies could reflect the many methodological differences: The

current study is event-related, and therefore activation patterns

reported for one condition are relative to those in the other. The

Patalano et al. study, which was PET rather than fMRI, contrasts

activation with within-condition baseline tasks. That study found

visual association cortex activity in both conditions; in the

present study, any such activation common to both conditions

would have been subtracted out. The Patalano et al. study’s

similarity-based condition employed exemplars that were highly

similar to training items (which consisted of two categories),

with a large number of features in common, and no equivocal

items. This may account for the lack of temporal–parietal

activation in that study, which we suggest reflects the processes

of feature configuration necessary to forming the new category

in the current study. The strong item similarity, coupled with the

several day delay between training and test in the Patalano et al.

study, may account for the lack of posterior cingulate activity,

which, in the present study, we attribute to the successful recall

of previous exemplars: Memory for specific exemplars in the

Patalano et al. study may not have been preserved over the

lengthy delay, while such memories were presumably available

in the present study. The Patalano et al. study’s rule condition

consisted of items that either closely resembled or looked greatly

unlike category members; these latter items presumably required

that participants inhibit perceptual dissimilarity in order to

follow the rules. In addition, participants had 3 s, in contrast

to the current study’s 12 s, to make a decision at test. Rule-

condition participants’ relatively low accuracy rates (78%,

compared with 98% in the current study) suggest that they


were unsuccessful in applying rules at least 20% percent of the

time, and it presumably cannot be known whether they

attempted to apply the rules during those times, responded to

perceptual similarity, or simply became inattentive. The current

study includes only successful trials. This may account for the

anterior cingulate activation observed in the current study, which

we attribute to response monitoring.

Conclusion

The observations of the present study extend previous findings

(Grossman et al., 2002a,b; Patalano et al., 2001) of distinct cerebral

activation patterns for rule-based and similarity-based categoriza-

tion to the acquisition and utilization of a new semantic category.

Our training stimuli were reasonably matched across conditions for

verbal and visual content, and contained the identical training

items; our test stimulus displays were completely identical across

conditions. Hence, qualitatively different responses across con-

ditions, particularly in the test phase, argue in favor of multiple

categorization processes. Our overall observation of distinct

patterns of decision-making paired with distinct patterns of cortical

activation thus is difficult to reconcile with approaches to semantic

categorization advocating a single mechanism for all forms of

categorization, with differing responses being attributed to varying

difficulty of task demands (Nosofsky and Johansen, 2000;

Nosofsky and Palmeri, 1998).

We found neural networks unique to each categorization

process. It appears that the acquisition of a new category by a

rule-based strategy is associated with an anterior cingulate–

thalamic–parietal network important for selectively attending to

specific perceptual features, and inferior frontal cortex that

maintains rules in working memory during categorization. During

subsequent categorization of stimuli following rule-based learning,

selective attention to perceptual features, supported by anterior

cingulate and parietal cortex, apparently plays a particularly

important role in the categorization of items with readily apparent

category membership. By comparison, categorization of LOW

DISTORTION stimuli is associated more closely with left inferior

frontal activation, presumably to maintain the rule-based criteria

and evaluated features in an active state in working memory.

Similarity-based categorization seems to consistently activate

temporal–parietal regions, supporting the integration of features

necessary for making overall comparisons, and an anterior

prefrontal–posterior cingulate network to support successful

retrieval from memory of previously encountered exemplars which

determine the structure of the category.

References

Aizenstein, H.J., MacDonald, A.W., Stenger, V.A., et al., 2000. Comple-

mentary category learning systems identified using event-related

functional MRI. J. Cognit. Neurosci. 12, 977–987.

Allen, S.W., Brooks, L.R., 1991. Specializing the operation of an explicit

rule. J. Exp. Psychol. Gen. 12, 3–17.

lsop, D., 1995. Correction of ghost artifacts and distortion in

echoplanar MR with an iterative reconstruction technique. Radiology

194, 338.

Ashby, F.G., Ell, S.W., 2001. The neurobiology of human category

learning. Trends Cognit. Sci. 5, 204–210.

Ashby, F.G., Alfonso-Reese, L.A., Turken, U., Waldron, E.M., 1991. A

neuropsychological theory of multiple systems in category learning.

Psychol. Rev. 105, 442–481.

Awh, E., Jonides, J., Smith, E.E., Schumacher, E.H., Koeppe, R.A., Katz, S.,

1996. PET evidence for a dissociation between the storage and rehearsal

components of verbal working memory. Psychol. Sci. 7, 25–31.

Benedict, R.H., Lockwood, A.H., Shucard, J.L., Shucard, D.W., Wack,

D.S., Murphy, B.W., 1998. Functional neuroimaging of attention in the

auditory modality. NeuroReport 9, 121–126.

Braver, T.S., Barch, D.M., Gray, J.A., Molfese, D.L., Snyder, A., 2001.

Anterior cingulate cortex and response conflict: effects of frequency,

inhibition, and errors. Cereb. Cortex 1, 825–836.

Bruner, J., Goodnow, J., Austin, G., 1956. A Study of Thinking. Wiley,

New York.

Buckner, R.L., Koutsaal, W., 1998. Functional neuroimaging studies of

encoding, priming, and explicit memory retrieval. Proc. Natl. Acad. Sci.

U. S. A. 95, 891–898.

Buckner, R.L., Goodman, J., Burock, M.A., et al., 1998. Functional–

anatomic correlates of object priming in humans revealed by rapid

presentation event-related fMRI. Neuron 20, 285–296.

Bushnell, M.C., Goldberg, M.E., Robinson, D.L., 1981. Behavioral

evidence of visual response in monkey cerebtral cortex: I. Modulation

in posterior parietal cortex related to selective visual attention.

J. Neurophysiol. 46, 755–772.

Carter, C.S., Braver, T.S., Barch, D.M., Botvinick, M.M., Noll, D., Cohen,

J.D., 1998. Anterior cingulate cortex, error detection, and the online

monitoring of performance. Science 280, 747–749.

Cohen, J.D., MacWhinney, B., Flatt, M.R., Provost, J., 1993. PsyScope: a

new graphic interactive environment for designing psychology experi-

ments. Behav. Res. Meth. Instrum. Comput. 25, 101–113.

Corbetta, M., Miezin, F.M., Shulman, G.L., Petersen, S.E., 1993. A PET

study of visuospatial attention. J. Neurosci. 13, 1202–1226.

Coull, J.T., Frith, C.D., Frackowiak, R.S.J., Grasby, P.M., 1996. A

frontal–parietal network for rapid visual information processing: a

PET study of sustained attention and working memory. Neuro-

psychologia 34, 1085–1095.

D’Esposito, M., Detre, J.A., Alsop, D.C., Shin, R.K., Atlas, S.W.,

Grossman, M., 1995. The neural basis of the central executive system

of working memory. Nature 378, 279–281.

Elfgren, C., Passant, U., Riberg, J., 1993. Neuropsychological findings in

frontal lobe dementia. Dementia 4, 214–219.

Elliott, R., Rees, G., Dolan, R.J., 1999. Ventromedial prefrontal cortex

mediates guessing. Neuropsychologia 37, 403–411.

Fisher, L.M., Freed, D.M., Corkin, S., 1990. Stroop color-word test

performance in patients with Alzheimer’s disease. J. Clin. Exp.

Neuropsychol. 12, 745–758.

Frackowiak, R.S.J., Friston, K.J., Frith, C.D., Dolan, R.J., Mazziotta, J.C.,

1997. Human Brain Function. Academic Press, San Diego.

Frith, C.D., Friston, K.J., 1996. The role of the thalamus in btop-downQmodulation of attention to sound. NeuroImage 4, 210–215.

Garavan, H., Ross, T.J., Li, S.-J., Stein, E.A., 2000. A parametric

manipulation of central executive functioning. Cereb. Cortex 10,

585–592.

Grady, C.L., Maisog, J., Horwitz, B., et al., 1994. Age-related changes in

cortical blood flow activation during visual processing of faces and

location. J. Neurosci. 14, 1450–1462.

Grossman, M., Koenig, P., DeVita, C., et al., 2002a. The neural basis

for category-specific knowledge: an fMRI study. NeuroImage 15,

936–948.

Grossman, M., Smith, E.E., Koenig, P., et al., 2002b. The neural basis for

categorization in semantic memory. NeuroImage 15, 1549–1561.

Grossman, M., Koenig, P., Glosser, G., et al., 2003a. Neural basis for

semantic memory difficulty in Alzheimer’s disease: an fMRI study.

Brain 126, 292–311.

Grossman, M., Smith, E.E., Koenig, P., Glosser, G., Rhee, J., Dennis, K.,

2003b. Categorization of object descriptions in Alzheimer’s disease and

frontotemporal dementia. Cognit. Affect. Behav. Neurosci. 3, 120–132.

Horwitz, B., Grady, C.L., Haxby, J.V., Schapiro, M.B., Ungerleider, L.G.,


Mishkin, M., 1992. Functional associations among human posterior

extrastriate brain regions during object and spatial vision. J. Cognit.

Neurosci. 4, 311–322.

Kapur, S., Craik, F.I.M., Jones, C., Brown, G.M., 1995. Functional role of

the prefrontal cortex in retrieval of memories: a PET study. Neuro-

Report 6, 1880–1884.

Kemler, D., Smith, L., 1979. Accessing similarity and dimensional

relations: effects of integrality and separability on the discovery of

complex concepts. J. Exp. Psychol. Gen. 108, 133–150.

Koenig, P., Smith, E.E., Moore, P., Dennis, K., Grossman, M., 2002.

Category acquisition in Alzheimer’s disease, frontotemporal dementia,

and corticobasal degeneration. Neurology 58, A353.

LaBerge, D., 1995. Attentional Processing. Harvard Univ. Press, Cam-

bridge, MA.

LaFleche, D., Albert, M.S., 1995. Executive function deficits in mild

Alzheimer’s disease. Neuropsychology 9, 313–320.

Lepage, M., Ghaffar, O., Nyberg, L., Tulving, E., 2002. Prefrontal cortex

and episodic retrieval mode. Proc. Natl. Acad. Sci. U. S. A. 97,

506–511.

MacDonald, A.W., Cohen, J.D., Stenger, V.A., Carter, C.S., 2000.

Dissociating the role of the dorsolateral prefrontal and anterior cingulate

cortex in cognitive control. Science 288, 1835–1838.

Martin, A., Haxby, J.V., Lalonde, F.M., Wiggs, C.L., Ungerleider, L.G.,

1995. Discrete cortical regions associated with knowledge of color and

knowledge of action. Science 270, 102–105.

Medin, D.L., Schaffer, M.M., 1978. A context theory of classification

learning. Psychol. Rev. 85, 207–238.

Medin, D.L., Goldstone, R.L., Gentner, D., 1993. Respects for similarity.

Psychol. Rev. 100, 254–278.

Mesulam, M.-M., van Hoesen, G.W., Pandya, D.N., Geschwind, N., 1977.

Limbic and sensory connections of the inferior parietal lobule (Area

PG) in the rhesus monkey: a study with a new method of horseradish

peroxidase histochemistry. Brain Res. 136, 393–414.

Miller, E.K., Freedman, D.J., Wallis, J.D., 2002. The prefrontal cortex:

categories, concepts, and cognition. Philos. Trans. R. Soc. Lond., Ser. B

357, 1123–1136.

Monchi, O., Petrides, M., Petre, V., Worsley, K.J., Dagher, A., 2001.

Wisconsin card sorting revisited: distinct neural circuits participating in

different stages of the task identified by event-related functional

magnetic resonance imaging. J. Neurosci. 21, 7733–7741.

Nosofsky, R.M., Palmeri, T.J., 1998. A rule-plus-exception model for

classifying objects in continuous-dimension spaces. Psychon. Bull. Rev.

5, 345–369.

Nosofsky, R.M., Johansen, M.K., 2000. Exemplar-based accounts of

bmultiple-systemQ phenomena in perceptual categorization. Psychon.

Bull. Rev. 7, 375–402.

Owen, A.M., Doyon, J., Petrides, M., Evans, A., 1996a. Planning and

spatial working memory: a positron emission tomography study in

humans. Eur. J. Neurosci. 8, 353–364.

Owen, A.M., Milner, B., Petrides, M., Evans, A.C., 1996b. Memory for

object features vs. memory for object location: a positron-emission

tomography study of encoding and retrieval processes. Proc. Natl.

Acad. Sci. U. S. A. 93, 9212–9217.

Pandya, D.N., Kuypers, H.G.J.M., 1969. Cortico-cortical connections in the

rhesus monkey. Brain Res. 13, 13–36.

Pasquier, F., Lebert, F., Lavenu, I., Guillaume, B., 1999. The clinical picture

of frontotemporal dementia: diagnosis and follow-up. Dement. Geriatr.

Cogn. Disord. 10, 10–14.

Patalano, A., Smith, E.E., Jonides, J., Koeppe, R., 2001. PET evidence for

multiple strategies of categorization. Cogn. Affect. Behav. Neurosci. 1,

360–370.

Perry, R.J., Hodges, J.R., 1999. Attention and executive deficits in

Alzheimer’s disease: a critical review. Brain 122, 383–404.

Petrides, M., Alivisatos, B., Evans, A.C., Meyer, E., 1993. Dissoci-

ation of human mid-dorsolateral from posterior dorsolateral frontal

cortex in memory processing. Proc. Natl. Acad. Sci. U. S. A. 90,

873–877.

Poldrack, R.A., Gabrieli, J.D.E., 2001. Characterizing the neural mecha-

nisms of skill learning and repetition priming: Evidence from mirror

reading. Brain 124, 67–82.

Posner, M., Petersen, S., 1990. The attention system of the human brain.

Annu. Rev. Neurosci. 13, 25–42.

Raichle, M.E., Fiez, J.A., Videen, T.O., et al., 1994. Practice-related

changes in human brain functional anatomy during nonmotor learning.

Cereb. Cortex 4, 8–26.

Rao, S.M., Bobholz, J.A., Hammeke, T.A., et al., 1997. Functional MRI

evidence for subcortical participation in conceptual reasoning skills.

NeuroReport 8, 1987–1993.

Reber, P.J., Stark, C.E.L., Squire, L.R., 1998a. Cortical areas supporting

category learning identified using functional MRI. Proc. Natl. Acad.

Sci. U. S. A. 95, 747–750.

Reber, P.J., Stark, C.E.L., Squire, L.R., 1998b. Contrasting cortical activity

associated with category memory and recognition memory. Learn.

Mem. 5, 420–428.

Rips, L.J., 1989. Similarity, typicality, and categorization. In: Vosniadou, S.,

Ortony, A. (Eds.), Similarity and Analogical Reasoning. Cambridge

Univ. Press, Cambridge, pp. 21–59.

Rugg, M.D., Fletcher, P.C., Allan, K., Frith, C.D., Frackowiak, R.S.J.,

Dolan, R.J., 1998. Neural correlates of memory retrieval during

recognition memory and cued recall. NeuroImage 8, 262–273.

Savage, C.R., Deckersbach, T., Heckers, S., et al., 2001. Prefrontal regions

supporting spontaneous and directed application of verbal learning

strategies: evidence from PET. Brain 124, 219–231.

Seger, C.A., Poldrack, R.A., Prabhakaran, V., Zhao, M., Glover, G.H.,

Gabrieli, J.D.E., 2000a. Hemispheric asymmetries and individual

differences in visual concept learning as measured by functional MRI.

Neuropsychologia 38, 1316–1324.

Seger, C.A., Prabhakaran, V., Poldrack, R.A., Gabrieli, J.D.E., 2000b.

Neural activity differs between explicit and implicit learning of artificial

grammar strings: an fMRI study. Psychobiology 2, 283–292.

Seltzer, B., Pandya, D.N., 1978. Afferent cortical connections and

architectonics of the superior temporal sulcus and surrounding cortex

in the rhesus monkey. Brain Res. 149, 1–24.

Shulman, G.L., Corbetta, M., Buckner, R.L., et al., 1997. Common blood

flow changes across visual tasks: I. Increases in subcortical structures

and cerebellum but not in nonvisual cortex. J. Cognit. Neurosci. 9,

624–647.

Smith, E.E., 1995. Concepts and categorization. In: Osherson, D., Smith,

E.E. (Eds.), Invitation to Cognitive Science, Thinking, vol. 2. MIT

Press, Cambridge, pp. 1–25.

Smith, E.E., 1997. Working Memory: A View from Neuroimaging. Cogn.

Psychol. 3, 5–42.

Smith, E.E., Jonides, J., 1999. Storage and executive processes in the

frontal lobes. Science 283, 1657–1661.

Smith, E.E., Medin, D.L., 1981. Categories and Concepts. Harvard Univ.

Press, Cambridge.

Smith, E.E., Sloman, S.A., 1994. Similarity-versus rule-based categoriza-

tion. Mem. Cogn. 2, 377–386.

Smith, E.E., Jonides, J., Koeppe, R., 1996. Dissociating verbal and spatial

working memory using PET. Cereb. Cortex 6, 11–20.

Smith, E.E., Jonides, J., Marshuetz, C., Koeppe, R.A., 1998a. Components

of verbal working memory: evidence from neuroimaging. Proc. Natl.

Acad. Sci. U. S. A. 95, 876–882.

Smith, E.E., Patalano, A.L., Jonides, J., 1998b. Alternative strategies for

categorization. Cognition 65, 167–196.

Talairach, J., Tournoux, P., 1988. Co-Planar Stereotaxic Atlas of the Human

Brain. Thieme Medical Publishing Company, New York.

Tracy, J.I., Mohamed, F., Faro, S., Pinus, A., Tiver, R., Harvan, J., Bloomer,

C., Pyrros, A., 2003. Differential brain responses when applying

criterion attribute versus family resemblance rule learning. Brain Cogn.

51, 276–286.

Tulving, E., 1972. Episodic and semantic memory. In: Tulving, E.,

Donaldson, W. (Eds.), Organization of Memory. Academic Press,

New York.


Tulving, E., Markowitsch, H.J., Craik, F.I.M, Habib, R., Houle, S., 1996.

Novelty and familiarity activations in PET studies of memory encoding

and retrieval. Cereb. Cortex 6, 71–79.

Vendrell, P., Junque, C., Pujol, J., Jurado, M.A., Molet, J., Grafman, J.,

1995. The role of the prefrontal regions in the Stroop task. Neuro-

psychologia 33, 341–352.

Wagner, A.D., Maril, A., Bjork, R.A., Schacter, D.L., 2001. Prefrontal

contributions to executive control: fMRI evidence for functional dis-

tinctions within lateral prefrontal cortex. NeuroImage 14, 1337–1347.

Wilkinson, D.T., Halligan, P.W., Henson, R.N.A, Dolan, R.J., 2002. The

effects of interdistracter similarity on search processes in the superior

parietal cortex. NeuroImage 1, 611–619.

bib koening et al. 2005. the neural basis for novel semantic categorization

Documents