marshall hm poster_vra2015

A Comparative Study of Cataloger- and User-assigned Subject TermsHannah Marie Marshall, Metadata Librarian for Image Collections

Cornell University Library

63%

31% 31%

25%

17% 22%

0%

5%2%

12%

47% 45%

Cataloger Control Group Variable Group

Boy Athlete

Primary Terms Secondary Terms

Tertiary Terms Non-Subject Terms

83%

14%5%

0%

2%

2%

17%

4%

5%

0%

80%88%


Chain Ornament And Coin



40% 36%27%

0% 7%17%

60%

14% 7%

0%

43% 49%


Female Figurine



50% 51%41%

0% 3%

7%

50%

24% 36%

0%

22% 16%


Articulated Madí



72%

36% 35%

14%

26% 23%

14%

17%17%

0%

21% 25%


Adoration Of The Kings



0%

19%9%

0%

1%4%

33%

40%

31%

67%

40%56%


Large Black Olla With Flat Shoulder



100%

62% 57%

0%

5% 10%

0%

11% 17%

0%

22% 16%


Neapolitan Night With Tarantella On The Sea Shore



71%56%

40%

0%

5%29%

29%

24% 14%

0%15% 17%


Portrait Of Lorenzo The Magnificent



38%46%

32%

63%

20%

28%

0%

15%18%

0%

19% 22%


The Expulsion Of Quetzalcoatl



78%

49%40%

0%

3%11%

22%

27% 30%

0%

21% 19%


Verbena De Atocha



DesignData was collected using two Qualtrics surveys – one for the control group and one

for the variable group. Each survey consisted of the same 10 images (below) drawn

from the Cornell University Library’s images for teaching collection, the Knight

Visual Resources Collection. Beneath each image were 9 blank fields into which

participants were asked to enter subject terms describing the image above. The

variable group included the addition of the following text: “When deciding on

descriptive terms for each image, keep the following three questions in mind: 1.

What is the image of? 2. What is the image about? 3. What is the image a good

example of?”

Recruitment & ParticipantsThe surveys were distributed randomly by email to all undergraduate students

enrolled in a course in the departments of Art History and Classics. Undergraduate

students in the College of Arts and Sciences are the user population about whose

search behavior nothing was known. Students in Art History and Classics were

specifically targeted to examine common assumptions about the relationship

between subject expertise, research experience and information-seeking behavior.

Presumably, this is a population that has some subject expertise but lacks the

research experience of graduate and faculty users of the collections.

January 2014 Initial proposal to the Institute for Research

Design in Librarianship (IRDL)

• Literature review

• Methodology

• Schedule

June 16th-June 26th 2014 IRDL

July – October 2014

• Refining study design and proposal; preparing to

submit study for IRB Exemption; pre-testing

October 5th 2014 IRB Exemption granted

October 24th 2014 Data collection begins

December 8th 2014 Data collection ends

December 2014 – Present Analysis

Spring 2015 – Continued data collection

• Primary terms yield the greatest search utility and higher levels of successful

image retrieval.

The findings discussed in the analysis of non-subject terms indicate that many non-subject

terms were applied based on primary level analysis, suggesting that the rate at which

participants assigned primary terms does not fully reflect the degree to which they employ

primary level subject analysis. In conjunction with the findings in Figure 2 demonstrating that

the vast majority of corresponding literal terms were primary and the established understanding

that literal correspondence is the best predictor of successful retrieval, it is clear that subject

metadata addressing the primary level of meaning in images of artworks is most likely to have

the highest search utility and levels of successful image retrieval.

• High numbers of non-subject terms among the responses indicate that subject

metadata is not the strongest access point for visual material

• Priming participants using questions based on the different types of subject

meaning did not dramatically effect the nature and content of their responses.

Secondary level subject analysis was the most influenced by the priming questions

Control

Group

Variable

GroupAll Participants

Respondents 39 41 80

Completion rate 31.7% 35.9% 33.8%

Average #of terms

assigned per image2.33 2.57 2.45

Research Question # 1What is the search utility of our images to our users?

What is the level of correspondence between between

cataloger- and user-assigned subject terms for

images?

Based on evidence that description is a strong indicator

of search behavior, this will lend insight into our users’

search behaviors and the search utility of our image

collections

This was accomplished by comparing the literal terms

assigned by each group and quantifying the frequency

with which each group assigned the same term to the

same image

The level of literal correspondence between the existing

subject terms and those assigned by the respondents

was only 8.5%

Of that 8.5%, 74% of the terms addressed the primary

level of subject analysis, 3% were secondary terms,

16% were tertiary, and 6% were non-subject terms

Research Question #3By asking users to contemplate the primary,

secondary and tertiary levels of subject meaning, can

we improve the accessibility of our image collections?

Does priming users with a set of questions about the

images change the nature and content of their

responses when asked to perform descriptive tasks?

• This drove the design of a variable group who were

asked to contemplate the following three questions as

they performed the descriptive tasks in the survey:

• What is the image of?

• What is the image about?

• What is the image a good example of?

Methodology

Literal Matches,

8.5%

Non-Matches, 91.5%

Literal correspondence between responses and existing metadata

Literal Matches Non-matches

Primary Terms, 74%

Secondary Terms, 3%

Tertiary Terms, 16%

Non-Subject Terms, 6%

Corresponding literal terms broken down by type



Conclusions

64%

34%39%

30%

12%

13%9%

15%

19%

16%18%

16%

5%

37% 34%39%

Cataloger Respondents (all) Control Group Variable Group

Comparison of all types of terms assigned by each participant group

Primary Terms Secondary Terms Tertiary Terms Non-Subject Terms

Research Question # 2What types of terms does each group assign, and in

what ratios do they assign them?

What is the level of correspondence in the type of

subject analysis being performed by each group?

When the terms assigned by the respondents differed

from the existing metadata (91.5% of the time), do they

differ because each group is performing fundamentally

different kinds of analysis or exercising dramatically

different levels of interpretation?

The subject analysis of images has traditionally been

understood as a graduated scale that identifies different

levels of interpretation and analysis. To what degree

does each group use them?

The definitions for each category were arrived at

through a review of the literature on the subject analysis

of images

Figure 5 presents the ratios of primary, secondary,

tertiary, and non-subject terms assigned by each group

Figure 8 presents the same data adjusted to exclude

non-subject terms

Figure 9 breaks the findings down by the type of image

– images of 2D works of art and images of 3D works

39%

30%

9%

15%

18%

16%

34%

39%

Control Group

Variable Group

Comparison of types of terms assigned by the control group and variable

group



TimelineResearch Question #1

In order to address Research Question #1, the literal terms provided by the

respondents were compared to the existing metadata. When a respondent assigned a

term that was a literal match to one of the existing terms, this correspondence was

noted, and the overall rates of correspondence have been compared in Figure 1. The

levels of literal correspondence were extremely low – 8.5%. The universe of

corresponding literal terms was then analyzed by type (Figure 2), revealing that a

significant majority, 75%, of corresponding literal terms are terms addressing primary

subject matter, suggesting that the search utility of primary subject terms in image

collections is extremely high.

Research Question #2

To address Research Question #2, each term in the existing metadata and the survey

responses was coded according to the different types of subject meaning defined

above (primary, secondary, tertiary, and non-subject). This allowed for comparison

between groups, as shown in Figure 5, which illustrates little difference between the

control and variable groups, but significant differences between the cataloger-

assigned terms and those assigned by the respondents. The existing metadata includes

almost no non-subject terms and roughly twice the percentage of primary terms as the

respondents’ metadata which, given the high number of primary terms among the

literal matches discussed in the findings for Research Question #1, contributes

positively to search utility.

Analysis & Findings

Primary – perception of the work’s pure form

• “What is the image of?” / “What does the image include?”

• Identifies figures and gestures

ie. “man”, “pointing”, “clasped hands”, “inscription”

Secondary – incorporates cultural and iconographic

knowledge

• “What is the image about?”

• Interprets figures and gestures

ie. “Christ”, “banishing”, “prayer”

Tertiary – demonstrates an awareness of the work as a

document of cultural activity that reflects a time and place

• “What is the image a good example of?” / “How does the

image communicate?”

• Identifies devices

ie. “symbolism, “abstraction”, “chiaroscuro”

Non-Subject terms – descriptive terms addressing aspects

of a work that are not related to its subject

ie. worktype, creator, style/period, culture,

materials/techniques, etc.

Types of Subject Terms

Research Question #3

The literal and coded responses for the control and variable groups were

compared, as seen in Figures 3 & 4. Little or no meaningful difference is

detectable between the groups, though the variable group assigned 10% more

secondary subject terms than the control group.

Non-Subject Terms

Non-subject terms were further broken down by type, as seen in Figure 7 which

reveals that, in most cases, participants assigned non-subject terms describing the

worktype of a piece and the materials and techniques used in its creation. Figure 9

reflects the finding that non-subject terms were assigned to images of 3D works at

twice the rate of 2D works, while primary terms were assigned to images of 2D

works at nearly twice the rate for images of 3D works. This suggests that

respondents were applying non-subject terms to images of 3D works in the same

way that they applied primary terms to images of 2D works, thus revealing a

failure to distinguish between the subject matter depicted in a 2D work and a 3D

work depicted in a digital image. This suggests a poor grasp of the “work: image”

relationship among the respondents. Viewed through the lens of this finding, non-

subject terms can, in many cases, be functionally viewed as primary terms,

indicating that the rates of primary level subject analysis are higher than reflected

here.

10%

7%

90%

93%

Control Group

Variable Group

Literal correspondence between control/variable groups and existing

metadata

Literal Matches Non-matches

Works and Images

The responses were broken down by image, and the literal and coded values

were compared. Comparisons of the coded values for each image appear

below. It is noteworthy that there does not seem to be a standard ratio

applied to all images by any group – including the existing cataloger-

supplied metadata. Correspondence between the control and variable groups

is consistently high, while correspondence between these two groups and

the existing metadata is generally low.

The responses were also broken into those assigned to images of 2D works

and images of 3D works to examine differences in the types of terms

assigned to each group. The findings, shown in Figure 9. Reveal a high

percentage of non-subject terms assigned to images of 3D works,

suggesting that participants especially struggle in the subject analysis of 3D

works, while subject analysis of 2D images comes more easily. The rate of

secondary terms assigned to images of 2D works was double that for 3D

works, and the rates of primary and non-subject terms for images of 2D

works are approximately inverse to those applied to images of 3D works.

This relationship suggests a poor grasp of the work/image relationship, as

discussed in the previous section.

Figure 1

Figure 2

Figures 3

Figure 5

Figure 6

0% 20% 40% 60% 80% 100%

Cataloger

Respondents (all)

Control Group

Variable Group

Percentages of subject and non-subject terms assigned by

each participant group

Subject Terms Non-Subject Terms

0% 20% 40% 60%

Condition

Culture

Materials/Techniques

Value

Worktype

Types of non-subject terms assigned by respondents

Figure 7 Figure 8

72%

54%60%

50%

9%

20%14%

25%

19%26% 26% 25%

Cataloger Respondents (all) Control Group Variable Group

Comparison of subject terms assigned by each participant group

Primary Terms Secondary Terms Tertiary Terms

71.70%

49.80%40.80% 45.30%

0

47.20%

30.20%22.60% 26.40%

15.30%

11.80%20.20%

16%

0

5%

6%10.40%

8.20%

13%

18.80% 19.20% 19%

0

32%

17.40%16.20% 16.80%

0%

19.60% 19.80% 19.70%

0

15.80%

46.40% 50.80% 48.60%

Types of terms assigned to images of 2D and 3D works by each participant group

2d 3d

Primary Terms Secondary Terms Tertiary Terms Non-Subject Terms

Figure 9

Figure 4

marshall hm poster_vra2015

Documents

descriptive terms

subject terms design

subject expertise

comparative study of

undergraduate students

user population

departments of art history

teaching collection