data-driven computational pragmatics 1.pdfschedule 1:10 –1:20 introductions 1:20 –1:30 structure...

166
Data-Driven Computational Pragmatics Shlomo Argamon and Jonathan Dunn

Upload: others

Post on 12-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Data-Driven Computational Pragmatics

Shlomo Argamon and Jonathan Dunn

Page 2: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Schedule

1:10 – 1:20 Introductions

1:20 – 1:30 Structure of Class

1:30 – 2:10 Intro to Data-Driven Computational Pragmatics

2:10 – 2:50 Co-Reference Resolution

2:50 – 3:00 Conclusion / Flex Time

Page 3: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Schedule

1:10 – 1:20 Introductions

1:20 – 1:30 Structure of Class

1:30 – 2:10 Intro to Data-Driven Computational Pragmatics

2:10 – 2:50 Co-Reference Resolution

2:50 – 3:00 Conclusion / Flex Time

Page 4: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Class 1:

• Overview of Data-Driven Computational Pragmatics

• Co-Reference Resolution

Page 5: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Class 1:

• Overview of Data-Driven Computational Pragmatics

• Co-Reference Resolution

• Class 2:

• Hypothesis Testing with Data-Driven Models

• Inferences and Reasoning

Page 6: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Class 3:

• Metaphor and Figurative Language

Page 7: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Class 3:

• Metaphor and Figurative Language

• Class 4:

• Social Meaning from Stylistics and Linguistic Variations

Page 8: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Readings and Notes Available Online: www.jdunn.name

Page 9: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Readings and Notes Available Online: www.jdunn.name

• Assignments:

• Reading Questions (60%)

Page 10: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Structure of Class

• Readings and Notes Available Online: www.jdunn.name

• Assignments:

• Reading Questions (60%)

• Short Paper (500 - 1,000 words; 40%)

• “Consider how you could use these sorts of computational models to address or provide evidence for a hypothesis from linguistic theory.”

• Due M, 7/20

Page 11: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Schedule

1:10 – 1:20 Introductions

1:20 – 1:30 Structure of Class

1:30 – 2:10 Intro to Data-Driven Computational Pragmatics

2:10 – 2:50 Co-Reference Resolution

2:50 – 3:00 Conclusion / Flex Time

Page 12: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview

• Pragmatics

• Meaning-in-Language: Cognitive, Contextual, Social• (c.f. Linguistic vs. Non-Linguistic Meaning)

Page 13: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview

• Pragmatics

• Meaning-in-Language: Cognitive, Contextual, Social• (c.f. Linguistic vs. Non-Linguistic Meaning)

• Subjective vs. Objective Phenomena• Epistemic Objectivity

• Ontological Objectivity

Page 14: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview

• Pragmatics

• Meaning-in-Language: Cognitive, Contextual, Social• (c.f. Linguistic vs. Non-Linguistic Meaning)

• Subjective vs. Objective Phenomena• Epistemic Objectivity

• Ontological Objectivity

• Meaning-in-Language: Epistemically Objective• Can be modeled

• Can make testable predictions

Page 15: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (2)

• Computational Pragmatics

• Models do not have direct access to introspection / intuition

Page 16: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (2)

• Computational Pragmatics

• Models do not have direct access to introspection / intuition

• Models entirely syntactic in nature (e.g., symbol manipulation)

Page 17: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (2)

• Computational Pragmatics

• Models do not have direct access to introspection / intuition

• Models entirely syntactic in nature (e.g., symbol manipulation)

• Data-Driven

• Learn models directly from linguistic data

Page 18: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (2)

• Computational Pragmatics

• Models do not have direct access to introspection / intuition

• Models entirely syntactic in nature (e.g., symbol manipulation)

• Data-Driven

• Learn models directly from linguistic data

• Limited rule-based heuristics

Page 19: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (2)

• Computational Pragmatics

• Models do not have direct access to introspection / intuition

• Models entirely syntactic in nature (e.g., symbol manipulation)

• Data-Driven

• Learn models directly from linguistic data

• Limited rule-based heuristics

• Corpus-based view of language: Language as a produced, observable phenomenon

Page 20: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (3)

Computational Pragmatics: A Spectrum

Computers Doing Pragmatics(e.g., Siri in the future)

AI as an Engineering Task

Page 21: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (3)

Computational Pragmatics: A Spectrum

Computers Doing Pragmatics(e.g., Siri in the future)

AI as an Engineering Task

Computational Modelling for Pragmatics(e.g., computer-assisted corpus linguistics)

AI as Cognitive Science

Page 22: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Sources of Evidence

Introspection

Page 23: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Sources of Evidence

Introspection

Hand-crafted Datasets

Page 24: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Sources of Evidence

Introspection

Hand-crafted Datasets

Huge “Natural”Datasets

Page 25: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Sources of Evidence

Introspection

Hand-crafted Datasets

Huge “Natural”Datasets

But also depend on introspection:Gold-Standard Annotations

Page 26: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Computational pragmatics is tricky

SYNTAX

Page 27: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Computational pragmatics is tricky

SYNTAX

SEMANTICS

Page 28: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Computational pragmatics is tricky

SYNTAX

SEMANTICS

PRAGMATICS

Page 29: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Data-Driven Computational Pragmatics

Page 30: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Data-Driven Computational Pragmatics

Data

(Corpora)

Page 31: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Data-Driven Computational Pragmatics

Data

(Corpora)

Features

(Representations)

Page 32: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (5)

Data-Driven Computational Pragmatics

Data

(Corpora)

Features

(Representations)Algorithms

(Model Building)

Page 33: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (7)

Data-Driven =?= Model-Independent

Page 34: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (10)

• Sample features for the co-reference task, for a given pair of candidate entities:

Page 35: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (10)

• Sample features for the co-reference task, for a given pair of candidate entities:

• Binary Feature: Do these co-reference candidates share number information?

Page 36: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (10)

• Sample features for the co-reference task, for a given pair of candidate entities:

• Binary Feature: Do these co-reference candidates share number information?

• Categorical Feature: What semantic role does entity A have? Entity B?

Page 37: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (10)

• Sample features for the co-reference task, for a given pair of candidate entities:

• Binary Feature: Do these co-reference candidates share number information?

• Categorical Feature: What semantic role does entity A have? Entity B?

• Frequency Feature: Frequency of represented verb-argument pair in reference corpus(Could model verb selection preferences: does the verb accept the candidate?)

Bob bought a new car. Then he drove it for hours.

Page 38: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (10)

• Sample features for the co-reference task, for a given pair of candidate entities:

• Binary Feature: Do these co-reference candidates share number information?

• Categorical Feature: What semantic role does entity A have? Entity B?

• Frequency Feature: Frequency of represented verb-argument pair in reference corpus(Could model verb selection preferences: does the verb accept the candidate?)

• Ratio Feature: Minimum Edit Distance for Candidates / Total String Length

Page 39: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Overview (10)

• Sample features for the co-reference task, for a given pair of candidate entities:

• Binary Feature: Do these co-reference candidates share number information?

• Categorical Feature: What semantic role does entity A have? Entity B?

• Frequency Feature: Frequency of represented verb-argument pair in reference corpus(Could model verb selection preferences: does the verb accept the candidate?)

• Ratio Feature: Minimum Edit Distance for Candidates / Total String Length

• Measurement Feature: Difference between abstractness ratings for candidate and its verb(Could model verb selection preferences)

Page 40: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Schedule

1:10 – 1:20 Introductions

1:20 – 1:30 Structure of Class

1:30 – 2:10 Intro to Data-Driven Computational Pragmatics

2:10 – 2:50 Co-Reference Resolution

2:50 – 3:00 Conclusion / Flex Time

Page 41: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (3)

• Co-Reference Resolution:

• What words refer to the same entity?

Page 42: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (3)

• Co-Reference Resolution:

• What words refer to the same entity?

North Korea opened its doors to the U.S. today, welcoming

Secretary of State Madeline Albright. She says her visit is a

good start. The U.S. remains concerned about North Korea’s

missile development program and its exports of missiles to

Iran.

Fernandes, et al. (2014). “Latent Trees for Coreference Resolution.” Computational Linguistics, 40(4)

Page 43: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (3)

• Co-Reference Resolution:

• What words refer to the same entity?

North Korea opened its doors to the U.S. today, welcoming

Secretary of State Madeline Albright. She says her visit is a

good start. The U.S. remains concerned about North Korea’s

missile development program and its exports of missiles to

Iran.

Fernandes, et al. (2014). “Latent Trees for Coreference Resolution.” Computational Linguistics, 40(4)

Page 44: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (3)

• Co-Reference Resolution:

• What words refer to the same entity?

North Korea opened its doors to the U.S. today, welcoming

Secretary of State Madeline Albright. She says her visit is a

good start. The U.S. remains concerned about North Korea’s

missile development program and its exports of missiles to

Iran.

Fernandes, et al. (2014). “Latent Trees for Coreference Resolution.” Computational Linguistics, 40(4)

Page 45: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (3)

• Co-Reference Resolution:

• What words refer to the same entity?

North Korea opened its doors to the U.S. today, welcoming

Secretary of State Madeline Albright. She says her visit is a

good start. The U.S. remains concerned about North Korea’s

missile development program and its exports of missiles to

Iran.

Fernandes, et al. (2014). “Latent Trees for Coreference Resolution.” Computational Linguistics, 40(4)

Page 46: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (3)

• Co-Reference Resolution:

• What words refer to the same entity?

North Korea opened its doors to the U.S. today, welcoming

Secretary of State Madeline Albright. She says her visit is a

good start. The U.S. remains concerned about North Korea’s

missile development program and its exports of missiles to

Iran.

Fernandes, et al. (2014). “Latent Trees for Coreference Resolution.” Computational Linguistics, 40(4)

Page 47: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (2)

• Many different kinds of linguistic phenomena:

• Proper names (“George”)

Page 48: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (2)

• Many different kinds of linguistic phenomena:

• Proper names (“George”)

• Aliases (“LSI”)

Page 49: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (2)

• Many different kinds of linguistic phenomena:

• Proper names (“George”)

• Aliases (“LSI”)

• Definite NPs (“the Linguistic Summer Institute”)

Page 50: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (2)

• Many different kinds of linguistic phenomena:

• Proper names (“George”)

• Aliases (“LSI”)

• Definite NPs (“the Linguistic Summer Institute”)

• Pronouns (“it”, “they”)

Page 51: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (2)

• Many different kinds of linguistic phenomena:

• Proper names (“George”)

• Aliases (“LSI”)

• Definite NPs (“the Linguistic Summer Institute”)

• Pronouns (“it”, “they”)

• Appositives (“the first institute to be...”)

Page 52: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (2)

• Many different kinds of linguistic phenomena:

• Proper names (“George”)

• Aliases (“LSI”)

• Definite NPs (“the Linguistic Summer Institute”)

• Pronouns (“it”, “they”)

• Appositives (“the first institute to be...”)

• Bridging References (“the cabinet was wood, but the top granite”)

Page 53: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Data

• Annotated corpora

Page 54: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Data

• Annotated corpora

• Each mention annotated with an ID of the unique entity it refers to

Page 55: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Data

• Annotated corpora

• Each mention annotated with an ID of the unique entity it refers to

• Can extract pairwise relations between mentions

Page 56: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Data

• Annotated corpora

• Each mention annotated with an ID of the unique entity it refers to

• Can extract pairwise relations between mentions

• Genre of the text kinds of co-reference phenomena

Page 57: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Features

• Agreement: number, person, case, etc.

Page 58: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Features

• Agreement: number, person, case, etc.

• Syntactic restrictions

Page 59: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Features

• Agreement: number, person, case, etc.

• Syntactic restrictions

• Semantic selectional preferences

Page 60: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Features

• Agreement: number, person, case, etc.

• Syntactic restrictions

• Semantic selectional preferences

• Syntactic/semantic role preferences

Page 61: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Features

• Agreement: number, person, case, etc.

• Syntactic restrictions

• Semantic selectional preferences

• Syntactic/semantic role preferences

• Saliency: recency, repetition

Page 62: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Features

• Agreement: number, person, case, etc.

• Syntactic restrictions

• Semantic selectional preferences

• Syntactic/semantic role preferences

• Saliency: recency, repetition

• Causal coherence

Page 63: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Algorithms

• Three basic dichotomies:

• Relationship between linguistic units or between entities

Page 64: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Algorithms

• Three basic dichotomies:

• Relationship between linguistic units or between entities

• Pairwise relationships or larger clusters

Page 65: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution Algorithms

• Three basic dichotomies:

• Relationship between linguistic units or between entities

• Pairwise relationships or larger clusters

• Whole text at once or processing sequentially item by item

Page 66: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (5)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

1. Find all entities (assumed)

Page 67: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (5)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

1. Find all entities (assumed)

2. Classify each possible pair (within defined window)

Page 68: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (5)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

1. Find all entities (assumed)

2. Classify each possible pair (within defined window)

3. Cluster identified co-referencing pairs into chains

Page 69: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (6)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are Two NPs Co-Referencing? (Binary Category: Yes/No)

Page 70: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (6)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are Two NPs Co-Referencing? (Binary Category: Yes/No)

Pairs To Classify:

Madeline Albright

Iran

North Korea

Her

its

North Korea

its

Page 71: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (6)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are Two NPs Co-Referencing? (Binary Category: Yes/No)

Pairs To Classify:

North Korea

its

U.S.

She

Her

its

Iran

Madeline Albright

Iran

North Korea

Her

its

North Korea

its

Page 72: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (6)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are Two NPs Co-Referencing? (Binary Category: Yes/No)

Pairs To Classify:

North Korea

its

U.S.

She

Her

its

Iran

Madeline Albright

Iran

North Korea

Her

its

North Korea

its

No

No

No

Yes

No

Yes

No

Page 73: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (7)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Clustering: Which Co-Referencing NPs Belong to the Same Chain?

Page 74: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (7)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Clustering: Which Co-Referencing NPs Belong to the Same Chain?

• If “Madeline Albright” and “she” co-reference,

Page 75: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (7)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Clustering: Which Co-Referencing NPs Belong to the Same Chain?

• If “Madeline Albright” and “she” co-reference,

• And if “she” and “her” co-reference,

Page 76: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (7)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Clustering: Which Co-Referencing NPs Belong to the Same Chain?

• If “Madeline Albright” and “she” co-reference,

• And if “she” and “her” co-reference,

• Then “Madeline Albright” and “her” must also co-reference

Page 77: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (7)

• Co-Reference Resolution: The Mention-Pair Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Clustering: Which Co-Referencing NPs Belong to the Same Chain?

• If “Madeline Albright” and “she” co-reference,

• And if “she” and “her” co-reference,

• Then “Madeline Albright” and “her” must also co-reference

Works great, but only if there are no classifier errors.

Page 78: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (8)

• Co-Reference Resolution: The Entity-Mention Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are an NP and a Preceding Cluster Co-Referencing?

Page 79: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (8)

• Co-Reference Resolution: The Entity-Mention Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are an NP and a Preceding Cluster Co-Referencing?

Pairs To Classify:

[North Korea + its]

[North Korea + its]

[Madeline Albright + She]

[Madeline Albright + She]

[She + her]

[North Korea + its]

[North Korea’s + its]

Page 80: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (8)

• Co-Reference Resolution: The Entity-Mention Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are an NP and a Preceding Cluster Co-Referencing?

Pairs To Classify:

[North Korea + its]

[North Korea + its]

[Madeline Albright + She]

[Madeline Albright + She]

[She + her]

[North Korea + its]

[North Korea’s + its]

Madeline Albright

She

its

her

its

North Korea’s

Iran

Page 81: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (8)

• Co-Reference Resolution: The Entity-Mention Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are an NP and a Preceding Cluster Co-Referencing?

Pairs To Classify:

[North Korea + its]

[North Korea + its]

[Madeline Albright + She]

[Madeline Albright + She]

[She + her]

[North Korea + its]

[North Korea’s + its]

Madeline Albright

She

its

her

its

North Korea’s

Iran

No

No

No

Yes

No

Yes

No

Page 82: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (9)

• Co-Reference Resolution: The Entity-Mention Model

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

Classifier: Are an NP and a Preceding Cluster Co-Referencing?

• Allows cluster-level features

• Maintains consistency within large clusters

Page 83: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (10)

• Co-Reference Resolution: Remaining Problems

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

• Often very many candidates• Ranking models help choose best

Page 84: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (10)

• Co-Reference Resolution: Remaining Problems

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

• Often very many candidates• Ranking models help choose best

• Many entities are not anaphoric • “Singletons”• More than choice of cluster

Page 85: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (10)

• Co-Reference Resolution: Remaining Problems

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

• Often very many candidates• Ranking models help choose best

• Many entities are not anaphoric • “Singletons”• More than choice of cluster

• Co-reference can depend on lexical relations• “Corsica is …. The island is …..”

Page 86: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (11)

• Features

• First, what are the cues for / properties of co-referenced words?

Page 87: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (11)

• Features

• First, what are the cues for / properties of co-referenced words?

• Second, how can we model those cues?

Page 88: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (11)

• Features

• First, what are the cues for / properties of co-referenced words?

• Second, how can we model those cues?

• e.g., Automatically extract features representing those cues

Page 89: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (11)

• Features

• First, what are the cues for / properties of co-referenced words?

• Second, how can we model those cues?

• e.g., Automatically extract features representing those cues

• (1) How to represent the language (e.g., WordNet synsets, parse tree)

Page 90: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (11)

• Features

• First, what are the cues for / properties of co-referenced words?

• Second, how can we model those cues?

• e.g., Automatically extract features representing those cues

• (1) How to represent the language (e.g., WordNet synsets, parse tree)

• (2) How to measure the property in that representation

Page 91: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (12)

• Hard Constraints

• Number agreement

• “John has an Acura. It is red.”

Page 92: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (12)

• Hard Constraints

• Number agreement

• “John has an Acura. It is red.”

• Person and case agreement

• “*John and Mary have Acuras. We love them.” (where We=John and Mary)

Page 93: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (12)

• Hard Constraints

• Number agreement

• “John has an Acura. It is red.”

• Person and case agreement

• “*John and Mary have Acuras. We love them.” (where We=John and Mary)

• Gender agreement

• “John has an Acura. He/it/she is attractive.”

Page 94: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (12)

• Hard Constraints

• Number agreement

• “John has an Acura. It is red.”

• Person and case agreement

• “*John and Mary have Acuras. We love them.” (where We=John and Mary)

• Gender agreement

• “John has an Acura. He/it/she is attractive.”

Binary Features:For each candidate pair, is the constraint satisfied?

(Yes/No)

Page 95: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (13)

• Hard Constraints

• Number agreement

• “John has an Acura. It is red.”

• Person and case agreement

• “*John and Mary have Acuras. We love them.” (where We=John and Mary)

• Gender agreement

• “John has an Acura. He/it/she is attractive.”

Categorical Features:A feature for each candidates number, person,

gender, etc.

Page 96: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (14)

• Syntactic constraints

• “John bought himself a new Acura.” (himself=John)

• “John bought him a new Acura.” (him = not John)

Page 97: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (14)

• Syntactic constraints

• “John bought himself a new Acura.” (himself=John)

• “John bought him a new Acura.” (him = not John)

• Required representation: parse tree / dependency relations

• Binary Feature: Does the necessary relation obtain?

Page 98: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (15)

• Selectional Restrictions

• “John parked his Acura in the garage. He had driven it around for hours.”

• “it” must refer to something that can be driven.

Page 99: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (15)

• Selectional Restrictions

• “John parked his Acura in the garage. He had driven it around for hours.”

• “it” must refer to something that can be driven.

• Knowledge-based Approach:

• VerbNet for verb properties

• WordNet for synset membership

Page 100: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (16)

• Selectional Restrictions

• “John parked his Acura in the garage. He had driven it around for hours.”

• “it” must refer to something that can be driven.

• Distributional Approach:

• Cluster nouns and verbs into classes

• What is the probability noun A will occur with verb B?

Page 101: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (17)

• Recency / Salience

• “John has an Integra. Bill has a Legend. Mary likes to drive it.”

Page 102: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (17)

• Recency / Salience

• “John has an Integra. Bill has a Legend. Mary likes to drive it.”

• Syntactic Operationalization

• How far removed in the parse tree are A and B?

Page 103: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (17)

• Recency / Salience

• “John has an Integra. Bill has a Legend. Mary likes to drive it.”

• Syntactic Operationalization

• How far removed in the parse tree are A and B?

• Semantic Operationalization

• How prominent is the semantic role of A (e.g., agent vs. patient)

Page 104: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (18)

• Grammatical Role: Subject preference

• “John went to the Acura dealership with Bill. He bought an Integra.”

• “Bill went to the Acura dealership with John. He bought an Integra.”

• “(?) John and Bill went to the Acura dealership. He bought an Integra.”

Page 105: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (18)

• Grammatical Role: Subject preference

• “John went to the Acura dealership with Bill. He bought an Integra.”

• “Bill went to the Acura dealership with John. He bought an Integra.”

• “(?) John and Bill went to the Acura dealership. He bought an Integra.”

• Categorical Feature:

• Grammatical Role or Semantic Role of each candidate as features

Page 106: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (19)

• Repeated Mentions Preference

• “John needed a car to get to his new job. He decided that he wanted something sporty. Bill went to the Acura dealership with him. He bought an Integra.”

Page 107: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (19)

• Repeated Mentions Preference

• “John needed a car to get to his new job. He decided that he wanted something sporty. Bill went to the Acura dealership with him. He bought an Integra.”

• Not as relevant in pairwise classification

• In entity-mention model, feature of candidate cluster size to favor larger clusters

Page 108: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (20)

• Verb Semantics Preferences

• “John telephoned Bill. He lost the pamphlet on Acuras.”

• “John criticized Bill. He lost the pamphlet on Acuras.”

Page 109: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (20)

• Verb Semantics Preferences

• “John telephoned Bill. He lost the pamphlet on Acuras.”

• “John criticized Bill. He lost the pamphlet on Acuras.”

• Implicit causality

• Implicit cause of criticizing is object.

• Implicit cause of telephoning is subject.

Page 110: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (21)

• Co-Reference Resolution: Features

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

String-Matching:

“U.S.” and “U.S.” = Identical

“North Korea” and “North Korea’s” = Similar

Page 111: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (21)

• Co-Reference Resolution: Features

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

String-Matching:

“U.S.” and “U.S.” = Identical

“North Korea” and “North Korea’s” = Similar

Minimum Edit Distance

How many operations are required to convert the first string into the second?

Page 112: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (21)

• Co-Reference Resolution: Features

North Korea opened its doors to the U.S. today,

welcoming Secretary of State Madeline Albright.

She says her visit is a good start. The U.S. remains

concerned about North Korea’s missile

development program and its exports of missiles

to Iran.

String-Matching:

“U.S.” and “U.S.” = Identical

“North Korea” and “North Korea’s” = Similar

Minimum Edit Distance

How many operations are required to convert the first string into the second?

“U.S.” and “U.S.” = 0“North Korea” and “North Korea’s” = 2

Page 113: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (22)

• Assume we have all the features for a candidate pair:

“North Korea” and “its”

• The next step is to use the features to predict whether they co-reference

Page 114: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (22)

• Assume we have all the features for a candidate pair:

“North Korea” and “its”

• The next step is to use the features to predict whether they co-reference

Sample Approach 1:

Lappin & Leass. (1994). “An Algorithm for Pronominal Anaphora Resolution.” Computational Linguistics.

Page 115: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (22)

• Assume we have all the features for a candidate pair:

“North Korea” and “its”

• The next step is to use the features to predict whether they co-reference

Sample Approach 1:

Lappin & Leass. (1994). “An Algorithm for Pronominal Anaphora Resolution.” Computational Linguistics.

Sample Approach 2:

McCarthy & Lehnert. (1995). “Using Decision Trees for Coreference Resolution.” Proceedings of IJCAI.

Page 116: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (23)

• Lappin and Leass (1994): Given he/she/it, assign antecedent

Page 117: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (23)

• Lappin and Leass (1994): Given he/she/it, assign antecedent

• (1) Discourse model update

• When a new noun phrase is encountered:

Page 118: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (23)

• Lappin and Leass (1994): Given he/she/it, assign antecedent

• (1) Discourse model update

• When a new noun phrase is encountered:

(a) Add a representation to discourse model with a salience value

Page 119: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (23)

• Lappin and Leass (1994): Given he/she/it, assign antecedent

• (1) Discourse model update

• When a new noun phrase is encountered:

(a) Add a representation to discourse model with a salience value(b) Modify saliences

Page 120: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (23)

• Lappin and Leass (1994): Given he/she/it, assign antecedent

• (1) Discourse model update

• When a new noun phrase is encountered:

(a) Add a representation to discourse model with a salience value(b) Modify saliences

• (2) Pronoun resolution

• (a) Choose the most salient antecedent

Page 121: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (24)

• Pre-defined Weights:

80Head noun emphasis

50Non-adverbial emphasis

40Ind. Obj and oblique emphasis

50Accusative (direct object) emphasis

70Existential emphasis

80Subject emphasis

100Subject recency

Page 122: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (24)

• Pre-defined Weights:

• Weights are cut in half after eachsentence is processed

80Head noun emphasis

50Non-adverbial emphasis

40Ind. Obj and oblique emphasis

50Accusative (direct object) emphasis

70Existential emphasis

80Subject emphasis

100Subject recency

Page 123: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (24)

• Pre-defined Weights:

• Weights are cut in half after eachsentence is processed

• This, and a sentence recencyweight (100 for new sentences,cut in half each time), capturesthe recency preferences

80Head noun emphasis

50Non-adverbial emphasis

40Ind. Obj and oblique emphasis

50Accusative (direct object) emphasis

70Existential emphasis

80Subject emphasis

100Subject recency

Page 124: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (25)

• Algorithm:

• (1) Collect the potential referents (up to 4 sentences back)

Page 125: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (25)

• Algorithm:

• (1) Collect the potential referents (up to 4 sentences back)

• (2) Remove potential referents that do not agree in number or gender with the pronoun

Page 126: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (25)

• Algorithm:

• (1) Collect the potential referents (up to 4 sentences back)

• (2) Remove potential referents that do not agree in number or gender with the pronoun

• (3) Remove potential references that do not pass syntactic coreference constraints

Page 127: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (25)

• Algorithm:

• (1) Collect the potential referents (up to 4 sentences back)

• (2) Remove potential referents that do not agree in number or gender with the pronoun

• (3) Remove potential references that do not pass syntactic coreference constraints

• (4) Compute total salience value of referent from all factors

Page 128: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (25)

• Algorithm:

• (1) Collect the potential referents (up to 4 sentences back)

• (2) Remove potential referents that do not agree in number or gender with the pronoun

• (3) Remove potential references that do not pass syntactic coreference constraints

• (4) Compute total salience value of referent from all factors

• (5) Select referent with highest salience value. In case of tie, select closest.

Page 129: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (26)

• Problems:

• Limited features

Page 130: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (26)

• Problems:

• Limited features

• Feature weight assumed in advance

Page 131: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (26)

• Problems:

• Limited features

• Feature weight assumed in advance

• Hard constraints mixed with imperfect feature extraction

Page 132: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (26)

• Problems:

• Limited features

• Feature weight assumed in advance

• Hard constraints mixed with imperfect feature extraction

• Limited coverage (e.g., pronouns)

Page 133: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (26)

• Problems:

• Limited features

• Feature weight assumed in advance

• Hard constraints mixed with imperfect feature extraction

• Limited coverage (e.g., pronouns)

• Hand-crafted rules are very language dependent

Page 134: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (27)

• McCarthy & Lehnert (1995): Given two entities, should they be linked?

Page 135: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (27)

• McCarthy & Lehnert (1995): Given two entities, should they be linked?

• (1) Create training data by manually annotating gold-standard links

Page 136: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (27)

• McCarthy & Lehnert (1995): Given two entities, should they be linked?

• (1) Create training data by manually annotating gold-standard links

• Every possible pair is a training example

• Positive: Co-Referenced pairs (very small number)

• Negative: Not Co-Referenced pairs (majority of examples)

Page 137: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (27)

• McCarthy & Lehnert (1995): Given two entities, should they be linked?

• (1) Create training data by manually annotating gold-standard links

• Every possible pair is a training example

• Positive: Co-Referenced pairs (very small number)

• Negative: Not Co-Referenced pairs (majority of examples)

• (2) Extract features for each possible pair

Page 138: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (27)

• McCarthy & Lehnert (1995): Given two entities, should they be linked?

• (1) Create training data by manually annotating gold-standard links

• Every possible pair is a training example

• Positive: Co-Referenced pairs (very small number)

• Negative: Not Co-Referenced pairs (majority of examples)

• (2) Extract features for each possible pair

• (3) Use learning algorithm to assign weights to features

Page 139: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (27)

• Note: Different Uses of Introspection

• (1) To create rules and feature weights in advance

• (2) To annotate gold-standard training set

• But introspection is involved in both methods

Page 140: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

Page 141: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

NAME-{1,2}: Does reference include a name?

Page 142: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

NAME-{1,2}: Does reference include a name?

JV-CHILD-{1,2}: Does reference refer to part of a joint venture?

Page 143: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

NAME-{1,2}: Does reference include a name?

JV-CHILD-{1,2}: Does reference refer to part of a joint venture?

ALIAS: Does one reference contain an alias for the other?

Page 144: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

NAME-{1,2}: Does reference include a name?

JV-CHILD-{1,2}: Does reference refer to part of a joint venture?

ALIAS: Does one reference contain an alias for the other?

BOTH-JV-CHILD: Do both refer to part of a joint venture?

Page 145: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

NAME-{1,2}: Does reference include a name?

JV-CHILD-{1,2}: Does reference refer to part of a joint venture?

ALIAS: Does one reference contain an alias for the other?

BOTH-JV-CHILD: Do both refer to part of a joint venture?

COMMON-NP: Do both contain a common NP?

Page 146: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (28)

Features (one for each candidate entity in pair)(features were manually annotated)

NAME-{1,2}: Does reference include a name?

JV-CHILD-{1,2}: Does reference refer to part of a joint venture?

ALIAS: Does one reference contain an alias for the other?

BOTH-JV-CHILD: Do both refer to part of a joint venture?

COMMON-NP: Do both contain a common NP?

SAME-SENTENCE: Are both in the same sentence?

Page 147: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (29)

Algorithm: C4.5 (Builds a Decision Tree Using Training Data)

Page 148: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (29)

Algorithm: C4.5

(1) Incrementally build decision-tree from labeled training examples

Page 149: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (29)

Algorithm: C4.5

(1) Incrementally build decision-tree from labeled training examples

(2) At each stage choose “best” attribute to split dataset

e.g., use info-gain to compare features

Page 150: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (29)

Algorithm: C4.5

(1) Incrementally build decision-tree from labeled training examples

(2) At each stage choose “best” attribute to split dataset

e.g., use info-gain to compare features

(3) After building complete tree, prune the leaves to prevent overfitting

e.g., remove branches based on useless features

Page 151: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (30)

Algorithm: Building Co-Reference Chains

(1) If A and B co-reference

Page 152: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (30)

Algorithm: Building Co-Reference Chains

(1) If A and B co-reference

(2) And if B and C co-reference

Page 153: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (30)

Algorithm: Building Co-Reference Chains

(1) If A and B co-reference

(2) And if B and C co-reference

(3) A-B-C all co-reference

Page 154: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (30)

Algorithm: Building Co-Reference Chains

(1) If A and B co-reference

(2) And if B and C co-reference

(3) A-B-C all co-reference

The Mention-Pair model

Page 155: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (31)

• Problems:

• Limited features (and manually annotated)

• Features are domain and possibly language-dependent

Page 156: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (31)

• Problems:

• Limited features (and manually annotated)

• Features are domain and possibly language-dependent

• Advantages:

• Features weights not assumed, so less language-dependent

Page 157: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (31)

• Problems:

• Limited features (and manually annotated)

• Features are domain and possibly language-dependent

• Advantages:

• Features weights not assumed, so less language-dependent

• Learning algorithm can give less weight to more inaccurate features

Page 158: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Co-Reference Resolution (31)

• Problems:

• Limited features (and manually annotated)

• Features are domain and possibly language-dependent

• Advantages:

• Features weights not assumed, so less language-dependent

• Learning algorithm can give less weight to more inaccurate features

• Negative as well as Positive information

Page 159: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Schedule

1:10 – 1:20 Introductions

1:20 – 1:30 Structure of Class

1:30 – 2:10 Intro to Data-Driven Computational Pragmatics

2:10 – 2:50 Co-Reference Resolution

2:50 – 3:00 Conclusion / Flex Time

Page 160: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding

• Referencing Relations vs. Lexical Relations

• Co-Referencing: Relations between referents

Page 161: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding

• Referencing Relations vs. Lexical Relations

• Co-Referencing: Relations between referents

• Lexical Relations (e.g., Entailment): Relations between senses

Page 162: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding (2)

Relations Between Referents

Mary brought her bike.

The U.S. is a large country.

The judges heard it and they are angry.

He is certainly the man I saw.

Page 163: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding (2)

Relations Between Referents

Mary brought her bike.

The U.S. is a large country.

The judges heard it and they are angry.

He is certainly the man I saw.

Specific to a particular use (Local)

Page 164: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding (2)

Relations Between Referents Relations Between Senses

Mary brought her bike.

The U.S. is a large country.

The judges heard it and they are angry.

He is certainly the man I saw.

House ENTAILS => Building

Run ENTAILS => Move

Car ENTAILS => Automobile

Corsica ENTAILS => Island

Specific to a particular use (Local)

Page 165: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding (2)

Relations Between Referents Relations Between Senses

Mary brought her bike.

The U.S. is a large country.

The judges heard it and they are angry.

He is certainly the man I saw.

House ENTAILS => Building

Run ENTAILS => Move

Car ENTAILS => Automobile

Corsica ENTAILS => Island

Specific to a particular use (Local) Specific to senses of words (Global)

Page 166: Data-Driven Computational Pragmatics 1.pdfSchedule 1:10 –1:20 Introductions 1:20 –1:30 Structure of Class 1:30 –2:10 Intro to Data-Driven Computational Pragmatics 2:10 –2:50

Concluding (3)

• Reasoning and Inferences

• Requires Co-Referencing

• Requires Lexical Relations

• Thursday’s Focus: Learning Lexical Relations for Reasoning