rhetorical structure theory - massachusetts institute of ...€¦ · domain-dependent rhetorical...

27
Rhetorical Structure Theory Regina Barzilay EECS Department MIT November 2, 2004

Upload: others

Post on 24-Apr-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Rhetorical Structure Theory

Regina Barzilay

EECS Department

MIT

November 2, 2004

Page 2: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Domain-Dependent Content Models

• Capture topics and their distribution

• Are based on pattern matching techniques

– Motifs of semantic units

– Distributional model

• Useful in generation and summarization

Rhetorical Structure Theory 1/26

Page 3: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Domain-Dependent Rhetorical Model

Domain: Scientific Articles

• Humans exhibit high agreement on the annotation scheme

• The scheme covers only a small fraction of discourse relations

Rhetorical Structure Theory 2/26

Page 4: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Domain-Independent Rhetorical Model

Model elements: •

– Binary Relations

– Compositionality Principle

• Requirements:

– Stability and Reproducibility of an Annotation Scheme

– Expressive Power of a Model

Rhetorical Structure Theory 3/26

Page 5: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Informational Structure

• How many different coherence relations are there?

Are different taxonomies of coherence relations•

compatible with each other?

• Some real-time evidence for validity of some coherence relations: pronoun experiments (difference cause-effect/resemblance)

Rhetorical Structure Theory 4/26

Page 6: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Coherence Relations: Historic Perspective

Aristotle Boccaccio Hume

(4th cent. BC) (14th cent.) (18th cent.)

Rhetorical Structure Theory 5/26

Page 7: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example of Coherence Relation (1)

Causal relations: Cause-Effect

effect cause

John is dishonest because he is a politician.

Rhetorical Structure Theory 6/26

Page 8: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example of Coherence Relation (2)

Causal relations: Violated-Expectations

John is honest although he is a politician.

John is dishonest

Rhetorical Structure Theory 7/26

Page 9: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example of Coherence Relation (3)

Causal relations: Condition

If someone is a politician he is dishonest

Rhetorical Structure Theory 8/26

Page 10: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example of Coherence Relation (4)

Resemblance relations: Pa rallel

John organized rallies for Gore,

and Fred distributed pamphlets for him.

Rhetorical Structure Theory 9/26

Page 11: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example of Coherence Relation (5)

Resemblance relations: Contrast

John

and Fred cheered for Bush.

supported Gore,

Rhetorical Structure Theory 10/26

Page 12: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example of Coherence Relation (6)

Elaborations relations:

John

and Fred cheered for Bush.

supported Gore,

Rhetorical Structure Theory 11/26

Page 13: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

How many coherence relations?

• Some accounts of coherence assume 2, other more than 400 coherence relations

• Hovy&Maier 1995: taxonomies with more relations represent subtypes of taxonomies with fewer relations

– cause-effect volitional, non-volitional →

Rhetorical Structure Theory 12/26

Page 14: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Problem: Ambiguity

Rhetorical Structure Theory 13/26

To see this image, go to http://images.google.com/images?q=yolady.gif

Page 15: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Find Coherence RelationsConsider this extract from “The Kreutzer Sonata“ by L. Tolstoy

(A) It is amazing how complete is the delusion that beauty is goodness .

(B) A handsome woman talks nonsense , you listen and hear not nonsense but cleverness .

(C) She says and does horrid things , and you see only charm .

(D) And if a handsome woman does not say stupid or horrid things , you at once persuade yourself that she is wonderfully clever and moral .

Rhetorical Structure Theory 14/26

Page 16: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Rhetorical Structure Theory

(Mann&Thompson:1988, Matthessen&Thompson:1988)

• Developed in the framework of natural language generation

• Aims to describe “building blocks” of text structure

– Nucleus vs Satellites

– Binary Relations between Discourse Units

• Compositionality principle defines how to build a tree from binary relations

Rhetorical Structure Theory 15/26

Page 17: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Example

[ No matter how much one wants to stay a non-smoker, A

], [ the truth is that the pressure to smoke in junior high is Bgreater than it will be any other time of one’s life. ] . [ We

know that 3,000 teens start smoking each day, C ] [ although it is a fact that 90% of them once thought that smoking was

Dsomething that they’ll never do. ]

Rhetorical Structure Theory 16/26

Page 18: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Binary Relations

• (JUSTIFICATION, A, B)

• (JUSTIFICATION, D, B)

• (EVIDENCE, C, B)

• (CONCESSION, C, D)

• (RESTATEMENT, D, A)

Rhetorical Structure Theory 17/26

Page 19: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

RST tree

JUSTIFICATION

A B C D

JUSTIFICATION CONCESSION

Rhetorical Structure Theory 18/26

Page 20: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Relations

Relation Nucleus Satellite

Background text whose understanding is being facilitated

text whose understanding is being facilitated

Elaboration basic information additional information

Preparation text to be presented text which prepares the reader to expect and in­terpret the text to be pre­sented

Rhetorical Structure Theory 19/26

Page 21: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Compositionality

Whenever two large text spans are connected through a rhetorical relation, that rhetorical relation holds between the most important parts of the constituent spans.

Marcu (1997): used constraint-satisfaction approach to build discourse trees given a set of binary relations Wolf (2004): tree structure is not an adequate representation of discourse structure

Rhetorical Structure Theory 20/26

Page 22: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Automatic Computation of RST Relations

(Marcu, 1997; Marcu&Echihabi, 2002) Surface cues for discourse relations:

I like vegetables, but I hate tomatoes.

Rhetorical Structure Theory 21/26

Page 23: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Automatic Computation of RST Relations

(Marcu, 1997)

• Aggregate discourse relations to a few stable

groups: (contrast, elaboration, condition,cause-explanation-evidence)

• Establish deterministic correspondence between cue phrases and discourse relations:

– { But, However } → Contrast

– { In addition, Moreover } → Elaboration

Rhetorical Structure Theory 22/26

Page 24: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Accuracy

• Compared against manually constructed trees

• Tested against human-constructed trees

• Automatically constructed trees exhibit high similarity with human-constructed trees

• However, see (Marcu&Echihabi, 2002) CONTRAST vs ELABORATION: only 61 from 238 have a discourse marker (26%)

Rhetorical Structure Theory 23/26

Page 25: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Other Words Also Count!

(Marcu&Echihabi, 2002)

Surface cues for discourse relations:

I like vegetables, but I hate tomatoes.

Rhetorical Structure Theory 24/26

Page 26: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Method

• Assume that certain markers unambiguously predict discourse relations

• Create Cartesian product of words located on two sides of a discourse marker

• For each pair of words, compute its likelihood to predict a discourse relation

argmaxrk P (rk|(s1, s2)) = argmaxrk P ((s1, s2) rk)∗P (rk)|

where si is a discourse clause, wi is a word and rk is a discourse relation P ((s1, s2)|rk) =

�i,j∈s1,s2

P ((wi, wj) rk)|

Rhetorical Structure Theory 25/26

Page 27: Rhetorical Structure Theory - Massachusetts Institute of ...€¦ · Domain-Dependent Rhetorical Model Domain: Scientific Articles • Humans exhibit high agreement on the annotation

Evaluation

• Training data:

– Raw 1 billion words corpus (41,147,805 sents)

– BLIPP parsed corpus (1,796,386 sents)

• The system can compute accurately some relations (see handout)

• The size and the quality of the training data matters a lot

Rhetorical Structure Theory 26/26