kees van deemter matthew stone formal issues in natural language generation lecture 5 stone,...

44
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Upload: blake-howard

Post on 28-Mar-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Kees van DeemterMatthew Stone

Formal Issuesin

Natural Language Generation

Lecture 5Stone, Doran,Webber, Bleam &

Palmer

Page 2: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

GRE and surface realization

Arguably, GRE uses a grammar.– Parameters such as the preference order on

properties reflect knowledge of how to communicate effectively.

– Decisions about usefulness or completeness of a referring expression reflect beliefs about utterance interpretation.

Maybe this is a good idea for NLG generally.

Page 3: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

GRE and surface realization

But we’ve thought GRE outputs semantics:

referent: furniture886

type: desk

status: definite

color: brown

origin: sweden

Page 4: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

GRE and surface realization

We also need to link this up with surface form:

the brown Swedish desk

Note: not

?the Swedish brown desk

Page 5: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Today’s initial observations

It’s hard to do realization on its ownmapping from semantics to surface

structure.

It’s easy to combine GRE and realizationbecause GRE is grammatical reasoning!if you have a good representation for

syntax.

Page 6: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Why it’s hard to do realizationA pathological grammar of adjective

order:

NP the N(w).N(w) w N(w) if w is an adjective and

wRw.N(w) w if w is a noun.

Page 7: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Syntax with this grammar

Derivation of example:

the brown Swedish desk

NP

N(brown)

N(Swedish)

N(desk)

Requires: brown R Swedish, Swedish R desk

Page 8: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Realization, formally

You start with k properties.Each property can be realized lexically.

assume: one noun, many adjectives(not that it’s easy to enforce this)

Realization solution:NP which realizes each property exactly

once.

Page 9: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Quick formal analysis

View problem graph-theoretically:k words, corresponding to vertices in a graphR is a graph on the k wordsSurface structure is a Hamiltonian path

(which visits each vertex exactly once)through R.

This is a famous NP complete problemSo surface realization itself is intractable!

Page 10: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Moral of the example

Semantics underdetermines syntactic relations.Here, semantics underdetermines syntactic

relations of adjectives to one another and to the head.

Searching for the correspondence is hard.See also Brew 92, Koller and Striegnitz 02.

Page 11: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Today’s initial observations

It’s hard to do realization on its ownmapping from semantics to surface

structure.

It’s easy to combine GRE and realizationbecause GRE is grammatical reasoning!if you have a good representation for

syntax.

Page 12: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Syntactic processing for GRE

LexicalizationSteps of grammatical derivation correspond to meaningful choices in NLG.

E.g., steps of grammar are synched with steps of adding a property to a description.

Page 13: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Syntactic processing for GRE

Key ideas: lexicalization, plusFlat dependency structure (adjs modify

noun)Hierarchical representation of word-order

NP

N(color)

N(origin)

N(size)

N(material)the desk

Page 14: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Syntactic processing for GRE

Other syntactic lexical entries

Adj

N(origin)

Swedish

N(color)

Adj

brown

Page 15: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Describing syntactic combinationOperation of combination 1: Substitution

NP + =NP

N(color)

N(origin)

N(size)

N(material)the desk

NP

N(color)

N(origin)

N(size)

N(material)the desk

Page 16: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Describing syntactic combinationOperation of combination 2: Sister adjunction

+ =NP

N(color)

N(origin)

N(size)

N(material)the desk

NP

N(color)

N(origin)

N(size)

N(material)the desk

N(color)

Adj

brownAdj

brown

Page 17: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Abstracting syntax

Tree rewriting:Each lexical item is associated with a structure.You have a starting structure.You have ways of combining two structures together.

Page 18: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Abstracting syntax

Derivation treerecords elements and how they are combined

the desk

brown(s.a. @ color)

Swedish(s.a. @ origin)

Page 19: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

An extended incremental algorithm• r = individual to be described• P = lexicon of entries, in preference

orderP is an individual entrysem(P) is a property or set of entries from

the contextsyn(P) is a syntactic element

• L = surface syntax of description

Page 20: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Extended incremental algorithm

L := NPC := DomainFor each P P do:

If r sem(P) & C sem(P)Then do

L := add(syn(P), L)C := C sem(P)If C = {r} then return L

Return failure

Page 21: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Observations

Why use tree-rewriting - not,e.g. CFG derivation?

NP the N(w).N(w) w N(w) if w is an adjective and

wRw.N(w) w if w is a noun.

CFG derivation forces you to select properties in the surface word-order.

Page 22: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Observations

Tree-rewriting frees word-order from choice-order.

NP

N(color)

N(origin)

N(size)

N(material)

the

desk

NP

N(color)

N(origin)

N(size)

N(material)

the

desk

Adj

brown

NP

N(color)

N(origin)

N(size)

N(material)

the

desk

Adj

brown

Adj

Swedish

Page 23: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Observations

Tree-rewriting frees word-order from choice-order.

NP

N(color)

N(origin)

N(size)

N(material)

the

desk

NP

N(color)

N(size)the

NP

N(color)

N(origin)

N(size)

N(material)

the

desk

Adj

brown

Adj

Swedish

N(origin)

N(material)

desk

Adj

Swedish

Page 24: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

This is reflected in derivation treeDerivation tree

records elements and how they are combined

the desk

brown(s.a. @ color)

Swedish(s.a. @ origin)

Page 25: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Formal results

Logical completeness.If there’s a flat derivation tree for an NP that identifies referent r, Then the incremental algorithm finds it.

ButSensible combinations of properties may not yield surface NPs.Hierarchical derivation trees may require lookahead in usefulness check.

Page 26: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Formal results

Computational complexityNothing changes – we just add properties, one after another…

Page 27: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Now, though, we’re choosing specific lexical entries

NP

N(departure)

N(destination)

N(stops)

the

express

Adj

3:35

N

Trenton

vsNP

N(departure)

N(destination)

N(stops)

the

express

Adj

15:35

N

Trenton

maybe these lexical items express the same property…

Page 28: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

• Use

in 12-hour time context

• Use

in 24-hour time context

What motivates these choices?

N(departure)

Adj

3:35

N(departure)

Adj

15:35

Page 29: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

• P = lexicon of entries, in preference orderP is an individual entrysem(P) is a property or set of entries from

the contextsyn(P) is a syntactic elementprags(P) is a test which the context must

satisfy for the entry to be appropriate

Need to extend grammar again

Page 30: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

For example:

syn:

sem: departure(x, 1535)prags: twentyfourhourtime

Need to extend grammar again

N(departure)

Adj

15:35

Page 31: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Extended incremental algorithm

L := NPC := DomainFor each P P do:

If r sem(P) & C sem(P) & prags(P) is trueThen do

L := add(syn(P), L)C := C sem(P)If C = {r} then return L

Return failure

Page 32: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Discussion:What does this entry do?

syn:

sem: thing(x)prags: in-focus(x)

NP

it

Page 33: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Suggestion: find best value

Given: – A set of entries that combine syntactically with

L in the same way– Related by semantic generality and pragmatic

specificity.– Current distractors

Take entries that remove the most distractorsOf those, take the most semantically generalOf those, take the most pragmatically specific

Page 34: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Extended incremental algorithm

L := NP C := DomainRepeat

Choices := { P : add(syn(P), L) at next node & r sem(P) & prags(P) is true }

P := find best value(Choices)L := add(syn(P), L)C := C sem(P)If C = {r} then return L

Return failure

Page 35: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

What is generation anyway?

Generation is intentional (or rational) actionthat’s why Grice’s maxims apply, for

example.

You have a goalYou build a plan to achieve it

(& achieve it economically in a recognizable way)

You carry out the plan

Page 36: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

In GRE…

The goal is for hearer to know the identity of r(in general g)

The plan will be to utter some NP Usuch that the interpretation of U identifies { r }(in general c u cg)

Carrying out the plan means realizing this utterance.

Page 37: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

In other words

GRE amounts to a process of deliberation.

Adding a property to L incrementally is like committing to an action.These commitments are called intentions.Incrementality is characteristic of intentions –

though in general intentions are open to revision.

Note: this connects with belief-desire-intention models of bounded rationality.

Page 38: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

GRE as (BDI) rational agency

L := NP // Initial plan C := Domain // Interpretationwhile (P := FindBest(P, C, L)) { //

DeliberationL := add(syn(P), L) // Adopt new intentionC := C sem(P) // Update interpretationif C = { r } return L // Goal satisfied

}fail

Page 39: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

NLG as (BDI) rational agency

L := X C := Initial Interpretationwhile (P := FindBest(P, C, L)) {

L := AddSyntax(syn(P), L)C := AddInterpretation(sem(P), C)if GoalSatisfied(C) return L

}fail

Page 40: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Conclusionsfor NLG researchers

It’s worth asking (and answering) formal questions about NLG.

Questions of logical completeness – can a generator express everything it ought to?Questions of computational complexity – is the cost of a generation algorithm worth the results?

Page 41: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Conclusionsfor linguists

NLG offers a precise perspective on questions of language use.

For example, what’s the best way of communicating some message?

NLG – as opposed to other perspectives – gives more complete, smaller-scale models.

Page 42: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Conclusionsfor AI in general

NLG does force us to characterize and implement representations & inference for practical interactive systems

Good motivation for computational semantics.Meaty problems like logical form equivalence.

Many connections and possibilities for implementation (graphs, CSPs, circuit optimization, data mining,…)

Page 43: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Open Problems

• Sets and salience in REs.• Generating parallel REs.• Theoretical and empirical measures of

quality/utility for REs.• Avoiding ambiguity in REs.

Any problem in RE generalizes to one in NLG.

Page 44: Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 5 Stone, Doran,Webber, Bleam & Palmer

Followup information

Course web page:http://www.itri.brighton.ac.uk/home/Kees.van.Deemter/esslli-notes.html

– downloadable papers– final lecture notes– papers we’ve talked about– links (recent/upcoming events, siggen,

sigsem)

by Monday August 26.