daniel gildea giorgio satta - university of rochestergildea/pubs/gildea-satta-cl16-slides.pdf ·...

Post on 02-Sep-2020

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Synchronous Context-Free Grammars

and Optimal Linear Parsing Strategies

Daniel Gildea Giorgio SattaUniversity of Rochester Università di Padova

Synchronous CFG

Context-free Grammar:

X → A B

Synchronous Context-free Grammar (SCFG)

X →A1

B2

C3

D4 , C

3A

1D

4B

2

C →Powell, 鲍威尔

Synchronous CFG

• Synchronous parsing: find tree from two strings

– used to learn grammar from parallel text

• This talk: parsing strategies for long rules

• Results also apply to translation with n-gram

language model

Context-Free Grammar

A → B C

B

C

A

Binary SCFG

A → B1

C2 , C

2B

1

B

C

A

SCFG with 4 nonterminals

A → B1

C2

D3

E4 , C

2E

4B

1D

3

E

D

C

B

A

Fan-Out

Number of spans in nonterminal.

CFG: fan-out 1 B

C

A

SCFG: fan-out 2 E

D

C

B

A

ϕ(G) = maxN∈G

ϕ(N) (Rambow & Satta, 1999)

Rank

Number of nonterminals on righthand side of rule.

CFG: rank 2 B

C

A

SCFG: rank r E

D

C

B

A

ρ(G) = maxP∈G

ρ(P)

Parsing Strategies

Reduce rankE

D

C

B

A

A → B C D E

C

B

X

D

X

Y

E

Y

A

X → B C Y → X D A → Y E

Parsing Strategies

Reduce rank, may increase fan-out

E

D

C

B

A

C

B

X

Rule Length in Synchronous CFG

• Binary grammar (ITG): parsing is O(n6) (Wu, 1997)

– Works in real MT (Zhang et al. 2006)

• Many rules cannot be binarized without

increasing fan-out (Aho and Ullman, 1972)

• Fan-out affects space and time complexity

Parsing Complexity

Space complexity: O(n2ϕ(A))

Time complexity: O(nϕ(A)+ϕ(B)+ϕ(C))

B

C

A

B

C

A

O(n2) space O(n4) space

O(n3) time O(n6) time

(Seki et al. 1991)

SCFG Parsing Strategies

E

D

C

B

A

C

B

X

naïve strategy: O(n2r+2) time

best strategy: Ω(ncr ) for some c

(Gildea and Štefankovic 2007)

This Talk

• Finding optimal space complexity is

NP-complete

• Finding optimal time complexity ⇒ better algs

for treewidth

Example Rule

B8

B7

B6

B5

B4

B3

B2

B1

A

Optimal Parsing Strategy

n7

n5

B1

n3

B2

n1

B3

B4

n6

B5

n4

B6

n2

B7

B8

B4

B3

n1

Carving Width

2 3 4

1

G

1 2 3 4

tree layout of G

Carving width: max number edges of G routed

through tree layout

Cyclic Permutation Multigraph

B1

B2

B3

B4

B5

B6

B7

B8A

A → B1B

2B

3B

4B

5B

6B

7B

8 ,

B5B

7B

3B

1B

8B

6B

2B

4

Carving Width = Space Complexity

A

n7

n5

n3

n1

n6

n4

n2

B1

B2

B3

B4

B5

B6

B7

B8

Our Reduction

• Carving width instance: (G, k )

• Construct permutation multigraph G′, integer k ′

• Carving width of G ⇔ Carving width of G′⇔

optimal parsing for SCFG

Our Construction

2 3 4

1

G

1 2 3 4

tree layout of G

X1

G1

X2

G2

X3

G3

X4

G4

G1

X1

G2

X2

G3

X3

G4

X4

Space Complexity

Theorem 1: Finding the parsing strategy with optimal

space complexity for an SCFG rule is NP-complete

Treewidth

A C E G I K M

B D F H J L

N

P

R

O

Q

S

CDE DEF EFG FGH GHI HIJ IJK

BCD GHN JK L

ABC HNO K LM

NOP

OPQ PQR QRS

Dependency Graph

x0 x1 x2 x3 x4

y0 y1 y2 y3 y4

x0 x1 x2 x3 x4

A → B C D E S → A1B

2C

3D

4 , B2

D4

A1

C3

Treewidth = Time Complexity

x0 x1 x2 x3 x4

x0x1x2 x0x2x3 x0x3x4

A → B C D E

C

B

X

D

X

Y

E

Y

A

X → B C Y → X D A → Y E

Our Reduction

• Treewidth instance: (G, k )

• Construct dependency graph G′, integer k ′

• Approx of treewidth of G ⇔ Treewidth of G′⇔

optimal time complexity for SCFG

Dependency Graph Construction

Approximation Algorithm for Treewidth

SOL < 8∆(G)(OPT + 1) .

SOL: solution using SCFG parsing strategy

OPT : optimal treewidth of input graph G

∆(G) = degree (max num edges touching one vertex)

Time Complexity

Theorem 2: Finding the parsing strategy with optimal

time complexity for an SCFG rule implies a

∆(G)-factor approximation algorithm for treewidth.

Time Complexity

Theorem 3: If finding the parsing strategy with

optimal time complexity for an SCFG rule is

NP-complete, then treewidth for graphs of degree 6 is

NP-complete.

Conclusion

• Finding parsing strategy with best space

complexity is NP-hard.

• P-time alg for finding parsing strategy with best

time complexity implies better approximation

algs for treewidth

• NP-hardness for time complexity implies

NP-hardness for treewidth of graphs of degree

six

top related