example-based machine translation using structural translation examples

25
Example-based Machine Translation using Structural Translation Examples Eiji Aramaki* Sadao Kurohashi* * University of Tokyo

Upload: maxwell-franklin

Post on 03-Jan-2016

41 views

Category:

Documents


5 download

DESCRIPTION

Example-based Machine Translation using Structural Translation Examples. Eiji Aramaki* Sadao Kurohashi* * University of Tokyo. Proposed System. Proposed System. Parses an Input Sentence. Selects Structural Translation Examples. Combines them to generate an output tree. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Example-based Machine Translation using Structural Translation Examples

Example-based Machine Translation using Structural

Translation Examples

Eiji Aramaki*

Sadao Kurohashi*

* University of Tokyo

Page 2: Example-based Machine Translation using Structural Translation Examples

Proposed System

Page 3: Example-based Machine Translation using Structural Translation Examples

Proposed System

SelectsStructural Translation Examples

Combines them to generate an output tree

Decides the word-order

Parses an Input Sentence

Page 4: Example-based Machine Translation using Structural Translation Examples

Structural Translation Examples

• The Advantage of High-Usability

• BUT: It requires many technologies– Parsing & Tree Alignment (are still being developed)

→ A naive method without such technologies may be efficient in a limited domain

新聞を(news paper)

Give

me

newspaper下さい(give)

Japanese

日本語の(Japanese)

obj

pos

Page 5: Example-based Machine Translation using Structural Translation Examples

Outline

• Algorithm– Alignment Module– Translation Module

• Experimental Results

• Conclusion

Page 6: Example-based Machine Translation using Structural Translation Examples

System Frame Work

• Alignment module– Builds Translation Examples from Bilingual Corpus

Input

Alignmentmodule

Output

Translationmodule

BilingualCorpus

TranslationMemory

• Translation module– Selects Translation Examples– Combines them into a Translation

Page 7: Example-based Machine Translation using Structural Translation Examples

Alignment Module ( 1/2)• A sentence pair is analyzed by parsers

[Kurohashi1994][Charniak2000]

• Correspondences are estimated by Dictionary-based Alignment method [Aramaki 2001]

新聞を(news paper)

Give

me

newspaper下さい(give)

Japanese

日本語の(Japanese)

obj

pos

(S (VP (VBP Give) (NP (PRP me)) (NP (DT a) (JJ Japanese) (NN newspaper.))))

Page 8: Example-based Machine Translation using Structural Translation Examples

• Translation example= A combinations of correspondences which are connected to each

other

– With Surrounding phrases (= the parent and children phrases of correspondences)

• for Selection of Translation Examples

Alignment Module ( 2/2)

Page 9: Example-based Machine Translation using Structural Translation Examples

System Frame Work

• Alignment module– Builds Translation Examples from Bilingual Corpus

Input

Alignmentmodule

Output

Translationmodule

BilingualCorpus

TranslationMemory

• Translation module– Selects of Translation Examples– Combines them into a Translation

Page 10: Example-based Machine Translation using Structural Translation Examples

Translation Module(1/2)

• Equality : The number of equal phrases

新聞を(news paper)

Give

me

newspaper下さい(give)

obj新聞を(news paper)

下さい(give)

中国語の(Chinese)

obj

pos

• Context Similarity:– calculated with a Japanese thesaurus

• Alignment Confidence:

– the ratio of content words which can be found in dictionaries

INPUT TRANSLATION EXAMPE

Page 11: Example-based Machine Translation using Structural Translation Examples

Translation Module(1/2)

新聞を(news paper)

Give

me

newspaper下さい(give)

obj新聞を(news paper)

下さい(give)

中国語の(Chinese)

obj

pos

INPUT TRANSLATION EXAMPE

• Equality : The number of equal phrases

• Context Similarity:– calculated with a Japanese thesaurus

• Alignment Confidence:

– the ratio of content words which can be found in dictionaries

Page 12: Example-based Machine Translation using Structural Translation Examples

新聞を(news paper)

Give

me

newspaper下さい(give)

Japanese

日本語の(Japanese)

obj

pos

新聞を(news paper)

下さい(give)

中国語の(Chinese)

obj

pos

INPUT TRANSLATION EXAMPE

• Equality : The number of equal phrases

• Context Similarity:– calculated with a Japanese thesaurus

• Alignment Confidence:

– the ratio of content words which can be found in dictionaries

Context = surrounding phrasesContext = surrounding phrases

Page 13: Example-based Machine Translation using Structural Translation Examples

Translation Module(1/2)

新聞を(news paper)

Give

me

newspaper下さい(give)

Japanese

日本語の(Japanese)

obj

pos

新聞を(news paper)

下さい(give)

中国語の(Chinese)

obj

pos

INPUT TRANSLATION EXAMPE

• Equality : The number of equal phrases

• Context Similarity:– calculated with a Japanese thesaurus

• Alignment Confidence:

– the ratio of content words which can be found in dictionaries

Page 14: Example-based Machine Translation using Structural Translation Examples

Translation Module(2/2)• Selection

– Score:= ( Equality + Similarity ) x (λ+ Confidence)

• Combine

Give

me

newspaper

Japanese

Give

me

newspaper

Japanese+ =

– The dependency relations & the word order between the translation examples are decided by heuristic rules

– The dependency relations & the word order in the translation examples are preserved

Page 15: Example-based Machine Translation using Structural Translation Examples

Exception: ShortcutInput

Alignmentmodule

Output

Translationmodule

TranslationMemory

If a Translation Example is almost equal to the input

⇒ the system outputs its target parts as it is.

• Almost equal= Character-based DP Matching Similarity > 90%

BilingualCorpus

Page 16: Example-based Machine Translation using Structural Translation Examples

Outline

• Algorism– Alignment Module– Translation Module

• Experimental Results

• Conclusion

Page 17: Example-based Machine Translation using Structural Translation Examples

Experiments

• We built Translation Examples from training-set

(only given in IWSLT)

bleu nist wer per gtm

Dev-set 0.38 7.86 0.52 0.45 0.66

Test-set 0.39 7.89 0.49 0.42 0.67

• Dev-set & Test-set score are similar← the system has no tuning metrics for the dev-set.

Auto. Eval. Result

Page 18: Example-based Machine Translation using Structural Translation Examples

Corpus size & Performance

0

0.1

0.2

0.3

0.4

0.5

0 5000 10000 15000 20000

corpus size

bleu

The system without a corpus can generate translations using only the translation dictionaries.

The score is not saturated ⇒the system will achieve a higher performance if we obtain more corpora.

Page 19: Example-based Machine Translation using Structural Translation Examples

Subjective Evaluation

• Subjective Evaluation Result

• Error Analysis– Most of the errors are classified into the following

three problems:(1) Function Words(2) Word Order(3) Zero-pronoun

Fluency 3.650

Adequacy 3.316

5: "Flawless English" 4: "Good English" 3: "Non-native English" 2: "Disfluent English" 1: "Incomprehensible"

Page 20: Example-based Machine Translation using Structural Translation Examples

Problem1: Function words

• The system selects translation examples using mainly content words

⇒ it sometimes generates un-natural function words– Determiners, prepositions

OUTPUT i 'd like to contact my Japanese embassy

Translation

ExampleI 'd like to contact my bank

Page 21: Example-based Machine Translation using Structural Translation Examples

Problem 2: Word Order

• The word order between translation examples is decided by the heuristic rules.

• The lack of rules leads to the wrong word order.

OUTPUT is there anything a like local cuisine?

OUTPUT has a bad headache.

Problem 3: Zero-pronoun

• The input includes zero-pronoun. ⇒ outputs without a pronoun.

Page 22: Example-based Machine Translation using Structural Translation Examples

Outline

• Algorism– Alignment Module– Translation Module

• Experimental Results

• Conclusion

Page 23: Example-based Machine Translation using Structural Translation Examples

Conclusions

• We described an EBMT system which handles Structural translation examples

• The experimental results shows the basic feasibility of this approach

• In the future, as the amount of corpora increases, the system will achieve a higher performance

Page 24: Example-based Machine Translation using Structural Translation Examples
Page 25: Example-based Machine Translation using Structural Translation Examples

Alignment Module ( 1/2)

新聞を

Give

me

newspaper下さい

新聞を

Give

me

newspaper下さいJapanese

日本語の

新聞を Give

newspaper下さいJapanese

日本語の

新聞を

Give

me

newspaper下さいJapanese

日本語の

新聞を

newspaper

Japanese

日本語の

新聞を

Give

newspaper下さいJapanese

日本語の