modern mt systems and the myth of human translation: real world status quo

18
Modern MT Systems and the Myth of Human Translation: Real World Status Quo Intro MT & HT Definitions Comparison MT vs. HT Evaluation Methods FAE Framework Conclusion Discussion

Upload: brenda-keith

Post on 01-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

Modern MT Systems and the Myth of Human Translation: Real World Status Quo. Intro MT & HT Definitions Comparison MT vs. HT Evaluation Methods FAE Framework Conclusion Discussion. Is This for Me?. (Freelance) translators and agencies Developers and vendors of MT systems - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Modern MT Systems and theMyth of Human Translation:

Real World Status Quo

● Intro

● MT & HT Definitions

● Comparison MT vs. HT

● Evaluation Methods

● FAE Framework

● Conclusion

● Discussion

Page 2: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Is This for Me?

● (Freelance) translators and agencies

● Developers and vendors of MT systems

● People concerned with MT evaluation

● People concerned with HT evaluation

This talk may be of benefit for:

Not for interpreters and speech/non-text based issues

Page 3: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Introduction

● What is Machine Translation (MT)?

● What is [Human] Translation (HT)?

„MT is the automatic translation of human language by computers.“

„The process of transforming text from one language into another language.“

„A written communication in a second language having the same meaning as the written communication in a first language.“

Page 4: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Introduction II

● Is there such a thing as HT?„Pure Human Translation“„Machine Aided Human Translation“„Human Aided Machine Translation“

● Is HT equal to HT?

„Native Speaker“„Speaks Language X“„[Trained] Professional“„Trained Prof. specialized in X“

Page 5: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

HT/MT Examples & Quizshow

Original: Einzigartiger Freizeitpark für Groß und Klein

T1: Singular recreational park for large and smallT2: Unique leisure time park for largely and smallT3: Ein Fantastische DinoPark ferrcoitungT4: Unique Freizeitpark at big and littleT5: Unique amusement park for great and KleinT6: Unique leisure park for big and little

T1: Babelfish/SYSTRANT2: SDL FreeTranslation.comT3: HumanT4: InterTranT5: Linguatex eTranslationT6: PetaMem LangSuite MT

Page 6: Modern MT Systems and the Myth of Human Translation: Real World Status Quo
Page 7: Modern MT Systems and the Myth of Human Translation: Real World Status Quo
Page 8: Modern MT Systems and the Myth of Human Translation: Real World Status Quo
Page 9: Modern MT Systems and the Myth of Human Translation: Real World Status Quo
Page 10: Modern MT Systems and the Myth of Human Translation: Real World Status Quo
Page 11: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Summary HT Quality

● Not all HTs are equal● Significant amount done by untrained people● Better performance of good(!) MT systems on these

examples suggests rising MT competitiveness

Page 12: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Issues with MT & HT Evaluation

● Evaluation vs. Similarity• Ngram does work? Why?

● Reference Translations:• Cost & Availability

• Multiples – which

• „Axiomatic Truth“

● Judging• Expensive

• Questionable results

● Using MT-eval methods: limitations just mentioned

Page 13: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Mission Impossible?

● Fully automatic evaluation method for both MT & HT – with no human Intervention?

● Purpose: Automatic QA of translations – at least safe rejection of bad results

● Part of an iterative process (with faith in the translator)

Page 14: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

We need it – should we give up?

Page 15: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Let's Try Anyway!

● Text Metrics• Length

• Word/Sentence/Paragraph count

● Statistics• Character/Word occurrence

• Ngram

• Collocations

● Translator Parameters

● Monolingual Corpora for SL & TL

• Statistical reference

● Dictionaries & Thesauri• Adequacy check

• Translation distance

• Sentence Alignment

● Parallel Corpora• Translation Length Ratio

Extract Information Reference Data

Page 16: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Workflow

Page 17: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Conclusion

● Translation results of the best contemporary MT systems can be considered on par with the average HT

● The presented evaluation framework is just the beginning of an automatic evaluation method for both MT & HT

● It is a robust and reliable validation method with safe rejection of invalid/bad translations

● In production Q1/2005

Page 18: Modern MT Systems and the Myth of Human Translation: Real World Status Quo

Thanks!

Q & A