google’s multilingual neural machine translation...

15

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation Hanock Kwak 2016-11-22 (Tue.) BI Lab Seoul National University

Upload: others

Post on 25-May-2020

10 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Google’s Multilingual Neural Machine Translation System:

Enabling Zero-Shot Translation Hanock Kwak

2016-11-22 (Tue.) BI Lab

Seoul National University

Page 2: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Abstract

• Simple, elegant solution to translate between multiple languages.

• Introduces an artificial token at the beginning of the input sentence to specify the required target language.

• The rest of the model is shared across all languages.

• Single multilingual model surpasses state-of-the-art results on WMT’14 and WMT’15 benchmarks.

• Transfer learning and zero-shot translation is possible.

Johnson, Melvin, et al. "Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation." arXiv preprint arXiv:1611.04558(2016).

Page 3: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Key Features

• Simplicity • The model is same for all languages. • Any new data is simply added.

• Low-resource language improvements • All parameters are implicitly shared by all the language pairs. • This forces the model to generalize across language boundaries

during training.

• Zero-shot translation • The model implicitly learns to translate between language pairs it

has never seen. • ex) Train Portuguese→English and English→Spanish

• Then it can generate Portuguese→Spanish.

Page 4: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Evolution of Neural Translation Machine

• We'll start with a traditional encoder decoder machine translation model and keep evolving it until it matches GNMT

http://smerity.com/articles/2016/google_nmt_arch.html

Page 5: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

V1: Encoder-decoder

• The encoder spits out a hidden state. • This hidden state is then supplied to the decoder, which generates the sentence in language B

Page 6: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

V2: Attention based encoder-decoder

• The encoder query each output asking how relevant they are to the current computation on the decoder side

Page 7: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

V3: Bi-directional encoder layer

• We would like the annotation of each word to summarize not only the preceding words, but also the following words

Page 8: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

V4: "The deep is for deep learning"

Page 9: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

V5: Parallelization

• To begin computation at one of the nodes, all of the nodes pointing toward you must already have been computed.

• A layer 𝑖 + 1 can start its computation before layer 𝑖 is fully finished.

Page 10: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

V6: Residuals are the new hotness

• One solution for vanishing gradients is residual networks.

• The idea of a layer computing an identity function

Page 11: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Page 12: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Visualization

• A t-SNE projection of the embedding of 74 semantically identical sentences translated across all 6 possible directions

Page 13: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Source Language Code-Switching

• Mixing Japanese and Korean in the source produces in many cases correct English translations

Page 14: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Weighted Target Language Selection

• We test what happens when we mix target languages.

Page 15: Google’s Multilingual Neural Machine Translation …ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (03).pdfGoogle’s Multilingual Neural Machine Translation System: Enabling

Big Picture

Google's Multilingual Neural Machine Translation System

Learning Multilingual Semantic Parsers for Question ......Learning Multilingual Semantic Parsers for Question Answering over Linked Data A comparison of neural and probabilistic graphical

Multilingual Digital Single Market - lt-innovate.org Multilingual DSM.pdf · @JochenHummel Multilingual Digital Single Market 3 ... Multilingual SEO/SEM ... multilingual CMS, human

Pre-training Multilingual Neural Machine Translation by … · 2021. 8. 16. · Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information Zehui Lin

Multilingual Communications as a Business Imperativegilbane.com/Research-Reports/Gilbane-Multilingual-Communications... · Multilingual Communications as a Business Imperative

wooCommerce Multilingual

Multilingual speech communities Language Choice in Multilingual Communities

30th Annual Conference on - Inriadeeploria.gforge.inria.fr/intranet/NIPS16review.pdf · 2017-02-06 · Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot

A Comprehensive Survey of Multilingual Neural Machine ...111 A Comprehensive Survey of Multilingual Neural Machine Translation RAJ DABRE∗, National Institute of Information and Communications

5 장 딥빌리프네트워크 - Seoul National Universityocw.snu.ac.kr › sites › default › files › NOTE › IML_Lecture (09).pdf · 2018-04-19 · Figure 5.2: 제한 볼쯔만

Multilingual internet

Machine Translation and Neural Networks for a multilingual EU

Multi-task Learning for Multilingual Neural Machine Translation · 2020. 11. 11. · Multi-task Learning for Multilingual Neural Machine Translation Yiren Wang y;, ... or better performance

Multilingual & Multinational Link Building with Multilingual Content Marketing

SYSTRAN MULTILINGUAL EDISCOVERY FOR … · SYSTRANGROUP.COM NEURAL MACHINE TRANSLATION IN EDISCOVERY AND FORENSIC INVESTIGATIONS TO IMPROVE CASE EFFICENCY Information Governance (IG)

Google’s Multilingual Neural Machine Translation System ... · Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation MelvinJohnson,MikeSchuster,QuocV.Le,MaximKrikun,YonghuiWu,

Multilingual dictionary

Neural Generation of Multilingual Wikipedia Mind the ......Q106693 Group 14 (chemical series): رﺻﺎﻧﻌﻠﻟ يرودﻟا لودﺟﻟا ﻲﻓ ةدوﺟوﻣﻟا ةدوﺟوﻣﻟا

Are Multilingual Neural Machine Translation Models Better

Improving Zero-shot Translation of Low-Resource Languagesworkshop2017.iwslt.org/downloads/O3-3-Slide.pdf · Ha et al., [2016]; Toward Multilingual Neural Machine Translation with

MultiLingual Technologies Inc. (MLT) - Multilingual Translation & Localization Service Provider

A Survey of Multilingual Neural Machine Translation€¦ · Neural machine translation (NMT) [8, 24, 140] has become the dominant paradigm for MT in academic research as well as commercial

Neural Machine Plan Translation and - Unbabel...Neural Machine Translation and Universal Multilingual Representa-tions Holger Schwenk Plan Introduction Vision NLP Embeddings RNN Seq2Seq

Multilingual Learning - GitLab · Facets of an NLP Application Algorithms Knowledge Data Fully Connected Networks Recurrent Networks Convolutional Neural Networks Sequence-to-Sequence

Multilingual natural language generation for multilingual

Multilingual WooCommerce

A model of multilingual digital library - SciELO · multilingual digital library is defined as: Definition 01: Multilingual digital library A multilingual digital library is a digital

[輪読会]Multilingual Image Description with Neural Sequence Models

Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

Multilingual Glossary

Transfer Learning - speech.ee.ntu.edu.twtlkagk/courses/ML... · Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation, arXiv preprint 2016. Example

기계학습개론 딥러닝 강의노트 서울대학교 컴퓨터공학부 장병탁ocw.snu.ac.kr/sites/default/files/NOTE/IML_Lecture (08).pdf상호 연결된다. 제3 컨볼루션

arXiv:2004.06748v3 [cs.CL] 12 May 2020 · 2020-05-14 · Balancing Training for Multilingual Neural Machine Translation Xinyi Wang Yulia Tsvetkov Graham Neubig Language Technology

Bridge Correlational Neural Networks for Multilingual ... · Figure 1: Bridge Correlational Neural Network. The views are English, French and German with English being the pivot view

Spotting Multilingual Consonant-Vowel Units of Speech using Neural Network Models Suryakanth V.Gangashetty, C. Chandra Sekhar, and B.Yegnanarayana Speech