spoken language translation, past, present, and future, by mark seligman, spoken translation, inc

21
Speech Translation: Past, Present, and Future Mark Seligman Spoken Translation, Inc. [email protected]

Upload: taus-enabling-better-translation

Post on 19-Jan-2017

311 views

Category:

Technology


0 download

TRANSCRIPT

Slide 1

Speech Translation: Past, Present, and Future

Mark Seligman Spoken Translation, [email protected]

Past

In the beginning

NEC demo, ITU Telecom World (1983 )Chat: Uni-verse, Amikai, Compuserve, Global-Link; (later) OrtsboC-STAR(-2) consortium (1991?)ATR trilateral demo (1993)English, French, Japanese, Italian, Korean, German, Chinese First unrestricted SLT (1997, 1998)GETA group (Seligman)/Compuserve Information Transcript Project (1995, 1996)MIT Biennale dArt Contemporain, LyonIBM Voice TypeGlobal-Link translation software Mac TEXT-TO-SPEECHCU-SeeMe, video, audioRobert PalmquistMobile device (1993)English-Spanish, Office of Naval Research (1997)SpeechGear launch (2001)Mobile device for JapaneseEnglish (2003)Interpreter mobile phone (2004)Compadre: Interact 4.0, English35 other languages (2009)Talk&Translate

(Still) In the beginning Verbmobil (Wolfgang Walster)German Federal Ministry of Research and Technology (1993-2000)Linguatec (>Lingenio)First boxed product (1998)More JapaneseNEC mobile device with Japanese-English (2006)ATR-Trek, Shabete Honyaku, mobile phone (2007)AppsGoogle Translate (2006-2007)Jibbigo, first offline (2009)Healthcare system pilotsSpoken Translation, Inc., Converser for Healthcare 3.0 (2011)Sehda (>Fluential) (2009?)Phone-basedSpeechTrans (multi-lingual conferences) (2014)Lexifone (2014)

4

Converser for Healthcare 3.0: Kaiser Permanente Pilot

Three departments at San Francisco Medical Center Pharmacy:Consulting or Drop-off use caseShortcuts: Consultation: Typhoid VaccinePickup use caseGreeter use caseInpatient NursingShortcuts: IV, External Catheter, Pain AssessmentEye CareShortcuts: Informed Consent for Cataract Surgery

Present

Google, Skype, Waibel

Alex Waibels Simultaneous Translation

7

Google Translates Dynamic OCR Feature

8

Future

More PlatformsWatchesApple WatchSpeechTrans GlassesAugmented realityIn 3 years, not 300!Issues:Cloud vs. on-deviceMust be everyday deviceMust be cool!Watch Apple!

10

The Way Forward

Improve statistical MTUser feedback + machine learningMore, better data Knowledge source integration DiscourseDomainProsody The Return of Semantics Interlingua/ontologiesTrue (grounded) semanticsNew tech Deep neural networksOther AI

Improve Statistical MT

User feedback (over) + machine learningMore, better dataParsing > hybrid MT

User FeedbackVerification and correction of translation Reliable RetranslationMeaning Cues

13

Knowledge Source Integration

Think: IBMs Watson or Verbmobil (till 2000)Nine Issues in Speech TranslationDiscourseSpeech actsTopic trackingDomainProsody PausesPitch, stressTranslation mismatchesSystem architecture, data structures

_car_obj_driving_do_personSyntactic structureNPVPSemantic structure

PPVNNPVNV

drive person person car

modagtobjThe Return of Semantics: Interlingua/Ontologies

English Real-time TextTexte Franais en Temps Vrais

English Voice

Voix Franais

Real-time Voice/TextDo you come here often?

Est-ce que tu viens souvent ici?Machine Translation

Google Knowledge Graph

17

IBM Watsons Knowledge Graph

18

The Return of Semantics: Grounded Semantics

19

New TechMT via deep neural networksGoogle DeepmindKyunghyun Cho (NYU)Other AIKurzweils brain projectMicrosoft research Issue: Is black box sufficient?

20

Contact

Mark Seligman [email protected]

Sendoff