pre:d say what you mean

Post on 14-Apr-2017

2.877 Views

Category:

Data & Analytics

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Pre:Dstandsfor“predisambiguation”– findingtheexactmeaningsoftermsintheoriginallanguage,before translation,toselecttheright vocabularyinotherlanguages.

disambiguation:figuringouttherightsenseofatermthathasmultiplemeanings

Whenyousay“seal”,doyoumean:

or

Nicole:WegotaletterfromChampittet!

Nicole:WegotaletterfromChampittet!Tati(Daddy):Didyoureadthatword,ordidyourecognizetheseal?

Nicole:WegotaletterfromChampittet!Tati(Daddy):Didyoureadthatword,ordidyourecognizetheseal?Nicole:Tati,that’snotaseal,it’sadolphin!

Whilereadingawebsite inFrench,IcameacrossawordIdidn’tknow.

IclickedthebuttonforGoogletotranslate.

BecausemyGooglewassettoSwahili,this istheresultIsaw.

Iwasmystified,because“plagi” isanelectrical plugoroutlet.

ThispicturefromWiktionaryshowstheideathattranslatestoSwahilias“plagi”.

WhywasGoogletellingmethatInnovaudwillhelpKamusi establishtotallynewelectrical sockets?

SwitchingthetranslationlanguagetoEnglish,youcanseewhathappened.

TheFrenchwordwastranslatedas“outlet”.

“Outlet” isafinesuggestion.

However,“outlet” hasa lotofdifferentmeaningsinEnglish.

ThosedifferentmeaningstranslatetoalotofdifferenttermsinFrench.

ThosedifferentmeaningsalsotranslatetoalotofdifferenttermsinSwahili.

Andthedifferentmeaningsof“outlet” willbetranslatedtoavarietyofdifferenttermswithineachotherlanguage.

Lookup“outlet” inanEnglish-Italiandictionary:“presa”isthetopresult.

Thatmakessense.“Electrical outlet” isstatistically yourmost likelychoice.

“Commercial outlet” isthebottomofthelist.

HereisGooglesayinginItalianthatInnovaudwillhelpestablishcompletely newelectricalsockets.

ButIdidn’tgotoInnovaud.chbecausewehaveanewelectrical socket.

IwentbecauseKamusi hasacompletelynewapproachtointer-languagecommunication, whichwillrevolutionizethetranslationindustrywhenwebringittomarket.

Totranslatesomething, asourcedocumentmustbeanalyzedtofindthepossiblemeaningsofaterminitsoriginallanguage.

TheproblemwithcurrentMachineTranslation(MT)isthatmeaningisselectedbasedonestimated frequencyacrossmillionsofknownuses.

If“outlet” iselectrical 70%ofthetimeintheliterature,thatsenseisoftenagoodbet.

LookatGoogle’spredictiongraphsfortheirtranslationof“the springinherstep”.Youcanseehowgoodbetsendbadly.

TotherightyoucanseeGoogle’s estimatedlikelihoodsfor“spring”and“step”inFrench.

Below,youcanseethoseprobabilitiesinaction.

Instate-of-the-art statistical translation,themachinemakesitsbestguess.

Itwavesamagicwand,and…

Presto:PRESA,PLAGI,electrical outlet.

AtKamusi,weknowwhatthepossiblemeaningsare,for“outlet” and100,000otherEnglishwords.

Wehaveseparatedoutthetranslationoptionsforover200,000specificmeanings,withmorethan10millionlinksamong1.2milliontermsintwentylanguages, andcounting.

Pre:D willshowyouallthedefinitionsthatmatchthewordinyoursourcelanguage,sentence bysentence.We’llrankyouroptionsstatistically, butyou’llbethefinaljudge.

Youchooseyourmeaning.

Eveniftheexpressionismultiplewords,like“springinherstep”.Evenifawordisagrammatical form,like“drove”and“driving”.Eveniftheexpressionhasmultiplewordsandgrammatical forms,separatedbymanyotherwords,like“drives[everybodyshemeets] crazy”.

Wegiveyouthematchingconcept inyourtranslationlanguage.English,French,Swahili,Finnish,Thai….anylanguagewe’ve got,andwe’readdinglanguagesallthetime.

Forexample:

◎ early(forameeting)

◉ early(intheday)

◎ early(informativestages)

Kamusi givesyou translationsthathavealreadybeenmappedbyhuman intelligence.

Pre:D canthen integratewithMTsoftwaretoput thosewords intowell-structuredsentences,inone translationlanguageoronehundred.

And ifwedonotalreadyhavethe termsyouneed inthe languagesyouneed,wehaveinnovative systemstoquickly learnyour wordsfromnativespeakers.

Imagine acompanythatneedstotranslateonedocument to80languages.

WithPre:D,theypredisambiguateonce,andalltheirtranslatorsknowtheintendedmeaningsofeverywordinthesourcedocument. Andeachtranslationteammembersees therecommended vocabularyfortheirtargetlanguage.

Imagine someone inLausannechattinginrealtimewithabusinessclient inShanghai.

WithPre:D,theyselect theirmeaninginFrench,andtheyarecertainthattheclientunderstandstheirmeaninginChinese.

TheLanguage ServicesMarket:

• $35billionglobally(CommonSenseAdvisory,2014)

• 5%annualgrowth

• Currentindustryservesmajorlanguagepairs,particularlywithEnglish.Pre:Dwillofferunparalleledvocabularyaccuracyforthesetranslators

• Kamusi technologywillalsoenablethousandsofvaluablenewnon-Englishtranslationpairs(theoreticalmaximumforall7000worldlanguagesis24,496,500pairs– thechartontheleftshowsearlyparallel-wordfiguresfromourBigDataBeta)

telephone 1916 telephone 2016

translation2016 translation2026

ExpectedUsers:

• Multinationalorganizations– needtotranslatesamedocumenttomanylanguages• Onepersonmakessenseofthedocument• Translatorsmakesenseintheirtargetlanguages

• Professionaltranslators– hominginonprecisevocabulary,includingterminologyandidioms• Language-awarecivilians– muchmorecontrolthanGoogleTranslate• Non-Englishpairs– thousandofnewcombinations,eliminatesmostnoisefromEnglishpivot• English<->underservedlanguages– kamusi isgatheringnewparalleldatafor100soflanguages

KamusiPre:D

Noguessing

Nomagic wands

Inonelanguageafteranother, justonethingthatwilleruptthelanguageindustry:

Therighttranslation

top related