corpus based study of relative clauses in hindi & telugu transfer grammar rules for relative...
TRANSCRIPT
Corpus Based study of Relative clauses inHindi & Telugu
Transfer grammar rules for relative clauses
Viswanatha Naidu YIIIT-Hyderabad
SCONLI-3
20-02-2009 SCONLI-3, India 2
OUTLINE
Introduction Hindi Relative clause overview Telugu Relative clause overview Transfer grammar & Rules Examples Conclusion
20-02-2009 SCONLI-3, India 3
Relative clause overview
Restricts or qualify the meaning of the noun in NP
Languages like English explicitly indicate with relative pronouns
Relative pronouns occur in the initial or part of PP or NP
20-02-2009 SCONLI-3, India 4
Types of Relative clause
There are two types of relative clauses
1.Restrictive clause
2.Non-restrictive clause
20-02-2009 SCONLI-3, India 5
Restrictive Non-Restrictive
Helps the hearer/ reader to identify the referent of the noun phrase.
Rel. pronoun can be dropped
• Serves to give hearer an added piece of information identified entity
• Provides additional info.
• Use of Proper nouns
Very often the distinction expressed intonationally,but also orthographically with punctuation marker.
How does he/she know the distinction?
20-02-2009 SCONLI-3, India 6
Restrictive clauseThe young linguist whom I saw in the conferencelives in Hyderabad.
Non-Restrictive clauseChomsky, who arrived for the conferencelives in Cambridge, Massachusetts.
20-02-2009 SCONLI-3, India 7
The accessibility hierarchy defined (comrie, 1977)as
SubjectDirect objectNon-direct object Possessive
Easy to relativize subjects than it is to relativize any of the other positions
20-02-2009 SCONLI-3, India 8
Hindi also has the Restrictive – Non-RestrictiveDistinction.
Restrictive clauseE.g. jo billi mere gara mem hai vaha cUhe se DartA hai Rel. cat.3.sg my house.nom in is that. Rat Abl afraid is
‘The cat in my house is afraid of Rat.’
Non-restrictive clause
Chandrababu Naidu, jo Andhra Pradesh kA mukhya mantrI REL GEN chief.M minister.M
thA Ajkal yahAM hai.Was.sg now a days here is
Chandrababu Naidu who was the CM of AP is here now a days.
20-02-2009 SCONLI-3, India 9
Hindi Relative clauses overview
Two types of relative clauses
1.Correlative clause
2.Participle clause
20-02-2009 SCONLI-3, India 10
Correlative clause | Participle clause
• Have an explicit relative markers.
These markers will be preceded by the nouns
Appear initial position of the clauses
• No explicit markers
• Two types of participle relatives,
1. Present Participle
2. Past Participle
• Modify head noun as relative clause do.
20-02-2009 SCONLI-3, India 11
Correlative clauses
In correlatives any NP position can be relativized.
Instrumental Relativization
jis cAku se maine murgI ko kAtA vaha bahut tej thAREL knife.M with I.nom hen.M. Accu cut that very sharp was.M
The knife with which I cut the hen was very sharp
Like Subjects, objects, Indirect, Oblique including Instruments, locatives etc.
20-02-2009 SCONLI-3, India 12
Present participle & Past participle
• All verbs yield present participle forms,
• which have two functions.1. Adjectival2. Adverbial• Inflect for Person,
Number, Gender.• Appear in the form
(verb +-tA +huA).
• Only a restricted set of verbs yield past participle forms,
• which indicate achievement.• which have two functions.1. Adjectival2. Adverbial• Inflect for Person, Number,
Gender.• Appear in the form (verb +
-A + huA).
20-02-2009 SCONLI-3, India 13
Hindi does not allow to relativize the Instrument, locative, etc. It allows only subjects and objects.
Instrumental Relativization*mere dwArA kelA kAtA huA cAkU le jAo I by Banana cut did. knife.M take away
Take away the knife with which I cut the Banana
Locative Relativization*mere dwArA baitI huI kurcI bahut mAngI hai. I by sit.F did. Chsir very costly. is
The chair in which I sat is costly.
Subject RelativizationdauDA huA laDkA acAnak ruk gayA running was boy suddenly stop. Came.
‘The boy who was running suddenly came to stop’
20-02-2009 SCONLI-3, India 14
Telugu Relative clause overview
Telugu also has two types of relative clauses 1. Correlative clauses2. Participle clauses• But, correlatives are not normal in Telugu
(Bh.Krishnamurthy) used in more formal speech.• My experience of working in corpus gave the same
experience as I hardly found the correlatives eppudu Akalaite appudu tinAli.
whenever. Hungry that+time eat.
More naturally it can be expressed using participles
20-02-2009 SCONLI-3, India 15
Participle Relative clauses
E.g. ADutunna abbAyi bAwunnAdu.play.v.adj.conti boy.Nom good.3.sg.. M
Accessibility hierarchy of Participle Relativizationin Telugu is more frequent, Higher, (oblique forms can also be relativized).
Participles form by means of non-finite construction Do not inflect person, number, gender
The boy, who is playing is good.
20-02-2009 SCONLI-3, India 16
MT ARCHITECTURE
ANALYSIS
GENERATOR
TRANSFER
POS TAGGAERCHUNKER MORPH ANALYZERTOKENIZERPARSERNERWSDEtc.
TRANSFER GRAMMAR1.Lexical grammar2.Structural grammarTRANSILITERATION
Default featuresTL specific featuresWord generatorEtc.
20-02-2009 SCONLI-3, India 17
Transfer grammar
One of the approaches to Machine Translation
Captures ‘structural differences’ between SL & TL
Aim to develop the MT system
20-02-2009 SCONLI-3, India 18
input
output
Removing specific features of grammar in SLBridging the gap between SL & TL
Lexical &
Structural TransferRules
NounsVerbsAdjectivesAdverbsPrepositionsConjunctionsMiscellaneous
Final adjustment by the generator according to theTL features
ORGANIZATION OF TRANSFER GRAMMAR
20-02-2009 SCONLI-3, India 19
Relative clause rules
जो� आँ�धी� कल आँयी� थी� वो� बहुत नु�कसा�नु कर गयी�
నిన్న� వచ్చి�న్న తుఫాన్న చాలా న్నష్ట�ము చేసిం�ది
jo AmdhI kal AyI thI vah bahut nuksAn kar gayIRel. storm.F yesterday come.Pef.F.sg Past.F.sg that much damage.M do go.PerfF.sg
ninna vaccina tufAnu cAlA nasTam cesindi.Yesterday.Nom come.past.verbal.adj storm.Nom much damage did.Non-Mascu.3.sg
‘The storm that raged yesterday did a great deal of damage’
20-02-2009 SCONLI-3, India 20
वो�
बहुत
नु�कसा�नु
कर गयी�
Null
చాలా
న్నష్ట�ము
చేసిం�ది
Till now mapping between main clause-main clause there are no problems except the correlative marker
vo
bahut
nuksAn
kar_gayI
Null
cAlA
nasTam
cesindi
Let’s see the sub-ordinate clause mapping
20-02-2009 SCONLI-3, India 21
जो�
आँ�धी�
कल
आँयी�-थी�
Null
నిన్న�
తుఫాన్న
వచ్చి��ది
Here we can clearly see, there is a gap betweenSL and TL in three aspects.
1. No relative marker in TL2. SL sub-ordinate +Tense into Non-finite form3. Word order change
jo
AndhI
kal
AyI-thI
Null
ninna
tufAnu
vaccindi
20-02-2009 SCONLI-3, India 22
RULES
1. Change of word order according to TL
2. Deleting the relative makers (jo & vo)
3. Converting the finite into non-finite
20-02-2009 SCONLI-3, India 23
जो� आँन्दी�
कल
आँयी� थी�
वो�
बहुत
नु�कसा�नु
करगयी�
Null
నిన్న�
వచ్చి�న్న
తుఫాన్న
Null
చాలా
న్నష్ట�ము
చేసిం�ది
20-02-2009 SCONLI-3, India 24
jo AndhI
kal
AyI_thI
vo
bahut
nuksAn
kargayI
Null
ninna
vaccina
tufAnu
Null
cAlA
nasTam
cesindi
Change of word order according to TL
20-02-2009 SCONLI-3, India 25
jo kal
AyI_thI
AndhI
vo
bahut
nuksAn
kargayI
Null
ninna
vaccina
tufAnu
Null
cAlA
nasTam
cesindi
20-02-2009 SCONLI-3, India 26
जो� कल
आँयी� थी�
आँ�धी�
वो�
बहुत
नु�कसा�नु
करगयी�
Null
నిన్న�
వచ్చి�న్న
తుఫాన్న
Null
చాలా
న్నష్ట�ము
చేసిం�ది
Deleting the relative makers (jo & vo)
20-02-2009 SCONLI-3, India 27
kal
AyI_thI
AndhI
bahut
nuksAn
kargayI
ninna
vaccina
tufAnu
cAlA
nasTam
cesindi
20-02-2009 SCONLI-3, India 28
कल
आँयी� थी�
आँ�धी�
बहुत
नु�कसा�नु
करगयी�
నిన్న�
వచ్చి�న్న
తుఫాన్న
చాలా
న్నష్ట�ము
చేసిం�ది
Converting the finite into non-finite
20-02-2009 SCONLI-3, India 29
kal
Aya+(Past.V.adj)
AndhI
bahut
nuksAn
kargayI
ninna
vaccina
tufAnu
cAlA
nasTam
cesindi
20-02-2009 SCONLI-3, India 30
कल
आँयी�(Past.v.adj)
आँ�धी�
बहुत
नु�कसा�नु
करगयी�
నిన్న�
వచ్చి�న్న
తుఫాన్న
చాలా
న్నష్ట�ము
చేసిం�ది
20-02-2009 SCONLI-3, India 31
REQUIRED RESOURCES
LARGE AMNOUNT OF CORPUS, REPRESENTS ALL DOMAINS BROAD COVERAGE MORPH ANALYZER, GENERATOR
>BROAD COVERAGE E-BILINGUAL LEXICONS
LARGE AMOUNT OF POS & PARSED TREE BANK CORPUS
LARGE AMOUNT OF PARALLEL CORPORA
Apart from, there is an high necessity of well trained LINGUIST for modeling the language (s), and a Computer Scientist for implementing the model (s).