running head: gradient symbols in code mixing matthew
TRANSCRIPT
![Page 1: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/1.jpg)
1
RunningHead:Gradientsymbolsincodemixing
Title:Coactivationinbilingualgrammars:Acomputationalaccountofcodemixing*
MatthewGoldrick1,MichaelPutnam2,&LaraSchwarz21NorthwesternUniversity2PennsylvaniaStateUniversity
*WegratefullyacknowledgeMattCarlson,MaríadelCarmenParfaitaCouto,BrianHok-
ShingChan,MargaretDeuchar,JaneGrimshaw,GéraldineLegendre,JohnLipski,Akira
Omaki,ShanaPoplack,LilianaSánchez,PaulSmolensky,ColinWilson,andMasayaYoshida
forhelpfulcommentsanddiscussion.ThisresearchwassupportedbyNSFgrant
BCS1344269.
Addressforcorrespondence:
MatthewGoldrick
DepartmentofLinguistics
NorthwesternUniversity
2016SheridanRd.
Evanston,IL60208USA
Keywords:Codemixing,GradientSymbolicComputation,doublingconstructions
![Page 2: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/2.jpg)
2
Abstract
Alargebodyofresearchintobilingualismhasrevealedthatlanguageprocessingis
fundamentallynon-selective;thereissimultaneous,gradedco-activationofmental
representationsfrombothofthespeakers’languages.Anequallydeeptraditionofresearch
intocodeswitching/mixinghasrevealedtheimportantrolethatgrammaticalprinciples
playindeterminingthenatureofbilingualspeech.Weproposetointegratethesetwo
traditionswithintheformalismofGradientSymbolicComputation.Thisallowsusto
formalizetheintegrationofgrammaticalprincipleswithgradientmentalrepresentations.
Weapplythisframeworktocodemixingconstructionswhereanelementofanintended
utteranceappearsinbothlanguageswithinasingleutteranceanddiscussthedirectionsit
suggestsforfutureresearch.
![Page 3: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/3.jpg)
3
CoactivationinBilingualGrammars:
AComputationalAccountofCodeMixing
Oneofthemoreamazingfeatsofbilinguallanguageproductionisthefluent
integrationoftwolanguageswithinasingleutterance.Werefertothisphenomenonas
codemixingtoemphasizetheintegrationoftwolinguisticsystems,usingthis
synonymouslywithtermssuchasintra-sententialcodeswitching.Anextensivebodyof
researchhasidentifiedimportantrolesforthegrammaticalprinciplesofthesource
languagesinconstrainingcodemixing(see,e.g.,Deuchar,2005;Muysken,1995;Myers-
Scotton&Jake,1995;Poplack,1980,forreviews).Paralleltothislineofresearch,several
decadesofresearchhasprovidedawealthofevidencesuggestingthatbilinguals
simultaneouslyco-activateelementsfromeachlanguageduringproduction.Forexample,
whenintendingtonameapictureofadog,aSpanish-Englishbilingualwillsimultaneously
activate,tovaryingdegrees,representationscorrespondingtoEnglish(DOG)andSpanish
(PERRO)forms(seee.g.,Kroll&Gollan,2014,forareview).Thissuggeststhatmental
representationsinbilingualspeakersincorporateblendsofstructuresfromeachlanguage
(i.e.,notDOGorPERRO,butarepresentationthatisbothDOGandPERRO).
Giventhestrengthoftheevidenceforgrammaticalprinciplesaswellasblend
representations,wearguebelowthatanadequatetheoryofbilinguallinguisticcognition
mustbeabletoincorporatebothoftheseelements.Discretegrammaticalprinciplesmust
beintegratedwithgradientblendrepresentations;currently,noexistingframeworkdoes
so.Inthiswork,weproposesuchanintegrationusingtheGradientSymbolicComputation
framework(GSC;Smolensky,Goldrick,&Mathis,2014).Thisgrammar-basedformalism
incorporatessymbolicrepresentations,whoseelementsareassociatedwithcontinuous
![Page 4: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/4.jpg)
4
activationvalues.WeshowhowaGradientSymbolicapproachtocodemixingcanallowus
toaccountforgrammaticalconstraintsonblendedrepresentationsthatemergeincode
mixing.
Webeginbyreviewingtheevidenceforblendrepresentationsinbilingualsacrossa
varietyofprocessingcontexts.Tohighlighttheinteractionofblendrepresentationsand
grammaticalprinciples,wethenexamineindetailcodemixingproductionswhereblended
elementsareovertlyproduced—anelementoftheutteranceisdoubled,appearinginboth
languageswithinasingleutterance.Withtheseempiricaldatainmind,wedevelopaGSC
accountofcodemixing.Wedemonstratehowitaccountsforempiricallyobserved
restrictionsondoubling,anddiscussthefutureresearchdirectionsitsuggests.
Whileourfocusisontheinteractionofgrammaticalprinciplesandthegradient
representationalstructure,itisimportanttonotethatmanyotherfactorscontributeto
codemixing.Inparticular,sociolinguisticfactorsplayanimportantroleinlanguagechoice
andbilingualidentity(foranoverview,seeGardner-Chloros,2009).Whiletheseare
outsidethescopeofthiscurrentwork,theydefineanimportantavenueforfuture
developmentofourapproach.
BlendRepresentationsinContextswithoutCodeMixing
Blendrepresentations
Manypsycholinguistictheoriesareframedwithinaspreading-activationor
connectionistperspective(Rumelhart,Hinton,&McClelland,1986;seeGoldrick,2012,fora
recentreview).Insuchtheories,mentalrepresentationsaregraded,distributedpatternsof
activation,anumericalquantityassociatedwithsimpleprocessingunits.Thisallowsfor
![Page 5: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/5.jpg)
5
blends:representationalstatesinwhichmultiplerepresentationalelementsoccupy(to
varyingdegrees)asinglepositionwithinalinguisticstructure.
Forexample,supposeanativeSpanishspeakerisproducingasentenceinEnglish:
“YesterdayIwenttotheparktowalkmydog.”Whileplanningthisutterance—in
particular,whileretrievingtheappropriatefinalnounfrommemory—many
psycholinguistictheoriesofbilingualismassumethatthespeaker’sproductionsystem
entersthestateshowninFigure1(see,e.g.,Kroll&Gollan,2014,forareviewofsuch
proposals).Inthisnetwork,therearethreetypesofrepresentationalunits.Theinputtothe
systemconsistsofsemanticfeaturesalongwitharepresentationoftheintendedlanguage
ofresponse.Activationspreadsfromtheseunitstoasetofunitscorrespondingtolexical
items(e.g.,‘lemmas’).Figure1Ashowstheflowofactivationthroughtheconnectionist
network;Figure1Bprovidesanalternativeviewofthedistributionofactivationoverthe
lexicalitems.
![Page 6: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/6.jpg)
6
Figure1.A.DepictionofpsycholinguisticprocessingmodelduringproductionofDOG.
Thicknessofcircledenotesrelativeactivationofunit.B.Alternativedepictionofthestate
ofthissystem,focusingongradientactivationatthelexicallevel.
Insucharepresentation,theintentiontoproduceasinglelexicalitem(asinglenoun
inthephrase‘my___’)resultsinthesimultaneousco-activationofmultiplemental
representations.Lexicalselectionprocessessimultaneouslyconsiderthetarget(DOG),
semanticallyrelatedwordswithinthesamelanguage(CAT),andnon-targetlanguage
words(PERRO).ThiscanbeseeninFigure1A+B,wheremultiplerepresentational
elementshavevaryingnon-zeroactivation.Thestateofprocessingasinglewordisthusa
blendofmultiplelinguisticrepresentations.Thisrepresentationalhypothesisisoften
referredtoasco-activationorparallelactivation.Weusethetermblendtoemphasizethat
themultipleelementsarenotsimplysimultaneouslyactivated;theyareco-presentwithin
!
!
!
!
!
!
!
!
!
A.#
!
!
!
!
!
!
!
B.#
#
#
#
#
#
<furry> <pet> <canine> [ENG]
MESA GATO PERRO DOG CAT TABLE
[SPAN]
Det 0.90 MY 0.10 MI
N 0.90 DOG 0.50 CAT 0.30 PERRO 0.25 GATO 0.10 TABLE 0.05 MESA
NP
![Page 7: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/7.jpg)
7
asinglepositioninthelinguisticrepresentation(e.g.,headofaparticularnounphrase;this
ishighlightedinthedepictioninFigure1B).
Empiricalevidenceforblendrepresentations
Activationbasedrepresentationsdonotrequireblends(onecouldassign0toall
non-targetrepresentationsinFigure1,activatingonlyDOG).Itisalsonotimmediately
clearwhatfunctionalmotivationwouldrequireblendstates.InapurelyEnglishutterance,
whyshouldoneconsiderSpanishwords?Thismakesitallthemorestrikingthata
substantialbodyofevidencesupportssuchblendrepresentations.KrollandGollan(2014)
provideanextensivereviewofevidencefrommultilingualspeakers(seeMelinger,
Branigan,&Pickering,2014,forareviewofevidencefrommonolinguals).Here,we
emphasizeafewkeyrecentstudiesthatprovideevidenceofsuchrepresentationsduring
productionofphrasesandsentences.
Akeypredictionofblendrepresentationsisthatthespreadofactivationwillleadto
thepartialactivationofnon-targetrepresentationsatotherlevelsofprocessing.Following
theexampleabove,whenproducingtargetDOG,thepartialactivationofthelexical
representationPERROispredictedtoleadtopartialactivationofrepresentationsofthe
/p/sound.Incontrast,whenproducingCAT,thePERROrepresentationshouldbeless
active,resultinginlessactivationofthe/p/sound.Consistentwiththisprediction,many
studieshavedemonstratedthatproductionisfacilitatedwhenthereisaphonological
relationshipbetweenthetargetutteranceandnon-targettranslationequivalents.
Spalek,Hoshino,Wu,Damian,andThierry(2014)examinedGerman-English
bilingualsproducingadjective-nounphrases.Forbothbehavioralandelectrophysiological
measures,theyfoundthatsecond-languageEnglishproductionwasfacilitatedwhenthe
![Page 8: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/8.jpg)
8
Englishadjectivesharedphonologicalstructurewiththenoun’sGermantranslation
equivalent(e.g.,“blueflower:”bluesharestheinitialsoundsoftheGermantranslation
equivalentBlume;contrastwith“greenskirt:”greensharesnosoundswiththeGerman
translationequivalentRock).Whilesignificant,effectsofnon-dominantL2Englishon
productionofdominantL1Germanphrasesweresmaller,limitedtoelectrophysiological
measures.Theseresultsareconsistentwiththepresenceofblendrepresentations,but
suggestthatthedegreetowhichnon-targetrepresentationsarepresentinblendsis
modulatedbytherelativestrengthofeachlanguage(suchthatBlumeismoreactiveduring
processingofflowerthanviceversa).
Anothersetofstudiesarguingforco-activationofmultiplerepresentationshas
comparedtheproductionoftargetsthatsharephonologicalstructurewiththeirtranslation
equivalent(e.g.,EnglishANCHOR—DutchANKER)tothosewithnooverlap(e.g.,BOTTLE
—FLES).Theformerareoftenreferredtoas‘cognates’inthepsycholinguisticliterature;
however,nohistoricalconnectionbetweenthetranslationequivalentsisrequired.The
logicisthatsimultaneousactivationoflexicalrepresentationsinthetwolanguagesshould
facilitateprocessingofanysharedphonologicalstructure,producingacognatefacilitation
effect(Costa,Caramazza,&Sebastián-Gallés,2000).Forexample,simultaneouslyactivating
ANCHORandANKERwillservetofacilitateretrieval/planningofsharedsegments/ŋ/,/k/,
/ə˞/.Starreveld,DeGroot,Rossmark,andVanHell(2014)recentlydocumentedcognate
facilitationduringsentenceplanning.Dutch-Englishbilingualsreadaloudsentenceswith
anembeddedpicture(e.g.,apictureofaanchorappearedinthepositionoftheblankinthe
sentence“Inthemiddleofthesquarewasan____withathickchainattachedtoit.”).When
producingpicturenamesinL2English,participantsshowedcognatefacilitation.Following
![Page 9: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/9.jpg)
9
thestudyreviewedabove(Spaleketal.,2014)aswellasmanyotherresults,theseeffects
weremuchstrongerinL2thanL1production.Furthermore,cognateeffectswere
modulatedbythesentencecontext.Whenthesentenceplacedgreaterconstraintsonthe
wordthatcouldfitinthespaceoccupiedbythepicture(e.g.,“Popeyethesailormanhasa
tattooofan____onhisarm.”),cognateeffectswerediminished.Aswiththepreceding
study,theseresultssuggestthatwhileblendsarepartoflanguageproduction,thedegree
towhichnon-targetrepresentationsarepresentismodulatednotonlybytherelative
strengthofeachlanguage,butalsobythedegreetowhichcontextsupportstheretrievalof
aspecifictargetword.
Finally,someofthestrongestevidenceforblendrepresentationshascomefrom
studiesthathavedocumentedtheliteralco-productionofmultiplerepresentations.The
simultaneousco-presenceofmultiplelinguisticrepresentationsduringplanningleadsto
thesimultaneousproductionofactionsassociatedwiththeserepresentations.Pyersand
Emmorey(2008)examinedtheoralandmanualproductionsofbimodalbilinguals:native
speakersofaspokenlanguage(English)andamanuallanguage(AmericanSignLanguage;
ASL).Duringconversationswithnon-signers—wherethebimodalbilingualsintendto
speakasingle(oral)language—theysimultaneouslyproducedASLandEnglish
grammaticalmarkers.Atratesmuchhigherthannon-signers(butlowerthanintheirASL
productions),thebimodalbilingualsfurrowedtheirbrowswhileproducingwh-questions
(e.g.,“Howmanysiblingsdoesshehave?”).Thisoccurredinspiteofthefactthatspoken
Englishexplicitlymarkswh-questions(makingdouble-markingunnecessarytoexpressthe
intendedmessage).NotethatthisgestureispragmaticallydispreferredinspokenEnglish,
whereitconveysnegativeaffect.PyersandEmmoreyarguedthatthismodulatedtherate
![Page 10: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/10.jpg)
10
ofco-productions,asco-productionsweremuchhigherforconditionals(e.g.,“Ifitrains,
classwillbecanceled;”associatedwithraisedbrows).Thisprovidesfurtherevidencefor
constraintsonthedegreeofactivationofnon-targetrepresentationsinblends.
Inthebimodalbilingualcase,thetwolanguagesarenotcompetingforexpression
onthesamecommunicationchannel.Moresubtleco-productionscanbefoundduring
productionoftwoorallanguages.Whileco-activationenhancesretrievalofshared
phonologicalstructure,theheightenedactivationofnon-targetlanguagerepresentations
shouldincreasecross-languagephoneticinterference—theintrusionofnon-target
languagephoneticpropertiesintobilingualproductions.Forexample,whileSpanishand
Englishshareacommonsetofvoicingcontrastsininitialstops(e.g.,/b/vs./p/),the
phoneticrealizationofthiscontrastisdistinctineachlanguage(pre-voicedvs.short-lag
voiceonsettimeinSpanish;shortvs.long-laginEnglish).Thisconflictleadsnon-native
speakerstoproducethesesoundswithphoneticpropertiesintermediatebetweenthetwo
languages(Flege,1991).Amengual(2012)showedthatthiscross-languagephonetic
interferenceisenhancedforcognates.Whenreadingsentencesaloud,Spanish-English
bilingualsproducedinitialstopsinSpanishwithmoreEnglish-likepropertiesincognates
vs.non-cognates.NosuchdifferencewasfoundintheproductionsofSpanish-Catalan
bilinguals(wherethetwolanguageshavesimilarphoneticrealizationsofthiscontrast).
Otherresultssuggestthatthesecognateeffectsarenotsimplyword-specificphonetic
patternsinbilingualspeech,butratherreflectdynamicpropertiesofbilingualproduction.
Olson(2013)andGoldrick,Runnqvist,andCosta(2014)foundthatphoneticinterference
wasincreasedwhenparticipantswererequiredtounexpectedlycode-switchduring
picturenaming(vs.trialswhereparticipantsdidnotswitchlanguages).Forvoicelessstops,
![Page 11: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/11.jpg)
11
Goldricketal.foundthatthiscontext-specificphoneticinterferenceeffectisenhanced
duringproductionofsinglecognatevs.non-cognatewords—suggestingthatthecognate
effectreflectsthecontext-specificactivationoftargetandnon-targetlanguage
representations.
Summary:Blendrepresentationsinbilingualproduction
Evenwhenintendingtoproduceasingleforminasinglelanguage,bilinguals
simultaneouslyactivateformsinbothlanguages.Thedegreeofco-presenceinsuchblend
representations,andourabilitytoobservetheeffectsofthisco-presence,isclearly
constrained.Tosomedegreethislikelyreflectsphysicalconstraints.Itisimpossibleto
placeasinglesetoforalarticulatorsintwocontradictorypositions.Inthesecases,the
productionsystemislimitedtoblendedarticulations,reflectingapartialcompromise
betweencontradictoryactions.However,manyoftheotherconstraintsonblendsclearly
reflectabstract,cognitiveprinciples.Evenwhenfreedfromphysicalconstraintsonco-
production,thepropertiesofbimodalbilinguals’blendsaremodulatedby
affective/pragmaticconstraints.Thepropertiesofunimodalbilinguals’blendsreflectthe
relativestrengthofthetwolanguagesandthecontextinwhichatargetwordisbeing
produced.
BlendRepresentationsinCodeMixing
Integrationofgrammaticalprinciplesincodemixing
Giventheevidenceforco-activationofthetwolanguagesincontextswhere
speakersintendtoproduceonlyonelanguage,itisunsurprisingthatco-activationisa
fundamentalpropertyofcodemixing.Critically,speakersarenotonlyutteringlexicalitems
![Page 12: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/12.jpg)
12
frombothlanguagesbutarealsointegratinggrammaticalprinciplesfromeachlinguistic
system.
Theintegrationcanbeseenincross-linguisticsyntacticpriming,whereexposureto
astructureinonelanguageincreasestheprobabilitythatspeakerswilluseasimilar
structureinanotherlanguage(seePickering&Ferreira,2008,forareview).Forexample,
Hartsuiker,Pickering,andVeltkamp(2004)foundthatwhenSpanish-Englishbilinguals
heardapassiveconstructioninSpanish,itincreasedthelikelihoodthattheywould
produceapassivevs.activeconstructioninEnglishonasubsequenttrial.Suchpriming
doesnotonlyaltertheprobabilityofattestedstructures.Incertaincontexts,itcanallow
forthetransferofgrammaticalpatternsfromonelanguagetoanother,reflectingthe
integrationofknowledgeofeachlanguage.Forexample,inmanycontextsSpanishdoesnot
allowforthewordorderadjective-noun,thetypicalwordorderpatternobservedin
English.Hsin,Legendre,andOmaki(2013)foundthatinSpanish-Englishbilingualchildren
primingcouldallowfortransferofthiswordorderfromEnglishtoSpanish.
Suchintegrationalsooccursinthecontextofintra-sententialcodemixing.For
example,Kootstra,vanHell,andDijkstra(2010)elicitedcodemixedutterancesfrom
Dutch-Englishbilinguals.ParticipantsdescribedpicturesbycompletingaDutchsentence
fragmentthatbiasedspeakerstoproduceoneofseveralwordorderspossibleinDutch
(Subject-Verb-Object[SVO],SOV,orVSO).Whencuedtoproduceamixedstructure(i.e.,
usingatleastoneEnglishwordtocompletetheDutchfragment),participantspreferredto
usethewordordercommontobothgrammars(SVO).Asimilarpreferenceforcongruent
grammaticalpatternshasbeenfoundinspontaneousmixingcorpora(forreviewsand
![Page 13: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/13.jpg)
13
discussion,seeDeuchar,2005;Muysken,1995;Myers-Scotton&Jake,1995;Poplack,
1980).
Blendsandco-productionincodemixing
Whenspeakersintendtomixlexicalitemsandgrammaticalprinciplesfromtwo
languages,wealsoobserveblends.Someofthemostdramaticexamplescomefrom
bimodalbilingualcodemixing.ForASL-Englishbilingualsthepredominanttypeofcode
mixingiscodeblending:co-productionoforalandmanualelements(Bishop,2010;
Emmorey,Borinstein,Thompson,&Gollan,2008).Thesecross-modalproductionsare
typicallysemanticallyequivalentandsynchronizedintime.Inexamples(1)and(2),the
Englishglossofthesignproductionisshowninitalicsbeneaththepointinthesentence
wherethesignroughlyoccurred;underliningindicatesthespeechthatco-occurredwith
thatsign.
(1) Andthere’sthebird.(Emmoreyetal.,2008:48)bird
(2) NowIrecentlywentback.(Emmoreyetal.,2008:48)nowIrecentlygo-to
Inunimodalbilingualsthereissubtleevidenceofco-activationinarticulation.
Analyzingaspontaneouscodemixingcorpus,BalukasandKoops(inpress)foundthat
phoneticinterferenceeffectsinSpanish-Englishbilingualsincreaseatpointsclosertocode
switches.Thissuggeststhatco-productionsarenotuniquetobimodalbilinguals.
Blendswithoutco-production:Doublingconstructions
Blendingrepresentationsfromtwodifferentlanguagescanalsoyieldnon-
simultaneousarticulations.Languagesdifferinwordorder,mappingelementstodifferent
positionsinthesurfacestring.Thisraisesthepossibilitythatthegrammaticalprinciples
![Page 14: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/14.jpg)
14
fromeachlanguage—bothactiveduringcodemixing—couldbothbesatisfiedwithout
yieldingsimultaneousarticulation.Forexample,abilingual’sL1hasthewordorderverb-
objectandL2object-verb.Thestringverb(L1)-object-verb(L2)satisfiesthewordorder
constraintsofbothlanguages;theL1verbprecedestheobject,whiletheL2verbfollows
theobject.Althoughsuchstringsmightviolatestructuralconstraintsonlinguistic
representations,theywouldnotsufferfromarticulatoryincompatibility1.
Patternsofattesteddoublingconstructions:Areview
Althoughsuchconstructionsarecommonlydiscussed,onlyafewdetailed
referencesaredevotedexclusivelytothem(Chan,2009;Hicks,2010,2012;Muysken,
2000:104-6).Theymanifestinavarietyofconstituents,althoughpreferenceseemstobe
giventothedoublingoffunctionalelements(syntacticelementsexpressinggrammatical
relationships;e.g.,complementizers,determiners,prepositions,andauxiliaryverbs)over
lexicalitem(e.g.,nouns).Thefollowingexamples2provideanoverviewoftherangeof
doubledstructures.Thedoubledelementsareunderlinedineachexample.
(3) Complementizers:English-Japanese(Azuma1993:199)ifitgoesthreeroundsdattaraneifitgoesthreeroundswasifTAG‘Ifitgoesthreerounds.’
Notethatin(3),ifislocatedinitscanonicalplaceinEnglish(appearingatthestart
ofthedependentclause),andtheJapaneserainitsexpectedlocation(weretheutterance
fullyJapanese,rawouldappearattheendofthedependentclause).Examples(4)-(7)
illustratesimilardoublingforvariousotherelements,respectingthecontrastingword
orders.
![Page 15: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/15.jpg)
15
(4) Adpositions:English-Finnish(Poplack,Wheeler,&Westwood,1989:405)muttaseolikidney-statoaorta-anbutitwaskidney-fromtoaorta-to‘Butitwasfromthekidneytotheaorta.’
(5) Adverbials:English-Tamil(Sankoff,Poplack,&Vannianiarajan,1990:92)
AccordingtotheschedulepaDioNNutaanirukkaNum. accordingtooneonlybemust‘Accordingtotheschedule,theremustbeonlyone.’
(6) Coordinating:conjunctionsSpanish-Aymara(Stolz,1996:146,citingPorterie-
Guierrez,1988:355)perosorro-stiwaliastuturi-tajna...butfox-COOverykeen-3.SG.PRT.EVI‘Butthefoxwasverykeen.’
(7) VerbsEnglish-Tamil(Sankoffetal.,1990:93)
theygavemearesearchgrantkoɖutaatheygavemearesearchgrantgave.3.PL.PAST‘Theygavemearesearchgrant.’
Multi-wordchunkscanbedoubled,asshownin(8)(verb+adverb)and(9)(verb+
complementizer).
(8) Verb+AdverbEnglish-Japanese(Nishimura,1986:139)WeboughtabouttwopoundsguraikattekitanoWeboughtabouttwopoundsaboutboughtTAG‘Weboughtabouttwopounds.’
(9) Verb+ComplementizerEnglish-Korean(Chan,2008:800)
everybodythinkthatnay-kayenge-lulcalhanta-kosayngkakhayyoeverybodythinkCI-NOMEnglish-ACCwelldo-Cthink‘EverybodythinksthatI’magoodEnglishspeaker.’
Whilethesetypesofcodemixingutteranceshavebeenconsistentlydocumentedin
corpora,theyareclearlymarked;ingeneral,thesestructuresarelargelyavoided.Poplack
etal.(1988:405)reportthattheseblendsare“exceedinglyrare,”citingthattheyonly
found2intheirentirecorpus;Furukawa(2008)found7examplesin5hoursof
sociolinguisticinterviewdata.However,Nishumura(1986)andBackus(1992)suggestthat
theseblendsoccurinroughly3-5%oftheircorpusmaterials.Therarityofsuch
![Page 16: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/16.jpg)
16
productionsisunsurprising.Asnotedabove,theintegrationofgrammaticalprinciples
frombothlanguagesyieldsapreferenceforcongruentgrammaticalpatternsincode
mixing.Theexamplesaboveviolatethisprinciple;theyinvolvedoublingofelementsthat
aresubjecttoconflictinggrammaticalpatterns(e.g.,verb-objectvs.object-verbword
order).Furthermore,researchonnon-codemixedproductionssuggeststherearestrong
limitationsonblends;thedegreeofco-presencewithinablendishighlylimited.Ifdoubling
constructionsreflectblendrepresentations,wealsoexpectthemtobestrongly
dispreferred.
Surveyingthereportedinstancesoftheseconstructions,Hicks(2010)identifies
severalcross-linguisticgeneralizations.First,asnotedabove,doubledelementslocally
respectthewordorderofthesourcegrammars.Second,thedoubledelementsaretypically
heads;syntacticelementsthatdefinethesyntacticpropertiesofthephrasetowhichthey
belong.Thedoubledheadsshareanon-doubledcomplement;theothersyntacticelements
thatbelongtothephrase.Forexample,averbphrasecanbecomposedofaverb(thehead)
andanobject(thecomplement).InthedoublingconstructionVL1OL1VL2,doubledverbs
shareanon-doubledobjectcomplement(seealsoFurukawa,2008).Thus,strictlylocal
doubling(e.g.,VL1OL1OL2VL2)istypicallynotobserved.Finally,whilesomelanguages
exhibitdoublinginmonolingualcontexts(discussedfurtherbelow),Hicksnotesthat
doublingofelementsfromthesamelanguage(e.g.,analogousto(7),*gavegrantgave,
*gavegavegrant)arenotobservedduringcodemixing.
Insum,whiledoublingisrare,itisconsistentlyobservedacrossvarioussources;a
varietyofelementsparticipateinintrasententialcodemixingblends.Doublingisnotmere
repetitionofelements;itsoccurrenceisconstrainedbygrammaticalprinciples;doubled
![Page 17: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/17.jpg)
17
headssharenon-doubledcomplements;andequivalenceforgrammaticalfeatures.This
suggeststhatdoublingconstructionsarenotan“adhocproductionstrategy”(Sankoffetal.,
1990:92),butarerathercoherent,syntacticobjectsthataregovernedbygrammatical
principles.
BlendsinGrammaticalTheories:ApplicationtoDoubling
Blendrepresentationsclearlyplayaroleinbilinguallanguageprocessing.In
doublingconstructions,weseethattheseblendrepresentationsinteractwithgrammatical
principles.Howcanthisbeformallyspecified?Inthissection,wedevelopagrammatical
approachtocodemixingthatincorporatesblendedrepresentations.Weapplythisto
doublingconstructions,showinghowitaccountsfortheoccurrenceofdoublingaswellas
theempiricallyattestedconstraintsonthisphenomenon.
Overviewoftheproposal
Ouraccountisbasedaround3generalprinciples.Wefirstprovideanoverviewof
theseandthenexamineinsomedetailofhowtheycanappliedtotheempiricalpatternsof
doublingconstructions.
Principle1:Probabilisticgrammarswithweightedconstraints
Languageuse—inmono-ormulti-linguals—isdefinedinpartbyregularstructural
patterns(e.g.,EnglishrequiresSVO,whileDutchallowsflexibilitybetweenSVO,SOV,and
VSO).Grammarsallowustopreciselyspecifythestructureofthemappingbetweenform
andmeaningthatyieldsthesepatterns.Theformalismweusespecifiesgrammarsthrough
interactionofconstraintsonlinguisticstructure.Forexample,aconstraintonwordorder
mightpreferthatcertainlexicalcategoriesappearattheleftedgeofasyntacticphrase.
Constraintsareassociatedwithnumericalweightsthatdeterminetheirrelative
![Page 18: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/18.jpg)
18
importance;cross-linguisticvariation(e.g.,ifalanguagecategoricallyprefersSVOvs.SOV)
isspecifiedbychangesintherelativeweightingofconstraints.Ourgrammaticalformalism
alsoallowsustospecifynotjustcategoricalpreferences,butalsorelativeprobabilitiesof
differentstructures;thisallowsustocapturevariationinthemappingbetweenmeaning
andformwithinaspeaker(e.g.,variablewordorderinginaDutchspeaker’sproductions,
orvariablestructuresobservedincodeswitching).
Principle2:Gradientblendsofgrammars
Bilingualsspeakershavevaryingdegreesofcompetenceinmultiplegrammars,
allowingthemtoproducedistinctstructuresineachlanguage.Inourformalism,thisis
reflectedbyassociatingeachlanguagewithadistinctweightingofconstraints.These
language-specificweightingscontributetothegrammar,independentlyinfluencingthe
probabilityofdifferentstructures.However,asdiscussedabove,thetwolinguisticsystems
ofbilingualsinteract.Wemodelthisbyalsoincorporatingintothegrammaraweightingof
constraintsthatblendsthelanguage-specificweightings.Thedegreetowhicheach
languagecontributestothisblendreflectstherelativeactivationofthatlinguisticsystem.
Principle3:Gradientblendsinlinguisticrepresentations
Buildingontheconnectionistformalismsthatserveasthefoundationofmany
psycholinguistictheories,weassumethatthereissimultaneouscoactivationof
representationalelementsinboththeinputandoutputofthegrammar.Thisallowsfor
representationsthatblendelementsfrommultiplelanguages.
Inthesectionsbelow,weelaboratethedetailsofthisgrammaticalproposal.Itis
importanttonotethatgrammarsdefinecognitiveprocessesatahighlevelofabstraction—
intermsofmappingbetweeninputsandoutputs.Thisiskeytodevelopingaclearand
![Page 19: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/19.jpg)
19
rigorousspecificationofwhatpreciselyacognitiveprocessdoes;whatarethetypesof
structuresthatarepredictedto(probabilistically)emergebyourtheoryoflanguage
structure?Weaimtodevelopsuchaframeworkforunderstandingthestructureofcode
mixing.However,itisimportanttonotethatunderstandingthecognitiveandultimately
neuralprocessesthatcomputetheseinput-outputmappingsiskeytodevelopinga
completetheoryoflanguageprocessing(fordiscussion,seeGoldrick,2011;Smolensky,
2006b).Grammaristhefoundationalcomponentatthebeginningofdevelopingacomplete
theory,butisbynomeansthefinalstep.
Relationshiptootherformalapproachestocodemixing
Generativetheoriesofcodemixing—suchastheonewefurtherdevelophere—can
bedividedintotwotypes.Onesetspecifiesgrammarsspecifictocodemixing.Rulesor
constraintsreferspecificallytocodemixedstructures,explicitlystatingpreferencesfor
distincttypesofcodemixing(Belazi,Rubin,&Toribio,1994;Bhatt,1997;DiSciullo,
Muysken,&Singh,1986;Joshi,1985;Legendre&Schindler,2010;Muysken,2013;Myers-
Scotton,1993;Poplack,1980;a.o.).AclassicexampleisPoplack’s(1980:586)Equivalence
Constraint:“Code-switcheswilltendtooccuratpointsindiscoursewherejuxtapositionof
LlandL2elementsdoesnotviolateasyntacticruleofeitherlanguage.”Here,the
grammaticalprinciplerefersdirectlytocodemixing,distinctfrom(butrelatedto)syntactic
patternsinnon-codemixedcontexts.Similarly,fromanOptimality-Theoreticperspective,
Muysken(2013:715)makesuseofconstraintssuchas“*CSL=Don’tswitchbetween
separatelanguages,eitherintheirlexiconorintheirgrammar.”
Analternativeapproachassumesthegrammaticalprinciplesoftwolanguagesare
integratedduringcodemixing,andthatthisintegrationyieldsthepatterns(Chan,2003,
![Page 20: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/20.jpg)
20
2008,2009;Lohndal,2013;MacSwan,1999,2000;Mahootian,1993;Woolford,1983;a.o.).
SinceMahootian(1993),thispositioniscommonlyreferredtoasthe“nulltheory”ofcode
mixing,accordingtowhichmonolingualandbilingualgrammarsshouldbesubjectto
identicalrepresentational/grammaticalconstraintsandpsychologicalprinciples.
Ourapproachincorporateselementsofbothperspectives.Followingthenulltheory
perspective,weassumethatthefeaturesofcodemixingreflectgeneralprinciplesof
syntacticknowledgeandsentenceprocessing.Blendrepresentationsareageneralfeature
ofgrammaticalknowledgeandprocessing;theemergenceofdoublingconstructionsin
bilingualsisaconsequenceoftheprinciplesunderlyingthesegrammars.However,in
contrasttothestrongestversionofthenulltheory(e.g.,MacSwan,1999),weassumethat
grammaticalprinciplescanrefertolanguagemembership(e.g.,distinguishingthewell-
formednessoflexicalitemsinEnglishvs.Tamilbasednotonsyntacticfeaturesbutonthe
languagefromwhichtheitemoriginates).Thisiscriticaltounderstandingattested
doublingpatterns.Insuchconstructions,thedoubledelementshave(nearly)equivalent
grammaticalfeatures(e.g.,theymatchinagreementfeatures(tense,aspect,case)andshare
argumentstructurerequirements),yetsurfaceinpositionsappropriatetotheelement’s
sourcelanguage.Ifthegrammardoesnotmakereferencetothesourcelanguage,thereis
nomeansofcapturingthisrestriction.
Weightedconstraintinteractioninstochasticgenerativegrammars
OurtheoryutilizestheGradientSymbolicComputationformalism(GSC;Smolensky
etal.,2014).GSCisaconstraint-basedapproachedtogenerativegrammar,buildingon
workinOptimalityTheory(Legendre,2001;Legendre,Putnam,deSwart,&Zaroukian,in
press-a;Prince&Smolensky,1993/2004)andHarmonicGrammar(Legendre,Miyata,&
![Page 21: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/21.jpg)
21
Smolensky,1990,2006;Pater,2009).Likeothergenerativegrammars,GSCdefinesa
functionthatmapsinputstructures(e.g.,logicalforms)tooutputstructures(e.g.,syntactic
structures).InGSC(andHarmonicGrammar),thegrammarisdefinedviaasetofweighted
violableconstraintsthatassignanumericalwell-formednessvalue(harmony)toeachof
thecandidateoutputsforagiveninput.GSCgrammarsarestochastic,generatinga
probabilitydistributionoveroutputforms(reflectingtherelativeharmonyofthe
candidates).
Tobuildupourtheory,webeginbymodelingmonolingualgrammars.Considera
simpleinputconsistingofasubjectandverb;asshowninFigure2,thiscanbelinearized
usingatleasttwosurfacesyntacticstructures.
Figure2.Twoalternativesurfacesyntacticstructurescorrespondingtotheinputgoes
(John).Thetextbeloweachprovidesabracketnotationcorrespondingtothetree,withthe
subscriptoneachopenbracketdenotingthecategoryoftheconstituent.
ThesesurfacesyntacticstructuresreflecttheassumptionsofX-bartheory(see
Carnie2010:Chapter7,forahistoricaloverview).Briefly,X-bartheoryplacesrestrictions
ontraditionalphrasestructuregrammars.Abstractingawayfrommorecomplex
phenomena,basicX-bartheoryassumesthatthebasicstructureofanextendedprojection
consistsoftwosyntacticphrases:XP,consistingofaspecifierandanX'phrase;andX',
!
! VP! ! ! ! !!!!!!!!VP! 3 3 John! !!!!!!!!!V'!! ! ! V'! !!!!!!John!! 3 3 !!!!!!!!!!goes!! !!!!Ø! !!!!!! goes!!!!!!!!!!!!!Ø!! ![VP!John![V'!goes]]! !! [VP![V'!goes]!John]!
!! !
![Page 22: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/22.jpg)
22
consistingoftheheadelementX0andacomplementizer.Asillustratedhere,theextended
projectionoftheverbconsistsofaverbphrase(VP)withspecifierJohn,andaV'phrase
consistingoftheheadgoes(andanullcomplement).Thissimplifiednotationissufficient
forcapturingthebasicfactsaboutsyntacticconstituencyandallowsustospecifythe
patternsinwordordervariationunderlyingthedoublingexamplesweconsider.We
believethattheinsightsofthisanalysiswouldgeneralizetomorerecentrepresentational
frameworks(e.g.,BarePhraseStructure),whichretainedmanyoftheinsightsofthisbasic
system(seee.g.,Chametzky2000).
Tocharacterizethedifferencebetweenlanguagesthatpreferthelinearorder
subject-verbvs.verb-subject,webuildonGrimshaw’s(1997,2001)analysis.Grimshaw
proposedconstraintsonthealignmentofspecifiers,heads,andcomplementstoedgesof
extendedprojectionsinX-bartheory3.Therelativeweightingoftheseconstraintsderives
differentwordorderingpreferences;i.e.determiningwhetheraparticulargrammar
prefersanSVO-vs.anSOV-orderingofarguments.WeadaptthesetodevelopaGSC
analysis,usingconstraintsonstructuralwell-formedness(markednessconstraints).A
subsetoftheseisshownbelow:
(10) HEADLEFT:“EveryX0isleftmostinX-max.”ForeachX0incandidateC,decreaseC’sharmonyby1foreachterminalnodeinterveningbetweentheX0andtheleftedgeofitsXP.
(11) SPECLEFT:“EveryspecifierisleftmostinX-max.”
ForeachspecifierincandidateC,decreaseC’sharmonyby1foreachterminalnodeinterveningbetweenthespecifierandtheleftedgeofitsXP.
Apseudo-Englishweightingofthesetwoconstraintsisshownbelow.Thecolumns
showtheconstraints.Cellsineachcolumnshowtheconstraint’scontributiontothe
harmonyofeachcandidate(scaledbytheweightoftheconstraint).Here,sinceSPECLEFT
![Page 23: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/23.jpg)
23
hasastrongerweightingthanHEADLEFT,thesubject-verbcandidatehasahigherharmony
value.Thefinalcolumngivestheprobabilityofeachcandidate.AsinMaximumEntropy
grammars(Goldwater&Johnson,2003;Hayes&Wilson,2008),theprobabilityisan
exponentialfunctionofitsharmonyrelativetotheothercandidates4.Inthisexample,the
harmonyofthesubject-verborderissomuchhigherthanverb-subjectthatitsprobability
isextremelycloseto1.0.Whilethesecondcandidatetechnicallyhasnon-zeroprobability,
itisextremelysmall;lessthan1x10–8.Thus,thegrammarisessentiallycategorical.
Table1.GrammarfragmentforEnglishwordorderInput:goes(John) SPECLEFT HEADLEFT
–20 –1 Harmony Probability[VPJohn[V'goes]] –1 –1 ≈1.0[VP[V'goes]John] –20 –20 ≈0
Iftherankingoftheconstraintsshifts,theprobabilityofdifferentcandidateswill
alsoshift.Thiscanspecifycross-linguisticvariation;considerthegrammarfragmentin
Table2.Here,HEADLEFThasamuchstrongerweightingthanSPECLEFT.Thisyieldsa
languagewithpost-verbalsubjects.
Table2.Grammarfragmentforverb-subjectwordorder
Input:goes(John) SPECLEFT HEADLEFT
–1 –20 Harmony Probability[VPJohn[V'goes]] –20 –20 ≈0[VP[V'goes]John] –1 –1 ≈1.0
Intheabovecases,thedifferencesinharmonyarequitelarge.However,as
differencesinharmonyofcandidatesgrowsmaller,variationcanresult:
![Page 24: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/24.jpg)
24
Table3.Grammarfragmentforvariablewordorder
Input:goes(John) SPECLEFT HEADLEFT –12 –10 Harmony Probability
[VPJohn[V'goes]] –10 –10 ≈0.88[VP[V'goes]John] –12 –12 ≈0.12
Intheseexamplefragments,theweightingofconstraintshasbeenarbitrarily
decided.Ourassumptionisthatsuchweightingsareacquiredbylearnersbasedonthe
probabilitydistributionofformsintheirlinguisticexperience(Goldwater&Johnson,2003;
Hayes&Wilson,2008).Forexample,anEnglishlearnerwouldacquiremanyexamples
withthewordordersubject-verb;allelsebeingequal,thiswouldleadhertofavor
constraintweightingssimilartoTable1overthoseinTable2orTable3.Asthisdiscussion
isfocusedonexploringthebasicprinciplesofthetheory,wedonotundertakeadetailed
studyofthisacquisitionprocess.Theconclusionswedrawbelowwillnotbedependenton
theparticularweightvaluesusedtoillustrateouranalysis.
Toprovideconcreteweightvaluesforthepurposeofillustration,weutilized
GoldwaterandJohnson’s(2003)learningalgorithm(asimplementedintheMaxEnt
GrammarTool;Hayes,2009;weightswereroundedtoyieldintegervaluesforeaseof
exposition).Aweakuniformpriorwasusedforeachconstraint(µ=0; σ=107).Theprior
influenceshowconstraintweightsareupdatedduringlearning.Thispriorspecifiesatarget
valueforeachconstraintweight(here,zero,sothatconstraintweightsareassmallas
possible),alongwithapenaltyfordeviatingfromthattargetvalue(here,theveryhigh
varianceimpliesanextremelysmallpenalty).Thisreducesourexamplestoasinglefree
arbitraryparameter:thevarianceontheprior.Usingthistrainingalgorithm,this
![Page 25: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/25.jpg)
25
parameter(combinedwiththetrainingdataexemplifyingagivenwordorder)completely
determinestheconstraintweightsbelow.
Head-complementwordordervariation
Forillustrationpurposes,thegrammarfragmentsabovearequitesimple,
consideringonly2possiblecandidateoutputsand2constraints.Inthissection,we
considerasomewhatmoreextendedexample,includingtwoadditionalconstraintsandan
explicitlydefinedspaceofpossibleoutputstructures.Thisallowsustospecifymonolingual
grammarsthatcontrastinwordorder—specifically,subject-verb-objectvs.subject-object-
verb(againbuildingonGrimshaw,1997,2001).
Modelingconstructionsincludingcomplementsrequiresanadditionalmarkedness
constraint,paralleltothoseproposedabove:
(12) COMPLEFT:“EverycomplementisleftmostinX-max.”ForeachcomplementincandidateC,decreaseC’sharmonyby1foreachterminalnodeinterveningbetweenthecomplementandtheleftedgeofitsXP.
Extendingthesetofcandidates,weconsidernotonlythosethatvaryinwordorder
butalsothosethatomitelementsoflexicalconceptualstructure.Theseavoidviolationsof
theconstraintsabovebysimplyleavingoutelements(acandidatewithnocomplements
cannotviolateCOMPLEFT).Toinsurethatsuchcandidatesaredispreferred,weusea
faithfulnessconstraintthatassignswell-formednessbasedontherelationshipbetween
syntacticandsemanticstructure(afterLegendre,Wilson,Smolensky,Homer,&Raymond,
1995):
(13) PARSE:“Lexicalconceptualstructureisparsed.”DecreasecandidateC’sharmonyby1foreachelementoflexicalconceptualstructurethatdoesnothaveacorrespondingelementinC’ssurfacesyntacticstructure.
![Page 26: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/26.jpg)
26
Forthisdiscussion,weassumecandidateoutputsarelimitedtothoseincludingX-
bartreesthatparseallelementsoflexicalconceptualstructureoranypossiblesubsetof
elements.Wefurtherassumethatelementsoflexicalconceptualstructureareparsedinto
theappropriatesyntacticpositions(e.g.,subjectisparsedintoSpec).Foraninputwitha
verb,subject,andobject,thisyieldsthecandidatesetshowninTable4.Constraintweights
weredeterminedbytrainingondatareflectingthecontext-neutralEnglishwordorderfor
thisparticularinput:100%subject-verb-object(e.g.,“Theygaveagrant”;Berk,1999).Note
thatanEnglishlanguagelearnermighthavesomewhatdifferentweightingsforthese
constraints,asshewouldbeexposedtodifferentinputs(e.g.,inputswithnoobject
complement)andwouldhaveamuchlargersetofconstraints.
Table4.Grammarfragment:Englishsubject-verb-objectwordorder.Blankcellsindicatethecandidatedoesnotviolatetheconstraint.NotethatsinceCOMPLEFThasaweightingof0,violations of the constraint do not decrease harmony. Probabilities are rounded; those lessthan1x10-4arerepresentedas0.Input:gave(they,grant)
SPECLEFT HEADLEFT COMPLEFT PARSE
–13 –12 0 –25 H Pr[VPthey[V'gavegrant]] –12 0 –12 1[VPthey[V'grantgave]] –24 0 –24 0[VP[V'gavegrant]they] –26 0 –26 0[VP[V'grantgave]they] –26 –12 –38 0[VPthey[V'gave]] –12 –25 –37 0[VP[V'gave]they] –13 –25 –38 0[VPthey[V'grant]] 0 –25 –25 0[VP[V'grant]they] –13 –25 –38 0[V'grantgave] –12 –25 –37 0[V'gavegrant] 0 –25 –25 0[VPthey] –50 –50 0[V'gave] –50 –50 0[V’grant] –50 –50 0ø –75 –75 0
![Page 27: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/27.jpg)
27
Thistrainingprocedureyieldsastrongweightingtoourfaithfulnessconstraint.
Althoughdeletingelementsoflexicalconceptualstructureallowsmanycandidateoutputs
inTable4toavoidviolationsofthemarkednessconstraints,theyincurone,two,orthree
penaltiesfromthefaithfulnessconstraint,substantiallyloweringtheirharmony.The
relativerankingofthemarkednessconstraintsdetermineswhichofthefirstfourfully
faithfulcandidatesareselected.Thesecondmost-highlyweightedconstraintprefers
specifiersoccurtotheleftoftheV'projection,rulingoutthethirdandfourthcandidates.
Thethirdconstraint,preferringthatheadsbeleftmost,thenrulesoutthesecond,yielding
thesubject-verb-objectwordorder.
LanguageslikeTamilexhibitacontrastingcontext-neutralwordorderpattern,
subject-object-verb(Sarma,1999;Schiffman,1999).Trainingonthesedataforthesame
inputyieldsacontrastingweighting:
![Page 28: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/28.jpg)
28
Table5.Grammarfragment:Tamilsubject–object-verbwordorder.Blankcellsindicatethecandidatedoesnotviolatetheconstraint.NotethatsinceHEADLEFThasaweightingof0,violations of the constraint do not decrease harmony. Probabilities are rounded; those lessthan1x10-4arerepresentedas0.
Input:gave(they,grant)
SPECLEFT HEADLEFT COMPLEFT PARSE
–13 0 –12 –25 H Pr[VPthey[V'gavegrant]] 0 –24 –24 0[VPthey[V'grantgave]] 0 –12 –12 1[VP[V'gavegrant]they] –26 –12 –38 0[VP[V'grantgave]they] –26 –26 0[VPthey[V'gave]] 0 –25 –25 0[VP[V'gave]they] –13 –25 –38 0[VPthey[V'grant]] –12 –25 –37 0[VP[V'grant]they] –13 –25 –38 0[V'grantgave] 0 –25 –25 0[V'gavegrant] –12 –25 –37 0[VPthey] –50 –50 0[V'gave] –50 –50 0[V'grant] –50 –50 0ø –75 –75 0
ThesolechangetotheweightingistherelativestrengthofHEADLEFTandCOMPLEFT.
Nowthatthelatterhasahigherweighting,thereisareversaloftherelativeharmonyofthe
firsttwocandidates;objectcomplements,notverbalheads,areleftmostinV'.Thisyields
theappropriatesubject-object-verbwordorder.
Codemixinginconstraint-basedgrammars
Havingdemonstratedthatourformalismcanrepresentcross-linguisticdifferences
inwordorder,weconsiderthegrammarsutilizedbybilinguals(e.g.,anEnglish-Tamil
bilingual).Asreviewedabove,inintra-sententialcodemixingbilingualsintegrate
grammaticalprinciplesfromeachlinguisticsystem.InGSC,grammarsaredefinedbythe
weightingofconstraints.Wethereforeformalizethisintegrationbyhavingtheweightsof
constraintsinthegrammarunderlyingcodemixingreflectbothlinguisticsystems.
![Page 29: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/29.jpg)
29
Weproposetoassociateeachlinguisticsystempresentinacodemixedutterance
(L1,L2)withanactivationvalue(αL1,αL2;thesumofthesevaluesmustbe1).Thisscales
theamounteachlinguisticsystemcontributestothemodulationofeachconstraint’s
violations.Specifically,violationsofeachconstraintCarescaledbythesumoftheC’s
rankingineachlinguisticsystem,weightedbytheactivationofthatsystem.Thisallowsfor
interactionsbetweenthetwolinguisticsystems(aseachcontributestoharmonyforevery
element).Thisscalingvalueisadditionallyincreasedbytheactivationofalinguisticsystem
iftheconstraintreferstoanelementinthatsystem.Thislatterfactorencodesa(violable)
preferenceforlinguisticelementstoobeythepropertiesofthesourcelanguage.
Forexample,supposeHEADLEFThasweighting–10inL1and–5inL2;theactivation
ofL1is0.75andL20.25.AnL1headthathas1positioninterveningbetweenitandthe
edgeofXPwillincuraharmonypenaltyof1*[–10*(0.75+0.75)+1*–5*(0.25)]=–16.25.
AnL2headthathas1positioninterveningbetweenitandtheedgeofXPwillincura
harmonypenaltyof1*[–10*(0.75)+1*–5*(0.25+.25)]=–10.0.TheL1headincursa
slightlygreaterpenaltybecauseofthestrongerweightingofthisconstraintwithinthe
sourcegrammar.
Blendsingrammaticalrepresentations
WeightedconstraintsarenotanovelclaimofGSC;theseoverlapwithexisting
formalismsincludingHarmonicGrammarandMaximumEntropymodels.Anovelfeature
ofGSCistheincorporationofblends.Specifically,GSCproposesthatelementsofsymbolic
grammaticalrepresentationsareassociatedwithactivationvalues.Thisincludesall
elementsofsyntacticrepresentations:thenodesofthetree(bothterminalandnon-
terminalelements)aswellasthelinksbetweennodes.Thisallowsforthespecificationof
![Page 30: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/30.jpg)
30
blends;multiplerepresentationalelementsthatwouldoccupyasinglepositionorroleina
discretesymbolicrepresentationcanbeco-present,tovaryingdegrees.Inpreviouswork,
wehaveexaminedtherolethatblendsplayinmonolinguallanguageprocessing.For
example,inphonologicalspeecherrors(mispronouncingbataspat),thereisevidencethat
targetanderrorsoundrepresentationsareco-activated(theonsetoftheerrorsyllablepat
isablendsimultaneouslycontainingelementsofboth/b/and/p/;Goldrick&Chu,2014;
Smolenskyetal.,2014).Here,weextendthisverygeneralrepresentationalprincipletothe
domainofbilingualism,focusingonblendsinvolvingelementsfromtwodistinctsource
languages.
Forourinitialdiscussion,wefocusoncaseswheremultipleelementsareco-present
tothesamedegree;thissufficestoillustratethegeneralanalysis.Weillustratethiswith
example(7),repeatedbelowforconvenience.
(14) Doubling:VerbsEnglish-Tamil(Sankoffetal.,1990:93)theygavemearesearchgrantkoɖutaatheygavemearesearchgrantgave.3.PL.PAST‘Theygavemearesearchgrant.’
Weanalyzetheinputtothegrammarincodemixingcontextsasconsistingofblends
ofsemanticelements.Fortheexampleabove,weanalyzetheinputastheco-presenceof
twoverbalelements—drawnfromtwodistinctlanguages—whichsharemultiple
arguments(shownin(15)below).Thisrepresentationinstantiatesacoreclaimofthe
blendanalysisofdoublingconstructions:thesimultaneouspresence,intheinputtothe
grammar,ofthesemanticrepresentationunderlyingthedoubledelements.
(15) (3rd plural, grant)!"ɖ!"##!"#$
![Page 31: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/31.jpg)
31
Blendinginsyntacticrepresentationsisakeypartofouranalysisofdoubling
constructions.Ingeneral,weanalyzesuchconstructionsinvolvingtwoV'phrases,with
distinctheads,thatoccupythesamepositioninthetree.TheseV'-projectionssharetwo
complements:anindirect(me)andadirectobject(grant),withtheseelements
simultaneouslyassociatedtobothV'.Figure3illustratestherepresentationhypothesized
for(15).Notethatweadoptaternary-branchingstructureforthedoubleobject
construction.Ouranalysisdoesnothingeonthisassumption;abinarybranchingVP-shell
structure(Larson,1988)couldalsoincludeblends,andwouldyieldsimilarresultshere.
Figure3.HypothesizedblendstructureforEnglish-Tamildoublingconstructiontheygave
me(aresearch)grantkoɖutaa.Thetextbeloweachprovidesabracketnotation
correspondingtothetree.Dashedlineshighlightblendedcomponentsofthe
representation.
Inthisexample,therearetwoV'(headedbygavevs.koɖutaa),sharingthe
complementsmeandgrant.ThesetwoV'simultaneouslyserveastheheadofVP(ablend
ofnodesinthesamepositioninthetree).Thisisakeypartofouranalysis,asitplaceseach
ofthedoubledelementsinthesamerolewithinthesyntacticstructure.Thissetsupthe
!
! !! !!!!!!!!!VP! ! ! ! !!!!!!!!They! !!!!!V'V'!! !!!!! !! !!!!!gave!! !!!!!!!!!!!!koɖutaa!!! ! !!!!!me!a!grant!!! !! ![VP!They![V'!gave![V'!me!a!grant]!koɖutaa!]!]!
!! !
![Page 32: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/32.jpg)
32
structuralrelationshipsthatensurethedoubledelementsexhibitparalleltense,aspect,
agreement,andcasefeatures.
Thusfar,ourdiscussionofblendinghasfocusedontheblendingofelementswithin
astructuralposition(e.g.,twoV'projectingfromtheheadposition;twoverbsoccupying
thesameroleinsemanticstructure).However,aseachelementofarepresentationcanbe
associatedwithanactivationvalue,blendingispredictedtoextendtotherolesthemselves.
ThisisdepictedinFigure3,wherethesameargumentsaresharedacrossthetwophrases.
Thisrepresentsablendingofpositionsorsyntacticrelationships;onenodeinthetree(e.g.,
theindirectobject)hasthesametypeoflinktotwodistinctnodes(e.g.,V'headedbyan
EnglishverbandtheV'headedbyaTamilverb).Inouranalysis,thissharingofthe
complementsiscritical,asitallowstheblendedstructuretobelinearized.Iftwolexical
itemssimultaneouslyoccupytheheadofVP,thereisnowaytodeterminewhichoneof
themshouldoccurfirstinthesurfacestring;simultaneitymeansthereisnoprecedence
relationshipbetweenthephrases.However,becausetheprecedencerelationshipswithin
eachphraseshareacommonelement(gaveprecedesmeandgrant,meandgrantprecedes
koɖutaa),acompleteorderingoftheterminalscanbedetermined(bytransitivity,gave
mustprecedekoɖutaa).
Asreviewedinprecedingsections,thereisampleempiricalevidencethatblendsare
subjecttobothphysicalconstraints(e.g.,unimodalbilingualscannotplacearticulatorsin
contradictorypositions)aswellascognitiveconstraints(e.g.,affective/pragmatic
constraints,relativestrengthofthetwolanguages,grammaticalcontext).GSCsuggestsa
cleartheoreticalmotivationforsomeofthesecognitiveconstraints(Smolenskyetal.,
2014).Inmanycases,purelygrammaticalconstraintswillpreferblendsthatdonotreflect
![Page 33: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/33.jpg)
33
thestructuralprinciplesofthesourcegrammars.Forexample,ifconstraintspreferthatall
elementsbeattheleftedge(i.e.,specifier)ofXP,whynotplaceallelementsinthatposition
simultaneously?Thiscandidatewouldsatisfyalltheconstraintsabove;intheabsenceof
otherprinciples,itwouldhavethehighestharmony.Byallowingthegrammartoavoid
makingchoicesbetweenstructuresexhibitingdifferentwordorders,thisblend
representationwouldpreventGSCfromcapturingkeypropertiesofcross-linguistic
variation.GSC,unlikemanypreviousconnectionistproposals,thereforeincorporatesan
explicitdispreferenceforblendrepresentations(Smolenskyetal.,2014).Forthepurposes
ofthisdiscussion(focusedonsimultaneouspresenceofequallyactiveelements),we
representthisasaconstraintthatsimplyreferstothepresencevs.absenceofblended
elements5:
(16) QUANTIZATION:“Candidatesmustbediscretesymbolicstructures.”ForeachblendedstructuralelementincandidateC,decreaseC’sharmonyby1.
ItisimportanttoemphasizethatQUANTIZATIONplaysakeyroleinmonolingual
grammars.Althoughwehaveemphasizedtheirroleinbilinguallanguageprocessing,blend
representationsareaubiquitousfeatureofmonolingualprocessingaswell(Melingeretal.,
2014).Allgrammaticalcomputations—notonlythoseinvolvedincodemixing—must
thereforeevaluaterepresentationswheremultipleelementsoccupythesamestructural
role(Smolensky,etal.2014).
Withrespecttothecurrentdiscussion,itisimportanttonotethatQUANTIZATIONis
violable;otherconstraintscancompelthepresenceofblends.Thenextsectionexaminesa
situationinwhichthiscanoccur.
![Page 34: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/34.jpg)
34
Analysisofdoublingconstructions
Combiningtheresultsoftheprevioussections,weexamineconditionsunderwhich
doublingconstructionscanbeproduced.Welimitourselvestocandidatesthatparseall
elementsoflexicalconceptualstructureoranypossiblesubsetofelements(inthiscode
mixingcase,thisincludeselementsfrommultiplelanguages).Parsedelementsare
associatedwithappropriatesyntacticpositions.Weextendthiscandidatesettoincludethe
blendstructuredepictedinFigure3,simplifyingtheexamplebyomittingtheindirect
object.Followingthesectiononcodemixing,theconstraintrankingsinTable4[English]
andTable5[Tamil]arecombinedtodeterminethegrammarusedtoevaluatecode-mixed
constructions.WeincludetheQUANTIZATIONconstraintasalanguage-independent
constraint,examininghowitsrelativeweightingaffectstheprobabilityofblend
constructions.
ThetableauinTable6illustratesonerankingthatqualitativelyapproximatesthe
empiricaldistributionofdoublingconstructions—non-zero,butrelativelysmall
probability.
Table 6. Grammar fragment: Doubling construction, Tamil-English code mixing (seeappendix for full set of candidates). Blank cells indicate the candidate does not violate theconstraint.Probabilitiesarerounded;thoselessthan1x10-4arerepresentedas0.Input:
(3rd plural, grant)!"ɖ!"##!"#$
SPECLEFT HEADLEFT COMPLEFT PARSE QUANT
English:0.5activation –6.5 –6 0 –12.5 Tamil:0.5activation –6.5 0 –6 –12.5 Combinedweighting –13 –6 –6 –25 –8 H Pr
a.[XPthey[X'gave[X'grant]koɖutaa]] –30 –12 –16 –58 0.039b.[XPthey[X'koɖutaa[X'grant]gave]] –42 –12 –16 –70 0c.[XPthey[X'grantkoɖutaa]] –12 –6 –37.5 –55.5 0.48
![Page 35: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/35.jpg)
35
Theobserveddoublingconstruction(candidatea)violatesseveralmarkedness
constraints:
• WithrespecttoHEADLEFT,itreceivesatotalharmonypenaltyof30dueto:
o Languagegeneralconstraints:–24=–6*4violations(3forkoɖutaa
and1forgave)
o English-specificconstraints:–6=–6*1violation(forgave)
o Tamil-specificconstraints:0=0*3violations(forkoɖutaa)
• ForCOMPLEFTthepenaltyis–12:
o Languagegeneralconstraints:–12=–6*2violations(forgrant)
o English-specificconstraints:0=0*2violations(forgrant)
• TwoviolationsoftheQUANTIZATIONconstraintyieldapenaltyof–16(forthe
twoV'simultaneouslyprojectingfromtheheadofVP,aswellasgrant
occurringasacomplementinbothV').
However,unlikecandidatesthatdeleteEnglishorTamilverbs(e.g.,candidatec),
theobserveddoublingconstructionsavoidviolationsofthefaithfulnessconstraintPARSE.
Theprobabilityofthedoublingconstructionrelativetonon-doubledcandidateslike(c)is
thereforerelatedtotheweightingoffaithfulnessrelativetothemarkednessconstraints
above.
Assumingfaithfulnesshasastrongenoughweighttocompelthepresenceof
doubling,theattestedcandidate(a)willbepreferredtounattestedcandidate(b)duetothe
influenceoflanguage-specificconstraints.Thetwocandidatesincurequalviolationsofthe
languagegeneralconstraints,but(b)incursextraviolationsoflanguage-specific
constraints(2additionalEnglish-specificviolationsforgave).Solongastheselanguage-
![Page 36: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/36.jpg)
36
specificconstraintshaveanon-zeroweighting,thegrammarwillassignhigherprobability
totheattestedform.
Predictedlimitationsondoublingconstructions
Ouranalysisabovefocusedonanexamplethat,followingtheempiricalpatternsof
doubling,consistedoftwoheadswithasharedcomplement.Weassumethatdoubled
complements—specifically,argumentsofverbs—areunattestedbecausesuchstructures
wouldviolateChomsky’sThetaCriterion,whichstatesthat“eachargumentbearsoneand
onlyonetheta-role,andeachtheta-roleisassignedtooneandonlyoneargument”(1981:
35;seealsoChan,2003,2008,2009).Forexample,contrasttheattestedtheygaveEnglish
grantgaveTamilwiththeunattestedtheygrantTamilgavegrantEnglish.Inthelatter,theTheta
CriterionisviolatedasbothgrantTamilandgrantEnglishsharethesamerole(theme).In
contrast,intheattestedexamplethethemeisoccupiedbyasingleentity(grantEnglish,
sharedacrossthetwoverbs)sothereisnoviolation.Weassumethatviolationofthis
constrainteithercausesthegrammartocategoricallyruleoutsuchstructuresor,
alternatively,greatlyreducestheirprobability(iftheThetaCriterionisrealizedviaa
stronglyweightedconstraint).
Anovelpredictionofouraccountisthatdifferentdistributionsofdoublingshould
beobservedforexpletivevs.non-expletiveelements.Expletiveelementsarethosethat
appearsolelyforstructuralconsiderationsandaresemanticallyvacuous(e.g.,inIt’s
raining;thepronounit doesnotactuallyrefertoaspecificagent).Ouranalysisattributes
thepresenceofdoublingtotheco-presenceofmultipleelementsintheinputtothe
grammar.Thisisreflectedbythecrucialroleoffaithfulness;PARSEprovidesanadvantage
fordoublingconstructions,inspiteoftheirincreasedviolationsofalignmentconstraints
![Page 37: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/37.jpg)
37
andQUANTIZATION.Thismakesanovelprediction:weshouldnotobservedoublingof
expletiveelements(e.g.,Englishdo)alone.FollowingGrimshaw(1997,2001,2013,a.o.),
theoccurrenceofsuchelementscanbeattributedtostructural(i.e.,markedness)
constraintsratherthantothepresenceofexpletivesintheinputtothegrammar.Asthey
onlyappeartosatisfythestructuralrequirementsofotherelements,ouraccountpredicts
thatexpletivesshouldnotbedoubledinisolation.
Tomakethisconcrete,consideracasewheredoublingcouldbepredicted.English
andKoreanbothutilizedo-supportinnegatives(Grimshaw,2013),butexhibitcontrasting
wordorder.Liketheverb,Koreannegatives(anddo)appearfollowingtheobject,the
oppositeofEnglish:
(17) Chelswu-kappang-ulmek-cianiha-ess-ta(Hagstrom,1996:169)Chelswu-NOMbook-ACCread-CINEGdo-PAST-DECL‘Chelswudidnotreadthebook.’
Anaccountthatattributedthepresenceofdoublingtocontrastingsurfaceword
orderswouldpredictthatdoublingofeithertheverb,negativemorpheme,ordoalone
couldoccurinKorean-Englishcodemixing.Incontrast,ouranalysispredictsthatthe
doublingofdoaloneshouldnotoccur.Thepresenceofdoherereflectsstructuralwell-
formednessconstraints,triggeredbythepresenceofnegation(Grimshaw,1997,2013).
Thereisnoindependentmotivationtoincludethisexpletiveasidefromthis.Thus,
doublingofdoalonewouldviolateconstraintssuchasQUANTIZATIONwhileprovidingno
benefitwithrespecttoconstraintssuchasPARSE.(Infact,insertionofexpletiveelements
notpresentintheinputmayviolatefaithfulnessconstraintssuchasGrimshaw’sFULLINT.)
Bilingualdoublingcannotbeanalyzedasmovement
![Page 38: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/38.jpg)
38
Havingoutlinedourproposal,webrieflyconsiderwhetherexistinganalysesof
doublingcouldprovideanalternativetoouranalysis.Doublingofelementsinmonolingual
grammarshasbeenafocusofrecentgenerativeresearch(see,e.g.,thecontributionsin
Barbiers,Koeneman,Lekakou,&vanderHam,2008).Thisistypicallyanalyzedasresulting
fromphonologicalrealizationofmultiplelinksinarepresentationalstructurelinkinga
syntacticelementfromitslocationinthesurfacesyntacticstructuretootherdistal
locationsinthesyntactictree(e.g.,derivationalchains;Jónsson,2008;Nunes,2004).Such
ananalysisdoesnotappeartobetenablefortheattestedexamplesofbilingualdoubling.
ConsidertheTamil-Englishexampleanalyzedabove,wherethereisdoublingoftheverb
gave.ThereisnoclearmotivationforsuchmovementinthegrammarofeitherEnglishor
Tamil.Evenifweweretoentertainsuchananalysis,itwouldviolateabasicprincipleofthe
localityofheadmovement—theheadofaprojection(here,theverb)cannotundergo
movementwithinthatprojection(Abels,2003).
Gradientco-activationanddirectionsforfutureresearch
Gradientactivationinblends—thekeytoaccountingfortherangeof
psycholinguisticdatareviewedintheintroductiontothispaper—isclearlyoutsidethe
scopeofanytraditionalgrammaticaltheory.IntheGSCframework,suchrepresentations
arepossibleinputsandoutputstothegrammar,andareassignedHarmonyvaluesby
constraints.Specifically,theviolationofeachconstraintreflectstheactivationofthe
constituentsreferredtobytheconstraint.Forexample(c.f.(13),emphasisaddedtoshow
contrastindefinitions):
(18) PARSE:“Lexicalconceptualstructureisparsed.”DecreasecandidateC’sharmonybytheactivationofeachelementoflexicalconceptualstructurethatdoesnothaveacorrespondingelementinC’ssurfacesyntacticstructure.
![Page 39: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/39.jpg)
39
Theincorporationofactivationstemsfromprinciplesofconnectionistcomputation
(Legendreetal.,1990;Smolensky&Legendre,2006).GSC-representationsarerealizedby
real-valuedactivationvectorsoversimpleprocessingunits.Overthecourseof
computation,activationspreadsamongtheseunitsviaweightsthatimplement
grammaticalconstraints(Smolenskyetal.,2014).Critically,theactivationvaluesare
continuouslyupdated;thenetworkdoesnotsimply‘jump’fromonegradientsymbolic
representationalstatetoanother.Inordertoinsurethatthiscontinuousupdaterespects
thewell-formednessconditionsspecifiedbythegrammar,constraintsmustassignwell-
formednessvaluestothefullrangeofintermediate,gradientrepresentationalstates.
Harmonythereforevarieswiththeactivationofeachrepresentationalconstituent.
Inthecontextofcodemixing,gradientactivationofelementsintheinputwillalter
therelativeprobabilityoftheseelementsappearingintheoutput.Thisisbecause
violationsoffaithfulnessconstraintslikePARSEwillbescaledbyactivation.Foragiven
weightingofPARSE,theharmonypenaltyincurredbydeletinganelementwillbelessifthe
elementhasalowervs.higheractivationvalue.Lessactiveelementswillthereforebemore
likelytobedeleted.ThetableauxinTable7andTable8illustratethisforasimplesubject-
verbsentence(here,weassumeQUANTIZATIONisstronglyweighted,blockingtheappearance
ofoutputblends).
![Page 40: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/40.jpg)
40
Table7.Effectsofvariationininputactivation:StrongbiastowardsEnglishvs.Tamil.Notethatcompetitorsinvolvingadditionaldeletionofinputelementshavebeenomitted(duetoviolationsofPARSE,theyhaveverylowHarmonyandthusoutputprobabilitynear0).
Input:
(3rd plural)!"ɖ!"## .!!"#$ .!
SPECLEFT HEADLEFT PARSE
English:0.5activation –6.5 –6 –12.5 Tamil:0.5activation –6.5 0 –12.5 Combinedweighting –13 –6 –25 H Pr[XPthey[X'gave]] –12 –15 –27 0.82[XPthey[X'koɖutaa]] –6 –22.5 –28.5 0.18Table8.Effectsofvariationininputactivation:WeakerbiastowardsEnglishvs.Tamil.Notethatcompetitorsinvolvingadditionaldeletionofinputelementshavebeenomitted(duetoviolationsofPARSE,theyhaveverylowHarmonyandthusoutputprobabilitynear0).Input:
(3rd plural)!"ɖ!"## .!!!"#$ .!"
SPECLEFT HEADLEFT PARSE
English:0.5activation –6.5 –6 –12.5 Tamil:0.5activation –6.5 0 –12.5 Combinedweighting –13 –6 –25 H Pr[XPthey[X'gave]] –12 –16.5 –28.5 0.18[XPthey[X'koɖutaa]] –6 –21 –27 0.82
Inthesetableaux,violationsofPARSEarescaledbytheactivationoftheinput
element.Forexample,inTable7,thefirstcandidatedeleteskoɖutaa.Languagegeneral
constraintsassignaviolationof–10=0.4*–25andTamilspecificconstraintsassigna
violationof–5=0.4*–12.5.ComparethistothefirstcandidateinTable8.Here,the
violationoflanguagegeneralconstraintsincreasesto–11=0.44*–25andTamil-specific
constraintsto–5.5=0.44*–12.5.Astheactivationofaninputelementincreases,thecost
ofdeletingitalsoincreases—itbecomesmorecriticaltopreservetheelement(reflectedin
theshiftinoutputprobabilitiesacrossthetwotableaux).
![Page 41: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/41.jpg)
41
Gradientactivationofelementsintheoutputprovidesamechanismformodeling
thedatareviewedinthefirstsectionsofthispaper;gradientblendsobservedin
phonologicalandarticulatoryprocessinginspokenandsignedlanguages(e.g.,theco-
activationof<DOG>and<PERRO>duringSpanish-Englishbilingualproduction,depictedin
Figure1).Clearly,gradientsymbolstructureshavetheexpressivecapabilitytorepresent
suchstructures;throughoutthisdiscussion,wehaveassumedgradedactivationof
elementsoftheinputtothegrammar.Ourclaimisthatthedegreeofblendingpresentin
theoutputreflectsgrammaticalcomputations.
Outsideofnumericalsimulations,thefinalblendstatesofourfirstimplementation
oftheQUANTIZATIONconstraint(Smolenskyetal.,2014)cannotbedetermined.Inmore
recentwork(TupperandSmolensky,inprogress)wehavethereforedevelopednew
realizationsofthisconstraintthataremoreamenabletoanalysis.Usingthesemethods,we
cancalculatetheoptimalblendstatepredictedbythegrammar.
Toillustratethisapproach,weconsideredthescenarioshowninFigure1—the
coactivationoftwonounsintheheadofanNPconsistingofadeterminerandnoun.For
thiscomputation,wesimplifiedourgrammar,focusingonlyonQUANTIZATIONand
FAITHFULNESS(asSpanishandEnglishagreeonwordorderfornounsinthesephrases).
FollowingthescenariodepictedinFigure1,weassumedthatEnglishhasgreateractivation
thanSpanish.AsshowninFigure4,thissetofconstraintweightings6assignedhighest
Harmonytoablendstatethatisclosestto<DOG>(reflectingthehigheractivationof
Englishvs.Spanish),yetcontainssomepartialactivationofthetranslationequivalent
<PERRO>(reflectingtherelativeweightingofQUANTIZATION).Critically,thisdegreeof
blendingisnotassumed,butisratherderivedfromtheconstraintsofthegrammar.
![Page 42: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/42.jpg)
42
Figure4.RelativeHarmony(!!)ofvariousblendsof<DOG>(activationshownonXaxis)
and<PERRO>(activationshownonYaxis);lightercolorindicateshigherHarmony.The
optimalblend(0.81<DOG>,0.15<PERRO>)ismarkedwithanx.
Giveninputactivationsandrelativeconstraintweightings,aGSCtheorywillmake
predictionsaboutmultiplefacetsofcodemixedproductions:discrete(e.g.,output
probabilitiesofvariousstructures)aswellasgradient(e.g.,coactivationoftranslation
equivalents).Todevelopthisaccount,itisimportantthatwegainamoreprecise
understandingofthefactorsthatfacilitate(andinhibit)theactivationofrepresentations
withineachofabilingual’slanguagesduringsentenceprocessingaswellashowbilinguals
learntherelativeweightingsofgrammaticalconstraints.Critically,GSCprovidesuswitha
frameworkthatcanintegratethesevariousinfluencesoncodemixing—allowingusto
developaunifiedaccountofdiscreteandgradientpropertiesofbilinguallinguistic
knowledgeandprocessing.
![Page 43: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/43.jpg)
43
Conclusions
Wehavesoughttobringtogethertwotraditionsinbilingualresearch.Studiesofon-
linebehaviorhaveestablishedthatblendrepresentations—wheremultipleelementsare
co-presentwithinasinglestructuralposition—playakeyroleinbilinguallanguage
processingatalllevelsoflinguisticstructure.Studiesofcodemixinghaveemphasizedthe
rolethatgrammaticalknowledgeplaysinconstrainingbilingualsentenceproduction.We
usedthephenomenonofdoublingtohighlighttheconnectionbetweenthesetwolinesof
research:theintegrationofblendrepresentationsandgrammar.Toformallylinkthesetwo
aspectsofbilingualcognition,weintroducedanaccountofcodemixingbasedinthe
GradientSymbolicComputation(GSC)formalism.Usingviolable,rankedconstraints,we
characterizedtheprobabilisticgrammarsunderlyingcodemixing.Therankingofsuch
constraintsreflectstheweightedsumofrankingsineachlanguageinvolvedinacode
mixedutterancealongwithacontributionfromthesourcelanguageofeachelement.
Crucially,blendrepresentationsarepartoftheinputandoutputofthegrammar.This
providesapredictiveaccountofdoublingconstructions;specifically,wepredict
restrictionsontheinsertionofexpletiveelementsinblendedstructures.Finally,our
approachcanbeextendedtoaccountforgradedblendrepresentationsinbilingual
languageprocessing.
Theprinciplesofouraccountofcodemixing—blendrepresentations;probabilistic
grammarswithweightedconstraints—comefromgeneralprinciplesofGSC.Theyarenot
postulatedtoaccountforbilinguallanguageprocessingspecifically,butratherreflect
principlesofthecognitivesystemthatholdforallspeakers.Similarly,thegrammatical
![Page 44: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/44.jpg)
44
principlesweusetoaccountforcodemixingarethesameprinciplesthatunderlienon-
codemixedutterances.Ouraccountthereforedoesnotassumethatbilingualismingeneral
orcodemixingspecificallyrepresentsatypical,exceptionalcircumstances.Thatsaid,these
twoaspectsoflinguisticcognitionprovideakeytestcasefordiscoveringtheprinciples
thatunderliethecognitivearchitectureoflanguageprocessing.Codemixingisan
“experiment”inthenatural“laboratory”ofbilingualism,revealingtheinteractionofblend
representationsandgrammarthatisattheheartofGradientSymbolicComputation.
![Page 45: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/45.jpg)
45
References
Abels,K.(2003)Successivecyclicity,anti-locality,andadpositionstranding.Doctoral
dissertation,UniversityofConnecticut-Storrs.
Amengual,M.(2012).Interlingualinfluenceinbilingualspeech:Cognatestatuseffectina
continuumofbilingualism.Bilingualism:LanguageandCognition,15,517–530.
Azuma,S.(1993).Wordordervs.wordclass:portmanteausentencesinbilinguals.InP.M.
Clancy(Ed.),Japanese/Koreanlinguistics2(pp.193-204).Stanford:CSLI.
Backus,A.(1992).Patternsoflanguagemixing:astudyofTurkish-Dutchbilingualism.
Wiesbaden:Harrassowitz.
Balukas,C.,&Koops,C.(inpress).Spanish-Englishbilingualvoiceonsettimein
spontaneouscode-switching.InternationalJournalofBilingualism.
Barbiers,S.,Koeneman,O.,Lekakou,M.,&vanderHam,M.(Eds.)(2008).Syntaxand
semantics36:Microvariationinsyntacticdoubling.Bingley:Emerald.
Belazi,H.M.,Rubin,E.J.&Toribio,A.J.(1994).CodeswitchingandX-Bartheory:the
Functionalheadconstraint.LinguisticInquiry,25,221-237.
Bentahila,A.,&Davis,E.(1983).ThesyntaxofArabic-Frenchcode-switching.Lingua,59,
301-330.
Berk,L.M.(1999)Englishsyntax:Fromwordtodiscourse.Oxford:OxfordUniversityPress.
Bhatt,R.(1997).�Code-switching,constraints,andoptimalgrammars.Lingua,102,223-251.
Bishop,M.(2010).Happencan'thear:Ananalysisofcode-blendsinhearing,nativesigners
ofAmericanSignLanguage.SignLanguageStudies,11,205-240.
Bokamba,E.(1988).Code-mixing,languagevariation,andlinguistictheory:Evidencefrom
Bantulanguages.Lingua,76,21-62.
![Page 46: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/46.jpg)
46
Bresnan,J.(2000)Optimalsyntax.InJ.Dekkers,F.vanderLeeuw,&J.vandeWeijer(Eds.)
Optimalitytheory:phonology,syntaxandacquisition(pp.334-385).Oxford:Oxford
UniversityPress.
Broekhuis,H.,&Vogel,R.(Eds.)(2013)Linguisticderivationsandfiltering:Minimalismand
optimalitytheory.London:Equinox.
Carnie,A.(2010).Constituentstructure.Oxford:OxfordUniversityPress.
Chametzky,R.(2000).Phrasestructure:FromGBtominimalism.Malden,MA:Blackwell-
Wiley.
Chan,B.H.-S.(2003).Aspectsofthesyntax,thepragmatics,andtheproductionofcode-
switching,withspecialreferencetoCantonese-English.NewYork:PeterLang.
Chan,B.H.-S.(2008).Code-switching,wordorderandthelexical/functionalcategory
distinction.Lingua,118,777-809.
Chan,B.H.-S.(2009).Code-switchingbetweentypologicallydistinctlanguages.In:B.
Bullock&A.J.Toribio(Eds.),TheCambridgehandbookoflinguisticcode-switching
(pp.182-198).Cambridge:CambridgeUniversityPress.
Chomsky,N.(1981).Lecturesofgovernmentandbinding.Dordrecht:Foris.
CoppockE.(2010).Parallelgrammaticalencodinginsentenceproduction:evidencefrom
syntacticblends.LanguageandCognitiveProcesses,25,38–49.
Costa,A.,Caramazza,A.,&Sebastián-Gallés,N.(2000).Thecognatefacilitationeffect:
implicationsformodelsoflexicalaccess.JournalofExperimentalPsychology:
Learning,MemoryandCognition,26,1283-1296.
Deuchar,M.(2005).CongruenceandWelsh-Englishcodeswitching.Bilingualism:Language
andCognition,8,255-269.
![Page 47: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/47.jpg)
47
DiSciullo,A.-M.,Muysken,P.,&Singh,R.(1986).Governmentandcode-mixing.Journalof
Linguistics,22,1-24.
Emmorey,K.,Borinstein,H.B.,Thompson,R.,&Gollan,T.H.(2008).Bimodalbilingualism.
Bilingualism:LanguageandCognition,11,43-61.
Flege,J.E.(1991).Ageoflearningaffectstheauthenticityofvoice-onsettime(VOT)instop
consonantsproducedinasecondlanguage.JournaloftheAcousticalSocietyof
America,89,395–411.
Furukawa,T.(2008).Head/complementrelations:Portmanteaucode-switchingbetween
JapaneseandEnglish.LanguageandInformationScience6(pp.283-292).Tokyo:
UniversityofTokyo,InstituteforIntegratedCulturalStudies,DivisionofLanguage
InformationScience.
Gardner-Chloros,P.(2009).Sociolinguisticfactorsincode-switching.InB.E.Bullock&A.J.
Toribio(Eds.)The Cambridge handbook of linguistic code-switching(pp.97-
113).Cambridge,UK:CambridgeUniversityPress.
Goldrick,M.(2011).Utilizingpsychologicalrealismtoadvancephonologicaltheory.InJ.
Goldsmith,J.Riggle,&A.Yu(Eds.)Thehandbookofphonologicaltheory,2ndedition
(pp.631-660).Oxford:Wiley-Blackwell.
Goldrick,M.(2012).Neuralnetworkmodelsofspeechproduction.InM.Faust(Ed.)
Handbookoftheneuropsychologyoflanguage(vol.1,Languageprocessinginthe
brain:Basicscience,pp.125-145).Chichester,UK:Wiley-Blackwell.
Goldrick,M.,&Chu,K.(2014).Gradientco-activationandspeecherrorarticulation:
CommentonPouplierandGoldstein(2010).Language,CognitionandNeuroscience,
29,452-458.
![Page 48: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/48.jpg)
48
Goldrick,M.,Runnqvist,E.,Costa,A.(2014).Languageswitchingmakespronunciationless
native-like.PsychologicalScience,25,1031-1036.
Goldwater,S.,&Johnson,M.(2003).LearningOTconstraintrankingsusingamaximum
entropymodel.InJ.Spenader,A.Eriksson,Ö.Dahl(Eds.),Proceedingsofthe
workshoponvariationwithinOptimalityTheory(pp.111–120).Stockholm,England:
StockholmUniversity.
Grimshaw,J.(1997)Projection,headsandoptimality.LinguisticInquiry,28,373-422.
Grimshaw,J.(2001)EconomyofstructureinOT.ROA-434,RutgersOptimalityArchive,
http://roa.rutgers.edu.
Grimshaw,J.(2013).Lastresorts:Atypologyofdo-support.InH.Broekhuis&R.Vogel
(Eds.)Linguisticderivationsandfiltering:MinimalismandOptimalityTheory(pp.
267-295).UnitedKingdom:EquinoxPublishing.
Hagstrom,P.(1996).Do-supportinKorean:Evidenceforaninterpretivemorphology.InH.-
D.Ahn,M.-Y.Kang,Y.-S.Kim,&S.Lee(Eds.)Morphosyntaxingenerativegrammar:
Proceedingsofthe1996Seoulinternationalconferenceongenerativegrammar(pp.
169-180).Seoul,SouthKorea:HankukPublishingCo.
Hartsuiker,R.J.,Pickering,M.J.,&Veltkamp,E.(2004).Issyntaxseparateorshared
betweenlanguages?Cross-linguisticsyntacticpriminginSpanish-Englishbilinguals.
PsychologicalScience,15,409-414.
Hayes,B.(2009).MaxEntGrammarTool[Software].Retrievedfrom
http://www.linguistics.ucla.edu/people/hayes/MaxentGrammarTool.
Hayes,B.,&Wilson,C.(2008).Amaximumentropymodelofphonotacticsandphonotactic
learning.LinguisticInquiry,39,379-440.
![Page 49: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/49.jpg)
49
Hicks,C.(2010).Morphosyntacticdoublingincodeswitching.MAthesis,UniversityofNorth
Carolina-ChapelHill.
Hicks,C.(2012).Adual-structureanalysisofmorphosyntacticdoublingincodeswitching.
InD.Ross(Ed.)Studiesinthelinguisticsciences:Illinoisworkingpapers2012(pp.44-
57).http://hdl.handle.net/2142/35295.
Hsin,L.,Legendre,G.,&Omaki,A.(2013).Primingcross-linguisticinterferenceinSpanish-
Englishbilingualchildren.InBaiz,S.,Goldman,N.,&Hawkes,R.(Eds.)Proceedingsof
the37thannualBostonuniversityconferenceonlanguagedevelopment(pp.165-77).
Somerville,MA:CascadillaPress.
Jónsson,J.G.(2008).PrepositionreduplicationinIcelandic.InBarbiers,S.,Koeneman,O.,
Lekakou,M.,&vanderHam,M.(Eds.)Syntaxandsemantics36:Microvariationin
syntacticdoubling(pp.403-417).Bingley:Emerald.
Joshi,A.(1985).Processingofsentenceswithintrasententialcodeswitching.InD.R.
Dowty,L.Karttunen,&A.M.Zwicky(Eds.)Naturallanguageparsing:Psychological,
computationalandtheoreticalperspectives(pp.190-205).Cambridge:Cambridge
UniversityPress.
Kootstra,G.,vanHell,J.,&Dijkstra,T.(2010).Syntacticalignmentandsharedwordorderin
code-switchedsentenceproduction:Evidencefrombilingualmonologueand
dialogue.JournalofMemoryandLanguage,63,210-231.
Kroll,J.F.,&Gollan,H.(2014).Speechplanningintwolanguages:Whatbilingualstellus
aboutlanguageproduction.InM.Goldrick,V.Ferreira,&M.Miozzo(Eds.)The
Oxfordhandbookoflanguageproduction(pp.165-181).Oxford:OxfordUniversity
Press.
![Page 50: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/50.jpg)
50
Kuhn,J.(2003).Optimality-theoreticsyntax-Adeclarativeapproach.Stanford:CSLI.
Larson,R.(1988).Onthedoubleobjectconstruction.LinguisticInquiry,19,335-391.
Legendre,G.(2001).Anintroductiontooptimalitytheoryinsyntax.InG.Legendre,J.
Grimshaw,&S.Vikner(Eds.).Optimality-theoreticsyntax(pp.1-27).Cambridge,MA:
MITPress.
Legendre,G.,Grimshaw,J.,&Vikner,S.(Eds.)(2001).Optimality-theoreticsyntax.
Cambridge,MA:MITPress.
Legendre,G.,Miyata,Y.,&Smolensky,P.(1990).HarmonicGrammar—Aformalmulti-level
connectionisttheoryoflinguisticwell-formedness:Theoreticalfoundations.In
Proceedingsofthetwelfthannualconferenceofthecognitivesciencesociety(pp.388–
395).Hillsdale,NJ:LawrenceErlbaum.
Legendre,G.,Miyata,Y.,&Smolensky,P.(2006).Theinteractionofsyntaxandsemantics:A
HarmonicGrammaraccountofsplitintransitivity.InP.Smolensky&G.Legendre
(Eds.),Theharmonicmind:FromneuralcomputationtoOptimality-Theoretic
grammar.Vol.1:Cognitivearchitecture(pp.417–452).Cambridge,MA:MITPress.
Legendre,G.,Putnam,M.,deSwart,H.&Zaroukian,E.(inpress-a).Introduction.InG.
Legendre,M.Putnam,H.DeSwart,&E.Zaroukian(Eds.),Optimality-theoreticsyntax,
semantics,andpragmatics:Fromuni-tobidirectionaloptimization.Oxford:Oxford
UniversityPress.
Legendre,G.,Putnam,M.,deSwart,H.&Zaroukian,E.(Eds.)(inpress-b).Optimality-
theoreticsyntax,semantics,andpragmatics:Fromuni-tobidirectionaloptimization.
Oxford:OxfordUniversityPress.
![Page 51: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/51.jpg)
51
Legendre,G.,&Schindler,M.(2010).CodeswitchinginUrbanWolof:Acaseforviolable
constraintsinsyntax.RevistaVirtualdeEstudosdaLinguagem-ReVEL,8,47-75.
Legendre,G.,Wilson,C.,Smolensky,P.,Homer,K.,&Raymond,W.(1995).Optimalityand
wh-extraction.InJ.Beckman,S.Urbanczyck,&L.Walsh(Eds.)PapersinOptimality
Theory(UniversityofMassachusettsOccasionalPapers18,pp.607-636).Amherst,
MA:GraduateLinguisticsStudentAssociation.
Lohndal,T.(2013).Generativegrammarandlanguagemixing.TheoreticalLinguistics,39,
215-224.
MacSwan,J.(1999).Aminimalistapproachtointrasententialcodeswitching:Spanish-
NahuatlbilingualisminCentralMexico.London/NewYork:Routledge.
MacSwan,J.(2000).Thearchitectureofthebilinguallanguagefaculty:Evidencefrom
intrasententialcodeswitching.Bilingualism:LanguageandCognition,3,37-54.
Mahootian,S.(1993).Anulltheoryofcode-switching.PhDdissertation,Northwestern
University.
Melinger,A.,Branigan,H.P.,&Pickering,M.J.(2014).Parallelprocessinginlanguage
production.Language,CognitionandNeuroscience,29,663-683.
Menn,L.,&Duffield,C.J.(2013).Aphasiasandtheoriesoflinguisticrepresentation:
representingfrequency,hierarchy,constructions,andsequentialstructure.Wiley
InterdisciplinaryReviews:CognitiveScience,4,651-663.
Muysken,P.(2000).Bilingualspeech:Atypologyofcode-mixing.Cambridge:Cambridge
UniversityPress.
Muysken,P.(2013).Languagecontactoutcomesastheresultofbilingualoptimization
strategies.Bilingualism:LanguageandCognition,16,709-730.
![Page 52: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/52.jpg)
52
Myers-Scotton,C.(1993).Duelinglanguages:Grammaticalstructureincode-switching.
Oxford:ClarendonPress.
Myers-Scotton,C.,&Jake,J.J.(1995).Non�niteverbsandnegotiatingbilingualismin
codeswitching:Implicationsforalanguageproductionmodel.Bilingualism:
LanguageandCognition,17,511-525.
Nishimura,M.(1986).Intrasententialcode-switching:Thecaseoflanguageassignment.In
J.Vaid(Ed.),Languageprocessinginbilinguals:Psycholinguisticand
neuropsychologicalperspectives(pp.123-143).Hillsdale:LawrenceErlbaum
Associates.
Nunes,J.(2004).Linearizationofchainsandsidewardmovement.Cambridge,MA:MIT
Press.
Olson,D.J.(2013).Bilinguallanguageswitchingandselectionatthephoneticlevel:
AsymmetricaltransferinVOTproduction.JournalofPhonetics,41,407–420.
Pater,J.(2009).Weightedconstraintsingenerativelinguistics.CognitiveScience,33,999–
1035.
Pickering,M.J.,&Ferreira,V.S.(2008).Structuralpriming:Acriticalreview.Psychological
Bulletin,134,427-459.
Poplack,S.(1980).SometimesI’llstartasentenceinSpanishyterminoenespañol:Toward
atypologyofcode-switching.Linguistics,18,581-618.
Poplack,S.,S.Wheeler,&A.Westwood.(1989).Distinguishinglanguage-contact
phenomena:evidencefromFinnish-Englishbilingualism.WorldEnglishes,8,389-
406.
![Page 53: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/53.jpg)
53
Porterie-Gutierrez,L.(1988)Étudelinguistiquedel’aymaraseptentrional(Pérou-Bolivie)(=
ThèseAmerindia).Paris:A.E.A.
Prince,A.,&Smolensky,P.(1993/2004).Optimalitytheory:Constraintinteractionin
generativegrammar.TechnicalreportTR-2,RutgersCenterforCognitiveScience,
RutgersUniversity,NewBrunswick,NJ.TechnicalreportCU-CS-696-93,Department
ofComputerScience,UniversityofColorado,Boulder.Revisedversion,2002:ROA-
537,RutgersOptimalityArchive,http://roa.rutgers.edu.Published2004,Oxford:
Blackwell.
Pyers,J.E.,&Emmorey,K.(2008).Thefaceofbimodalbilingualismgrammaticalmarkers
inAmericanSignLanguageareproducedwhenbilingualsspeaktoEnglish
monolinguals.PsychologicalScience,19,531-535.
Rumelhart,D.E.,Hinton,G.E.,&McClelland,J.L.(1986).Ageneralframeworkforparallel
distributedprocessing.InD.E.Rumelhart,J.L.McClelland,&thePDPResearch
Group(Eds.),Paralleldistributedprocessing:Explorationsinthemicrostructureof
cognition.Vol.1:Foundations(pp.110–146).Cambridge,MA:MITPress.
Sankoff,D.(1998).Theproductionofcode-mixeddiscourse.InProceedingsofthe36th
AnnualMeetingoftheAssociationforComputationalLinguisticsand17th
InternationalConferenceonComputationalLinguistics(vol.I,pp.8-21).Association
forComputationalLinguistics.
Sankoff,D.,Poplack,S.,&Vannianiarajan,S.(1990).ThecaseofthenonceloaninTamil.
LanguageVariationandChange,2,71-101.
Sarma,V.M.(1999).Case,agreementandwordorder:Issuesinthesyntaxandacquisitionof
Tamil.Doctoraldissertation,MassachusettsInstituteofTechnology,Cambridge,MA.
![Page 54: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/54.jpg)
54
Schiffman,H.F.(1999).AreferencegrammarofspokenTamil.Cambridge:Cambridge
UniversityPress.
Sells,P.(2001a).Structure,alignmentandoptimalityinSwedish.Stanford:CSLI.
Sells,P.(Ed.)(2001b).Formalandempiricalissuesinoptimalitytheoreticsyntax.Stanford:
CSLI.
Smolensky,P.(2006a).Optimizationinneuralnetworks:Harmonymaximization.InP.
Smolensky&G.Legendre(Eds.),Theharmonicmind:Fromneuralcomputationto
Optimality-Theoreticgrammar.Vol.1:Cognitivearchitecture(pp.345–392).
Cambridge,MA:MITPress.
Smolensky,P.(2006b).Computationallevelsandintegratedconnectionist/symbolic
explanation.InP.Smolensky&G.Legendre(Eds.),Theharmonicmind:Fromneural
computationtoOptimality-Theoreticgrammar.Vol.2:,Linguisticandphilosophical
implications(pp.503-592).Cambridge,MA:MITPress.
Smolensky,P.,&Legendre,G.(2006).FormalizingtheprinciplesII:Optimizationand
grammar.InP.Smolensky&G.Legendre(Eds.),Theharmonicmind:Fromneural
computationtoOptimality-Theoreticgrammar.Vol.1:Cognitivearchitecture(pp.
207–234).Cambridge,MA:MITPress.
Smolensky,P.,Goldrick,M.,&Mathis,D.(2014).Optimizationandquantizationingradient
symbolsystems:Aframeworkforintegratingthecontinuousandthediscretein
cognition.CognitiveScience,38,1102-1138.
Spalek,K.,Hoshino,N.,Wu,Y.J.,Damian,M.,&Thierry,G.(2014).Speakingtwolanguages
atonce:Unconsciousnativewordformaccessinsecondlanguageproduction.
Cognition,133,226-231.
![Page 55: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/55.jpg)
55
Starreveld,P.A.,DeGroot,A.M.B.,Rossmark,B.M.M.,&VanHell,J.G.(2014).Parallel
languageactivationduringwordprocessinginbilinguals:Evidencefromword
productioninsentencecontext.Bilingualism:LanguageandCognition,17,258-276.
Stolz,T.(1996).GrammaticalHispanismsinAmerindianandAustronesianlanguages:The
otherkindoftranspacificisoglosses.Amerindia,21,137-160.
Woolford,E.(1983).Bilingualcode-switchingandsyntactictheory.LinguisticInquiry,14,
520-536.
![Page 56: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/56.jpg)
56
Footnotes
1Consistentwithananalysiswheredoublingcanariseduetotheco-presenceofmultiple
elementsintheinputtothegrammar,elementsfrommultiplealternativeformulationsof
anintendedmessagearesometimesco-presentinmonolingualspeecherrors(Coppock,
2Doublingofinflectionalelementshasalsobeenreported,bothwheninflectionalelements
occurindistinctpositions(e.g.,prefixationvs.suffixation;Bokamba,1988;Myers-Scotton,
1993)andwhentheyoccurinthesameposition(Backus,1992).Thislattertypehasnot
beenreportedwithnon-inflectionalelements,whichisthefocusoftheanalysishere.
3Thisgeneralapproachisconsistentwithderivational/Minimalistapproachestogrammar
(seee.g.,Broekhuis&Vogel2013;Legendre,Grimshaw,&Vikner.2001;Legendre,Putnam,
deSwart,&Zaroukian,inpress-b)aswellasconstraint-basedmodelssuchasLexical
FunctionalGrammar(seee.g.,Bresnan2000;Kuhn2003;Sells2001a,b).
4GSCassumesastochasticoptimizationalgorithmthatconvergestoadistributioninwhich
theprobabilityofcandidatecis: !!(!)!!!(!)!!.Here,H(c)istheharmonyofcandidatec,xranges
overthesetofallpossibleoutputcandidates,andTisaparameteroftheoptimization
algorithm.HereweassumethatThasalowerlimitof1.
5SeeSmolenskyetal.,2014,fordetailsonthestochasticoptimizationprocessesthat
generalizethisideatovaryinglevelsofactivationofelements(whichresultsinnon-linear
changestorelativeharmonyofdifferentrepresentationalstates).
6QUANTIZATION’scontributiontoharmonyisbasedontheactivationofeach
representationalelementeinagivenstructuralposition: !!! 1− !! ! + !!!! − 1 !! ;
![Page 57: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/57.jpg)
57
thisisweightedby–10.ThePARSEconstraint,weightedat+1,isdefinedas !!! !!! ,where
!!! istheactivationofeachelementintheinput(<DOG>=+2,<PERRO>=+1).Finally,
followingotherHarmonynetworks(Smolensky,2006a),thereisacontributionfromunit
Harmony(atermensuringtheharmonymaximumisafinitevalue): !! !! − !
!!
! ,
weightedat–11.
![Page 58: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/58.jpg)
58
Appendix
Table A1. Grammar fragment: Doubling construction, Tamil-English codemixing, showingfull set of candidates. Blank cells indicate the candidate does not violate the constraint.Probabilitiesarerounded;thoselessthan1x10-4arerepresentedas0.
Input:(3rd plural, grant)!"ɖ!"##
!"#$ SPECLEFT HEADLEFT COMPLEFT PARSE QUANT
English:0.5activation –6.5 –6 0 –12.5 Tamil:0.5activation –6.5 0 –6 –12.5 Combinedweighting –13 –6 –6 –25 –8 H Pr
[VPthey[V'gave[V'grant]koɖutaa]]
–30 –12 –16 –58 0.039
[VPthey[V'koɖutaa[V'grant]gave]]
–42 –12 –16 –70 0
[VPthey[V'gavegrant]] –12 –12 –37.5 –61.5 0.001[VPthey[V'koɖutaagrant]] –6 –12 –37.5 –55.5 0.48[VPthey[V'grantgave]] –24 –6 –37.5 –67.5 0[VPthey[V'grantkoɖutaa]] –12 –6 –37.5 –55.5 0.48[VP[V'grantgave]they] –39 –12 –37.5 –88.5 0[VP[V'grantkoɖutaa]they] –39 –6 –37.5 –82.5 0[VP[V'gavegrant]they] –39 –6 –37.5 –82.5 0[VP[V'koɖutaagrant]they] –39 –6 –37.5 –82.5 0[VPthey[V'gave]] –12 –75 –87 0[VPthey[V'koɖutaa]] –6 –75 –81 0[VP[V'gave]they] –19.5 –75 –94.5 0[VP[V'koɖutaa]they] –19.5 –75 –94.5 0[V'gavegrant] –6 –75 –81 0[V'koɖutaagrant] –6 –75 –81 0[V'grantgave] –12 –75 –87 0[V'koɖutaagrant] –6 –75 –81 0[VPthey[V'grant]] –6 –75 –81 0[VP[V'grant]they] –19.5 –75 –94.5 0[VPthey] –112.5 –112.5 0[V'gave] –112.5 –112.5 0[V'koɖutaa] –112.5 –112.5 0[V'grant] 0 0 –112.5 –112.5 0Ø 0 0 –150 –150 0
![Page 59: Running Head: Gradient symbols in code mixing Matthew](https://reader031.vdocuments.site/reader031/viewer/2022012915/61c5b165c0dc8d51ed26c5c1/html5/thumbnails/59.jpg)
59
Notethatthisgrammarfragmentpredictsthatintheabsenceofdoublingthemost
probablecodemixedproductionsareonesthatinserttheTamilverb,eitherintheTamilor
Englishwordorder.Notethatbothtypesofconstructionsareempiricallyattestedincode
mixing(Bhatt,1997).Whydoesthisoccurinthisspecificanalysis?TheHEADLEFT
constrainthasaweightingof0intheTamillinguisticsystemvs.–6intheEnglishsystem.If
onlyoneverbisretained(resultinginaviolationoffaithfulness),itisthereforemore
harmonictoretaintheTamilverb—itincursfewerviolationsofalignmentconstraints.In
thisfragmentwehavealsoincludedharmonicallyboundedcandidatestoillustrateall
possiblerepresentationaloutputforms.