hierarchical statistical inference and lexical diffusion of sound change

20
Hierarchical statistical inference and lexical diffusion of sound change Vsevolod Kapatsinski University of Oregon

Upload: felcia

Post on 25-Feb-2016

31 views

Category:

Documents


0 download

DESCRIPTION

Hierarchical statistical inference and lexical diffusion of sound change . Vsevolod Kapatsinski University of Oregon. Two kinds of change in Usage-based Phonology ( Bybee 1976, 2001 , 2002), Phillips (1984, 2001). Articulatorily -motivated sound change - PowerPoint PPT Presentation

TRANSCRIPT

Analogy and lexical diffusion of reductive sound change

Hierarchical statistical inference and lexical diffusion of sound change Vsevolod KapatsinskiUniversity of Oregon

Two kinds of change in Usage-based Phonology(Bybee 1976, 2001, 2002), Phillips (1984, 2001)

Articulatorily-motivated sound changeDriven by automatization of production (Browman & Goldstein 1992, Bybee 2001, 2002, Kapatsinski 2010, Mowrey & Pagliuca 1995)Tempered by avoidance of misperception (Lindblom 1990)Starts in high-frequency wordsE.g., word-final t/d deletion (Bybee 2002), memory/mammary (Hooper 1976)But what about analogy to reduced words?

Analogical changeDriven by pressure to be like similar items (analogy) and imperfect learning Starts with low-frequency wordse.g., irregular past tenses are all in high-frequency verbs

Implementing the opposing mechanismsReduction in use: Every time a word is used, it reducesWords are assumed to be units of articulatory planning and execution (Bybee 2001, 2002, Kapatsinski 2010)Or at least used in a reduction-favoring context (Bybee 2002, Raymond & Brown 2012)

Learning: Words are associated with typical rates of reduction (Bybee 2001, Erker & Guy 2012, Pierrehumbert 2001, 2002)word-specific phonetics

Is sound change in the phoneme or in the word? Its in both. Phonemes and words can be associated with rates of reduction (cf. Phillips 2001). Ascribing blame for reduction to phoneme vs. word is a process of hierarchical statistical inference.We implement this using lme4 in R (Baayen et al. 2008)Predicting reductionp(reduction) ~ 0 + 1*word

0 = overall probability of reduction (for this gesture/phoneme)1 = adjustment associated with individual wordA learning problem: Zipf (1949)Zipfs law:

In a corpus of any size, most words occur only once (Baayen 2001)For most words, we cannot estimate a word-specific reduction coefficient with any certainty

Erker & Guy (2012)

The standard solution (mixed-effects): Partial pooling / Word as a random effect(Gelman & Hill 2007: 252-259)The child as a mixed effects modelRecall the theory:Every time you use a word, its reduction probability is incrementedWhen children acquire language, they acquire word-specific and phoneme/gesture-specific phoneticsThey do not try to recover the coefficient associated with word frequencyThis is equivalent to saying the child is modeling reduction as an overall rate for a phoneme with a random deflection for each word

Prior evidence: Erker & Guy (2012)Pronoun use with Spanish verbsGrammatical effects are augmented in high-frequency wordsAs you would expect if within-word coefficients for grammatical predictors are calculated by the learner

Partial pooling and word frequencyPrediction: At late stages of an articulatorily-driven change, exceptionally conservative words are likely to have intermediate frequency of use. When high-frequency words come to be associated with reduced variant of the phoneme, low-frequency words will be pulled in.

The U-shaped frequency effect

Generation 1

Generation 10Possible case: Flappingt/d become flaps / V_V[-stress]

Change is affecting a particular sublexical unit

Lexical diffusion

Far gone in American English

Still variable

ExperimentReading sentences withWords found mostly in colloquial speech (N=15)I found the bullshitter.Words found mostly in formal speech (N=15)I found the emitter.Non-words (N=15)I found the lenitter.Formality estimated using BNC: informal: conv, drama, interview_oral, spch w script & not; formal: parliament, academic, broadcast news, courtroom, public debateBritishisms eliminated from colloquial set by comparing SUBTLEX-US & SUBTLEX-UK frequenciesPrediction: nonwords should be in-between

I would prefer a letter. I would prefer the latter. I would prefer a gatter.

She is looking for the butter. She is looking at the jitter. She is looking at The Witter.

I found the bullshitter. I found the emitter. I found the lenitter.

She is going to get even madder. She is going to find the highest bidder. She is going to get even gadder!

The presenter is great at shutting up the audience. The presenter is great at stating the obvious. The presenter is great at spating the audience.

That girl is so pretty! The world needs this treaty! The murl feeds the dretty.

He always tells dirty jokes! He always puts duty first! He always bicks puty off.

He bumped into his daddy. He's read about that study. He stood by the Gaddy.

How is everybody doing today? How is the antibody doing that? How is Don Abrimody doing now?

He found somebody online. He came to embody this principle. He came to Plembody today.

He is really into flirting with Jennifer. He is really into rating the stimuli. He is really into brating the blick.

She is really into her knitting.She is really into the setting. She is really into her mitting.

He is getting fatter.They just have to scatter. They are getting snatter.

TriplesLots of fillersThey were asked to withdraw from Crimea. North Korea is changing. Who amongst you have lived in Veneta? He had a lot of raw ability but too little patience. The store is closed today. Your visa is only valid until June. His visa does not allow him to stay. The Space Needle is Seattle's major landmark. David went to the opera in Seattle. They won't change the law. The law is changing fast. Jamaica pays close attention to Cuba's actions. One might observe that Iowa too is next to Illinois. Mike's idea is excellent but we do not have the resources. South Asia has limited space for its population. This government sees Canada as a tolerant multilingual society. Don't touch the chainsaw when it is in operation. China is occupying a strategic location on the east coast of Asia. China faces a huge task. Austin got a diploma in education. Who invented the telephone? Mold is found in the basement. In his will, Bill will leave the wheel to Will. Mood affects sentences we make. Mussels taste well with white wine sauce. Some people find diamonds in the bushes. Researchers at the University of Oregon find that saying sentences is healthy. Ben baked cookies. Each day he wakes up and says a sentence. I have recently read a paper claiming this. James washed his hands. Hugh could choose the hue of the hoop. You need to buy the cat some diamonds. My cat talks funny. Why would one want the weasels to win? Even the biggest elephant is not as big as a whale. Thanksgiving was fun. Blue benches buzz loudly. Mashed potatoes can be eaten with chili sauce. Ken did not find the gold. Cats like shiny things. The big bamboo stick is Mike's. Dogs can catch mice and small marsupials. Can Ken can cantaloupes? The floor can't wash itself!

Participants and measures40 (22 so far) adult native speakers of AmEng

Closure duration (rel. to preceding V)Minimum intensity (rel. to max in next V)Presence/absence of voicingPresence/absence of burstAnalysislmer(log(ClosureDur)~ spelling + underlyingPhoneme + Condition+ (1+Condition|Triple)+(1+Condition|Subject),data=data[ClosureDur>10 & ClosureDur