dispersion (and lack thereof) in consonant inventories · 2015. 8. 9. · introductionpart 1: stop...

Introduction Part 1: Stop Place of Articulation Part 2: Voice Onset Time Conclusions Appendix

Dispersion (and lack thereof) in consonantinventories

Ivy Hauser

2nd Year ConferenceMay 4, 2015

0


Introduction

Dispersion Theory (DT) in vowel inventories is robust(Liljencrantz and Lindblom 1972, Schwartz et al. 1997).General idea: vowel inventories tend towardsconfigurations which maximize perceptual distinctiveness.

1


Are consonants inventories also dispersed?

Preliminary: What is dispersion?Any discussion of dispersion assumes a phonetic space.For vowels: first and second formants.

2



Preliminary: What is dispersion?Any discussion of dispersion assumes a phonetic space.For vowels: first and second formants.

Spectrogram showing frequency over time.Darkness = intensity

Formants change with articulatory changes:

2



What is the right phonetic space for consonants?More perceptual cues available.Different types of consonants might behave differently.

This project: Given some phonetic space for consonants -How should dispersion be measured?Do we observe the predictions of DT?

3


Analysis of two spaces

Part 1: Stop place of articulation (POA)Schwartz et al. (2012): dispersion predicts [d-g-P] as theoptimal inventory.Does this hold up under a better metric for measuringdispersion?

Part 2: Stop voicing (VOT)DT prediction: bigger inventory = larger phonetic space,less variationDoes this hold with consonants?Test case with voicing contrasts in Hindi (4 contrasts perPOA) and English (2 contrasts per POA)

4


Part 1: Background

Dispersion prediction: Common inventories should bedispersed in the appropriate space.

Most common three stop inventory: [labial-coronal-velar]([b-d-g] [p-t-k])Schwartz et al. (2012): Vocal tract model data shows[coronal-velar-pharyngeal] is the most dispersed.Does this claim hold up under a better metric fordispersion?

This project: Re-analysis of vocal tract model data andproposal of new dispersion metric.

5


The vocal tract model

Data from vocal tract model based on Maeda (1990), usedby Schwartz et al. (2012).50,000 stop tokens generated along the vocal tract andclassified according to POA.Formants measured at onset of following vowel, firstappearance of formant structure after release.

Formant transitions from /d/

6


Formant transitions as a space for stops

Are formant transitions into the following vowel a reasonablespace?

Yes - experimental evidence shows formants are primarycues.Walley and Carrell (1983): Adults and children use formanttransitions for identification when burst and transitionsconflict.Sussman et al. (1991): F2 transitions between consonantand vowel effectively categorize stops by place (locusequations).

7


The data: F1/F2 space

F1/F2 (kHz): lots of overlap, F1 difference for epi-pharyngeals only8


The data: F2/F3 space

F2/F3 (kHz): lots of overlap, categories different shapes and sizes9


What categories are dispersed in this space?

Claim: [coronal-velar-pharyngeal] is the most dispersedconfiguration. (Schwartz et al., 2012)

when

Dispersion = the mean-to-mean distance between categories.

Conclusion is entirely dependent on dispersion metric.

10


Mean to mean dispersion

POA1 POA2 POA3 Dispersion (area)1 coronal epi-pharyngeal velar 0.42 coronal uvular velar 0.33 epi-pharyngeal palatal velar 0.294 bilabial coronal epi-pharyngeal 0.235 bilabial coronal velar 0.236 coronal epi-pharyngeal uvular 0.217 bilabial epi-pharyngeal palatal 0.28 coronal epi-pharyngeal palatal 0.29 epi-pharyngeal palatal uvular 0.2

10 coronal palatal uvular 0.1911 palatal uvular velar 0.1912 bilabial coronal palatal 0.1613 bilabial palatal velar 0.1614 bilabial epi-pharyngeal velar 0.115 epi-pharyngeal uvular velar 0.116 coronal palatal velar 0.08717 bilabial palatal uvular 0.08518 bilabial coronal uvular 0.08219 bilabial uvular velar 0.06920 bilabial epi-pharyngeal uvular 0.043

11


F2/F3 space and mean to mean distance

12


F2/F3 space and mean to mean distance

13


The problem with mean-to-mean distance

Intuitively not equal to dispersion:Two distributions - same means but different variances.

We want the one on the left to be better dispersed.

14


Incorporating distribution shape into dispersion

Within category variance:Should have some effect on perception.Should be considered in a dispersion metric.

Jeffries-Matusita DistanceNot measuring between two points (mean to mean) butbetween two distributions.Incorporates covariance of the category distribution - themultidimensional analog of variance.A true distance metric (metric axioms: symmetry, triangleinequality)

15


Jeffries-Matusita distance

POA1 POA2 POA3 Mean JMdist Area1 bilabial coronal epi-pharyngeal 1.413 0.864542 bilabial palatal uvular 1.4124 0.863793 bilabial epi-pharyngeal palatal 1.4124 0.86384 bilabial coronal uvular 1.4111 0.862175 epi-pharyngeal palatal uvular 1.4058 0.855526 bilabial epi-pharyngeal uvular 1.4057 0.855467 epi-pharyngeal uvular velar 1.4056 0.855348 coronal epi-pharyngeal uvular 1.4038 0.853199 coronal epi-pharyngeal velar 1.4024 0.85131

10 coronal uvular velar 1.4004 0.8488411 bilabial epi-pharyngeal velar 1.3555 0.7871312 bilabial uvular velar 1.3553 0.7869713 coronal epi-pharyngeal palatal 1.3535 0.7842314 coronal palatal uvular 1.3516 0.7822415 bilabial coronal palatal 1.3505 0.7811816 bilabial coronal velar 1.3426 0.773517 epi-pharyngeal palatal velar 1.3365 0.7589118 palatal uvular velar 1.3364 0.7587719 bilabial palatal velar 1.2761 0.6921720 coronal palatal velar 1.2641 0.6823 16


Part 1 Conclusions

Dispersion does not make the correct predictions for typologicalfrequency in stops.

The [labial-coronal-velar] inventory will not be optimal inthis space even with a better metric.Pharyngeals and epiglottals - problematic places fordispersion because of F1 differences.Dispersion is not typologically predictive for consonants inthis space, cannot solely explain prevalence of [b-d-g].

17


Part 2: VOT space

Voice onset time durations as a different space for dispersion inconsonants.

DT prediction: Overall phonetic space size shouldincrease with number of segments, within categoryvariation should decrease (Liljencrantz and Lindblom,1972).Do these predictions extend to consonants?Test case: VOT in Hindi (4 contrasts per POA) andEnglish (2 contrasts per POA) as a test case.

18


The experiment

Participants: Two speakers each of Hindi and AmericanEnglish. All the speakers were female and between ages 20-35.

Stimuli: CVC words and non-words with vowels [i a u], recordedin carrier phrases (“Say X again” in English).

Goal for stimuli to be as similar as possible betweenlanguages.Words and non-words used for Hindi and English.English stimuli grouped further according to lexicalstatistics.

19


Segments under consideration

Labial Dental/Alveolar Retroflex VelarHindi p ph b bh t th d dh ú úh ã ãh k kh g gh

English p b t d k g

Four contrasts per place of ariculation in Hindi.Two contrasts per place of articulation in English.

20


Analysis

Voiceless stops: positive VOT measured from the burst to theonset of voicing.

21


Analysis

Prevoiced stops: negative VOT measured from the start of theocclusion to the burst.

22


English results

Speakers: No significant differences in VOTs for any segmentsin any vowel contexts.Data collapsed over speakers.

VOTs in / i/ context: 2 distinct modes, other contexts similar

23


Hindi results

Speakers: Differences in VOTs /p/, /bh/, /t/, /dh/, /k/.Data collapsed over speakers.

VOTs in / a/ context: 3 distinct modes, other contexts similar

24


Results: Hindi vs. English

VOTs in Hindi and English collapsed over contexts and speakers:3 modes in Hindi, 2 modes in English.Space doesn’t appear bigger in Hindi.

25


Results: Hindi vs. English

VOT distributions: Hindi space doesn’t look bigger.English boxes do look bigger - more variation.

26


Within category variance

Place English VOT (ms) Hindi VOT (ms) Levene’s Test pLabial-vcls 75.53 ± 21.52 85.11 ± 9.67 6.15e-05 ***Labial-vcd -105.1 ± 16.8 -91.56 ± 14.38 0.02 *

Coronal-vcls 93.45 ± 20.3 86.38 ± 12.25 0.002 **Coronal-vcd -101.3 ± 21.23 -89 ± 17.13 0.04 *Velar-vcls 100.8 ± 19.4 97.4 ± 18.6 0.9Velar-vcd -90 ± 17.2 -72 ± 16.4 0.7

Variance within VOT categoriesLevene’s Test: whether variances are equalEnglish categories have more variation except for velars.Variation prediction of DT: held up in VOT space for labialsand coronals.

27


Conclusions: VOT and dispersion predictions

Space-size prediction: not held up.→ But this is very context dependent.

In isolation English would not have negative VOTs and theHindi space would be bigger.Context is part of the phonetic space.

Variation prediction: holds for all segments except velars.Within category variances of English VOTs categories arebigger than Hindi.

28


Overall conclusions

Does dispersion theory apply to consonant inventories?

As a predictive typological model: NoDispersion doesn’t predict typological trends.Possible options: consonants are not dispersed, or wedon’t have the right phonetic space.

But perceptual distinctiveness effects are real inconsonants.

VOT variance data in Hindi and English give evidence forthis.Perceptual distinctiveness without the typologicalframework of dispersion theory.

29


Acknowledgements

Thanks to John Kingston, Kristine Yu, Sang-Im Lee Kim, LynFrazier, Sound Workshop and PRG audiences for advising andinput.Thanks to Sang-Im Lee Kim, Yangsook Park, Sakshi Bhatia,Jyoti Iyer, Coral Hughto, and Megan Somerday for voices.

This material is based upon work supported by the NationalScience Foundation Graduate Research Fellowship Programunder Grant Number 2014175439. Any opinions, findings, andconclusions or recommendations expressed in this material arethose of the author(s) and do not necessarily reflect the viewsof the National Science Foundation.

30


Stop system data

All data from the 628 phonological inventories of P-base.

Assumptions:d stands for both alveolar and dental place.Retroflexes not included because they are usuallycontrastive.

Languagesb d g 429b d - 35b - g 2- d g 5none 133

31


Stop system data

Languagesp t k 545p t - 3p - k 0- t k 59

none 15

Languagesm n N 327m n - 276m - N 1- n N 0none 10

32


Triangle areas

A geometric problem:

We want equilateral - Incorporating variance of sides.

33


English results by word type

34


Example stimuli

Example Stimuli:Language C1 vowel stimulus (IPA) type

Hindi b i bit wordHindi kh i khil wordHindi bh u bhut wordHindi d a dag word

English p i pis word-hiEnglish t u tun word-lowEnglish t i tiT word-hiEnglish b a bag word-low

35


VOT means: Hindi vs. EnglishEnglish

segment VOT- -ph 74.07- -th 92.23- -- -- -kh 98.99b -105.05- -d -101.25- -- -- -g -91.35- -

Hindi

segment VOTp 10.08ph 83.86t 12.47th 86.58T 11.25Th 82.11k 39.97kh 100.00b -99.03bh -83.69d -98.44dh -79.91D -95.75Dh -80.08g -77.83gh -80.00

36


Covariance

covx,y =∑N

i=1(xi−x)(yi−y)N−1

37

dispersion (and lack thereof) in consonant inventories · 2015. 8. 9. · introductionpart 1: stop...

Documents