focus contrast in web harvested data
DESCRIPTION
Focus Contrast in Web Harvested Data. Mats Rooth Linguistics and CIS Cornell University based on joint research with Jonathan Howell. Radio sites. Hundreds use Everyzing/Ramp technology Full ASR transcripts often available Time offset sometimes available - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/1.jpg)
Focus Contrast in Web Harvested Data
Mats RoothLinguistics and CISCornell University
based on joint research with Jonathan Howell
![Page 2: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/2.jpg)
![Page 3: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/3.jpg)
![Page 4: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/4.jpg)
![Page 5: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/5.jpg)
![Page 6: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/6.jpg)
![Page 7: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/7.jpg)
![Page 8: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/8.jpg)
![Page 9: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/9.jpg)
Radio sites
• Hundreds use Everyzing/Ramp technology• Full ASR transcripts often available• Time offset sometimes available• Either URL of audio or RSS feed almost always
available• Not not enough hits for one target on a single
site• A lot or repetitions of same audio• Seemingly less “spontaneous” speech than on
Everyzing
![Page 10: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/10.jpg)
Youtube
• Searchable closed captions, some obtained with ASR and some provided by video author
• Time offset available on hit page and in URL• Youtube player can seek to a time• Transcript of snippet available• Full transcript not available• Not enough data now• Can hope that a lot of indexed spontaneous
speech will become available
![Page 11: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/11.jpg)
Reuters Insider
• Searchable audio based on Everyzing/Ramp
• Full transcripts available
• Player seeks to timestamp
![Page 12: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/12.jpg)
![Page 13: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/13.jpg)
Goals
Assemble large, focused datasets of examples where intonation varies in a way that correlates with syntax, semantics, or pragmatics.
Study correlation between lexical/grammatical/pragmatic context and acoustic realization.
![Page 14: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/14.jpg)
he stayed longer than I did
-er [[ he he stayed x long]2
than [ IF stayed x long ]~2]
[ y stayed x-long ] antecedent clause
[ speaker stayed x-long ] scope of focus
![Page 15: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/15.jpg)
… I should have liked that song a lot more than I did.
[more
x[[should w[ I like that song x well in w]]
than [I like that song x well in w0]]]
![Page 16: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/16.jpg)
I understand even less than I did before
even less [[ I prs understand x much]2
than [I understood x much beforeF] ]~2]
![Page 17: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/17.jpg)
Alternative semantics for focus-er [[ he he stayed x long]2
than [ IF stayed x long ]~2][ y stayed x-long ] antecedent clause[ speaker stayed x-long ] scope of focusSemantics of focus is the set of alternative
propositions of the form ‘y stayed x long’.Licensing condition for focus The proposition
contributed by the antecedent is an element of the alternative set that is distinct from the proposition contributed by the scope.
![Page 18: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/18.jpg)
Givenness/Entailment semantics for focus
[ y stayed x-long ] antecedent clause
[ speaker stayed x-long ] scope of focus
Licensing condition for focus The antecedent entails the union of the alternative set (focus existential closure).
If he stayed d long, then someone stayed d long.
![Page 19: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/19.jpg)
Alternative semantics and givenness semantics are predictive theories of focus licensing, if the antecedent is stipulated.
Almost always, the antecedent for focus in the than-clause is the main clause.
With that hedge, grammar makes a prediction about where focus should go.
Try to correlate this with acoustic signal.
![Page 20: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/20.jpg)
Focus in comparative clauses
Coherent semantic theory about where focus should go
Possibilities are constrained, because the main clause is usually the antecedent for focus interpretation in the comparative clause
On a theoretical basis, we often think we know the correct grammatical analysis of comparative sentences people use, including the features that determine focus
Nice model system for studying contextual conditioning and phonetic realization of contrastive intonation
![Page 21: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/21.jpg)
![Page 22: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/22.jpg)
![Page 23: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/23.jpg)
![Page 24: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/24.jpg)
![Page 25: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/25.jpg)
Automatic harvest procedure
Replicates how a user would interact with website.
![Page 26: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/26.jpg)
![Page 27: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/27.jpg)
![Page 28: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/28.jpg)
curl retrieve information designated by URL
cutmp3 cut audio file given offsets
awk process html
awk, bash
make control
Time for one run retrieving 1000 hits is less than a day.
![Page 29: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/29.jpg)
116 a1135.g.akamai.net
110 hosted-media.podzinger.com
76 media.weei.podzinger.com
58 feeds.wnyc.org
54 media.libsyn.com
51 podcastdownload.npr.org
50 feeds.feedburner.com
39 library.kraftsportsgroup.com
33 www.whiterosesociety.org
24 www.kpbs.org
21 www.podtrac.com
21 media.wrko.podzinger.com
![Page 30: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/30.jpg)
Jonathan Howell
![Page 31: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/31.jpg)
22 081
397991
43 51 57
328
520
1118
154
3734
na
3750
3500
3250
1000
750
500
250
than he did he himself his own for one thing the one thing
Switchboard Everyzing (collected/verified) Everyzing (projected)
Jonathan Howell
![Page 32: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/32.jpg)
Classification experiment
He stayed longer than IF did. s classantecedent: He stayed x long
I should have liked that song a lot more than
I didF. ns class antecedent: I should have liked that song x
much
I understand even less than I did beforeF
I understand even x little ns class
![Page 33: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/33.jpg)
![Page 34: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/34.jpg)
SVM classifier in R statistical environement (e1071 package)
308 acoustic parameters extracted with Praat
91 tokens in cross-validated design
(Several hundred more tokens with similar results.)
![Page 35: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/35.jpg)
1. all parameters3. duration of “I” only4. duration of “I”, duration of “d” closure, formant
difference 40% into “I”
![Page 36: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/36.jpg)
Jonathan Howell
![Page 37: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/37.jpg)
Jonathan Howell
![Page 38: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/38.jpg)
Method suggested by comparatives experiment
Find common grammatical or lexical contexts that trigger representations with different prosodic realization, according to relatively well-understood and well-supported theory.
Correlate the semantic-grammatical categories directly with the speech signal using machine learning.
Don’t worry about phonemic/morphemic categories like the accent types H* and L+H*, or assume they be annotated on the basis of pitch contour.
![Page 39: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/39.jpg)
![Page 40: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/40.jpg)
Fery and Ishihara (2009) Journal of Linguistics 45.3
SOF: Prenuclear
Die meisten unserer Kollegen waren beim Betriebsausflug lässig angezogen. Nur Peter hat eine Krawatte getragen.
Nur Peter hat sogar einen Anzug getragen.
![Page 41: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/41.jpg)
He’s gotta pick someone who is younger than he is, and is definitely more conservative than he is.
[-er [ t is d young than he is d young]]2 and
more [[ t is is d conservativeF]3
than [ heF is d conservative ] ~3 ] ~2
![Page 42: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/42.jpg)
+Generic corpus of focused pronouns
The SVM classifier is good at detecting focused pronouns using local features on pronoun:
Duration of vowel “I” [ai]
Distance between f1 and f2 halfway into vowel “i” [ai]
![Page 43: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/43.jpg)
Method suggested by comparatives experiment
Find common grammatical or lexical contexts that trigger representations with different prosodic realization, according to relatively well-understood and well-supported theory.
Correlate the semantic-grammatical categories directly with the speech signal using machine learning.
Don’t worry about phonemic/morphemic categories like the accent types H* and L+H*, or assume they be annotated on the basis of pitch contour.
![Page 44: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/44.jpg)
Inherently contrastive phrases
in MY opinion ... admits that other things might be true in
other people’s opinionsNEXT Friday ... at end weekly Friday radio programon the TENOR saxophone ... in Jazz program where there is
frequent mention also of the Alto saxophone
![Page 45: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/45.jpg)
1162 of> my life1110 in> my life681 in> my mind377 in> my opinion276 in> my view231 in> my heart217 of> my career199 in> my career183 in> my head146 with> my life146 with> my family141 on> my way
140 of> my mind139 on> my part134 in> my lifetime125 in> my office115 of> my family108 with> my wife106 on> my face106 in> my house99 on> my mind96 over> my head96 in> my family91 for> my family90 in> my face
![Page 46: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/46.jpg)
![Page 47: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/47.jpg)
+ Does general SVM pronoun focus classifier work on SOF tokens?
+ How common is SOF?
![Page 48: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/48.jpg)
[you made a very small amount more than I did]2
[nowF I make muchF more than youF do] ~2
2 is of the form
required form of antecedent: at t speaker makes d-much more than hearer makes
actual: at t hearer makes d-much more than speaker makes
![Page 49: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/49.jpg)
two SOF tokens
You made a very small amount more than I did. Now I make muchF more than youF do.
![Page 50: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/50.jpg)
There is a correlation between the string context and prosody type?
+ Learn information-theoretically
-- two distributions of acoustic pronoun realizations
-- two distributions of trigram contexts that condition them
![Page 51: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/51.jpg)
P( in opinion) =def
P(type 1) P( 〈 in,opinion 〉 | type 1)
P( | type 1) +
P(type 2) P( 〈 in,opinion 〉 | type 2)
P( | type 2)
![Page 52: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/52.jpg)
What don’t we know about Focus realization? Accent type
Claim that English focal accents divide into• Topic (T), contrastive theme, L+H*• Focus (F), H*
What about Anna? Who did she come with?
AnnaT came with MannyF.What about Manny? Who came with him?
AnnaF came with MannyT.
![Page 53: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/53.jpg)
Attempt to make do pragmatically without a T/F distinction in alternative semantics
Michael Wagner (2008). A Compositional Theory of
Contrastive Topics. NELS 28.
Controversy whether there is a categorial phonetic distinction among H*, L*+H, L+H*.
![Page 54: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/54.jpg)
He’s gotta pick someone who is younger than he is, and is definitely more conservative than he is.
[-er [[t is d youngF]5
than [heF is d young] ~5 ]]2 ~4 and
more [[ t is is d conservativeF]3
than [ heF is d conservative ] ~3 ]4 ~2
![Page 55: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/55.jpg)
A. Nenkova, J. Brenier, A. Kothari, S. Calhoun, L. Whitton, D. Beaver, D. Jurafsky To memorize or predict: prominence labeling in conversational speech
Sasha Calhoun. Information Structure and the Prosodic Structure of English: a Probabilistic Relationship. PhD thesis, University of Edinburgh, 2006
Markup and prediction of accented words in Switchboard corpus
Try to do this for pronouns only
![Page 56: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/56.jpg)
Inherently contrastive phrases
in MY opinion ... admits that other things might be true in
other people’s opinionsNEXT Friday ... at end weekly Friday radio programon the TENOR saxophone ... in Jazz program where there is
frequent mention also of the Alto saxophone
![Page 57: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/57.jpg)
There is a correlation between the string context and prosody type?
+ Learn information-theoretically
-- two distributions of acoustic pronoun realizations
-- two distributions of trigram contexts that condition them
![Page 58: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/58.jpg)
There is a correlation between the string context and prosody type?
+ Learn information-theoretically
-- two distributions of acoustic pronoun realizations
-- two distributions of trigram contexts that condition them
![Page 59: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/59.jpg)
What don’t we know about Focus realization? Accent type
Claim that English focal accents divide into• Topic (T), contrastive theme, L+H*• Focus (F), H*
What about Anna? Who did she come with?
AnnaT came with MannyF.What about Manny? Who came with him?
AnnaF came with MannyT.
![Page 60: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/60.jpg)
What don’t we know about Focus realization? Non-anaphoric focus.
Fery and Samek-Lodovici (2007) Language 82.1
[(An AMERICANf farmer) (with a purple CHEVROLET) (was talking to a CANADIANf farmer) (with a purple Chevrolet)]f
![Page 61: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/61.jpg)
What don’t we know about Focus realization? Accent type
Claim that English focal accents divide into• Topic (T), contrastive theme, L+H*• Focus (F), H*
What about Anna? Who did she come with?
AnnaT came with MannyF.What about Manny? Who came with him?
AnnaF came with MannyT.
![Page 62: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/62.jpg)
two SOF tokens
You made a very small amount more than I did. Now I make muchF more than youF do.
![Page 63: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/63.jpg)
He’s gotta pick someone who is younger than he is, and is definitely more conservative than he is.
[-er [ t is d young than he is d young]]2 and
more [[ t is is d conservativeF]3
than [ heF is d conservative ] ~3 ] ~2
![Page 64: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/64.jpg)
![Page 65: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/65.jpg)
![Page 66: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/66.jpg)
![Page 67: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/67.jpg)
Distribution of datasets
Audio snippets can probably by distributed under fair use.
http://confluence.cornell.edu/display/prosody/Prosody+Datasets
![Page 68: Focus Contrast in Web Harvested Data](https://reader035.vdocuments.site/reader035/viewer/2022062217/568140de550346895daca893/html5/thumbnails/68.jpg)
• A lot of naturalistic data bearing on theories of
prosody can be found using search engines that index audio using ASR.
• Machine learning classification is a good methodology for prosody, because one can work with semantic-pragmatic categories that figure in formal theories.
• For focus, try to do build classifiers, not just find statistically significant correlations with acoustic parameters. Classifiers such as SVM can combine information from a lot of features.