diffusion of lexical innovations - investigating the spread of … › personen › wiss_ma ›...
TRANSCRIPT
![Page 1: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/1.jpg)
# 1/36
Diffusion of Lexical InnovationsInvestigating the Spread of English Neologisms on the Web and
on Twitter
Quirin Würschinger, LMU Munich
FJUEL 2018Munich
14 September, 2018
![Page 2: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/2.jpg)
# 2/36
Research questions
I Which new words enter the English language?I How do they diffuse?I Which factors affect how they diffuse?
![Page 3: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/3.jpg)
# 3/36
Which new words enter the English language?
![Page 4: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/4.jpg)
# 4/36
Urban Dictionary
![Page 5: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/5.jpg)
# 5/36
What is a ‘new word’?
I nonce-formations: used once, but have not diffusedI neologisms: have diffused to some degree, but are still
perceived to be ‘new’I conventional words: have successfully diffused and are
known to the majority of the speech community
![Page 6: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/6.jpg)
# 6/36
Which words enter the English lexicon?Morphological productivity
0,0%
5,0%
10,0%
15,0%
20,0%
25,0%
1950-1960 1960-1970 1970-1980 1980-1990 1990-2000 2000-2010
blends rel. initialisms rel. acronyms rel. clippings rel.
![Page 7: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/7.jpg)
# 7/36
Which words are entering the English language?
NeoCrawler: Discoverer module (Kerremans and Prokic 2018)I goal: investigating incipient diffusionI method:
I retrieve sample of web pagesI dictionary matchingI semi-manual selection of candidatesI store in database (≈ 1,000 lemmas)
![Page 8: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/8.jpg)
# 8/36
How do new words diffuse and become conventional?
![Page 9: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/9.jpg)
# 9/36
Previous work
I cultural innovation: S-curves (Rogers 1962; Rogers andShoemaker 1971), big data (Kim, McFarland and Leskovec2017)
I sociolinguistics and language change: mainly phonologyand syntax, diffusion, early and late adopters (Labov 1980;J. Milroy and L. Milroy 1985; Croft 2000)
I structural: lexicalization, institutionalization, establishment(Bauer 1983; Lipka 1992)
I corpus linguistics:I recent work: large-scale studies, bigger samples (Eisenstein
et al. 2014; Grieve, Nini and Guo 2016)I tools: NeoCrawler (Kerremans, Stegmayr and Schmid 2012),
Wortwarte (Lemnitzer 2018), Logoscope (Bernhard et al.2015), Neoveille (Cartier 2017)
![Page 10: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/10.jpg)
# 10/36
S-curve
Figure 1: Integration of Milroy’s and Rogers’ model of diffusion stages into an S-curve(Kerremans 2015, p. 65)
![Page 11: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/11.jpg)
# 11/36
How can we model diffusion?
The EC model (Schmid 2015) – a simplified account:I coining: first useI usualization: agreement over communicative functionI diffusion: spread to new usage contexts and speakersI normation: establishment of norms about how to use new
words
![Page 12: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/12.jpg)
# 12/36
Which factors influence diffusion?
lemma-inherent (type level)I form
I transparencyI productivity of
word-formation patternI formal appeal
I meaningI semantic domainI existing near-synonymsI nameworthiness
in usage (token level)I sociolinguistic
I density of social networkI speakers’ prestige
I cognitiveI formal salience in useI metalinguistic uses
I pragmaticI type of source
I emotive-affectiveI sentiment
![Page 13: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/13.jpg)
# 13/36
Dimensions of diffusion
new uses bring about . . .I spread across speakersI spread across usage contexts
low high
low hypostatization alt-left
high electron DNAspeakers
usage contexts
![Page 14: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/14.jpg)
# 14/36
How can we measure diffusion empirically?
I detecting candidates: DiscovererI investigating diffusion
I on the web: NeoCrawlerI on social media: Twitter
![Page 15: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/15.jpg)
# 15/36
How do new words spread on the web?
![Page 16: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/16.jpg)
# 16/36
NeoCrawler (Kerremans, Stegmayr and Schmid 2012)
I weekly Google Searches1 (about 1,000 lemmas)I download all html pages foundI pre- and post-processingI corpus compilation
1Google Custom Search API
![Page 17: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/17.jpg)
# 17/36
ResultsWord classes
DFG form 53.01– 03/18 page 7 of 20
Figure 2: Distribution in terms of word class and underlying word-formation patterns
It should be noted that our data are not collected by means of a systematic sampling method, but are based on the Discoverer’s capacity to detect new words on the web and on Twitter. To check whether the composition of our sample covers the spectrum of lexical innovations as investigated by previous lexicographic work, a systematic investigation of new words recorded in the OED was conducted by the PhD candidate working on the project. A quantitative analysis of all neologisms which have entered the OED since 1800 has been found to be largely in line with the composition of word classes and word-formation processes in our sample. 1.4.2 Discussion in the light of initial hypotheses regarding factors affecting diffusion 1.4.2.1 Productivity The distribution of our data in terms of word class and word formation are an indicator of the productivity of the different word-formation patterns and word classes on a macroscopic scale. They reflect speakers’ tendencies to coin new words and recruit word-formation patterns for lexical innovations. The distribution of word classes in our sample (see Figure 2.1) is dominated by a high percentage of nouns (79%), followed by lower percentages of adjectives (15%) and verbs (12%), adverbs (1%) and phrases (1%). This is in line with the expectation that new nouns are particularly useful for naming innovative products, concepts and practices which are salient in public discourse. Our quantitative study of OED data has shown that the distribution of word classes among neologisms that have entered the lexicon since 1800 has remained very stable over time. In the period between 1950 and 2010, a total of 14,796 new nouns (69%), adjectives (22%), verbs (8%) and adverbs (1%) have been entered. In comparison with these data, our sample features a slightly higher proportion of nouns. While our sample cannot claim full representativity, the numbers indicate that our database of neologisms at least reflects the distribution of new words in terms of word classes mirrored in the OED (with all due reservations regarding the OED’s text sampling, lemma inclusion policy, etc.). Regarding word-formation processes, compounding (37%), blending (31%) and derivation (24%) have given rise to the great majority of new words we detected (see Figure 2.2). The dominance in productivity of these three patterns is in accordance with previous studies (e.g. Bauer 1983). As with the evaluation of word classes, a quantitative comparison with OED data was drawn. While compounds (38%) and derivations (19%) account for a similarly large proportion of new words in the OED in the period between 1990 and 2010, blends (6%) are much rarer than in our data sample, even though this number has increased significantly in recent years. The rising productivity of blending in the formation of new words is in line with previous quantitative investigations which regard blends as increasingly productive formations typical of newspapers and language use on the web (Ayto 2003). The final thesis by Andrea Birkmüller (see 1.5) has shown that blending has increased in productivity over the past 20 years, both in terms of innovations as such and in terms of diffusion. The fact that blending is often assumed to produce more short-lived formations (Algeo 1998) might partly account for the higher numbers of blends in our data of incipient diffusion compared with lower numbers of blends entered as fairly conventional words in the OED. With regard to the relation between
![Page 18: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/18.jpg)
# 18/36
ResultsWord-formation processes
DFG form 53.01– 03/18 page 7 of 20
Figure 2: Distribution in terms of word class and underlying word-formation patterns
It should be noted that our data are not collected by means of a systematic sampling method, but are based on the Discoverer’s capacity to detect new words on the web and on Twitter. To check whether the composition of our sample covers the spectrum of lexical innovations as investigated by previous lexicographic work, a systematic investigation of new words recorded in the OED was conducted by the PhD candidate working on the project. A quantitative analysis of all neologisms which have entered the OED since 1800 has been found to be largely in line with the composition of word classes and word-formation processes in our sample. 1.4.2 Discussion in the light of initial hypotheses regarding factors affecting diffusion 1.4.2.1 Productivity The distribution of our data in terms of word class and word formation are an indicator of the productivity of the different word-formation patterns and word classes on a macroscopic scale. They reflect speakers’ tendencies to coin new words and recruit word-formation patterns for lexical innovations. The distribution of word classes in our sample (see Figure 2.1) is dominated by a high percentage of nouns (79%), followed by lower percentages of adjectives (15%) and verbs (12%), adverbs (1%) and phrases (1%). This is in line with the expectation that new nouns are particularly useful for naming innovative products, concepts and practices which are salient in public discourse. Our quantitative study of OED data has shown that the distribution of word classes among neologisms that have entered the lexicon since 1800 has remained very stable over time. In the period between 1950 and 2010, a total of 14,796 new nouns (69%), adjectives (22%), verbs (8%) and adverbs (1%) have been entered. In comparison with these data, our sample features a slightly higher proportion of nouns. While our sample cannot claim full representativity, the numbers indicate that our database of neologisms at least reflects the distribution of new words in terms of word classes mirrored in the OED (with all due reservations regarding the OED’s text sampling, lemma inclusion policy, etc.). Regarding word-formation processes, compounding (37%), blending (31%) and derivation (24%) have given rise to the great majority of new words we detected (see Figure 2.2). The dominance in productivity of these three patterns is in accordance with previous studies (e.g. Bauer 1983). As with the evaluation of word classes, a quantitative comparison with OED data was drawn. While compounds (38%) and derivations (19%) account for a similarly large proportion of new words in the OED in the period between 1990 and 2010, blends (6%) are much rarer than in our data sample, even though this number has increased significantly in recent years. The rising productivity of blending in the formation of new words is in line with previous quantitative investigations which regard blends as increasingly productive formations typical of newspapers and language use on the web (Ayto 2003). The final thesis by Andrea Birkmüller (see 1.5) has shown that blending has increased in productivity over the past 20 years, both in terms of innovations as such and in terms of diffusion. The fact that blending is often assumed to produce more short-lived formations (Algeo 1998) might partly account for the higher numbers of blends in our data of incipient diffusion compared with lower numbers of blends entered as fairly conventional words in the OED. With regard to the relation between
![Page 19: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/19.jpg)
# 19/36
ResultsDiffusion of all candidates
DFG form 53.01– 03/18 page 6 of 20
new pages, to 28 items which we have monitored for a longer period but which have not been used since April 2016. To give an idea of the words, we have selected the 25 top-ranking items (Figure 1.2), the 25 items in the middle of the frequency distribution, i.e. 12 above and below the median (Figure 1.3) and the 25 items at the end of the tail (Figure 1.4), omitting those which are not attested in the period. Note that these are cumulative counts which do reflect how long the words have existed. For example, the top-ranking item Trumpism has been in use over the whole period, while the compound fake news was coined later and has only been monitored for 61 weeks. Nevertheless, we regard cumulative counts as a suitable approximative indicator of diffusion in general, because they reflect the number of uses on the web and the number of occasions for the average Internet user to have come across these words on the web.
Figure 1: Cumulative number of new pages in the period between April 2016 and April 2018
On the basis of this rationale, it seems legitimate to argue that the 25 top-ranking items listed in Figure 1.2 are the neologisms which have caught on best, while those in the middle part of the frequency cline are less strongly conventionalized and those at the end of the tail are not at all conventionalized so far. This conclusion is confirmed by intuitive assessments of the items listed and by our questionnaire results, which suggest that frequency is a strong indicator of which words are generally more familiar to individual speakers. It should be noted that the data gloss over differences in the patterns of diffusion already identified by Kerremans (2015). These patterns are confirmed by the larger dataset we now have at our disposal. The main patterns are:
• fast and sustained diffusion, illustrated by most of the top-ranking items in Figure 1.2, although some, e.g. liveblog, show a less steep increase in the early stage after coinage;
• no diffusion, illustrated by the long tail of Figures 1.1 and 1.4; • topical diffusion after noteworthy events, with subsequent reduction of usage intensity or
sporadic topical peaks, e.g. Grexit, Catalexit, creepy clown; • cyclical changes in usage frequency, depending on seasons, repeated events like elections
or sports events, e.g. veganuary.
1.4.1.2 Distribution in terms of word class and underlying word-formation patterns Figure 2 provides a summary of the data with regard to their distribution across word classes (Figure 2.1) and underlying word-formation patterns (Figure 2.2), counting the topmost or final word-formation process in the word-internal hierarchy if several apply.
![Page 20: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/20.jpg)
# 20/36
ResultsTop 25 items
DFG form 53.01– 03/18 page 6 of 20
new pages, to 28 items which we have monitored for a longer period but which have not been used since April 2016. To give an idea of the words, we have selected the 25 top-ranking items (Figure 1.2), the 25 items in the middle of the frequency distribution, i.e. 12 above and below the median (Figure 1.3) and the 25 items at the end of the tail (Figure 1.4), omitting those which are not attested in the period. Note that these are cumulative counts which do reflect how long the words have existed. For example, the top-ranking item Trumpism has been in use over the whole period, while the compound fake news was coined later and has only been monitored for 61 weeks. Nevertheless, we regard cumulative counts as a suitable approximative indicator of diffusion in general, because they reflect the number of uses on the web and the number of occasions for the average Internet user to have come across these words on the web.
Figure 1: Cumulative number of new pages in the period between April 2016 and April 2018
On the basis of this rationale, it seems legitimate to argue that the 25 top-ranking items listed in Figure 1.2 are the neologisms which have caught on best, while those in the middle part of the frequency cline are less strongly conventionalized and those at the end of the tail are not at all conventionalized so far. This conclusion is confirmed by intuitive assessments of the items listed and by our questionnaire results, which suggest that frequency is a strong indicator of which words are generally more familiar to individual speakers. It should be noted that the data gloss over differences in the patterns of diffusion already identified by Kerremans (2015). These patterns are confirmed by the larger dataset we now have at our disposal. The main patterns are:
• fast and sustained diffusion, illustrated by most of the top-ranking items in Figure 1.2, although some, e.g. liveblog, show a less steep increase in the early stage after coinage;
• no diffusion, illustrated by the long tail of Figures 1.1 and 1.4; • topical diffusion after noteworthy events, with subsequent reduction of usage intensity or
sporadic topical peaks, e.g. Grexit, Catalexit, creepy clown; • cyclical changes in usage frequency, depending on seasons, repeated events like elections
or sports events, e.g. veganuary.
1.4.1.2 Distribution in terms of word class and underlying word-formation patterns Figure 2 provides a summary of the data with regard to their distribution across word classes (Figure 2.1) and underlying word-formation patterns (Figure 2.2), counting the topmost or final word-formation process in the word-internal hierarchy if several apply.
![Page 21: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/21.jpg)
# 21/36
ResultsItems around median
DFG form 53.01– 03/18 page 6 of 20
new pages, to 28 items which we have monitored for a longer period but which have not been used since April 2016. To give an idea of the words, we have selected the 25 top-ranking items (Figure 1.2), the 25 items in the middle of the frequency distribution, i.e. 12 above and below the median (Figure 1.3) and the 25 items at the end of the tail (Figure 1.4), omitting those which are not attested in the period. Note that these are cumulative counts which do reflect how long the words have existed. For example, the top-ranking item Trumpism has been in use over the whole period, while the compound fake news was coined later and has only been monitored for 61 weeks. Nevertheless, we regard cumulative counts as a suitable approximative indicator of diffusion in general, because they reflect the number of uses on the web and the number of occasions for the average Internet user to have come across these words on the web.
Figure 1: Cumulative number of new pages in the period between April 2016 and April 2018
On the basis of this rationale, it seems legitimate to argue that the 25 top-ranking items listed in Figure 1.2 are the neologisms which have caught on best, while those in the middle part of the frequency cline are less strongly conventionalized and those at the end of the tail are not at all conventionalized so far. This conclusion is confirmed by intuitive assessments of the items listed and by our questionnaire results, which suggest that frequency is a strong indicator of which words are generally more familiar to individual speakers. It should be noted that the data gloss over differences in the patterns of diffusion already identified by Kerremans (2015). These patterns are confirmed by the larger dataset we now have at our disposal. The main patterns are:
• fast and sustained diffusion, illustrated by most of the top-ranking items in Figure 1.2, although some, e.g. liveblog, show a less steep increase in the early stage after coinage;
• no diffusion, illustrated by the long tail of Figures 1.1 and 1.4; • topical diffusion after noteworthy events, with subsequent reduction of usage intensity or
sporadic topical peaks, e.g. Grexit, Catalexit, creepy clown; • cyclical changes in usage frequency, depending on seasons, repeated events like elections
or sports events, e.g. veganuary.
1.4.1.2 Distribution in terms of word class and underlying word-formation patterns Figure 2 provides a summary of the data with regard to their distribution across word classes (Figure 2.1) and underlying word-formation patterns (Figure 2.2), counting the topmost or final word-formation process in the word-internal hierarchy if several apply.
![Page 22: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/22.jpg)
# 22/36
ResultsBottom 25 items
DFG form 53.01– 03/18 page 6 of 20
new pages, to 28 items which we have monitored for a longer period but which have not been used since April 2016. To give an idea of the words, we have selected the 25 top-ranking items (Figure 1.2), the 25 items in the middle of the frequency distribution, i.e. 12 above and below the median (Figure 1.3) and the 25 items at the end of the tail (Figure 1.4), omitting those which are not attested in the period. Note that these are cumulative counts which do reflect how long the words have existed. For example, the top-ranking item Trumpism has been in use over the whole period, while the compound fake news was coined later and has only been monitored for 61 weeks. Nevertheless, we regard cumulative counts as a suitable approximative indicator of diffusion in general, because they reflect the number of uses on the web and the number of occasions for the average Internet user to have come across these words on the web.
Figure 1: Cumulative number of new pages in the period between April 2016 and April 2018
On the basis of this rationale, it seems legitimate to argue that the 25 top-ranking items listed in Figure 1.2 are the neologisms which have caught on best, while those in the middle part of the frequency cline are less strongly conventionalized and those at the end of the tail are not at all conventionalized so far. This conclusion is confirmed by intuitive assessments of the items listed and by our questionnaire results, which suggest that frequency is a strong indicator of which words are generally more familiar to individual speakers. It should be noted that the data gloss over differences in the patterns of diffusion already identified by Kerremans (2015). These patterns are confirmed by the larger dataset we now have at our disposal. The main patterns are:
• fast and sustained diffusion, illustrated by most of the top-ranking items in Figure 1.2, although some, e.g. liveblog, show a less steep increase in the early stage after coinage;
• no diffusion, illustrated by the long tail of Figures 1.1 and 1.4; • topical diffusion after noteworthy events, with subsequent reduction of usage intensity or
sporadic topical peaks, e.g. Grexit, Catalexit, creepy clown; • cyclical changes in usage frequency, depending on seasons, repeated events like elections
or sports events, e.g. veganuary.
1.4.1.2 Distribution in terms of word class and underlying word-formation patterns Figure 2 provides a summary of the data with regard to their distribution across word classes (Figure 2.1) and underlying word-formation patterns (Figure 2.2), counting the topmost or final word-formation process in the word-internal hierarchy if several apply.
![Page 23: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/23.jpg)
# 23/36
How do new words spread on Twitter?
![Page 24: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/24.jpg)
# 24/36
Methodology
I advantagesI going back in timeI high temporal resolutionI user metadata (social, geographic)I social network data
I toolsI ongoing Twitter mining: TAGSI web scraping: Twitter Scraper
![Page 25: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/25.jpg)
# 25/36
A Case study of alt-right and alt-leftBackground of alt-right
clipped form of earlier term Alternative Right, coined by WhiteSupremacist Richard Spencer
![Page 26: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/26.jpg)
# 26/36
A Case study of alt-right and alt-leftBackground of olt-left
formed in analogy (and opposition) to pre-existing alt-right
![Page 27: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/27.jpg)
# 27/36
Corpus examples
use of alt-left in 2016
![Page 28: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/28.jpg)
# 28/36
Corpus examplesuse of alt-left in 2017
![Page 29: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/29.jpg)
# 29/36
Zooming in on diffusion
0
50000
100000
150000
2008 2010 2012 2014 2016 2018
Twee
ts (
wee
kly)
alt−left
alt−right
![Page 30: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/30.jpg)
# 30/36
0
50000
100000
150000
2016−01 2016−07 2017−01 2017−07 2018−01
Twee
ts (
wee
kly)
alt−left
alt−right
![Page 31: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/31.jpg)
# 31/36
0
10000
20000
30000
Aug Sep Okt Nov Dez
Twee
ts (
daily
)
alt−left
alt−right
August 25, 2016: Hillary Clinton’s speech against alt-rightNovember 22, 2016: Trump publicly defends Steven Bannon
![Page 32: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/32.jpg)
# 32/36
0
10000
20000
30000
Jul 31 Aug 07 Aug 14 Aug 21 Aug 28
Twee
ts (
daily
)
alt−left
alt−right
August 12, 2017: Charlottesville RallyAugust 16, 2017: Trump attacking ‘alt-left’
![Page 33: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/33.jpg)
# 33/36
Zooming in on diffusion
the ‘social’ network
![Page 34: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/34.jpg)
# 34/36
Social network analysis
alt-left alt-right
number of tweets 295,968 1,760,777number of individual speakers 117,607 550,798avg. weighted degree 0.855 1.044modularity 0.937 0.877
→ alt-right shows a high degree of diffusion over an extendedtime window
→ alt-left shows some diffusion, but remains to be used bysmaller pockets of the speech community
![Page 35: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/35.jpg)
# 35/36
Implications
I S-curves not to be expected due to effects of topicality2
I differentiated view on diffusion: sub-communitiesI ’influencers’ drive innovationI social network characteristics influence diffusion
2and for other reasons that we could discuss . . .
![Page 36: Diffusion of Lexical Innovations - Investigating the Spread of … › personen › wiss_ma › ... · 2019-10-09 · studies (e.g. Bauer 1983). As with the evaluation of word classes,](https://reader033.vdocuments.site/reader033/viewer/2022060209/5f0444ee7e708231d40d24a6/html5/thumbnails/36.jpg)
# 36/36
Thanks!
3
3OED Word of the Year 2015