next generation translation tools swansea 19...
TRANSCRIPT
Next Generation Translation ToolsSwansea – 19 July
Monday, 25 July 2016 1
Dr Ana Frankenberg-Garcia
On my way to TaLC 12 in Giessen
Monday, 25 July 2016 2
Sorry I am not here
Monday, 25 July 2016 3
How I could contribute?
1. Corpus compilation
2. Applied uses of corpora in
• Bilingual lexicography
• Second language and translation pedagogy
• Translation studies
3. Corpus workshops for translators
4. Languages
Monday, 25 July 2016 4
Research background and experience
How I could contribute
• Principal investigator of COMPARA (1999-2008)
• 3-million-word parallel corpus of Portuguese-English, English-
Portuguese fiction
• Open access at www.linguateca.pt/COMPARA
• Contributed to the compilation of ptTenTen corpus
• 3 billion-word web-crawled corpus of contemporary European and
Brazilian Portuguese
• Access at www.sketchengine.co.uk
• Various ad hoc corpora for specialized translation and terminology extraction
Monday, 25 July 2016 5
1. Corpus compilation
Sketch Engine (Lexical Computing Ltd.)
• State-of-the-art technology
• Software used by major industry stakeholders (e.g. OUP, CUP,
Collins, Macmillan, Le Robert, etc.)
• Team of experts continuously developing software
• Close contact with users, willing to incorporate functionalities
requested by users
• Access to existing corpora in over 60 languages
• Handles monolingual and parallel corpora
• Handles web-crawling for ad hoc corpus creation
• One-step part-of-speech tagging
• Embedded terminology extraction tools
Monday, 25 July 2016 6
Preferred software
1. Corpus compilation
• Chief editor, Oxford Portuguese Dictionary, Oxford University Press (2015)
• Editor, Collins Gem Portuguese Dictionary (2016 update)
• Editor, Collins Pocket Portuguese Dictionary (2016 update)
Monday, 25 July 2016 7
In bilingual lexicography
2. Applied uses of corpora
Frankenberg-Garcia, A. (2016) “Corpora in ELT”. In Hall, G. (ed.) Routledge Handbook of
English Language Teaching. London: Routledge, 383-398.
Frankenberg-Garcia, A. (2015) “Training Translators to Use Corpora Hands-on:
challenges and reactions by a group of 13 students at a UK university”. Corpora.
10/2: 351-380.
Frankenberg-Garcia, A. (2015) “Dictionaries and Encoding Examples to Support
Language Production”. International Journal of Lexicography, 28/4: 490-512.
Frankenberg-Garcia, A. (2014) "How Language Learners Can Benefit from Corpora, or
Not". Recherches en Didatique des Langues et des Cultures, 11/1: 93-110.
Frankenberg-Garcia, A. (2014) "The Use of Corpus Examples for Language
Comprehension and Production”. Special Issue of ReCALL, Researching uses of
corpora for language teaching and learning, 128-146.
Frankenberg-Garcia, A. (2012) “Learners’ Use of Corpus Examples”. International Journal
of Lexicography, 25/3: 273-296.
Frankenberg-Garcia, A. (2012) “Integrating Corpora with Everyday Language Teaching”.
In Thomas, J. and Boulton, A. (eds.) Input, Process and Product: Developments in
Teaching and Language Corpora. Brno: Masaryk University Press, 36-53.
Frankenberg-Garcia, A. (2012) "Raising Teachers’ Awareness of Corpora". Language
Teaching, 45/4: 475-489.
Monday, 25 July 2016 8
In second language and translation pedagogy
2. Applied uses of corpora
Frankenberg-Garcia, A. (forthcoming) “A corpus study of loans in translated and non-
translated texts”. In G. Corpas Pastor and M. Seghiri (eds.) Corpus-based
Approaches to Translation and Interpreting: from theory to applications. Frankfurt:
Peter Lang.
Frankenberg-Garcia (2014) "Understanding Portuguese Translations with the Help of
Corpora". In T. Sardinha and T. Ferreira (eds.) Working with Portuguese Corpora.
London: Bloomsbury, 161-176.
Frankenberg-Garcia, A. (2009) "Compiling and Using a Parallel Corpus for Research in
Translation". International Journal of Translation, XXI/1: 57-71.
Frankenberg-Garcia, A. (2009) “Are Translations Longer than Source Texts? A Corpus-
Based Study of Explicitation”. In Beeby, A., Rodríguez, P. & Sánchez-Gijón, P. (eds.)
Corpus Use and Learning to Translate (CULT): An Introduction. Amsterdam and
Philadelphia: John Benjamins, 47-58
Frankenberg-Garcia (2008) "Suggesting Rather Special Facts. A Corpus-Based Study of
Distinctive Lexical Distributions in Translated Texts". Corpora, 3/2: 195-211.
Monday, 25 July 2016 9
In Translation Studies
2. Applied uses of corpora
• DG Translation: 2016
• Institute of Translators and Interpreters: 2014, 2015, 2016
• Teaching and Language Corpora conference: 2014, 2016
• Scotnet Workshop: 2016
Monday, 25 July 2016 10
Hands-on introduction to corpora in translation practice
3. Corpus workshops for translators
• Languages A
• Portuguese
• English
• Languages B
• French
• Spanish
Monday, 25 July 2016 11
4. Working languages
• I hope my background and experience can be of use to the project
• Do get in touch if collaboration makes sense
• Thanks to Andy for organizing this meeting
• Thanks to Joanna for presenting on my behalf
Monday, 25 July 2016 12
In conclusion
Investigating the ‘externalisation’ phenomenon -studying the patterns of reusing of the external data
in the translation process
‘Next Generation Translation Tools’ day workshop, Swansea
Joanna GoughCentre for Translation Studies
University of Surrey19 July 2016
- Survey on the adoption of Web 2.0 technologies by professional translators (MA Thesis)
- Empirical research into how translators interact with external resources (PhD Thesis)- Screen recordings with audio commentary- Questionnaires (pre- and post-task)
- Member of the ITI Professional Development Committee (potential for collaboration with freelance translators outside DGT)
- Languages: Polish, English
RELEVANT PROFESSIONAL AND RESEARCH BACKGROUND
The major theme of these new approaches is the embeddedness of cognitionin the body and in the world.
If one can, conceptually, distinguish the brain from the body and from the environment, a flood of dense and continuous information binds the three
together(Hardy-Vallée, 2008)
EMOBODIED COGNITIONthe brain is embedded in in the body
EXTENDED COGNITIONcognition not skull- or body-bound, but
extends into the world
SITUATED COGNITIONthe brain-body complex embedded in
an environment
DISTRIBUTED COGNITIONcognition distributed across individuals,
objects and tools in the environment
THEORETICAL FRAMEWORK - COGNITION BEYOND THE BRAIN
THEORETICAL FRAMEWORK - COGNITION BEYOND THE BRAIN
MY INTEREST:
EXTENDED MIND /ACTIVE EXTERNALISM
(Clarke and Chalmers, 1998)
Emphasises the active role of the environment support in driving/shaping cognitive processes
(in everyday life: pen & paper, digital notes, reminders, calendars, Internet, cameras, personal assistant such as Siri or Cortana etc.)
(in translation: translators’ notes, translation memories, machine translation, online resources etc.)
focus on TRANSLATION TECHNOLOGY as the active agent
in the translation process
Overreliance on external resources – users “will turn to the Internet to ask even the simplest
question. Just knowing that a piece of information is readily available […] anytime leads humans not to
memorise” (Duval, 2012, in: Zapata, 2015)
Overreliance on external resources -rapid, 2-3 sec consultations for everyday
words (Gough, 2016)
Research from Information Technology and Web and Studies:
Research from Translation Studies:
The Web is undermining our capacity for deep, sustained, linear thinking and
promotes superficial forms of information processing (Carr, 2010)
Shallow, economising research patterns; few participants showed explorative tendencies with deep, engaging patterns (Gough, 2016)
Deterioration in recall - users are more likely to remember the location of information (and how to retrieve it) than the actual
information (Sparrow, Liu & Wegner, 2011)
Less need for recall (T6: “incidentally, since I use Déjà Vu, I found that I am much less
motivated to even try to remember certain terms translations […] knowing that Déjà Vu will bring them up when needed”) (see Gough,
2016)
RELATED RESEARCH FINDINGS
From black box to tool box
Tool boxBlack box
Translation memory
CHANGE IN COGNITIVE SKILL SET AS A RESULT OF ACTIVE EXTERNALISM
Different set of skills required for translation:
From GENERATING ideasto SELECTING the appropriate solution
(Pym, 2013)
What implications does increasing reliance on technologyhave for the way we translate?
This becomes even more interesting with the predictive MT
So, my interest is in trying to empirically capture at least some aspects resulting from the externalisation/extension of the
translators’ mind:
HOW OPTIONS ARE SELECTED FROM VARIOUS TYPES OF INPUTHOW MUCH OF REUSED MATERIAL from TM, MT, TB, Autosuggest
makes its way into the target text
But how can we do it?
Since access to translator’s brain activities and how they use their internal resources is very difficult, we can observe the environment in which the
translator works with
Methods of observation are becoming more and more viable thanks to new technologies (e.g. MATECAT, CASMACAT, IOmegaT, TranslogII, HandyCAT)
MY RESEARCH AGENDA
Since the tool of choice is SDL Trados, it is proposed to use Qualitivity plugin (Hartnett, 2015). Why?
- Free (as of June 2016)- Works unobtrusively in SDL Trados Studio (2014 & 2015) - Offers a wide range of metrics:
• TM, MT, Autosuggest, Termbase suggestions that were used• Tracking of whether the translation was later adapted by the translator• Records additions, deletions or adaptations in a segment in a track-changes
mode• The time taken to translate the complete file and each segment• The key strokes typed by the translator/reviewer • The number of words (source) in the segments that were updated• Cascading comparison results, when a segment is modified more than once
- Produces visualisations or exports the metrics as XML or to Excel where data can be manipulated.
- Later this year SDL is to release their Adaptive MT feature and Qualitivity should be able to track this activity (TBC)
- Connects to TAUS Quality Dashboard (Dynamic Quality Framework)
QUALITIVITY PLUGIN
Source: TM – Pl(PL)-En(GB)
Source: MT – SDL Language Cloud
QUALITIVITY PLUGIN
QUALITIVITY PLUGIN
QUALITIVITY PLUGIN
CONSIDERATIONS FOR THE FUTURE
In order to uncover how technology influences/shapes cognitive processes during translation, we need to study translation process:
- across different technologies/platforms (e.g. differences between interactive translation vs post-editing)
- as an interaction with different modalities • voice• touch screens• e-pens etc.
- as human/machine systems
Duval, C. (2012). L’impact du Web en 4 questions. LaRecherche, 467, 46–50.
Carr, N. (2010). How the Internet is changing the way we think, read and remember. London: Atlantic Books.
Clarke, A & Chalmers, D.J. (1998). The Extended Mind, in Analysis 58:10-23, 1998.
Duval, C. (2012). L’impact du Web en 4 questions. LaRecherche, 467, 46–50.
Gough, J. (2016). The Patterns of Interaction between Professional Translators and Online Resources. Unpublished PhD Thesis. University of Surrey.
Hartnett, P (2015). Qualitivity. Available at: http://codificare.net/tools/qualitivity/
Hardy-Vallée, B. (2008). Embodied, situated and distributed cognition. Retrieved from http://naturalrationality.blogspot.co.uk/2008/01/embodied-situated-and-distributed.html
Pym, A. (2011). What technology does to translating. Translation and Interpreting, 3(1), 1–9.
Small, G. W., Moody, T. D., Siddarth, P., & Bookheimer, S. Y. (2009). Your Brain on Google: Patterns of Cerebral Activation during Internet Searching. American Journal of Geriatric Psychiatry, 17(2), 116–126.
Smart, P. (2013). Understanding the Cognitive Impact of Emerging Web Technologies: A Research Focus Area for Embodied, Extended and DistributedApproaches to Cognition. 1st International Web for Wellbeing & Human Performance Workshop, Paris, France. Retrieved July 11, 2016 from http://eprints.soton.ac.uk/350372/
Sparrow, B., Liu, J., & Wegner, D. (2011). Google effects on memory: Cognitive consequences of having information at our fingertips. Science, 3333(6043), 776–778.
Zapata, J. (2015). Investigating translator-information interaction: A case study on the use of the prototype biconcordancer tool integrated in CASMACAT. In M. Carl, S. Bangalore, & M. Schaeffer (Eds.), New Directions in Empirical Translation Process Research: Exploring the CRITT TPR-DB (pp. 139–155). Springer.
References
Thank you for [email protected]