foreign language learning mode preferences of japanese

16
Foreign Language Learning Mode Preferences of Japanese University Students Kunihiro KUSANAGI Kazumichi ENOKIDA Shusaku KIDA Tatsuya SAKAUE Fuyuko TAKITA Keiso TATSUKAWA Koji UENISHI Goro YAMAMOTO Lisa YOSHIKAWA Hiroshima University Abstract The present study aims to develop a practical and statistically sound inventory that helps in examining Japanese university students’ foreign language learning mode preferences (LMPs). LMP, like numerous other learners’ traits such as learning styles, beliefs, and motivation, is one of the behavioral and meta-cognitive variables that potentially moderate the variance of the learning outcome under a specific learning mode. The present study conducted a questionnaire survey, sampling university students who were learning English as a foreign language (N = 1,003) in a Japanese national university and developed a Foreign Language Learning Mode Preference Inventory (ver. 1), whose items (K = 12) were all written in Japanese. The inventory yielded three subscales with relatively higher reliability coefficient for each: (a) cooperative learning preference (α = .91), (b) e-learning preference (α = .90), and (c) face-to-face instruction preference (α = .70). The factorial validity of the inventory was tested using structure equation modeling. The results showed that the theoretical measurement model fit well to the observations. Furthermore, multiple sample analyses supported the measurement invariance with the constraints of equal loadings and equal intercepts across genders. Some practical applications of this inventory, specifically in the context of foreign language teaching in Japanese higher education, were discussed. 193

Upload: others

Post on 03-Dec-2021

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Foreign Language Learning Mode Preferences of Japanese

Kondo, T., & Shirahata, T. (2014, December). Explicit instruction on English verb structures in L2 classrooms. Oral presentation at The Sixth CLS International Conference (CLaSIC) 2014.Singapore.

Kondo, T., & Shirahata, T. (2015). The effects of explicit instruction on intransitive verb structures in L2 English classrooms. Annual Review of English Language Education in Japan, 26.93-108.

Lado, R. (1957). Language across cultures: Applied linguistics for language teachers. University of Michigan Press.

Matsunaga, K. (2005). Overgeneralization in second language acquisition of transitivity alternations. Second Language, 4, 75-110.

Montrul, S. (2000). Transitivity alternations in L2 acquisition: Toward a modular view of transfer. Studies in Second Language Acquisition, 22, 229-273. doi: 10.1017/S0272263100002047

Omaki, A., & Ariji, K. (2005). Testing and attesting the use of structural information in L2 sentence processing. In Proceedings of the 7th Generative Approaches to Second Language Acquisition Conference, 205-218.

Oshita, H. (2000). What is happened may not be what appears to be happening: A corpus study of ‘passive’ unaccusatives in L2 English. Second Language Research, 16, 293-324. doi: 10.1177/026765830001600401.

Scott, R. M., & Fisher, C. (2009). Two-year-olds use distributional cues to interpret transitivity-alternating verbs. Language and Cognitive Processes, 24, 777-803. doi: 10.1080/01690960802573236

Shirahata, T. (2015). Eigo shido ni okeru koukateki na ayamari teisei: Daini gengo shuutoku no kenchi kara [Who’s afraid of grammatical errors?]. Tokyo: Taisyukan Shoten.

Shirahata, T., Suda, K., Kondo, T., Yokota, H., Ogawa, M., & Ogawa, S. (2016, September). The interaction of animacy with the wh-extraction by Japanese learners of English. Oral presentation at The Pacific Second Language Research Forum 2016. Tokyo. Japan.

Silverstein, M. (1976). Hierarchy of features and ergativity. In M. Robert, & W. Dixon (Eds.), Grammatical Categories in Australian Languages (pp.112-171). Australian Institute of Aboriginal Studies .

Syndicate, U. C. L. E. (2001). Quick placement test. Oxford University Press.Tsunoda, T. (2009). Sekai no gengo to nihongo [Languages in the world and Japanese]. Tokyo:

Kuroshio Shuppan .White, L. (2003). Second language acquisition and universal grammar. Cambridge University

Press.Yusa, N. (2003). ‘Passive’ unaccusatives in L2 acquisition. In M. C. Patricia (Ed.),

Japanese/Korean Linguistics, 11, 246-259. CSLI Stanford University Press.

Foreign Language Learning Mode Preferences of

Japanese University Students

Kunihiro KUSANAGI Kazumichi ENOKIDA

Shusaku KIDA Tatsuya SAKAUE Fuyuko TAKITA

Keiso TATSUKAWA Koji UENISHI

Goro YAMAMOTO Lisa YOSHIKAWA Hiroshima University

Abstract

The present study aims to develop a practical and statistically sound inventory that helps in examining Japanese university students’ foreign language learning mode preferences (LMPs). LMP, like numerous other learners’ traits such as learning styles, beliefs, and motivation, is one of the behavioral and meta-cognitive variables that potentially moderate the variance of the learning outcome under a specific learning mode. The present study conducted a questionnaire survey, sampling university students who were learning English as a foreign language (N = 1,003) in a Japanese national university and developed a Foreign Language Learning Mode Preference Inventory (ver. 1), whose items (K = 12) were all written in Japanese. The inventory yielded three subscales with relatively higher reliability coefficient for each: (a) cooperative learning preference (α = .91), (b) e-learning preference (α = .90), and (c) face-to-face instruction preference (α = .70). The factorial validity of the inventory was tested using structure equation modeling. The results showed that the theoretical measurement model fit well to the observations. Furthermore, multiple sample analyses supported the measurement invariance with the constraints of equal loadings and equal intercepts across genders. Some practical applications of this inventory, specifically in the context of foreign language teaching in Japanese higher education, were discussed.

193

Page 2: Foreign Language Learning Mode Preferences of Japanese

1. Background

1.1 Diversified Learning Modes in Higher Education Nowadays, most foreign language learning curricula, programs, and classes in Japanese higher education consist of various learning modes or modalities, such as (a) pair and group works, (b) e-learning, and (c) face-to-face instruction. To give a concrete example, a compulsory English language learning program, which Hiroshima University’s Institute for Foreign Language Education and Research supplies, is systematically designed to cross multiple learning modes. A common student in this cross-mode program thus experiences face-to-face instruction and some communicative language activities in class and is required to engage in an e-learning program for basic vocabulary in another online course for the same academic term. Such a recent trend in diversification of learning modes in higher education is not simply a matter of everyday teaching practice. Rather, it has been complexly affected by various academic, social, and political movements, such as (a) a cooperative learning perspective in higher education research (e.g., Johnson & Johnson, 2008); (b) striking developments in information and communication technology (Zhao, 2003); (c) recent educational policies in Japan and other countries (e.g., active learning; Prince, 2004 for review); and possibly, (d) second language acquisition (SLA) research. These movements may interact complexly in educational decision-making at any level, and consequently, such interactions may form cross-mode language learning curricula, programs, and classes. This diversification trend will continue for a while. Blended learning and other similar terms may have captured the situation in an appropriate way. Blended learning research focuses on the learning outcome of the combinations of some learning modes rather than that of a single mode. As is easily expected, a couple of meta-analyses reported the relatively higher efficiency of blended learning (e.g., the combination of e-learning and face-to-face instruction) in comparison to single-mode learning, such as purely e-learning (e.g., Means, Toyama, Murphy, & Baki, 2013; Means, Toyama, Murphy, Bakia, & Jones, 2009). However, it should be noted that the results are far from decisive, and researchers should consider the inevitable effects of some moderator variables.

1.2 Learning Mode Preferences as a Moderator of Learning OutcomesOne can easily claim that there is no optimal combination for all the students and can also

assume the existence of some cognitive, psychological, and behavioral variables that statistically moderate one’s learning outcome, as the classical aptitudes-treatment interaction framework in educational psychology formulates (Snow, 1991). For example, one who favors a certain treatment will greatly benefit from the treatment and will not benefit from another treatment. Simply put, this also goes for the combinations of these treatments. Regardless of its historical success, there is no doubt that understanding and controlling some learners’ information concerning aptitudes, and other inter-learner variables that moderate

the learning outcome, is potentially beneficial for any type of educational decision-making, such as in (a) student placement, (b) classroom size setting, (c) curriculum design, (d) teaching material selection, and (e) optimization of everyday teaching practice for each student. Such applications are exactly what the present study views as a goal. In psychology, numerous researchers have paid greater attention to learning styles (e.g., Dunn, 1984; Dunn & Dunn, 1978; Kolb, 2015). Learning style research is basically theory oriented and is based on a cognitive architecture of learning in general, and thus it sometimes fails to explain the direct causality between learning styles and learning outcome in practice (see also criticisms to learning style research, such as Lynn, 1990). Various theoretical learning style models have emerged, such as Kolb’s model, the basis of which is experiential learning (Kolb, 2015) as well as related learning style inventories, such as Kolb’s Learning Style Inventory (Kolb, 2007). However, these models and inventories were not specifically designed to directly examine the moderator variables regarding the currently major learning modes, and they do not fit the diversification trend in higher education, specifically, in the context of foreign language teaching. Likewise, SLA research has voluminous literature on learner variables such as motivation, beliefs, and strategies (Winke, 2007, for overview). It cannot be denied that such extensive research on learner variables has historically brought us enormous scientific insights. However, most of the constructs in SLA research have not been fine-tuned to measure the moderator variable of the learning outcome, and some of them lack statistical appropriateness in their measurement. In this study, we take a different angle from learning style research and SLA research. Namely, we attach importance to more practical aspects. Unlike learning style research, which often focuses on genetic differences of cognitive architecture, we look at the closer causality behind learning. We assume that a behavioral and meta-cognitive trait, learning mode preference (LMP), exists and that this LMP is one of the causes of individuals’ learning engagement, and thus LMP moderates the learning outcome. We statistically model LMP as a latent variable under the reflective model (cf., Edwards & Bagozzi, 2000; Borsboom, Mellenbergh, & Heerden, 2003). This assumption is guided by a perspective on students’ own decision-making about learning activities rather than genetic individual differences in cognition or epistemological differences of one’s mental state such as integrative motivation vs. instrumental motivation. In the LMP framework, it can be said that one who shows higher corporative learning preference (see detail in the next section) can be meta-cognitively aware that he or she favors cooperative learning and initiatively decides how much he or she should be engaged in learning under the cooperative learning mode than in another mode. This type of simple causality can be more easily statistically modeled and behaviorally tested. For instance, compare the three causality scenarios here: (a) “those who have visual and audio learning styles tend to benefit more in e-learning,” (b) “those who have higher instrumental motivation tend to benefit more in e-learning,” and (c) “those who prefer e-learning tend to benefit more in e-learning.”

194

Page 3: Foreign Language Learning Mode Preferences of Japanese

1. Background

1.1 Diversified Learning Modes in Higher Education Nowadays, most foreign language learning curricula, programs, and classes in Japanese higher education consist of various learning modes or modalities, such as (a) pair and group works, (b) e-learning, and (c) face-to-face instruction. To give a concrete example, a compulsory English language learning program, which Hiroshima University’s Institute for Foreign Language Education and Research supplies, is systematically designed to cross multiple learning modes. A common student in this cross-mode program thus experiences face-to-face instruction and some communicative language activities in class and is required to engage in an e-learning program for basic vocabulary in another online course for the same academic term. Such a recent trend in diversification of learning modes in higher education is not simply a matter of everyday teaching practice. Rather, it has been complexly affected by various academic, social, and political movements, such as (a) a cooperative learning perspective in higher education research (e.g., Johnson & Johnson, 2008); (b) striking developments in information and communication technology (Zhao, 2003); (c) recent educational policies in Japan and other countries (e.g., active learning; Prince, 2004 for review); and possibly, (d) second language acquisition (SLA) research. These movements may interact complexly in educational decision-making at any level, and consequently, such interactions may form cross-mode language learning curricula, programs, and classes. This diversification trend will continue for a while. Blended learning and other similar terms may have captured the situation in an appropriate way. Blended learning research focuses on the learning outcome of the combinations of some learning modes rather than that of a single mode. As is easily expected, a couple of meta-analyses reported the relatively higher efficiency of blended learning (e.g., the combination of e-learning and face-to-face instruction) in comparison to single-mode learning, such as purely e-learning (e.g., Means, Toyama, Murphy, & Baki, 2013; Means, Toyama, Murphy, Bakia, & Jones, 2009). However, it should be noted that the results are far from decisive, and researchers should consider the inevitable effects of some moderator variables.

1.2 Learning Mode Preferences as a Moderator of Learning OutcomesOne can easily claim that there is no optimal combination for all the students and can also

assume the existence of some cognitive, psychological, and behavioral variables that statistically moderate one’s learning outcome, as the classical aptitudes-treatment interaction framework in educational psychology formulates (Snow, 1991). For example, one who favors a certain treatment will greatly benefit from the treatment and will not benefit from another treatment. Simply put, this also goes for the combinations of these treatments. Regardless of its historical success, there is no doubt that understanding and controlling some learners’ information concerning aptitudes, and other inter-learner variables that moderate

the learning outcome, is potentially beneficial for any type of educational decision-making, such as in (a) student placement, (b) classroom size setting, (c) curriculum design, (d) teaching material selection, and (e) optimization of everyday teaching practice for each student. Such applications are exactly what the present study views as a goal. In psychology, numerous researchers have paid greater attention to learning styles (e.g., Dunn, 1984; Dunn & Dunn, 1978; Kolb, 2015). Learning style research is basically theory oriented and is based on a cognitive architecture of learning in general, and thus it sometimes fails to explain the direct causality between learning styles and learning outcome in practice (see also criticisms to learning style research, such as Lynn, 1990). Various theoretical learning style models have emerged, such as Kolb’s model, the basis of which is experiential learning (Kolb, 2015) as well as related learning style inventories, such as Kolb’s Learning Style Inventory (Kolb, 2007). However, these models and inventories were not specifically designed to directly examine the moderator variables regarding the currently major learning modes, and they do not fit the diversification trend in higher education, specifically, in the context of foreign language teaching. Likewise, SLA research has voluminous literature on learner variables such as motivation, beliefs, and strategies (Winke, 2007, for overview). It cannot be denied that such extensive research on learner variables has historically brought us enormous scientific insights. However, most of the constructs in SLA research have not been fine-tuned to measure the moderator variable of the learning outcome, and some of them lack statistical appropriateness in their measurement. In this study, we take a different angle from learning style research and SLA research. Namely, we attach importance to more practical aspects. Unlike learning style research, which often focuses on genetic differences of cognitive architecture, we look at the closer causality behind learning. We assume that a behavioral and meta-cognitive trait, learning mode preference (LMP), exists and that this LMP is one of the causes of individuals’ learning engagement, and thus LMP moderates the learning outcome. We statistically model LMP as a latent variable under the reflective model (cf., Edwards & Bagozzi, 2000; Borsboom, Mellenbergh, & Heerden, 2003). This assumption is guided by a perspective on students’ own decision-making about learning activities rather than genetic individual differences in cognition or epistemological differences of one’s mental state such as integrative motivation vs. instrumental motivation. In the LMP framework, it can be said that one who shows higher corporative learning preference (see detail in the next section) can be meta-cognitively aware that he or she favors cooperative learning and initiatively decides how much he or she should be engaged in learning under the cooperative learning mode than in another mode. This type of simple causality can be more easily statistically modeled and behaviorally tested. For instance, compare the three causality scenarios here: (a) “those who have visual and audio learning styles tend to benefit more in e-learning,” (b) “those who have higher instrumental motivation tend to benefit more in e-learning,” and (c) “those who prefer e-learning tend to benefit more in e-learning.”

195

Page 4: Foreign Language Learning Mode Preferences of Japanese

Although we take a practical approach based on decision-making and discuss some applications of this approach, we never ignore the literature on learning styles and other learner variables. As shown in the path diagram in Figure 1, the present study presumes that the effect of learning styles and other possible factors is behind the latent variable, LMP.

Figure 1. A schematic path diagram representing the basic concept of LMP, where (a) LMP is measured as a reflective model, (b) learning styles and other variables are one of the possible causes of LMP, and (c) LMP potentially explains the variance on the outcome measure, via the decision-making function, which is not manifestly given.

1.3 The Measurement Model and Inventory Design Based on the conceptualization of LMP above, the present study attempts to develop a practical and statistically sound inventory that measures LMP. Now, LMP can be more formally defined as “a behavioral and meta-cognitive trait of learners, which predicts one’s engagement, and outcome of learning under a specific learning mode.” The present study assigns three currently major learning modes in Japanese higher education: (a) cooperative learning preference (CLP), (b) e-learning preference (ELP), and (c) face-to-face instruction preference (FTFIP). It should be assumed that the estimated values of the subscales are not orthogonal, and the present study does not consider any higher order structure of the subscales. These statistical conditions come down to a simple three-factor model. Moreover, regarding the practicality of the inventory, we designed a shorter one; we set the target item number of the final model at 12, which comes to 4 items for each subscale. The exact measurement model can be seen in Figure 2. CLP refers to the preference for cooperative learning. Cooperative learning perspectives have long debates on its definition (e.g., Johnson & Johnson, 2008; Kagan, 1989). Taking a relatively broader view, the present study expects that cooperative learning typically occurs in pair and group works only when (a) the activities are mutually beneficial to the members; (b) the members share their goal, efforts, thoughts, and information; and (c) the goal cannot be accomplished without cooperation (Johnson & Johnson, 2008). Therefore, the term cooperative learning takes a narrower domain than pair and group works; not all pair and group works are

LMP

LearningStyle Decision

Making Outcome・・・ i1 i2 ・・ in

Measurement Model

cooperative. Cooperative learning perspectives may have some similar characteristics as task-based language learning and the active learning concept. Some meta-analyses in higher education research have supported the effectiveness of cooperative learning (e.g., Johnson, Johnson, & Stanne, 2000; Johnson, Maruyama, Johnson, Nilson, & Skon, 1981).

Figure 2. The measurement model of the present study.

ELP is related to e-learning, web-based training, digital learning devices, multimedia, and social networking. Nowadays, this learning mode is applied to numerous areas such as occupational training, language learning, and other school subjects. Some meta-analyses have reported its efficacies (e.g., Bernard et al., 2009; Boghikian-Whitby & Mortagy, 2016, Means et al., 2009), but some studies have also reported the equivalent effect of e-learning with face-to-face instruction (e.g., Neuhauser, 2010). FTFIP is about so-called traditional teaching methods, such as grammar translation, language exercises, lectures, and other common classroom activities that are typically featured in one-directional communication between a teacher and students. Face-to-face instruction is frequently contrasted with e-learning, especially in distance education research (e.g., Neuhauser, 2010; Johnson, Argon, Shaik, & Palma-Rivas, 2000). Some studies showed a higher preference for this on average than for e-learning (e.g., Beard & Happer, 2002; Kishore, Tabrizi, Ozan, Aziz, & Wuensch, 2009). However, this does not imply that the same preferences apply in the context of Japanese higher education.

1.4 Other Related Surveys An applied linguistics researcher, Fushino, conducted a questionnaire survey relating to learner beliefs about second language group work (Fushino, 2010). The scale that Fushino developed is multi-factorial, unlike the present study, which put all its information into a single factor relating to cooperative learning. Furthermore, Fushino’s scale is about learner beliefs, which in the present study we treat as one of the indirect factors behind LMP.

CLP

i1 i2 i4i3

ELP

i5 i6 i8i7

FTFIP

i9 i10 i12i11

e1 e2 e3 e4 e5 e6 e7 e8 e9 e10 e11 e12

196

Page 5: Foreign Language Learning Mode Preferences of Japanese

Although we take a practical approach based on decision-making and discuss some applications of this approach, we never ignore the literature on learning styles and other learner variables. As shown in the path diagram in Figure 1, the present study presumes that the effect of learning styles and other possible factors is behind the latent variable, LMP.

Figure 1. A schematic path diagram representing the basic concept of LMP, where (a) LMP is measured as a reflective model, (b) learning styles and other variables are one of the possible causes of LMP, and (c) LMP potentially explains the variance on the outcome measure, via the decision-making function, which is not manifestly given.

1.3 The Measurement Model and Inventory Design Based on the conceptualization of LMP above, the present study attempts to develop a practical and statistically sound inventory that measures LMP. Now, LMP can be more formally defined as “a behavioral and meta-cognitive trait of learners, which predicts one’s engagement, and outcome of learning under a specific learning mode.” The present study assigns three currently major learning modes in Japanese higher education: (a) cooperative learning preference (CLP), (b) e-learning preference (ELP), and (c) face-to-face instruction preference (FTFIP). It should be assumed that the estimated values of the subscales are not orthogonal, and the present study does not consider any higher order structure of the subscales. These statistical conditions come down to a simple three-factor model. Moreover, regarding the practicality of the inventory, we designed a shorter one; we set the target item number of the final model at 12, which comes to 4 items for each subscale. The exact measurement model can be seen in Figure 2. CLP refers to the preference for cooperative learning. Cooperative learning perspectives have long debates on its definition (e.g., Johnson & Johnson, 2008; Kagan, 1989). Taking a relatively broader view, the present study expects that cooperative learning typically occurs in pair and group works only when (a) the activities are mutually beneficial to the members; (b) the members share their goal, efforts, thoughts, and information; and (c) the goal cannot be accomplished without cooperation (Johnson & Johnson, 2008). Therefore, the term cooperative learning takes a narrower domain than pair and group works; not all pair and group works are

LMP

LearningStyle Decision

Making Outcome・・・ i1 i2 ・・ in

Measurement Model

cooperative. Cooperative learning perspectives may have some similar characteristics as task-based language learning and the active learning concept. Some meta-analyses in higher education research have supported the effectiveness of cooperative learning (e.g., Johnson, Johnson, & Stanne, 2000; Johnson, Maruyama, Johnson, Nilson, & Skon, 1981).

Figure 2. The measurement model of the present study.

ELP is related to e-learning, web-based training, digital learning devices, multimedia, and social networking. Nowadays, this learning mode is applied to numerous areas such as occupational training, language learning, and other school subjects. Some meta-analyses have reported its efficacies (e.g., Bernard et al., 2009; Boghikian-Whitby & Mortagy, 2016, Means et al., 2009), but some studies have also reported the equivalent effect of e-learning with face-to-face instruction (e.g., Neuhauser, 2010). FTFIP is about so-called traditional teaching methods, such as grammar translation, language exercises, lectures, and other common classroom activities that are typically featured in one-directional communication between a teacher and students. Face-to-face instruction is frequently contrasted with e-learning, especially in distance education research (e.g., Neuhauser, 2010; Johnson, Argon, Shaik, & Palma-Rivas, 2000). Some studies showed a higher preference for this on average than for e-learning (e.g., Beard & Happer, 2002; Kishore, Tabrizi, Ozan, Aziz, & Wuensch, 2009). However, this does not imply that the same preferences apply in the context of Japanese higher education.

1.4 Other Related Surveys An applied linguistics researcher, Fushino, conducted a questionnaire survey relating to learner beliefs about second language group work (Fushino, 2010). The scale that Fushino developed is multi-factorial, unlike the present study, which put all its information into a single factor relating to cooperative learning. Furthermore, Fushino’s scale is about learner beliefs, which in the present study we treat as one of the indirect factors behind LMP.

CLP

i1 i2 i4i3

ELP

i5 i6 i8i7

FTFIP

i9 i10 i12i11

e1 e2 e3 e4 e5 e6 e7 e8 e9 e10 e11 e12

197

Page 6: Foreign Language Learning Mode Preferences of Japanese

In Japanese foreign language teaching research, Kawaguchi and Kusanagi (2016) developed a scale that examines computer-assisted language learning (CALL) attitudes. Kawaguchi and Kusanagi’s scale also yields multiple factors. As Fushino (2010) and Kawaguchi and Kusanagi (2016) had addressed conceptually similar constructs with the present study, a future study should examine the correlational evidence between the present study and these studies.

2. The Preliminary Study

2.1 Purpose In order to develop a statistically sound inventory, the present study followed a two-step procedure. First, we conducted a preliminary study, whose main purpose was to select the questionnaire items before the main study.

2.2 Participants In total, 94 university students who learned English as a foreign language participated in the preliminary study. All of the participants were sampled from one national university. Some of the demographic differences were (a) academic years (second-year students and above in the preliminary study and mostly freshmen in the main study); (b) gender distribution; and (c) English language proficiency. However, the present study presumed that the effects of the demographic differences were minuscule. In fact, the main study exactly reproduced the targeted factorial structure.

2.3 Materials The first author clarified the initial item pool (K = 32). The item writer picked up sets of keyword lists for each subscale widely from the literature (e.g., “pair and group” for CLP) and built up sentences that contained the keywords. Then, the item pool was checked by other authors through a consultation. All the items were written in Japanese and had meta-cognitive and self-reflective features. The present study administered the items through a seven-point Likert scale examining the degree of agreement with each sentence.

2.4 Procedure and Results The participants answered a questionnaire that consisted of the 32 items using pen and paper. It took almost 15 minutes on average for them to complete the survey. Then, a research assistant input the responses into computer readable media by hand with some times of validations. All the answers were valid without missing values. We first checked the distribution of each item. Through this procedure, 10 items that violated the normal distribution were excluded. The remaining items (k = 22) were submitted to

classical multidimensional scaling in order to visually select the items, confirming the covariance structure. We excluded 8 items through this process. Finally, by repeatedly simulating exploratory and confirmatory factor analyses, 2 items were excluded by referring to the simulated factor loads and goodness of fit indices. Then, the remaining 12 items were submitted to confirmatory factor analysis with a theoretical measurement model. The results showed relatively fine goodness of fit, CFI = .959, TLI = .930, RMSEA = .068, SRMR = .061 and acceptable reliability coefficient values for each item, CLP (k = 4, α = .86), ELP (k = 4, α = .83), FTFIP (k = 4, α = .70). The 12 items were selected for the main study (see the items in the Appendix).

3. The Main Study

3.1 Purpose The purpose of the main study was to establish the inventory. The present study performed structure equation modeling with the measurement model and multiple sample analyses to test the measurement invariance across genders.

3.2 Participants In total, 909 university students who learned English as a foreign language participated in the main study. All the participants were sampled from the same national university, and they were all freshmen with a couple of exceptions. Their departments were variously ranged: medicine (19.58%), dental (8.91%), pharmaceutical (5.28%), law (14.85%), education (11.55%), engineering (26.51%), biology (10.89%), and science (2.42%). The gender ratios were as follows: men (57.21%), women (42.13%), and transgender (0.66%). The responses of the transgendered participants were not submitted to multiple sample analyses. However, this does not imply that they were not of interest; this decision was based on purely statistical considerations. No participant attended both studies. Most of the participants (91.31%) reported their scores in the Test of English for International Communication (TOEIC), which they had taken during an academic semester. The distribution is shown in Figure 3. The data can be modeled by fitting the normal distribution at μ = 522.58, σ = 101.33, with the goodness of fit at AIC = 10025, or Gamma distribution at α = 26.27, β = 0.05, with the goodness of fit at AIC = 10015.

3.3 Materials The 12 items selected in the preliminary study were adopted. Each subscale had 4 items. Namely, items 1 to 4 referred to CLP, 4 to 8 to ELP, and 9 to 12 to FTFIP. The present study administered the items through a Likert scale, like in the preliminary study.

198

Page 7: Foreign Language Learning Mode Preferences of Japanese

In Japanese foreign language teaching research, Kawaguchi and Kusanagi (2016) developed a scale that examines computer-assisted language learning (CALL) attitudes. Kawaguchi and Kusanagi’s scale also yields multiple factors. As Fushino (2010) and Kawaguchi and Kusanagi (2016) had addressed conceptually similar constructs with the present study, a future study should examine the correlational evidence between the present study and these studies.

2. The Preliminary Study

2.1 Purpose In order to develop a statistically sound inventory, the present study followed a two-step procedure. First, we conducted a preliminary study, whose main purpose was to select the questionnaire items before the main study.

2.2 Participants In total, 94 university students who learned English as a foreign language participated in the preliminary study. All of the participants were sampled from one national university. Some of the demographic differences were (a) academic years (second-year students and above in the preliminary study and mostly freshmen in the main study); (b) gender distribution; and (c) English language proficiency. However, the present study presumed that the effects of the demographic differences were minuscule. In fact, the main study exactly reproduced the targeted factorial structure.

2.3 Materials The first author clarified the initial item pool (K = 32). The item writer picked up sets of keyword lists for each subscale widely from the literature (e.g., “pair and group” for CLP) and built up sentences that contained the keywords. Then, the item pool was checked by other authors through a consultation. All the items were written in Japanese and had meta-cognitive and self-reflective features. The present study administered the items through a seven-point Likert scale examining the degree of agreement with each sentence.

2.4 Procedure and Results The participants answered a questionnaire that consisted of the 32 items using pen and paper. It took almost 15 minutes on average for them to complete the survey. Then, a research assistant input the responses into computer readable media by hand with some times of validations. All the answers were valid without missing values. We first checked the distribution of each item. Through this procedure, 10 items that violated the normal distribution were excluded. The remaining items (k = 22) were submitted to

classical multidimensional scaling in order to visually select the items, confirming the covariance structure. We excluded 8 items through this process. Finally, by repeatedly simulating exploratory and confirmatory factor analyses, 2 items were excluded by referring to the simulated factor loads and goodness of fit indices. Then, the remaining 12 items were submitted to confirmatory factor analysis with a theoretical measurement model. The results showed relatively fine goodness of fit, CFI = .959, TLI = .930, RMSEA = .068, SRMR = .061 and acceptable reliability coefficient values for each item, CLP (k = 4, α = .86), ELP (k = 4, α = .83), FTFIP (k = 4, α = .70). The 12 items were selected for the main study (see the items in the Appendix).

3. The Main Study

3.1 Purpose The purpose of the main study was to establish the inventory. The present study performed structure equation modeling with the measurement model and multiple sample analyses to test the measurement invariance across genders.

3.2 Participants In total, 909 university students who learned English as a foreign language participated in the main study. All the participants were sampled from the same national university, and they were all freshmen with a couple of exceptions. Their departments were variously ranged: medicine (19.58%), dental (8.91%), pharmaceutical (5.28%), law (14.85%), education (11.55%), engineering (26.51%), biology (10.89%), and science (2.42%). The gender ratios were as follows: men (57.21%), women (42.13%), and transgender (0.66%). The responses of the transgendered participants were not submitted to multiple sample analyses. However, this does not imply that they were not of interest; this decision was based on purely statistical considerations. No participant attended both studies. Most of the participants (91.31%) reported their scores in the Test of English for International Communication (TOEIC), which they had taken during an academic semester. The distribution is shown in Figure 3. The data can be modeled by fitting the normal distribution at μ = 522.58, σ = 101.33, with the goodness of fit at AIC = 10025, or Gamma distribution at α = 26.27, β = 0.05, with the goodness of fit at AIC = 10015.

3.3 Materials The 12 items selected in the preliminary study were adopted. Each subscale had 4 items. Namely, items 1 to 4 referred to CLP, 4 to 8 to ELP, and 9 to 12 to FTFIP. The present study administered the items through a Likert scale, like in the preliminary study.

199

Page 8: Foreign Language Learning Mode Preferences of Japanese

Figure 3. The empirical distribution of self-reported TOEIC scores (n = 830).

3.4 Procedure The participants answered the items on an online form. There were no missing values. All the item responses were visualized using multiple histograms. Then, the moment statistics and variance and covariance matrices were reported. Next, we fit the measurement model to the observations using structure equation modeling. The standardized coefficients and goodness of fit under the models were reported. We also calculated reliability coefficients for each subscale. Then, multiple sample analyses were conducted. All the procedures were performed on R, statistical software (R Core Team, 2016) with the packages psych (Revelle, 2016), lavaan (Rosseel, 2012) and semTools (semTools contributors, 2016).

3.5 Results The empirical distribution of all the item responses are shown in Figure 4, and based on them, we can confirm the normality of each variable. Table 1 shows the moment statistics, and Table 2 shows the variance and covariance matrices.

Figure 4. Multiple histograms showing the distribution of the responses.

Self-Reported TOEIC Score

Frequency

200 400 600 800

050

100

150

1 2 3 4 5 6 7

Item 1

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 2

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 3

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 4

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 5

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 6

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 7

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 8

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 9

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 10

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 11

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 12

Frequency

0100

200

300

Table 1. The Moment Statistics of the Item Responses

M SD Skewness KurtosisItem 1 4.18 1.69 -0.15 -0.79Item 2 3.93 1.54 0.00 -0.54Item 3 4.19 1.52 -0.12 -0.56Item 4 4.01 1.56 -0.04 -0.65Item 5 4.37 1.44 -0.36 -0.19Item 6 4.12 1.43 -0.13 -0.38Item 7 4.09 1.47 -0.16 -0.46Item 8 4.17 1.54 -0.17 -0.52Item 9 4.32 1.38 -0.32 -0.22Item 10 4.57 1.35 -0.38 0.05Item 11 4.99 1.27 -0.57 0.51Item 12 4.30 1.35 -0.28 -0.14

Table 2. The Variance and Covariance Matrices of the Item Responses

I1 I2 I3 I4 I5 I6 I7 I8 I9 I10 I11 I12I1 2.85 1.89 1.57 1.83 0.22 0.22 0.26 0.31 -0.07 0.18 0.36 0.32I2 1.89 2.37 1.62 1.89 0.20 0.34 0.34 0.33 -0.04 0.13 0.31 0.28I3 1.57 1.62 2.32 1.84 0.16 0.19 0.19 0.30 0.09 0.32 0.39 0.56I4 1.83 1.89 1.84 2.44 0.23 0.25 0.24 0.31 0.01 0.15 0.35 0.40I5 0.22 0.20 0.16 0.23 2.07 1.40 1.41 1.51 -0.10 0.12 0.25 0.28I6 0.22 0.34 0.19 0.25 1.40 2.06 1.70 1.53 0.16 0.20 0.28 0.33I7 0.26 0.34 0.19 0.24 1.41 1.70 2.16 1.59 0.11 0.23 0.30 0.35I8 0.31 0.33 0.30 0.31 1.51 1.53 1.59 2.38 -0.03 0.17 0.26 0.28I9 -0.07 -0.04 0.09 0.01 -0.10 0.16 0.11 -0.03 1.91 1.02 0.70 0.83I10 0.18 0.13 0.32 0.15 0.12 0.20 0.23 0.17 1.02 1.82 0.95 0.83I11 0.36 0.31 0.39 0.35 0.25 0.28 0.30 0.26 0.70 0.95 1.61 0.63I12 0.32 0.28 0.56 0.40 0.28 0.33 0.35 0.28 0.83 0.83 0.63 1.81Note. “I” represents Item. The bold figures signify the variance and covariance values among the items in the same subscale.

In order to examine the factorial validity, we conducted structure equation modeling. The estimation method was maximum likelihood estimation. Standard errors were calculated using nonparametric bootstrapping at B = 1,000. The results showed a sufficient goodness of fit, CFI

200

Page 9: Foreign Language Learning Mode Preferences of Japanese

Figure 3. The empirical distribution of self-reported TOEIC scores (n = 830).

3.4 Procedure The participants answered the items on an online form. There were no missing values. All the item responses were visualized using multiple histograms. Then, the moment statistics and variance and covariance matrices were reported. Next, we fit the measurement model to the observations using structure equation modeling. The standardized coefficients and goodness of fit under the models were reported. We also calculated reliability coefficients for each subscale. Then, multiple sample analyses were conducted. All the procedures were performed on R, statistical software (R Core Team, 2016) with the packages psych (Revelle, 2016), lavaan (Rosseel, 2012) and semTools (semTools contributors, 2016).

3.5 Results The empirical distribution of all the item responses are shown in Figure 4, and based on them, we can confirm the normality of each variable. Table 1 shows the moment statistics, and Table 2 shows the variance and covariance matrices.

Figure 4. Multiple histograms showing the distribution of the responses.

Self-Reported TOEIC Score

Frequency

200 400 600 800

050

100

150

1 2 3 4 5 6 7

Item 1

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 2

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 3

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 4

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 5

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 6

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 7

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 8

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 9

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 10

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 11

Frequency

0100

200

300

1 2 3 4 5 6 7

Item 12

Frequency

0100

200

300

Table 1. The Moment Statistics of the Item Responses

M SD Skewness KurtosisItem 1 4.18 1.69 -0.15 -0.79Item 2 3.93 1.54 0.00 -0.54Item 3 4.19 1.52 -0.12 -0.56Item 4 4.01 1.56 -0.04 -0.65Item 5 4.37 1.44 -0.36 -0.19Item 6 4.12 1.43 -0.13 -0.38Item 7 4.09 1.47 -0.16 -0.46Item 8 4.17 1.54 -0.17 -0.52Item 9 4.32 1.38 -0.32 -0.22Item 10 4.57 1.35 -0.38 0.05Item 11 4.99 1.27 -0.57 0.51Item 12 4.30 1.35 -0.28 -0.14

Table 2. The Variance and Covariance Matrices of the Item Responses

I1 I2 I3 I4 I5 I6 I7 I8 I9 I10 I11 I12I1 2.85 1.89 1.57 1.83 0.22 0.22 0.26 0.31 -0.07 0.18 0.36 0.32I2 1.89 2.37 1.62 1.89 0.20 0.34 0.34 0.33 -0.04 0.13 0.31 0.28I3 1.57 1.62 2.32 1.84 0.16 0.19 0.19 0.30 0.09 0.32 0.39 0.56I4 1.83 1.89 1.84 2.44 0.23 0.25 0.24 0.31 0.01 0.15 0.35 0.40I5 0.22 0.20 0.16 0.23 2.07 1.40 1.41 1.51 -0.10 0.12 0.25 0.28I6 0.22 0.34 0.19 0.25 1.40 2.06 1.70 1.53 0.16 0.20 0.28 0.33I7 0.26 0.34 0.19 0.24 1.41 1.70 2.16 1.59 0.11 0.23 0.30 0.35I8 0.31 0.33 0.30 0.31 1.51 1.53 1.59 2.38 -0.03 0.17 0.26 0.28I9 -0.07 -0.04 0.09 0.01 -0.10 0.16 0.11 -0.03 1.91 1.02 0.70 0.83I10 0.18 0.13 0.32 0.15 0.12 0.20 0.23 0.17 1.02 1.82 0.95 0.83I11 0.36 0.31 0.39 0.35 0.25 0.28 0.30 0.26 0.70 0.95 1.61 0.63I12 0.32 0.28 0.56 0.40 0.28 0.33 0.35 0.28 0.83 0.83 0.63 1.81Note. “I” represents Item. The bold figures signify the variance and covariance values among the items in the same subscale.

In order to examine the factorial validity, we conducted structure equation modeling. The estimation method was maximum likelihood estimation. Standard errors were calculated using nonparametric bootstrapping at B = 1,000. The results showed a sufficient goodness of fit, CFI

201

Page 10: Foreign Language Learning Mode Preferences of Japanese

= .959, TLI = .946, RMSEA = .074, with 90% CI [.066, .082], SRMR = .053, AIC = 33382, BIC = 33512. The estimated standardized coefficients can be seen in the path diagram in Figure 5. All the coefficients were statistically significant at α = .05. The covariance among the latent variables showed lower values. This means that this inventory is practically informative.

Figure 5. Path diagram showing the estimated parameters (n = 909).

In order to attain more practicality, the present study also examined a statistically simpler model, setting equality constraints on all of the factor loading values. This alternative model was submitted to structure equation modeling. The results showed a slightly degraded, but practically acceptable, goodness of fit, CFI = .947, TLI = .942, RMSEA = .077, with 90% CI [.069, .084], SRMR = .062, AIC = 33442, BIC = 33528. This statistical evidence can justify the use of summated scale scores instead of calculating factor scores. Furthermore, reliability coefficients for the subscales are summarized in Table 3. All the subscales signify sufficient reliability coefficients.

Table 3. Reliability Coefficients and their 95% Boundaries

k α 95% Lower Boundary 95% Upper BoundaryCLP 4 .91 .90 .92ELP 4 .90 .89 .91FTFIP 4 .77 .75 .78All 12 .80 .80 .82

CLP

I1 I2 I4I3

ELP

I5 I6 I8I7

FTFIP

I9 I10 I12I11

0.39 0.24 0.32 0.17 0.40 0.21 0.21 0.36 0.56 0.34 0.56 0.65

.78 .87 .82 .91 .77 .89 .89 .80 .66 .81 .66 .60

0.17

0.15 0.17

Using a series of multiple sample analyses, the present study tested four types of measurement invariance with their respective sets of equality constraints, as described below: (a) configural invariance; (b) equal loadings, or so-called metric invariance; (c) equal intercepts or scalar invariance; and (d) equal mean structure, or full uniqueness measurement invariance (see Milfont & Fischer, 2015, for a methodological introduction). Only when scalar invariance or full uniqueness measurement invariance is satisfied, the users of this inventory become capable of comparing the factor scores of the subscales. The results of multiple sample analyses are shown in Table 4. In the table, all of the models show similar values of goodness of fit indices, but the present study found that the scalar invariance was satisfied. This means that users can apply the same measurement model across genders and compare their scores.

Table 4. Summary of Multiple Sample Analyses (n = 903) Models df χ2 CFI RMSEA AIC BIC(a) Configural 101 385.68 .953 .078 33097 33472(b) Metric 111 398.66 .952 .076 33092 33424(c) Scalar 120 417.83 .951 .074 33093 33382(d) Full Uniqueness 123 441.82 .947 .076 33111 33385

4. Discussion

The empirical evidence reported so far is sufficient to establish a new, practical, and statistically sound inventory to measure LMS. We call this inventory a Foreign Language Learning Mode Preference Inventory (FLLMPI) Ver. 1. This inventory has some desirable statistical features: (a) it has higher reliability, (b) the factorial structure fits well to observations and is analytically steady, (c) the use of the summated scale scores is practically sufficient, (d) scalar invariance among genders is realized, (e) the inventory consists of a small number of items, and (f) the covariance among the subscales shows lower values. Although this new inventory is statistically sound and easy to handle in practice, there remain many conceptual and technical issues to be discussed. First, the participants in the present study were sampled from a single university. It is sure that the sample was fairly large and spanned across a variety of departments. However, this sampling will never be a guarantee of generalization to other types of university students. It is necessary to conduct a survey that investigates the reproducibility of the factorial structure. Additionally, most of the validity evidence has not been addressed yet. For instance, future studies should examine the direct causality between LMP and learners’ decision-making. Likewise, correlational evidence with some other variables such as learning styles (e.g., Kolb,

202

Page 11: Foreign Language Learning Mode Preferences of Japanese

= .959, TLI = .946, RMSEA = .074, with 90% CI [.066, .082], SRMR = .053, AIC = 33382, BIC = 33512. The estimated standardized coefficients can be seen in the path diagram in Figure 5. All the coefficients were statistically significant at α = .05. The covariance among the latent variables showed lower values. This means that this inventory is practically informative.

Figure 5. Path diagram showing the estimated parameters (n = 909).

In order to attain more practicality, the present study also examined a statistically simpler model, setting equality constraints on all of the factor loading values. This alternative model was submitted to structure equation modeling. The results showed a slightly degraded, but practically acceptable, goodness of fit, CFI = .947, TLI = .942, RMSEA = .077, with 90% CI [.069, .084], SRMR = .062, AIC = 33442, BIC = 33528. This statistical evidence can justify the use of summated scale scores instead of calculating factor scores. Furthermore, reliability coefficients for the subscales are summarized in Table 3. All the subscales signify sufficient reliability coefficients.

Table 3. Reliability Coefficients and their 95% Boundaries

k α 95% Lower Boundary 95% Upper BoundaryCLP 4 .91 .90 .92ELP 4 .90 .89 .91FTFIP 4 .77 .75 .78All 12 .80 .80 .82

CLP

I1 I2 I4I3

ELP

I5 I6 I8I7

FTFIP

I9 I10 I12I11

0.39 0.24 0.32 0.17 0.40 0.21 0.21 0.36 0.56 0.34 0.56 0.65

.78 .87 .82 .91 .77 .89 .89 .80 .66 .81 .66 .60

0.17

0.15 0.17

Using a series of multiple sample analyses, the present study tested four types of measurement invariance with their respective sets of equality constraints, as described below: (a) configural invariance; (b) equal loadings, or so-called metric invariance; (c) equal intercepts or scalar invariance; and (d) equal mean structure, or full uniqueness measurement invariance (see Milfont & Fischer, 2015, for a methodological introduction). Only when scalar invariance or full uniqueness measurement invariance is satisfied, the users of this inventory become capable of comparing the factor scores of the subscales. The results of multiple sample analyses are shown in Table 4. In the table, all of the models show similar values of goodness of fit indices, but the present study found that the scalar invariance was satisfied. This means that users can apply the same measurement model across genders and compare their scores.

Table 4. Summary of Multiple Sample Analyses (n = 903) Models df χ2 CFI RMSEA AIC BIC(a) Configural 101 385.68 .953 .078 33097 33472(b) Metric 111 398.66 .952 .076 33092 33424(c) Scalar 120 417.83 .951 .074 33093 33382(d) Full Uniqueness 123 441.82 .947 .076 33111 33385

4. Discussion

The empirical evidence reported so far is sufficient to establish a new, practical, and statistically sound inventory to measure LMS. We call this inventory a Foreign Language Learning Mode Preference Inventory (FLLMPI) Ver. 1. This inventory has some desirable statistical features: (a) it has higher reliability, (b) the factorial structure fits well to observations and is analytically steady, (c) the use of the summated scale scores is practically sufficient, (d) scalar invariance among genders is realized, (e) the inventory consists of a small number of items, and (f) the covariance among the subscales shows lower values. Although this new inventory is statistically sound and easy to handle in practice, there remain many conceptual and technical issues to be discussed. First, the participants in the present study were sampled from a single university. It is sure that the sample was fairly large and spanned across a variety of departments. However, this sampling will never be a guarantee of generalization to other types of university students. It is necessary to conduct a survey that investigates the reproducibility of the factorial structure. Additionally, most of the validity evidence has not been addressed yet. For instance, future studies should examine the direct causality between LMP and learners’ decision-making. Likewise, correlational evidence with some other variables such as learning styles (e.g., Kolb,

203

Page 12: Foreign Language Learning Mode Preferences of Japanese

2015), beliefs (Fushino, 2010), and attitudes (Kawaguchi & Kusanagi, 2016) should be accumulated to strengthen the LMS framework. The framework itself should be more theoretically sophisticated, despite its practical orientation. Regarding measurement invariance, scalar invariance among genders is only one of the numerous types of measurement invariance. Invariance across other groups, such as school levels, departments, academic year, and teachers, remained to be untested. Additionally, time-series stability of LMS can be an interest in future studies.

5. Conclusion and Pedagogical Significance

Although there are some issues that remain unsolved as discussed above, the FLLMPI developed in the present study will be a useful tool for teaching practitioners, specifically those in higher education. For instance, FLLMPI can be used as prior information for classroom placement. Currently, most classroom placements in practice are administrated using proficiency data, which are quite likely filled with larger errors. Through the use of FLLMPI, the teachers can refer to the additional information. Curriculum development and course design are also a promising area for the application of FLLMPI. Since learning modes have been diversified recently, as discussed in the Background section, FLLMPI can be an important resource to learn the needs of students. Thus, it can also be applied to prior classroom size determination. These applications will enable teaching practitioners to design more efficient learning environments. Of course, from a student point of view, it is informative to know one’s own LMP, since such meta-cognitive activity may promote learner autonomy. However, the most important point is that we should never miss the ethical issues behind the labeling and categorization of students by an arbitrary trait. Every single student has his or her own personality, aptitudes, needs, thoughts, and feelings. Researchers and teachers should not forget that LMS, or any other learner variable, is only a very small piece of students’ information. Treating students as statistical values greatly diverges from the purpose of education. The present study attached users’ practical guidelines for FLLMPI, which was written in Japanese, to the Appendix section. Additionally, URLs to the other supplementary data, including (a) the initial item pool, (b) R code, (c) other statistical data, and (d) a simulated data set that is parametrically equal to the data of the present study, are available. The present study fully agrees with the open science movement and reproducible research.

References

Beard, L. A., & Harper, C. (2002). Student perceptions of online versus on campus instruction. Education, 122, 658 –663.

Bernard, R. M., Abrami, P. C., Borokhovski, E., Wade, C. A., Tamim, R. M., Surkes, M. A., & Bethel, E. C. (2009). A meta-analysis of three types of interaction treatments in distance education. Review of Educational Research, 79, 1243 –1289.

Boghikian-Whitby, S., & Mortagy, Y. (2016). Student preferences and performance in online and face-to-face classes using Myers-Briggs Indicator: A longitudinal quasi-experimental study. Issues in Informing Science and Information Technology, 13, 89 –109.

Borsboom, D., Mellenbergh, G. J., & Van Heerden, J. (2003). The theoretical status of latent variables. Psychological Review, 110, 203 –219.

Dunn, R. (1984). Learning style: State of the science. Theory into Practice, 23, 10 –19. Dunn, R. S., & Dunn, K. J. (1978). Teaching students through their individual learning styles: A

practical approach. Upper Saddle River, New Jersey: Prentice Hall. Edwards, J. R., & Bagozzi, R. P. (2000). On the nature and direction of relationships between

constructs and measures. Psychological Methods, 5, 155–174. Fushino, K. (2010). Causal relationships between communication confidence, beliefs about group

work, and willingness to communicate in foreign language group work. TESOL quarterly, 44, 700–724.

Johnson, D. W., & Johnson, R. T. (2008). Cooperative learning. Oxford: Blackwell Publishing Ltd.

Johnson, D. W., Johnson, R. T., & Stanne, M. B. (2000). Cooperative learning methods: A meta-analysis. Retrieved from http://www.cooperation.org/pages/cl-methods.html

Johnson, D. W., Maruyama, G., Johnson, R., Nelson, D., & Skon, L. (1981). Effects of cooperative, competitive, and individualistic goal structures on achievement: A meta-analysis. Psychological Bulletin, 89, 47–62.

Johnson, S. D., Aragon, S. R., Shaik, N., & Palma-Rivas, N. (2000). Comparative analysis of learner satisfaction and learning outcomes in online and face-to-face learning environments. Journal of Interactive Learning Research, 11, 29–49.

Kagan, S. (1989). The structural approach to cooperative learning. Educational Leadership, 47(4), 12–15.

Kawaguchi, Y., & Kusanagi, K. (2015). Nihonjin eigogakusyusya o taisyou to shita atarashii konpyuta shien gogakugakusyuu taido no sakusei. [Developing a new scale for computer assisted language learning attitudes]. Language Education & Technology, 52, 257–277.

Kishore, M., Tabrizi, M. N., Ozan, E., Aziz, S., & Wuensch, K. L. (2009). Correlates of student preference for online instruction over face-to-face instruction. E-Learning and Digital Media, 6, 400–415.

204

Page 13: Foreign Language Learning Mode Preferences of Japanese

2015), beliefs (Fushino, 2010), and attitudes (Kawaguchi & Kusanagi, 2016) should be accumulated to strengthen the LMS framework. The framework itself should be more theoretically sophisticated, despite its practical orientation. Regarding measurement invariance, scalar invariance among genders is only one of the numerous types of measurement invariance. Invariance across other groups, such as school levels, departments, academic year, and teachers, remained to be untested. Additionally, time-series stability of LMS can be an interest in future studies.

5. Conclusion and Pedagogical Significance

Although there are some issues that remain unsolved as discussed above, the FLLMPI developed in the present study will be a useful tool for teaching practitioners, specifically those in higher education. For instance, FLLMPI can be used as prior information for classroom placement. Currently, most classroom placements in practice are administrated using proficiency data, which are quite likely filled with larger errors. Through the use of FLLMPI, the teachers can refer to the additional information. Curriculum development and course design are also a promising area for the application of FLLMPI. Since learning modes have been diversified recently, as discussed in the Background section, FLLMPI can be an important resource to learn the needs of students. Thus, it can also be applied to prior classroom size determination. These applications will enable teaching practitioners to design more efficient learning environments. Of course, from a student point of view, it is informative to know one’s own LMP, since such meta-cognitive activity may promote learner autonomy. However, the most important point is that we should never miss the ethical issues behind the labeling and categorization of students by an arbitrary trait. Every single student has his or her own personality, aptitudes, needs, thoughts, and feelings. Researchers and teachers should not forget that LMS, or any other learner variable, is only a very small piece of students’ information. Treating students as statistical values greatly diverges from the purpose of education. The present study attached users’ practical guidelines for FLLMPI, which was written in Japanese, to the Appendix section. Additionally, URLs to the other supplementary data, including (a) the initial item pool, (b) R code, (c) other statistical data, and (d) a simulated data set that is parametrically equal to the data of the present study, are available. The present study fully agrees with the open science movement and reproducible research.

References

Beard, L. A., & Harper, C. (2002). Student perceptions of online versus on campus instruction. Education, 122, 658 –663.

Bernard, R. M., Abrami, P. C., Borokhovski, E., Wade, C. A., Tamim, R. M., Surkes, M. A., & Bethel, E. C. (2009). A meta-analysis of three types of interaction treatments in distance education. Review of Educational Research, 79, 1243 –1289.

Boghikian-Whitby, S., & Mortagy, Y. (2016). Student preferences and performance in online and face-to-face classes using Myers-Briggs Indicator: A longitudinal quasi-experimental study. Issues in Informing Science and Information Technology, 13, 89 –109.

Borsboom, D., Mellenbergh, G. J., & Van Heerden, J. (2003). The theoretical status of latent variables. Psychological Review, 110, 203 –219.

Dunn, R. (1984). Learning style: State of the science. Theory into Practice, 23, 10 –19. Dunn, R. S., & Dunn, K. J. (1978). Teaching students through their individual learning styles: A

practical approach. Upper Saddle River, New Jersey: Prentice Hall. Edwards, J. R., & Bagozzi, R. P. (2000). On the nature and direction of relationships between

constructs and measures. Psychological Methods, 5, 155–174. Fushino, K. (2010). Causal relationships between communication confidence, beliefs about group

work, and willingness to communicate in foreign language group work. TESOL quarterly, 44, 700–724.

Johnson, D. W., & Johnson, R. T. (2008). Cooperative learning. Oxford: Blackwell Publishing Ltd.

Johnson, D. W., Johnson, R. T., & Stanne, M. B. (2000). Cooperative learning methods: A meta-analysis. Retrieved from http://www.cooperation.org/pages/cl-methods.html

Johnson, D. W., Maruyama, G., Johnson, R., Nelson, D., & Skon, L. (1981). Effects of cooperative, competitive, and individualistic goal structures on achievement: A meta-analysis. Psychological Bulletin, 89, 47–62.

Johnson, S. D., Aragon, S. R., Shaik, N., & Palma-Rivas, N. (2000). Comparative analysis of learner satisfaction and learning outcomes in online and face-to-face learning environments. Journal of Interactive Learning Research, 11, 29–49.

Kagan, S. (1989). The structural approach to cooperative learning. Educational Leadership, 47(4), 12–15.

Kawaguchi, Y., & Kusanagi, K. (2015). Nihonjin eigogakusyusya o taisyou to shita atarashii konpyuta shien gogakugakusyuu taido no sakusei. [Developing a new scale for computer assisted language learning attitudes]. Language Education & Technology, 52, 257–277.

Kishore, M., Tabrizi, M. N., Ozan, E., Aziz, S., & Wuensch, K. L. (2009). Correlates of student preference for online instruction over face-to-face instruction. E-Learning and Digital Media, 6, 400–415.

205

Page 14: Foreign Language Learning Mode Preferences of Japanese

Kolb, D. A. (2007). The Kolb Learning Style Inventory—Version 3.1: LSI workbook. Boston: Hay Learning Transformations.

Kolb, D. A. (2015). Experiential learning: experience as the source of learning and development(2nd ed.). Upper Saddle River, New Jersey: Pearson Education.

Lynn, C. (1990). A critique of the research on learning styles. Educational Leadership, 48(2), 50–56.

Means, B., Toyama, Y., Murphy, R., Bakia, M., & Jones, K. (2009). Evaluation of evidence-based practices in online learning: A meta-analysis and review of online learning studies. US Department of Education.

Means, B., Toyama, Y., Murphy, R., & Baki, M. (2013). The effectiveness of online and blended learning: A meta-analysis of the empirical literature. Teachers College Record, 115(3), 1–47.

Milfont, T. L., & Fischer, R. (2015). Testing measurement invariance across groups: Applications in cross-cultural research. International Journal of Psychological Research, 3, 111–130.

Neuhauser, C. (2010). Learning style and effectiveness of online and face-to-face instruction. The American Journal of Distance Education, 16, 99–113.

Prince, M. (2004). Does active learning work? A review of the research. Journal of Engineering Education, 93, 223–231.

R Core Team (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. Retrieved from https://www.R-project.org/.

Revelle, W. (2016). psych: Procedures for personality and psychological research, Northwestern University, Evanston, Illinois, USA.

http://CRAN.R-project.org/package=psych Version = 1.6.6. Rosseel, Y. (2012). lavaan: An R Package for Structural Equation Modeling. Journal of Statistical

Software, 48(2), 1–36. Retrieved from http://www.jstatsoft.org/v48/i02/ semTools Contributors. (2016). semTools: Useful tools for structural equation modeling. R

package version 0.4-13. Retrieved from http://cran.r-project.org/package=semTools Snow, R. E. (1991). Aptitude-treatment interaction as a framework for research on individual

differences in psychotherapy. Journal of Consulting and Clinical Psychology, 59(2), 205–216. Winke, P. M. (2007). The psychology of the language learner: Individual differences in second

language acquisition. Studies in Second Language Acquisition, 29, 143–144. Zhao, Y. (2003). Recent developments in technology and language learning: A literature review

and meta-analysis. CALICO journal, 21, 7–27.

Appendices

Appendix A. Supplementary materials are available online at https://sites.google.com/site/kusanagikuni/home/

Appendix B.

尺度使用の手引き

尺度名: (和文)外国語学習形態選好性尺度 ver. 1 (英文) Foreign Language Learning Mode Preference Inventory (FLLMPI)

適用母集団:日本の大学生のみを想定している。この母集団以外に適用した場合の因子構造の再

現は保証しない。また,日本の大学に在籍する学習者を対象としているため,学習者の国籍は基

本的に問わない。

測定概念:外国語の学習形態に関する選好性を測定する。三因子からなり,因子CLPは協同学習

に対する選好性を,ELPはe-ラーニングに関する選好性を,FTFIPは,対面形式の学習に関する選

好性を測定する。これら三因子は非常に弱い相関をもつ。

信頼性と妥当性:各因子にそれぞれ高い信頼性係数(およそ.70から.90程度)が見られ,十分な

内的一貫性があるといえる。ただし,再検査信頼性の検討はなされていない。測定モデルが高い

適合度を示したという点において一種の妥当性の証拠がある。

測定不変性:男女間において,配置不変性,因子負荷量と切片の不変性が成立しているため,男

女間で同一の採点方法を適用し,男女間の因子得点差について検討することができる。

運用と採点方法:7件法のリッカート尺度とし,各項目について「以下の文章について,自分に

どれだけよくあてはまるか選んでください」,またはこれに類する指示文で運用すること。簡易

的に各因子の合計尺度得点を求めて因子得点の代用をすることは,実務上十分である統計的証拠

がある。項目の改変をしないかぎり,質問紙自体を自作することは構わない。

尺度使用の許諾:項目を改変,翻訳する場合を除き,出典を示すことで使用を許可する。使用に

関する第一著者への連絡は特段要しない。

項目の改変および翻訳:項目の改変や翻訳を行う際,以下の連絡先にその旨を伝えること。また,

本尺度の項目を別の尺度の項目と合成し,新しい尺度として発表することについても,連絡を要

する。他言語への翻訳に関わる諸手続きについては,本研究の責任著者が協力を約束する。

出典と連絡先:本研究を引用すること。 以下が本尺度に関する照会先である。

草薙邦広(広島大学外国語教育研究センター)

Email: [email protected] The English version is available online.

206

Page 15: Foreign Language Learning Mode Preferences of Japanese

Kolb, D. A. (2007). The Kolb Learning Style Inventory—Version 3.1: LSI workbook. Boston: Hay Learning Transformations.

Kolb, D. A. (2015). Experiential learning: experience as the source of learning and development(2nd ed.). Upper Saddle River, New Jersey: Pearson Education.

Lynn, C. (1990). A critique of the research on learning styles. Educational Leadership, 48(2), 50–56.

Means, B., Toyama, Y., Murphy, R., Bakia, M., & Jones, K. (2009). Evaluation of evidence-based practices in online learning: A meta-analysis and review of online learning studies. US Department of Education.

Means, B., Toyama, Y., Murphy, R., & Baki, M. (2013). The effectiveness of online and blended learning: A meta-analysis of the empirical literature. Teachers College Record, 115(3), 1–47.

Milfont, T. L., & Fischer, R. (2015). Testing measurement invariance across groups: Applications in cross-cultural research. International Journal of Psychological Research, 3, 111–130.

Neuhauser, C. (2010). Learning style and effectiveness of online and face-to-face instruction. The American Journal of Distance Education, 16, 99–113.

Prince, M. (2004). Does active learning work? A review of the research. Journal of Engineering Education, 93, 223–231.

R Core Team (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. Retrieved from https://www.R-project.org/.

Revelle, W. (2016). psych: Procedures for personality and psychological research, Northwestern University, Evanston, Illinois, USA.

http://CRAN.R-project.org/package=psych Version = 1.6.6. Rosseel, Y. (2012). lavaan: An R Package for Structural Equation Modeling. Journal of Statistical

Software, 48(2), 1–36. Retrieved from http://www.jstatsoft.org/v48/i02/ semTools Contributors. (2016). semTools: Useful tools for structural equation modeling. R

package version 0.4-13. Retrieved from http://cran.r-project.org/package=semTools Snow, R. E. (1991). Aptitude-treatment interaction as a framework for research on individual

differences in psychotherapy. Journal of Consulting and Clinical Psychology, 59(2), 205–216. Winke, P. M. (2007). The psychology of the language learner: Individual differences in second

language acquisition. Studies in Second Language Acquisition, 29, 143–144. Zhao, Y. (2003). Recent developments in technology and language learning: A literature review

and meta-analysis. CALICO journal, 21, 7–27.

Appendices

Appendix A. Supplementary materials are available online at https://sites.google.com/site/kusanagikuni/home/

Appendix B.

尺度使用の手引き

尺度名: (和文)外国語学習形態選好性尺度 ver. 1 (英文) Foreign Language Learning Mode Preference Inventory (FLLMPI)

適用母集団:日本の大学生のみを想定している。この母集団以外に適用した場合の因子構造の再

現は保証しない。また,日本の大学に在籍する学習者を対象としているため,学習者の国籍は基

本的に問わない。

測定概念:外国語の学習形態に関する選好性を測定する。三因子からなり,因子CLPは協同学習

に対する選好性を,ELPはe-ラーニングに関する選好性を,FTFIPは,対面形式の学習に関する選

好性を測定する。これら三因子は非常に弱い相関をもつ。

信頼性と妥当性:各因子にそれぞれ高い信頼性係数(およそ.70から.90程度)が見られ,十分な

内的一貫性があるといえる。ただし,再検査信頼性の検討はなされていない。測定モデルが高い

適合度を示したという点において一種の妥当性の証拠がある。

測定不変性:男女間において,配置不変性,因子負荷量と切片の不変性が成立しているため,男

女間で同一の採点方法を適用し,男女間の因子得点差について検討することができる。

運用と採点方法:7件法のリッカート尺度とし,各項目について「以下の文章について,自分に

どれだけよくあてはまるか選んでください」,またはこれに類する指示文で運用すること。簡易

的に各因子の合計尺度得点を求めて因子得点の代用をすることは,実務上十分である統計的証拠

がある。項目の改変をしないかぎり,質問紙自体を自作することは構わない。

尺度使用の許諾:項目を改変,翻訳する場合を除き,出典を示すことで使用を許可する。使用に

関する第一著者への連絡は特段要しない。

項目の改変および翻訳:項目の改変や翻訳を行う際,以下の連絡先にその旨を伝えること。また,

本尺度の項目を別の尺度の項目と合成し,新しい尺度として発表することについても,連絡を要

する。他言語への翻訳に関わる諸手続きについては,本研究の責任著者が協力を約束する。

出典と連絡先:本研究を引用すること。 以下が本尺度に関する照会先である。

草薙邦広(広島大学外国語教育研究センター)

Email: [email protected] The English version is available online.

207

Page 16: Foreign Language Learning Mode Preferences of Japanese

Appendix C. 外国語学習形態選好性尺度

Foreign Language Learning Mode Preference Inventory Cooperative Learning Preference (CLP) I1 CLP 英語を学習するとき,ペアで会話したり,意見を交換したりする活動を好む。

I2 CLP 英語を学習するとき,グループで課題に取り組む形式の授業を好む。

I3 CLP 英語を学習するとき,クラスメートと一緒に課題に取り組む形式が効果的だと思う。

I4 CLP 英語を学習するとき,グループで力を合わせて課題に取り組むことが重要だと思う。

e-Learning Preference (ELP) I5 ELP 英語を学習するとき,コンピュータを使う勉強の仕方が自分に合っていると思う。

I6 ELP 英語を学習するとき,コンピュータ上で問題を解く活動が効果的だと思う。

I7 ELP 英語を学習するとき,コンピュータ上で行う演習を好む。

I8 ELP 英語を学習するとき,自分のコンピュータやタブレット・スマートフォンを使う勉強法を好む。

Face-to-Face Instruction Preference (FTFIP) I9 FTFIP 英語を学習するとき,黒板や資料を使った講義形式の授業を好む。

I10 FTFIP 英語を学習するとき,より深い知識が得られる講義形式の授業のほうが効果的だと思う。

I11 FTFIP 英語を学習するとき,先生が英語の知識を学生に伝えることが重要だと思う。

I12 FTFIP 英語を学習するとき,講義形式の授業が自分に合っていると思う。

How Do Japanese EFL Learners Elaborate Sentences

Complexly in L2 Writing? Focusing on Clause Types

Yoshito NISHIMURA

Graduate School, Nagoya University Yu TAMURA

Graduate School, Nagoya University / Japan Society for the Promotion of Science Kazuhisa HARA

Graduate School, Nagoya University

Abstract

Syntactic complexity has traditionally been measured by “macro-perspective measures,” which provide a paucity of angles from which to examine how learners actually elaborate a sentence. Mixing up a large variety of clauses with only “the number of clauses” or “subordination ratios” could lead to overlooking desired relationships between complexity and proficiency or task manipulation and linguistic performance. The current study attempted to capture the features of writing syntactically complex sentences through “micro-perspective measures,” such as clause types (main clauses, coordinate clauses, adverbial clauses, relative clauses, complement clauses, and non-finite clauses), and differences in learner proficiency levels. Participants were 28 Japanese EFL learners. Proficiency was operationalized via argumentative essay scores. To elicit syntactic knowledge, we offered the participants a specialized task that restricted the number of sentences in describing a plot consisting of six related illustrations. The results revealed that coordinate clauses, relative clauses, and non-finite clauses are more frequently produced in elaborating syntactic structures, irrespective of the writer’s proficiency level. Our findings also indicated that non-finite clauses are a more practical expedient for proficient learners than less proficient ones. Some pedagogical implications are also discussed.

1. Introduction

Inquiries into syntactic complexity (or syntactic development) have been of considerable interest in the field of second language (L2) writing in both longitudinal and cross-sectional studies. It is widely acknowledged that an L2 learner’s ability to arrange words into phrases or phrases into clauses demonstrates his or her capacity to manipulate the syntactic knowledge of a given target language (Crossley & McNamara, 2014; Ortega, 2003). Studies that have explored syntactic complexity have primarily focused on the syntactic variation and sophistication of the

208