NII-Electronic Library Service
Foreign
Language
Grammatical
CarefulnessScale:
Scale
Development
and
Its
Initial
Validation
Kunihiro
KUSANAGI
Junya
FUKUTA
Graduate
SbhooL
AidgqyaUitiversityJopanSbcietyfortheP)"omotion
ofStrience
Yusakti
KAWAGUCHI
Yu
TAMURA
Aki
GOTO
AI(ari
KURITA
Daisuke
MUROTA
GraduateSbhooL
Migaya
Uhiver:sityAbstract
Thisstudy aimed to
develop
and validate a scale tomeasure the GrarnmaticalCarefulness(GC)
of fbreignlanguagelearners.GC, by itsdefinition,
referstopsychological,
behavioral,and meta-cognitive traitsof alearner,
and itentails highlycontrolled, cautious, analytical, andtime-consuming
laiiguage
use. By conducting a sct ofquestionnairesurveys
targeting Japanese
jum'or
highschool, highschool, and university students(N
=
2,288),a ForeignLaiiguage GramrnaticalCarefulnessScale
(FLGCS)
with 14items,written inJapanese,was developedandtested
for
itsfactorialstmcture, reliability,convergent, content, and criterion validity. TheresultsdemonstratedthatFLGCS
yields
threefactors:(a)
phonological,
(b)
lexical-syntactic,and(c)
pragmaticcarefulness, with ahighreliability fbreach. The factorialvaliditywas also supported
by
using bothexploratory and confirmatory factoranalyses. Further,a set of analyses confirmed
various typesofvalidity. [[heevidence fbrthe validity isas foliows:
(a)
the1inguisticexperts(n
=
10)consistently
judged
thatalltheiternsproperlyreferred to each factorinan appropriate linguisticsense,
(b)
FLGCS showed correlations with learnerbeliefs,
consisteni withtheoretical
expectations, and
(c)
FLGCS correlated tothescores of aC-test,
and withthe
time tofinishthe C-test.Theapplical)ility ofFLGCSin
EFL teachngand research wi11also bediscussed,1.Background
largeUndoubtedly,
grammaticalperfbrrnanceof a secondlforeigri
language
shows a relativelyvariance among learnersincomparison with thatof theirfirstlanguage.Researchersin
Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglish Language Education
secondrforeign
language
Iearning
and teachinghavethusattempted fora longtimetodiscoverwhich
factors
explain thislarge
variance ingrammaticalperforrnanceamong individnals.Second languageacqpisition
(SLA)
theories,fbrinstance,haveoffered various frameworksrelating tothedevelopmentof
grammatical perfbrmance
(e.g.,
Segalowitz& Segalowitz,1993).Task-related factorsserve as another importantcontributor to the variance(e.g,,
Tarone,1985).Also,behavioraland
psychologicaltraitswithin individuals,such as aptitudes, attitudes, motivation,
beliefs,and anxiety are
yetmore importantfactors,It
is
selievident thatsuch factorsinteractina complex manner to
predictone's grammaticalperfbrmance, and theymay
jointly
affect theacquisition ofa second!fbreign lariguage.
Itisa consideral)le challenge tocover allof
these
factors
usingjust
a single framewotk, However,a concept which hasbeencommonly adopted inthefieldofcognitivepsychologyand
psychologicalmeasurement is possiblyone which captures the
inter-learner
variance ofgramrnatical
perfbrmance;
thatis,
Speed-Accuracy
Tradeoff(SAT),
SAT generallyproposesthat taskperfbrmance,with regard tovarious aspects, shows a very similarpattern,whereby faster
actions result inloweraccuracy, while slower actions
have
a higheraccuracy(e.g.,
Dennis&Evans, 1996;Goldhammer & Kroehne,2014;van derLinden,2007).Seeplot
(a)
in
Figure1,
which
graphicallyrepresents thisconcept. Taking an example in
the case of
gramrnatical perfbrmance,itcan besaid thatifataskisspeeded up ortimed,or
thetesttakerisinahurry,they
may exhibit reduced accuracy. On theother hand,ifthe
persondoing
the task can take enough
timeto accomplish thetask,helshecan
planand monitor histherlanguageuse deliberately.
'Ihis
basicallyleadstohigheraccuracy, Thistendencyof
SAT
may be common inmany aspects oflanguageuse and languageassessment.
Theoretically,thc SAT patterncxhibits functions
as in
(a)
of Figure 1.Howeyer, itis hypothesizedthat,inparalleltoski11development,thefunetionsfbrSAT
will also change, as in
(b)
inFigure1.[[hechanges ofthe functionsmay correspond tosome ofthe SLA theories,such asski11-acquisition theoryorautomatization
(e,g,,
Segalowitz
&
Segalowitz,1993).b98e
(a) {b)
'h,,'N,,,'1,`,,,...
bg8e
(c)
XL"-":::::.-::J
S-L
1 -LLu.--Ls--'--::
it9ge
hL;IL-hlRTittLlllxtLs-s
hsit;x:""----E)..
hxXL-"ei--
s--Speed Speed Speed
FVgurei,Schematic
plotsof theconcepts inSAT. Plot
(a)
shows thebasictradeoffpattem,Plot
(b)
explains thechanges of thefUnctionscaused by development.Plot(c)
shows the
inter-learner
variance ofthe compromisepoints.
NII-Electronic Library Service
Thereisyetanother importantviewpointinthisframework,Irrespectiveof development, irrter-learnervariance stillrernains, Thiscan becaptured
by
considering thecompromisepointof
SAT functionsof individuals.Assume thatone
personat leastina specific situation tends to
prioritize
accuracy, and another doesnot choose fbraccuracy, butspeed, Apartof such a variance
of compromise
pointsamong individnalscan be determinedby histherpsychologicaland
behaviora1traits.Thismay beresponsible fortherest ofthe variance,asin
(c)
inFigure1,Thepresentstudy calls
this
hypothetical
traitGrammaticalCarefulness(GC),
which we willviewas a constmct ina
psychometricsense.i The next section will introducetheconcqpts of ([}C
and review some of therelevarrt studies intheliterature.Thisarticle wi11report the
development
and
initial
validation ofa scaletomeasure thisnew constmct, GC,inthelattersections.2.
Grammatical
Carefulness
in
ForeignLanguage2.1Definitionof GrammaticalCarefulness
GC inforeignlaiiguagerefet;s toa behavioral,psychological,and rneta-cognitive traitof individualswhich
is
characterizedby
thefbllowing:
(a)
it
entailshighly
cautious, carefu1,deliberate,
intentional,
and analyticallanguage
use,fo)
it promotes relatively slow, time-consuming,and cogriitively demandinglanguageuse and leaming,which leadstoa higheraccuracy oflearners with some
grammaticaltasks,and
(c)
itcomplexly linkstoother inter-learnervarial)les, such as aptitudes, attitudes,motiyation,
beliefs,
and anxiety.The SAT
framewotk
regards GC as amoderator of thecompromisepoints,Inotherwerds,
itishypothesizedthatsomeone with a
higher
GC tendstoachieve higheraccuracy attheexpense of speed, and anotherpersonwitha lowerGC tendstoperfbrrnspeedily and lessaccurately. 2.2GrammaticalPerformanceand Inter-LearnerVariablesintheLiterature
A
couple of
previousstudies attempted toreveal therelationships
between
inter-learner
variables and
grammaticalper[Ebrmance.For instance,Krashen
(1978),
inhisearly theoreticalwotk, suggested thatthereare two
types
of second laiiguagelearnersi
monitoFunder-usens andmonitor-over-useng
(See
also Seliger,1980).Kormos(1999)
extended thisidea,and empiricallyinvestigatedtheeffects of thetwo
different
spealdng stylesofindividuals
(aecuracly-centered
andfluenay-centerec()
ontheir
selfcorrection behaviorsbyobserving Ll-HungarianEnglishlearners'speech
productionand
questionnaireanswers. Kormos lookedat the interplaysamong the
speaking styles and thefrequencyof selfcorrection behaJvior;theaceuracy-centered
participants
showed higherfrequenciesof selfcorrection behaviorthanthosewith afluency-centeredstyle.
One
other case isa recent classroom-based study condncted by Kartchavaand Ammar
(2014),
which investigatedthe effect of learnerbeliefsahoutcorrectivefeedl)ack
on noticingbehaviorsand leamingoutcomes. They reported that some beliefsmediated thefrequenciesof
noticing behaviors,butnot thelearningoutcomes.
Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglish Language Education
These studies focusedonly on very specific behaviorsand situations of learners'
perfbrmance;Kormos
(1999)
isconcerned with speaking, especially selfcorrection, whileKartchavaand Ammar
(2014)
are concerned with noticing,fbrinstance.The studiespickedup
very 1irnitedlearners'traits
(speakmg
styles and beliefsahoutcorrective feedbacki.More critically, some studiesdid
not consider themeasurement as constmcts. For instance,Kormos' study onlyused fivequestionnaireitemsfordeterminingthelearnersJspeaking styles,of which reliability and validity remained unclear, Kartchavaand Arnmar'sstudy, on theother hand,reported
the
reliabilityand
factorial
stmctures,but
ofcoursefurther
validation would bedesirable,Thepresentstudy takesa
broader
view,
wnh some methodological sophistication regardingtherelationships between
grammatical perfbrrrianceand inter-leainervariables. GC isa traitof
individualswhich
directly
linkstogrammaticalperformanceingeneral,unlike beliefsregarding
some specific
behaviors,
GC
alsohas
value with regard toits
relationshipswith
other typesofinter-learnervariables, such as beliefs.lnthelattersection, fbrinitialvalidation ofthe
developed
scale, we will report thatthe scales of GC are actually correlated to a
partof grammatical
perfbrmance,and we show thetheoreticallyplausiblerelationships with certajn
types
ofbeliefs.2.3Signhicanceof GC inTheoryand Practice
Estal)lishingGC as a
psychologicalconstmct and developingitsreliablemeasures would
be
a
promisingway to shed light
on various fieldsof study in
the
future.
Forinstance,
in
psycholinguisticexperiments, itcan
be
considered thatcontrolling and establishing thevariahlesofSAT-relatedinter-learnervariance such asGC will beboththeoreticallyand methodologically
lmportant,
Inclassroom-based studies,GC can beapplied tomeasure moderators of theoutcome of
students' leaming directly.Also,theimpactson GC of certain teaching metheds or treatments
would be
interesting
research topics.More specificallyinpractice,understating students' traits,
such asGC, can
providemuch informationtoteachersinthecontext ofcurriculum
design,
choice ofteaching materials,and everyday teachingpractice.A reliable,validated,and also easy-to-usepsychologicalscale ishencestrongly desired.
3.ScaleDevelopment
3.1The PreliminarySurvey
inorder to
develop
apsychologicalscaleto measure GC, we condncted two
questionnaire
surveys. The main
purposeof thefirst
tpreliminary)
survey was the initialitemselection forthescale. Intotal,169students intwo
privateuniversities participated,All
of the
participants
werefirst
yearstudents who took English classes. Theiracademic majors includedeconomics
and education. The survey was canied out at thebeginningof Apri1,2014,The
participants
answered thequestionnaireduringtheirEnglishclasses.
The
questionnaireconsisted of
<a)
a facesheet,(b)
NII-Electronic Library Service
GC itemswritten inJapanese
(k
=
40),as detailedbelow,and
(c)
anchor scales(k
=
14,Tanaka & Ellis,2003).Thesurveys were conducted inthestyle ofa Likertscale, fromone toseven
points.
The initialitempool
(k
=
40) was created
by
the authors. By referring to learners'retrospective
data
aboutlar)guage
use intheliterature,theauthors composed theitemsinconsultation with each other. Alltheitemsintheinitialitempoolare availahle online atthefirst
author's website
(see
Appendix),All the
data
were typed and were verified twice by theauthors. The response of oneparticipantwas excluded becauseofsome missing values;
the
number ofvalid responses was l68. Beforetheinitialitemselection byfactoranalysis, we excluded 18itemswhich obviously violatedthe
normaldistribution,
Since
the
goodnessof
fit
indicesoftheinitialexploratory factoranalysis(22
itemswere submitted), which extracted threefactors,were unfavorahle, 7itemswhich causeda misfitwere excluded, using a step-wise exploratory factoranalysis
(SEFA).
Hence,15items
out of40 were selected forthesecondary study. These15itemscan beseen intheAppendix,3.2The SecondarySurvey
The secondary
survey was undertaken fromMay toJune,2014,using theselected items
(k
=
15).Intotal2,288
participants
took
partin
the survey, and 2,098 answers, with no missing values or extraordinary responses, were analyzed, Theparticipantsconsisted ofjunior
high
schoolstudents sampled fromtwo
publicschools
(n
=-216),highschool students
(n
-1,078)fromtwo
normal
publicschools, and university students from11national, public,and privateuniversities
(n
=
804).Almostallof theuniversity students were first
yearstudents and hadvarious academic
majors. Juniorhighand highschool students,on theother hand,were sampled ina well-balanccd way intermsoftheir academic
years.
As inthepreliminarysurvey, the
participants
answered the
questionnaireintheirEnglish
classes. The
questionnaireconsisted of the facesheet and the 15 itemsrelated toGC. The
secondary survey used acomputer-readable
questionnaire,Ihe
data
were automaticallyprocessed
using scanners and computers. Then,theauthors yalidated theresponses twice byhand, Firstly,descriptivestatistics of all thevalid answers
(n
=
2,098)were calculated. Befbre
condncting factoranalyses, we confirrned thedistributionsof alltheresponses
(k
=
15),ItemNo. 7showed astrongly
biased
distribution,
which may negatively affectthe
factorial
stmcture.Hence,
theitemwas excluded. Then,we conducted an exploratory
factor
analysis todetermine
the
constmcts ofGC. Thisstudy also
perfbrmedconfirrnatory factoranalyses forthemodel inorder to
confirm itsfactorialvalidity.
The distributionsof theresponses are graphicallyrepresented inthemultiple histogramsin Figure2.Tal)le1summarizes correlation coeMcients and the variance!covariance matrix of the
itemresponses
(k
=
14,excludmg item7).
Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglishLanguage Education
[teml ltem2 1tem3 1tem4 1tem5
)h 7lt )h )h h
.O'g・liMlb.
:g!ma
S!IMk
S:IM]}
SEinlh..,
IS57 1357 1357 1357 1357
Rate Ftare Rate Hate Rate
ltem6 lteme 1tem9 ltemlO
h h )h }.
iElg:n..
Slma
iy'llMln-
lk.!l:[!b..
1357 135T 1357 1357
nste Mte date tete
1temM ltem12 ltem13 ltemd4 ltemrl5
S X )h pt X
SEI[!11)i
[1!lma
S':lua
Ig・!ima
i,-!in#I)
1357 la57 1357 1357 1357
fete Flate Rate Rate Rate
]Fligttre 2.Histogramsrepresenting thedistributionsof theresponses.
Table
1,
CbrrelationCbEz(i7cientsand P'Ziriancel(]ovarianceMatrix
ofthe
ftemResponses,ItemNo.1 2 345689101112131415
123・4568910111213l4152.01 .49 ,55 .50 .47 .43 .40 ・op.44 .36 .36 .40 .40 .35 1.062.35 .41 51 .32 .37 ,37 .33 .32 .39 .36 .39 .40 ,37 1,Ol,811,66,59 ,53 55 .43 .63 .63 .29 ,35 .44 .42 ,32 ,991,091.061,93 ,39 .40 .37 .48 .44 .26 ,34 .45 .41 .31 ,97 .731.00 .792.14 .37 .3S・" .48 .29 .36 .39 .37 .30 ,79.74.92.72.701.72.45.57.58.36.30.32.3528.83 .83 .81 .75 .75 .g62.13 .55 ,56 ,36 ,35 .38 ,42 .39 .82.671.07.87.85.991,051.73,71,33.34.40.40.36.82 .641.06 .80 ,93 .991,081221.72 .30 .37 .38 .39 ,31 .81 .95 ,59 .57 ,66 .74 ,82 ,68 .612.45 .63 .54 ,57 ,66 .79 .84 ,69 ,72 .81 .60 .78 .69 .741.502.32 .67 .68 .66 .Sl .87 ,81 ,90 .82 .61 .80 .76 .721211.462.06 .72 .61 .84 ,92 .81 .86 ,80 ,69 ,92 .79 .771.331,561,552,25 .61 .76 ,86 .62 .65 .67 .55 ,86 .72 ,631.591,541,341.402.32
7Vbte.Valueson theleftsiderepresent correlation coeencierrts,right
for
covariance.Table2
presents
thedescriptive
statistics,thesurnmary of thefhctoranalysis, and the
reliability foreach factor.The factoranalysis extracted three factors.The
goodnessof fitindices
were demonstratedtobenot favorable,
btrt
at an acceptahle level,x2(52),==285.44,p
< .Ol,TLI
=
,93,RMSEA =
.08,and with a 90 % confidence interval
(CI)
[.07,
.08].ItemsNo. 11to1582
NII-Electronic Library Service
showed
higher
loadings
fbrFactor1,and theseitemswere allrelated tothecarefulness towardsthe
phonologicalaspects ofgrammaticalperfbrmance.ItemsNo,6to1OloadedFactor2heavily,
and theseoorresponded tothelexical-syntacticaspects, Therest of theitems
(from
ItemNo. 1to5)
showed relevancefbr
thepragrnaticaspects.Hence,
thisstudy narned thefactorsphonolQgieal
carEzti(lnesty,
lexical-syntactic
carefuiness, andprtigmatic carE:tiiiness respectively,Table2.
Descriptive
Statistics
and theRegultsofthe
Ex/ploratoiy jFkectorAnalysis Descriptivestatistics Pattern!StructmematrixItem
M saSkewness KurtosisFactor 1Factor 2Factor 3Communality
131412151110968421353.573,823.844.083.103.263263.303,963.464,093.553203211.441,501,521,521,561.311.311,311,461.391,531.421.291,46O.24O,15O.18o.oo-O.02O.30O.30O,24O.06022-O.06O.27O.25O.40.O.40-O,53-O,59-O,63-O,78.O.09-O.11-O,16-O,45-O,31-O,65-O.32.O.08-O,28 .87/.80.86/.81.801.80.751.83.70/.75
-.OIL44.OIL49.04L41-.02f,44-.08L40-.05!.49-,10!.58-,11L44.12L49.22f,42 ,70.65.57,67,66
-.03f.44-.Ol!.46-,OIL40,18!.48 .92L86.82/.83.601.6857L65
-.071.59.02f.62,11!.55-,04L50 .74.69.47.44
-,07L44,16L48.04L47-.121.45,06L43-.04/.64-.131,42.09L55.421.75.29L56 .841.77.621.62.591.77.54L77.32L57 .60Al.48.68.37
FactorCorrelationsFactor
2Factor
3
,55,63
,73 Reliahilitya
coesucients
Averagecorrelationcoeencients
.90,64 .84,57 .82.48
Sums ofsquares ofloadings
Proportionofvariance Cumulative
proportionofvariance
3,28 .23 ,23
2.68 .19 .43
2,16
J5
.58
?Vbte.The
factor
analysis was conducted using maximum1ikelihood
estimation method, andPromaxrotation,with thenumber of factors,three,as suggested bytlie
parallelanalysis,and we
judged
thatthismodel was alsotheoreticallythemostplausible.
83
Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglish Language Education
The reliability of each
factor
was calculatedwith
Cronbach'sa coethcients and averageinter-correlationcoecacients, The reliabilitycoecacients of allthe
factors
were sucacient, as canbeseen inTable2.
Themodel was thensubmitted teconfirrnatory
factor
analyses. Allthepathstotheobserved
variables were statisticallysignificantatp < .O1
,
and
the
goodnessof
fit
indicesshowed acceptablelevels,lnorder toobserve thedifferencesinthisfactorialstmcture among thethree
groupsof
participants,confirmatory factoranalyses, using thesame model, were condncted by dividingthe
three
groups
(see
Figure3
fbritspath
diagi;am).
Table3 sumrnarizes thecomparison of thegoodness offit indicesamong the
groups.Theindicesshowed almost equal
goodness offitamong the
groups.Also,multiple sample stmcture equation modeling was used todetectthedifferences
arnong thegroups.We testedfourmodels:
(a)
"configural",
ofwhich the
pathsare equal among
thegroups,
fo)
"weak
measurement invariance",of which loadingsare invariant,
(c)
a"strong
measurement inyariance",of which loadingsand interceptsare invariant,and
(d)
another typeoftheformer,ofwhich loadings,intercepts,and means are invariant,As theresults demoilstrate,all of themodels showed a
goodnessof fit,as inTable 4.The fburthmodel, which was under the
strongest constraints,was the
best
model. Hence,itcanbe
safely statedthatat leastthefactorialstructureand itsloadingswere not invariantamong thegroups.Thissuggests thatthescalewhich
the
presentstudy developedcan measure theGC ofvarious levelsoflearners. Table3.
Cbmparison
ofGoociness
of]FVt
indicesamong the Groups(lroup n
f
of
p CFITLI RMSEA SRMR
A!1Junior high HighUniversity
2,098 1,121.27 216 206.98 1,078 567.61 804 537.29
74747474<.Ol<.
Ol<. Ol<. Ol
.94.93.94.92 .92.92.92.91 .08
[.07,
,08].09
[.08,
.11].08
[.07,
,09].09
[.08,
,11].05.05.05.06
Table4.
Sle(mmar;I?
ofMeasurement
invarianceamong theMuttipieSbnrplesModel
f
ctf"pCFI RMSEA BICConfiguralmodel
Weak measurement invariancemodel
(equal
loadings)Strongmeasurement invariancemodel
(equal
loadings+ intercepts)St!!Qng-gigqsuig!ugn!-igya!iaggg-!ugdg!
tr t dl
(equal
loadings+ intercqpts+ means)1,311.87
1,333.l8
1,345.09
-138045
222
2an
266
<.Ol .93
<.Ol .93
<,Ol ,93
Z2tZ2
<,Ol..9.3
.05
.05
.04
=04
90,127.45
89,980.58
89,824.28
.8.N...
981385
NII-Electronic Library Service
.3s/.s4l.s41(・si}-)[iiEiiilll'N
.so!.6s1.6s1(.7o)
.631.671.67/(.66}-!l!Ellll21-.61/.ss/.s7
.391.32!.31/(.33}p
m
.78/.82/.83・4i1・4g1.s4/{.so}-.Il!Eliilll.
・77/・7i/・6s
,63!,6S/,60
.6oi.s7/.64/(・6o)
-b[iiEiiil5]'
,451,52f,S4/(.51)p
a
.74/.S9/.68.391,561.61!{.55)p
a
.781.661.63,231,32f.31/{.3o}-[l!EiiiSl.
.88f.82/.83!
,91/.80/.84l
.161,3s/.3o!C3o) ltemlO
,4Sl,46f,48!C47) ltemll
,74/J3/.
.34!.33/.28/(.31) ltem12
.821.82/.
,331,311,37/(,33} ltem13 .s21.831,
.281,301.3s!C31) ltem14
・85/.84/.
.81/.77/.801(.78)
.351,411.36/{.39) ltemlS
.F7gure 3.Pathdiagramrepreseming themodel in
!
university1
(all)".
N=
2,098.
The results so farprovidedsufficierrt psychologicalscale,foreignLanguage
which
yields thethreefactors:phonological pragmaticcarefulness, Forreference, thedescriptive
Allthescores exhibited anormal distribution.
.65/.S61.53/(.S7)
(.64)
question,with standardized estimates. The
standardized estimates foreach
groupwere shown intheformof
`tiunior
highschool /highschool
empirical evidence forestablishing the new
Gvammatical
Ckerefulness
Sbate
(FLGCS,
hereafter),carefulness, Iexical-syntacticcarefulness, and
statistics of thesummated scale scores are
sumniarized inTal)le5.Phonologicalcarefulness exhibited relatively higherscores thantheothers.
Table5,
DescriptiveSlatistias
ofthe
ShrmmatedSbaleSZroregkMsw Skewness Kurtosis Phonologicalcarefulness
(item
No.1
1
to15)
Lexical-Syntacticcarefulness
(item
No.6to1O)Pragrnaticcarefulness
(item
No.1to5)All
54514
3.883.443.503.62
1.271.111.llO.98
O.14O.22O.14O.13
-O.35O.06-0.35-O.13Nbte,Thesummated scale scores herewere themean scores fortheresponses ofthe itemsfbr
each. Thefactorscores were notused here.n
=
2,098.
Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglish Language Education
4.InitialValidatien
For themitialvalidation
procedure,
thepresentstudy furtherexamined thecontentand criterion validity of FLGCS by condncting multiple fbllow-upanalyses. We testedthethree hypothesesbelow
(hypothesis
ItoIII),
H)/pothesis
I:Thecontents ofall theitemsinFLGCS match thetheoreticalconcepts ofeachfactor.Forinstance,itwas hypothesizedthattheitemsfbr
phonologicalcarefulness actually refeT
tothephonologicalaspects of
grarnmaticalperfbrmance inlinguisticterms.
thpothesis
IZIiEachtypeofgrammaticalcarefulness iscorrelated tolearnerbeliefswith a
medium levelof strength. More specifically,
GCs
are correlated to analyticbeliefs
(Tanaka
& Ellis,2003)more strongly thantoexperientialbeliefs.
Hilpothesis
I[l:GCs are correlated totheaccuracy ofa C-test,which issupposed tomeasuregenerallariguageperfbrmance, and thetime which test-takerstaketocomplete thetest.As discussedintheBackgroundsection of thisstudy, GC was considered as a typeof moderator of compromise
pointsinthe
SAT
framework;thus,itisassumed thatsemeone with a higherGC
should exhibithigheraccuracy and lowerspeed inthetask.
4.1
Content
Validity
Ten 1inguisticsexperts voluntarily
participatedinthispartof thestudy, Usingan online
version of
the
questionnaire,we asked theparticipantstoread thequestionnaireitems
(k
=
14)
carefu11y,thentoselectwhich typeof
grammaticalperformancetheitemrefers to,inlinguistic
temis,by choosing from one of fouralternatives:
(aj
phonological,th)
lexical-syntactic,(c)
pragmatic,and
(d)
none ofthem. Itwas not allowedtoskip an item.The result was thatallthe
participantsanswered thatthe itemsNo. 1to5 referred to the
pragrnaticaspects of
grammaticalperfbrrnance,6to1Othe
lexical
and syTitacticaspects,and 11to15thephonologicalaspects. This
providesus with empirical support forthecontent validity of
FLGCS on a certain
point.
4.2CriterionValidity
4.2.1RelationshipwithLearnerBeliefs
Inorder toconfirrn a
partof
the
criterionvalidity(especially
convergent anddiscriminate
validity) of FLGCS, thisstudy investigatedthecorrelation
patternsbetween FLGCS and two types,oflearner
beliefs,
analyticand experientialbeliefs(H/mpothesis
M.
Analyticand experientialbeliefs
(AB
and EB foreach) were established byTanakaand Ellis(2003).
Theft)rmertypereferstolearners'beliefswhich support analytical
types
of learningmethods and theirbenefits,andconsisted of 7 questionnaire
items
(e.g.,
I canlearn
wellby
writing clown evetything inno? notebooe, while theother typesupports experiential ones, with 7questionnaire
items(e.g.,
IcanNII-Electronic Library Service
learnwell Lly
spealdngwith otheKs inEnglish).Allofthe
items
in
the
Japanese-translatedversionare alsoavailable online
(see
Appendix).Theoretically,itcanbe
expected thatGC and beliefsshow some correlations, and GCs are related toanalytic beliefsmorc strongly thanexperiential ones
in
termsoftheirconceptual relevance.The
data
ef thissection was cempared with that of thepreliminarystudy, which included boththeGC questionnaireitemsand thelearnerbelief
items.
[[hus,allthe
participants
(168
answers were used) were
first
yearuniversity students.Firstly,the
descriptive
statisticsand
the
reliabilitycoethcientsfor
each of the'summatedscale scores were calculated
(see
Tahle6),The sarnple showed relatively higherexperientialbeliefs,and a lowerlevelof GCs than theresults ofthe secondary study. Itis
possibletoinferthai
thesubsample hadatendencytosupport experientialbeliefspreferahlyand
be
less
grammatically
carefu1. We
judged
thattherelial)ilityofeach score was sufficient(.73
to.91).Then, acorrelation analysis among thefivesummated scale scores was conducted. Figure4 graphicallysumrnarizes
the
correlation pattem and thedisuibutions
ofthe
scores, We also usedclassical mutti-dimensional scating
(Crvfl)S,
also known asprinciplecooTdinate ana4ysis;
see
Coxon,1982).
CMDS
is
a statisticalmethod tovisualizethesimilarityofvariables. Basedon thecorrelation coefficients matrix, CMDS can
placeeach variable on a two-dimensionalscale.Thus, itcan be interpretedthata
pairof closer variables intheplotmeans thattheyhavea higher
correlation, and more distantvarial)les show lowercorrelations. Figure5 shows theresults of
CMDS.
Table6.
DescriptiveSinttstiasand Reliability
ofthe
Sle{mmatedSbaleSlooresofE[LGC:S
andLearner Beli(:tSk
MsaSkewness
Kurtosis ctPhonologicalcarefulness
(PH)
Lexical-Syntacticcarefulness
(LS)
Pragmaticcarefulness
(P)
Analyticbeliefs
(AB)
Experiemialbeliefs(EB)
545 3.492.983,301.211.331.29O.19 O.54-O,57
O.04O.16O.67
.87,91,8977 4.154.59O.95O,97O,13-O,33 O.46O.52 ,73,74
IVlote.n ==
168.
Theresults ofthe correlationanalysis clearlysupported IlypothesisU;allofthe GSs showed
low tomiddlelevelsof correlation coecacients, butmore specifically theywere more strongly
relatedtoanalytical
beliefs,
PH: r= ,63,with
its
95% CIbeing
[,53,
.71], LS:r==
.51
[.39,
.61], P:r
= .61
[.51,
,70],than
toexperiential ones, PH: r=
,33,with
its
959'6CIbeing
[.19,
,46],LS:r= ,22
[.07,
.36],P:r= .39[.25,
.51].Also,as Figure5presents,allofthe GCs were locatedclosertoanalytic beliefsthantoexperiential ones. [rhislinks
perfectlytotheconceptual relevance among them.
Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglishLanguage Education
1 3 57 1 3 57
tn{]
ma
Ei]
tw
[iSl
l・
ieniilistl[IEI][l5i][l21]
'
ff
pm
kifim
[IIEi]
[EIIEI
l'
:-pa
MM
rm
EII
r
papas[iiijllimal
1 3 S 7 I S 5 7 1 3 5 7 jF igure4.Scatter
plotmatrix representing the
correlation coeracients on theupper side,the
histograrns
in
the
middle
columns, and thescatter
pletswith 1inearregressions on the
lowerside.
9-:
g
8
:
R
P,(i]ilg
-1
,O-O,5o,oO.51,O
Figure 5. Plotrepresenting the
distances
between the variables, based on their
correlationcoeMcients matrix.
4.2.2RelationshipwiththePerformanceofa C-test
This section wi11report the results of theexperiment which investigatedtherelationship
betweenGC and languageperfbrmance.We assumed
that
GC as amoderatorin
SAT
will show acorrelation with boththeaccuracy
(score)
and thespeed(time
tocomplete) ofalanguage
test.Thepresentstudy
focused
on theperfbrrrianceofaC-test.
Thenumber ofpanicipants was 77.Allofthe
panicipantswere
first
yearuniversitystudents,
The
participantsoverlapped withthesecondary study. Afterthesecondary study,
they
participated ina C-test(detailed
below)as apartofthe learningactivity of theirEnglishclasses,
in
June,2014,Allthe
participantswere women. We also used thedataabout theirGC, as determined
in
thesecondary study.
TIheC-testwas created
by
theauthors(also
avai1ahle on theauthors' webpage).The text
typewas narrative
(a
letter
toawriter'sfriend),
The lengthofthe textwas 249words, includingsome blankwords. The number of items
folanks)
was 17,which isequal toalmost 7% of thewhole text.The readability scores ofthe text,ignoringthe
blanks,
were 91atFleschReadmg Ease, 2.6atFlesch-KincaidGrade Level;theselevelsare usually regarded tobe easy in・fbreigri languagereading studies.The examiner askedthe
participantsto
fi11
in
theblanks
intheuntimedcondition
(there
was no time1imt),butalso asked theparticipantstoreportthe time when theyhad
completed answering. A digitalcount-up timer was displayedon the monitor of theclassrooms, and theparticipantscould note thetimewhen theyfinishedanswering, using this.
Table7presentsthedescriptive
statisticsand thereliabilityfbrthesmmated scale scores of
GCs,
thescores of theC-test,and thetimetocomplete thetest.Thissubsample may harreshownNII-Electronic Library Service
lowerGCs incornparison tothewhole dataof thesecondary study, The reliahilitycoethcients were acceptable, Figures6and 7summarize thecorrelation
patterns,as intheprevioussection.
Table7.
Description
of
theStimmatedSbaleSboresof
GCs,C-7layt
the
Slrows,
and the7TmetoCbmpletethek M swSkewness Kurtosisa
Phonologicalcarefulness
(PH)
Lexical-Syntacticcarefulness
(LS)
Pragmaticcarefulness
(P)
ScoreTime
tocomplete
(sec)
545 3.182.622.79O,97O.91O.87O.03O,20-O.08O.03-O.14-O.74.81.79.79
17n.a. 5,45491,342.35138.36O.58O.54 O.421.45,63n,a.
IVbte,n =
77.
135 04Slt
[ImuEi61ESIEillEIZI:
i
ilRili
rm
[il2i]
[!!l]
[III6]
F
tw
Eiiill]
[ii[iiN
[illiE]
[ilill
r
iew
ge
ge
[illl
[ilii]
e
!llllll
[kiii]
[iiiiE]
[liillll
IZilill
:
1SS lt34
ZIOEco jFVgure 6.Scatter
plotmatrix representing the distributionsand thecorrelations
between
thevariables fbr
H)/pothesis
ILILg・
g
g-8g8
dy
S re
wy
-O.6-O.4-O.2
O.O O,2 O.4 O,6
jFVgure 7.Plotgraphicallyrepresenting the
results of CmoS.
The results supported I]5/pothesisllI.GCs are correlated toboththescores, PH: r
= .35
[,21,
.48],LS: r= ,41[.28,
.53],P:r
-.38
[.24,
.50],and thetmes,PH: r-.27
[,12,
,41],LS:r= .36
[,22,
,49],P:r=
.31
[.17,
.44],with lowtomiddlelevels
for
the
coedicients. Theresults ofCMDS alsosuggest thatGCs havea correlation with thescore and thetime,with almost thesame magnitudes. Thismeans thatGC linksto
both
theaccuracy and speed of languageperforrnance,
exactly as theframewotkof
SAT
expected,4.3Summary of theInitialVafidation
Our initialvalidation
providedinfbrmatienregarding
both
thecontent and criterion validity of thescale. The sumrriary ofthe
results of our mitialvalidation, using a hypothesistestingprocedure,
is
shownin
Table8.Japan Society of English Language Education
NII-Electronic Library Service
JapanSociety ofEnglish Language Education
Table8.
71JieSIimmary
pfthe
Resultsofthe
initialVbliciationHypotheses Content Results Evidence
I ThecontentsofalltheitemsinFLGCSmatch Supported Properly
judged
by 10thetheoreticalconcepts of each factor. Iinguisticexperts
II GCs are correlated toanalytical beliefsmore SupportedShowed the correlation strongly thantoexperientialbeliefs. patternexactly asexpected
III GCs are correlated to boththescore ofa Supported Showed the correlation
C-testand thetimetofinishthetest, patternexactly asexpected
5.GeneralDiscussion
The results presentedal)oveleadus toconclude
thatthenew
psychologicalscale, FLGCS,
with
its
threefactors(k
=
14),isa statistically reliahle measure. Itsstmctural, content,and criterion validity were also supported by conducting multiple analyses. Funhermore,multiple sample stmctural equation modeling
demonstrated
thatFLGCS showed measurement mvariance amongthe
groups.However,FLGCS has
acouple ofpotential
limitations,
as noted below. Most importantly,regardless of itshighreliability,FLGCS covers only a small area of GC
as aconstmet. As inter-correlationand inter-factorcorrelation coefficientssigriified
quitestrongly, thequestionnaireitemsmay measure very close behaviorsand characteristicsof individuals.This phenomenon iscalled bandwidlrh:fidelity
dilemma.
However, since GC isa new concept, our
preliminaryaim was toestablishareliablescaleattheexpense ofits coverage, inorder toprovide
abasisforfurtherresearch. As intheBackground section and theliteraturereview of the
present
study, therationale forGC underlies
the
concepts of SAT, and thescale was mainlydesigned
tobe
applied inpsycholinguistic
studies,classroom-based studies, and teaching
practice.Needless
to say, lessreliable measures leadtoattenuation
problems,statistically,Hence,we
presumed to
judge
that
amore reliable scale was preferableinthiscase.Obviously,validation isnot a dichotomous
judgment
and fUrthervalidation isalways strongly
desired,
A
partof theevidence whichtheinitialvalidation providedmay cover only a very small range of validity.Futurcstudies should confirm thelinksbetweenGC and othertypes of individualdiffbrences,the developmentof ([}C,and relationships with other types of
grammatical perfbrrnance
(e.g.,
grammaticalityjudgment,
sentence verification,andimprornptu
speech), Additionally,whereas
the
presentstudy was a 1arge-scalesurvey,it
neverdenies
theexistence of sampling errors, Datawithmore varied and 1argersamples wi11also beneeded.
Itshould
be
noted thatthepresentstudy failedto
assess the feasibilityand theconsequential aspects of validity.Itwi11be importarrtto analyze
the
washback efft)cts on leaminglteaching behaviorsoflearnersfteachersin
practice.
NII-Electronic Library Service
6.
Conclusion
Thisstudy
developed
and validated apsychologicalscale tomeasure GC, which isrelated to
irrter-learner
variance on SAT. SAT, a sophisticated frameworkconceming human behaviors,may explain a large
partof languageperfbrmance,and
GC,
as anindividual's
trait,willbe
key
tocapturing
the
dynamics
of numerous varial)lesconcemed withlanguageperforrnance.However,the
importance
ofthisis
notlimited
totheories
ofsecondlfbreign languageacquisition and use.Inteachingpractice,FLGCS can also
provideteachersmnch infbrmationabout theirstudents.
FLGCS will enable ustounderstand students' traits.It
wi11
also
helpteachersspecify what kmd ofgrammaticalcarefulness
(phonological,
lexical-syntactic,andpragrnatic)of a panicularstudent is
(in)suthcient.
The infbrmationwi11contribnte totheeveryday teachingpracticeof Englishbyplayingvarious roles inthework ofteachers. Likewise,FLGCS willbeusefu1,even fbrlearnersto
understand
their
own traits,Thismaypromote
leamers'selfregulated learning. NotesLThe
terrnGC
has
numerous simi1ar terTnssuch as meta-lingtiisticawarenexy, languageawareness, languagesensitivity, and
grammatieal
sensitivity, However, these
generallyrefer
toone's
knowledge,
orcertaintypesoflanguage-related ski11s,which are mainly measured by languagetests.Weintended
torefer toGC only as apsychologicaland behavioraltrait,which
we consider tobefUndamentallyseparate fromlanguage
knowledge
or ski11s,However,
we alsoassumethat
theymaybe
correlatedtoeach other tosome extent.References
Coxon,A, P.M,
(1982).
77)euserlsguidetomultidimensional scaling: Mith
speeialrc!XZirence
to theMDS. London: Heinemann EducationalBooks.
Dennis,I.,& Evans,J.St.B,T.
(1996),
The speed-error trade-offprobleminpsychometric
testing.
British.loumal
ofRsychology,
87,105-129.Goldhammer,F.,& Kroehne,U.
(2014).
Controllingindividuals'time spent on task inspeededperfic}rrnancemeasures: Experimenta1tirne
limits,
posterior
time limits,and response time modeling.Al?plied
Rsycholqgi'calA4easurement,38,255-267,Kartehava,E.,& Ammar, A.
(2014).
Learners'beliefsas mediators ofwhat isnoticed and learnedinthelanguageclassroom. 1:ES()L
euarterly,
48,86-109.Krashen,S,D.
(1978),
Individnalvariation intheuse ofthe monitor. InW. Ritchie(Ed.),
Sticondlanguageacquisition research
(pp.
175-183).New Yotk,NY: AcademicPress,Kormos, J,
(1999).
The effect of speaker variables on theselfcorrectionbehaviour
ofL2
learners.
Srgtem,
27,207-221,Japan Society of English Language Education
NII-Electronic Library Service
Japar ユ Society of English Language Education
Segalowiセ
, N ,
,
& Segalowitz
, S.」.
(1993), Ski11ed
pe曲 mance
,
practice, and
the
differentiation
of speed −up 丘om automa 重ization effbcts :Evidence倉om second language word recognition . ノ
勿 1ガθ
4
勾 6乃o〜inguistics,/4,369−
385
,Seliger
, H.
(1980),
Utterance
planning
and correction behavior:Its ft ction l皿 the
grammar construction
process
ibr second language leamers. H.W . Dechert
,
M . Raupach (Eds.), Tovvarcts
a
cro∬ 伽
g istic a∬essment (
抑 θε
吻 roぬo伽
(pp、
87
−
99
). Frank顛 , Germany: Lang.
Tanaka
, K.
,
& Ellis ,
R .
(2003). Study abroad
,
language
proficiency,
and leamer
beliefs
aboutlanguage
lean血
g. L尻乙τ
ソ
:
ournal
,
25
,
63−85・ Tarone
, E.
(1985).
Variability in interlanguage use :Astudy of style
−
shift血
g in morphology and
syntax. Language Leaming,
35
,373
−404,
van der Linden
,
W , J.
(2007). A hierarchical frarnework for modeling speed and accuracy on test items. Psychometrika
, 73
,
287−308.
Appen 〔hx
Foreign
Language
(]ran 〃natical Carefalness Scale(FLGcs )
Item 1
(P)
ltem 2
(P)
ltem 3
(P)
ltem 4 (P} Item 5
(P)
Item 6
(LS) ltem 8
(LS)
外国 語 を使 うとき,会話の流 れの不自 然さ に
つ いてよ く考 え る
外 国語を使う とき,表現が文脈にあわ ないと
考えこんでしま う 外 国語 を使 うと き,
一
貫してない表 現や曖昧
な表 現にはよく気がつ く
外 国語 を使 うとき, 一
貫していない表現があ
る と考えこんでしま う
外国語 を使うと き,失礼な表現や丁寧過ぎる
表現 が よ く気にな る
外国語を使うと き、語の形の変化の誤りには
よく気がつ く方だ
外 国語 を使 うと き,単 語のつづりが間 違っ て
いるとよく気になる
1tem 9
(LS)
ltem 10
(LS)
Item
11
(PH)
Item t
2
(PH)
Item 13 (PH) item 14
(PH)
ltem 15
(PH)
外国 語を使 うとき,文章の中で間違っ た単 語 があると よく気 がっ く
外 国語 を使 うと き,単語の間 違いに はよく気
づく方 だ
外国 語を使 う とき,発音 が正確 が考 え ること が多 い
外国 語を使うとき,いつ も発音 が正 しいか ど うか気になる
外 国語 を使うとき,発音が正確でない と考え
こんで しまう
外国語を使う と き,発 音が誤っていると気に
なっ て しま うことが多い
外国 語を使 うとき,発音が本当に正確か 確認
することがあ る Note. Item 7 was
deleted
(see
the
section concerning scale development). Note
,
just
for reference , Item7 :
“
外 国語 を使 うと き,文法 規則に合わない表現によく気 がつ く
” .
P
=
pragmatic
carefUlness, LS =
lexical
−
syntactic carefUlless
, PH =
phonological carefUlness. Supplementary data including
(
a
)the
mitial
item
pool,(
b
)theC
−test, and(c
)the final
version
of
the questio皿 aire
used
in
the
present
stUdy
, are
available
at the lilst author
’
swebsite :
hゆs:1/sites.
google.com
/site!kUsanagikt ni!home!
pr(}
jects
/gc
92
N工 工
一