A Bayesian regression approach to the poisson distribution
著者 平館 道子
journal or
publication title
金沢大学経済学部論集 = Economic Review of Kanazawa University
volume 7
number 2
page range 31‑40
year 1987‑03‑30
URL http://hdl.handle.net/2297/24013
ABayesianregressionapproachtothePoissondistribution(Hiradate)
ABayesianregressionapproachto thepoissondistribution
MichikoHiradate
Countd2t2
Modelamdamethodofestimation Numericalexamples
DiSCuggion RefCreyDces
IⅡⅢⅣ
ICountdata
Regressionanalysisofthepoissondistributeddatahasbeendiscussedby
severalauthors([3],[4],[7]).ButNelderandWedderbumfbrmulatedthegeneralizedlmearmodelinl972([5])andsmcethenwecanviewthis prob1emmawiderscopeincludingtheusualnormalregression,Thepoisson
is,theoreticaUyandpracticany,themostimportantdistribution,whenwe analysethecountdata・Butweoftenobservecovariables,orfactors,togQther withresponsevariablemeconomlcorsociologicalstudies,ormotherfields、Inthesecases,multipleregressionanalysiSprovidesanmtereStmgteChniqUe tomterpretedataandobtaininfbnnationaboutanundedyingmechanism6
However,wesomtimesobtainthepOorfitinre図essionanalysis,especiaUyinanalysingthecountdata・Thismaybebecausethecountobservationsare aggregatedoversomefactors,orimportantexplanatolyvariablesarenotob‐
servedorunavaUable、Asaresult,overdiversificationissuspected,
Hereweexammeamethodtoaccomodatesuchunexplainedvariations
-31-
金沢大学経済学部論集第7巻第2号1987.3
duetomisspecificationbymtroducingnormaUyrandompartmtothelinear
regressionmodelfbrthepoissondistribution.ⅡModelandamethodofestim2tiOn
NelderandWedderbumproposethegeneralizedlmearmodelwhichre‐
latesthenaturalparameterorsomefimctionofitofexponentialhlmilydis‐
tributionstoexplanatolyvariables([sDLikelihoodfimctionandlinear
regressionpartaregivenbyf(y18,`)=exp(。[8y-a(8)]}b(。,y)………(1)
and
7(8)=h'β………(2) where8isthenaturalparameter,disthescaleparameter,hisapxlvector ofexplanatolyvariables,andβisapxlvectorofregressioncoefTicientsm cludmgconstantterm・WhenyfbUowsthestandardpoissondistribution,。
=1,a(8)=e0.Contmuousdistributionsofexponentialfamnyareas flexibleastoaccountfbrthevariationmthedatabygivingavarietyofvalues tothescaleparameter,However,discretedistributionshavefixedvalueofd andarenotsoflexible・West([7])discussesthisproblemindetaUfrom theBayesianpomtofview,andproposesscaledexponentialfamnylikelihood asanapproximatesamplingmodelwhichkeepsthescaleparameterfreeand usesthedeviancefimction、ThescaledexponentialfamUylikelihoodhasthe samemean/variancerelationshipthatoforigmalfbrmulation,andreducesto theoriginalN-Wmodelwhend=LIfwekeepdfreeandtlytolearnits behavior,someapproximationisnecessary,becausefactorb(d,y)is usuallydifficulttobemcorporatedintheanalysisofdiscretedata・Sweeting ([6])discussesthisprobleminmoregeneralcontextwithscaleparameter havmgimproperprior・Ourfbnnulationintroducesnormanyrandompart mtotheregressionrelation(2),fbrthepurposeoffindmgthesourcesofsUch variationsandinterpretmgdatamamoreappropriatemodeL
Supposeweobservensamplesofy,andlety=(y1…….,y、).Wesuppose thecomponentsofyfbUowindependentlytheexponentialfhmUydistrip
-32-
ABayesianregresslonapproachtothepoissondistributioH1(Hiradate)
butionsofthesametype、Then,hkelihoodfimctionisgivenby
。(y'@ルなIexp[川y,-3(``D]b(M)|………(3)
where,=(8,,………,0,)isavectorofthenaturalparameters・Wesuppose thefbUowmglinearrelationmsteadof(2).
7(,‘)=h'β+uj・………・………・………(4) Theaddedtennuaccomodateserrorsduetothemisspecificationorthefail‐
ureofobservingsomeimportanthlctors.Inthecaseofthepoissonof
pammeterル,relation(4)mayreduceto8`=logeルーh/β+ui………(5) Letubeavectorofuterm、AsfOrthepriordistributions,wehaVenoconju・
gateanalysisheleexceptthesimplestcase,andweassumeβanduare
mdependentandnormallydistributed,andalsouhavezeromeans・Then jomtpriordistributionisasfbUowsp(β,ulb,B,V)=p(β|b,B)p(ulV)
…p{-÷(β-b)B(β-b)}・exp{-÷uw}……(6)
wherebisavectorofpriormeansofβ,BandVareprecisionmatricesof βandurespectively・ItisnotpracticaltoassumethatplecisionmatrixB
andVareknowncompletely.Anunknownscalefactorisassumedtoexist ineachmatrixandithassomehyperpriordistlibutionNamely,weset B=γBo,V=シVo,whereBo,VoarespeciHed、Unknownfactors7,yfbUowchi-sqUaredistributionsfbrsomefo,do,ko,go,thatis,fo7~Z2ao,
koy~X23。、Thenintegrating(6)withrespectp(γ),p(ソ)gives
p(β,ulhfo,。。,k・’9b)。c[fo+(β-b).B・(β-b)]~苧・[ko+uVou]~且夢上…………(7) Thekemelofposteliordistributionisgivenbymakingproductof(3)and (7),andincorporatingtherelation(4).Posteriormeansofparametersare
-33-
金沢大学経済学部論築第7巻第2号1987.3
notobtaindanalytiCallyinthisfbrmulation,sowetaketheposteriormodesaS convenientestimates6Letp'=(β,,u'),thentheposteriorscorefimction
andinfOrmationfimctionaredefinedrespectivelyby&(刺y俳念Iogp〃側………'8)
and
Myルー為g(畷',小………(9)
Inthecaseofrelation(5),theyare
…作|:雪歴貯。Ⅲ)
whereHispxnmatrixwithcolumnhf,
zisnxlvecterwithelements(yi-a'(h;β+u‘)),i=1..…、’
7(硬)=f・+(蒜B・(β_b)=E(''y川………(u)
7(璽)一意語荒TE(,M)………(',
G(Fly,‘)
-(鰍讐)…B-2M1;隅"Wo-21)
………(lD
whereA(P)isnxnmatrixwithdiagonalelementsa,'(h;β+uj),
and
D戸~:;筈←B伽Ⅲ-b川
D2=ZU2LvounW・
go+nTheposteriormodesareobtainediterativelybyusualNewton‐Raphson method・Let蛭bethevalueofpofi,thiteration,andtheiterationeqUation
isaSfbUowS
酪十,=巫+G(野|y,`)-19(蛭Iy,。)………00
-34-
ABayesianregressionapproachtothepoissondistribution(Hiradate)
mNumericalexamples l・AmodiffedfbrmuIation
Weshowtwonumericalexamplesmthissection・Forconvemence,wemod‐
ifythefbrmulationAssumewehavemgroupsofobservationsonthepoL ssonvariableyandi,thgroupconsistsofrfobservations,andwehavep
explanatoryvariableswithobservationsh,`=(h】`,………,hpf),whereh,`=1
fbralli,i=1,.…,m・Thenwehavempoissonparametersル,i=1,….mand logeルー&・Ojisexpressedbyregressionmodelas(5).Asfbrpriors,wesupposeinsection2βanduhaveindependentnormal distributions,butbetweencomponentsofβandu,theremaybecorre‐
lationsrespectively・Herewesimplyassumethatcomponentsofβandu distributeindependentlyeachotherwithdifferentscaleparameters”,ツガ、
Then,thepriordistributionofβanduis
,(β皿川,)。cJi小xp[-,(β号M1ルトexp[-竺乳
Ashyperpriorsofyi,ハ,weassessthatfo”~Z。.;koy`~Z図。2.Then priordistributionofβanduis
p(β,ulfok…。)"Jil[(昨b臘川。}樂堂[u`圏十k.1-鶚
…=舅y腱小P。‘t・…giv・nby
p(β,uly,h)。C
oxp[魚{s`(h/β+uJ-r,e…}]
xhI(〃M’十fo1樂立[u'十k.1鵯 L蝕加)-f・器M製,k-い…。
耐(シ)=鵲戸i-い…、m
zbemxlvectorwithelements(sf-r:eh/β+u'),
P(F)bepxpmatrixwithdiagonalelements7MP),
-35-
金沢大学経済学部論巣第7巻第2号1987,3
andN(p)bemxmmatrixwithdiagonalelementsy,(〃).
ItfbUowssimUarrelationslike(11)and(12).
…WWI'㈹
・川棚lwwト☆塊削ト☆・)
whereA(P)ispxpmatrixWithdiagonalelementsrjeh…u`,
D,=(p)2(β-b)(β-b)’
D2=NGPP)zuu,.
2Electronicequl1pment●
Jorgenso、([4])discussesthenumberoffailuresofacomplexpieceof electroniceqUipmentusmgregressionanalysisofthepoissondistribution・
Explanatoryvariablesaretimesspentmtwooperatmgreglmesatthecycleof operationTheobservationsareshownmTableLDependentvariableisthe numberoffanuresmthei,thcycle・
Jorgenson,sfbrmulationis
yi=β1t,`+β2t2i,i=1,………,n Altematively,weset
logルーβ・+βltli+β2t2i+U`.
PosteriormodesandestimatesofAarealsoshownmTablel,wherewe takefo,ko,。。,goas5.
-36-
ABayesianregressionapproachtothepoissondistribution(Hiradate)
Tablel
〈、八 八u
T1 T2 y
14.10 8.79 13.63 24.13 27.35 26.88 23.34 18.05 21.66
682190949813121202OG●●●●●●■000000000
一一 一一
33.3 52.2 64.7 137.0 125.9 116.9 131.7 85.0 91.,
345566635●●●、■●●●●542073677213295584 59447738211222212
β,=0.015,八
β・=1.135,八 β2=0.006八
3Quine,ssociolOgicaldata
Aitken([1DanalysesdataofasociologicalstudyaboutAustralianschool childrenAresponsevariableisthenumberofdaysabsentfromschool dulingschoolyear・Childrenweresampledbyfburfactors,Le・age,sex,
culturalbackgmundandleamingability・Aitkenanalysesthedatamtennsof theanalysisofvariance,andseekstofIndthemmimaladequatemodeLHe
obtainssixfinalmmimalmodelsandshowsfittedvaluesoneachmodeLFit
seemstobenotsogood、Wesupposetheresponsevariableispoissonvariate,
andthelogarithmofparameterhasaregressionpartanderrorpart,Unfbrtu‐
nately,wecannotrefertoQuine,sorigmaldatabutonlyAitken,ssummarized data(samplemeans)withsamplesizes、Dataareshownintable2、Under
abovemodiiiedfbrmulation,Aisareestimatedandhttedtosamplemeans、
ExplanatoryvariablesareasfbUows,
C:1=Aborigina1,2=white,
S:1=Female,2=Male,
A:1=Primary,2=Firstfbnn,3=Secondfbnn,4=Thirdfbrm,
L:1=slow,2=averageOeamingability).
-37-
金沢大学経済学部鶴築第7巻第2号1987.3
WetryalsothecasewithexplanatoryvariablesincludingSxA,CxAwithout u-term,fbrcomparison・Aitkenobtamsthebestfitmthiscaseunderhis fbImulation・Theresultisshownintable2・Herewetakefo,ko,do,gbas
alntable2,ハisestimatefbrourfbrmulationandズロisestimatefbi
model(SA,CA)Withoutu.
Table2
Bへ、几
八
八、八
八uCSALyr
1111111111111122222222222222 ’111111222222211111112222222 ’’22334112233411223341122334 l212l22121212212l2l221212l22 00000940500806034034800002000005021026430503153120500205●●●●●●●DC●●●●●●由●●●90■●●●□●C93907773l2l624056399758616l3l1322221313222111 35324.77140581936723771416910111 179171983358573055979481865570524267320048122940090601139905730805402564290229300335●O●●●■0●●●●CO●□■0●●●●。●●の●●●829067731215349563997386162311322221312222111 0836851092212438597136346482399685430433216855238819900779431843292014840808600939390310064886095224598340750782●□●●●●0●●CDU●●●□■、、DC●●●●●●■0000100000001010000001000010
一一 一一 一一一 一一一一一
46261405265793533071733199513900855336597055450807787889●●■DP●●●●●●●●●●●●●●●●c●●●●●⑪4016034161616786298302929299112132321212111112111八
J8゜=1.804,人 βA=0.2321,
βt=-00191,ノas=0.0672八八
A=0.0369
-38-
ABayesianregressionapprOachtothepoissondistribution(Hiradate)
wDiscussiOn
ltisnotaeasytasktointerpretetheestimatedutermsappropriately,but
wemayexamineinausualanalysisofresidualsmanner・Asmentionedabove,wecamotrefertotheoriginalOuine,sdata,soweestimateハsandexamine
thefittosamplemeansonly・Thenwedonothavetheproblemofoverb diversification・However,ifwetrytofittotheorigmalindividualobser‐
vations,extravariationmayarise・Thisproblemmaybeapproachedeitherby thecoumpounddistributionwhichisamixturewithrespecttosomedistri‐
butionfbrtheparameter,orbythemethodthatWestproposes,whichkeeps scaleparameterunristrictedanduseshisapproximatelikelihoodBut,ifwe canconsiderextravariationsasconseqUencesofthemisspecification,u-term mayconveysomeinfbnnationaboutthekindofmisspecification,or,missing
eXplanatoryvariables・Thatsomeimportantexplanatoryvariablesareover‐lookedmaymean,mthisexample,observationsofdependentvariablewhich belongtodifferentdistributionsareaggregatedmtothesamedistribution,
andthengiverisetotheextraVariations・Anadhocmethodtodealwiththis problemmaybeto厚CupObservationsaccordmgtouvalues,Butsome
fbrmalprocedurefbrexploitingthemfbrmationthatuconveysisnecessaly・
FmaUy,asfbrthepriordistributionofu-telm,somesmoothpriormight
beused,whichisrelatedtovaluesoftheexplanatryvaIiablesmthesenseof expressmgthejudgementthatthechangesofvaluesofuarenotsolarge amongyhavmgsimilarvaluesofexplanatolyvariables,asinBhghtandOtt ([2]).RefCrence8
LAitken,M、A、1178.T11eanalysisofunbalancedcross-cl2IssifHcations(withdis・
cussion).』.R、SSA141,195--223.
2.BnghtbBJ.N、,Ott,L、1975.ABayesianapproachtomodelinadequacymr po1ynomialregressionBiometrika62,79-88.
3.Frome,E、L,Kutner,M、H、,Beauchamp,J、1973.Regressionanalysisofpoisson‐
-39-
金沢大学経済学部論巣第7巻第2号1987.3
distrmuteddata.』.A、S、A、68,,0.334,935-940.
4.Jorgenson,,.W・l96LMultipleregressionanalysisofapoissonpIocess.』.A、S、A、
235-245.
5.Nelder,』.A、、WedderbumDR.W、M、1972.Genera1izedLinearModels.J、R、S、S、
A・No.135,370-384.
6.Sweeting,T・l98LScaleParameters;aBayesianTIeatment.』.R,S、S、B、43 No.3,333-338
7.West,M、1985.GeneranzedUnearmodels、Scaleparameters,OutUerAccomo‐
dationandPrioldistributionaBayesianstatistic82,J.M・Bemardo,M、H、A・DeGroot,
nV・Lindley,A、F、M、Smith(Eds.)North-HoUand,531‐-558.
-40-