Example 5: Multinomial logit model:
The ith individual has m + 1 choices, i.e., j = 0 , 1 , · · · , m.
P(y
i= j) = exp(X
iβ
j)
∑
mj=0
exp(X
iβ
j) ≡ P
i j,
for β
0= 0. The case of m = 1 corresponds to the bivariate logit model (binary choice).
Note that
log P
i jP
i0= X
iβ
jThe log-likelihood function is:
log L( β
1, · · · , β
m) =
∑
n i=1∑
m j=0d
i jln P
i j,
where d
i j= 1 when the ith individual chooses jth choice, and d
i j= 0 otherwise.
Example 6: Nested logit model:
(i) In the 1st step, choose YES or NO. Each probability is P
Yand P
N= 1 − P
Y. (ii) Stop if NO is chosen in the 1st step. Go to the next if YES is chosen in the 1st step.
(iii) In the 2nd step, choose A or B if YES is chosen in the 1st step. Each probability is P
A|Yand P
B|Y.
For simplicity, usually we assume the logistic distribution.
So, we call the nested logit model.
The probability that the ith individual chooses NO is:
P
N,i= 1 1 + exp(X
iβ ) .
The probability that the ith individual chooses YES and A is:
P
A|Y,iP
Y,i= P
A|Y,i(1 − P
N,i) = exp(Z
iα ) 1 + exp(Z
iα )
exp(X
iβ )
1 + exp(X
iβ ) .
The probability that the ith individual chooses YES and B is:
P
B|Y,iP
Y,i= (1 − P
A|Y,i)(1 − P
N,i) = 1 1 + exp(Z
iα )
exp(X
iβ ) 1 + exp(X
iβ ) . In the 1st step, decide if the ith individual buys a car or not.
In the 2nd step, choose A or B.
X
iincludes annual income, distance from the nearest station, and so on.
Z
iare speed, fuel-e ffi ciency, car company, color, and so on.
The likelihood function is:
L( α, β ) =
∏
n i=1P
IN1i,i(
((1 − P
N,i)P
A|Y,i)
I2i((1 − P
N,i)(1 − P
A|Y,i))
1−I2i)
1−I1i=
∏
n i=1P
IN1i,i(1 − P
N,i)
1−I1i(
P
IA2i|Y,i(1 − P
A|Y,i)
1−I2i)
1−I1i,
where
I
1i=
1 , if the ith individual decides not to buy a car in the 1st step, 0 , if the ith individual decides to buy a car in the 1st step, I
2i=
1 , if the ith individual chooses A in the 2nd step,
0 , if the ith individual chooses B in the 2nd step,
Remember that E(y
i) = F(X
iβ
∗), where β
∗= β σ . Therefore, size of β
∗does not mean anything.
The marginal e ff ect is given by:
∂ E(y
i)
∂ X
i= f (X
iβ
∗) β
∗.
Thus, the marginal e ff ect depends on the height of the density function f (X
iβ
∗).
2.2 Limited Dependent Variable Model (
制限従属変数モデル)
Truncated Regression Model: Consider the following model:
y
i= X
iβ + u
i, u
i∼ N(0 , σ
2) when y
i> a, where a is a constant, for i = 1 , 2 , · · · , n.
Consider the case of y
i> a (i.e., in the case of y
i≤ a, y
iis not observed).
E(u
i| X
iβ + u
i> a) =
∫
∞a−Xiβ
u
if (u
i)
1 − F(a − X
iβ ) du
i. Suppose that u
i∼ N(0 , σ
2), i.e., u
iσ ∼ N(0 , 1).
Using the following standard normal density and distribution functions:
φ (x) = (2 π )
−1/2exp( − 1 2 x
2) , Φ (x) =
∫
x−∞
(2 π )
−1/2exp( − 1
2 z
2)dz =
∫
x−∞