Measuring Olympic achievements in DEA: a data fitting technique subject to downside-deviation restrictions
01206313
東京理科大学 関谷 和之
SEKITANI Kazuyuki05000602
東京理科大学
*趙 宇
ZHAO Yu1. Introduction
Japan has become an Olympic setting since the Summer Olympic Games were first staged in Asia in 1964. After Tokyo was selected as the host city of the 2020 Summer Olympics, the Japanese Olympic Committee (JOC) set a target of a record-high 30 gold medals. However, due to the developing global situation in light of the COVID-19 pandemic, the International Olympic Committee (IOC) and JOC announced that the Tokyo 2020 Summer Olympic games are resched- uled to a date beyond 2020 but not later than summer 2021. As the upcoming Olympic Games gets closer, interest in predicting and measuring the Olympic achievements is rising. To investi- gate and predict the performance of the upcom- ing Summer Olympics, the proposed model of this study incorporates a data fitting technique of medal prediction using ordinary least squares regression (OLS) in input multiplier restrictions of the conventional DEA model.
2. Downside-deviation least squares Suppose that there arenDMUs (j= 1, . . . , n) to be evaluated. Three output variables are con- sidered: the number of gold (y1j), silver (y2j), and bronze medals (y3j) achieved in the previ- ous Olympic games. Regarding the inputs, we adopt the GDP (x1j) and the population (x2j) recorded in the latest year. The DEA technology converts an input vector xj = (x1j, x2j, x3j)⊤ to an output vector yj = (y1j, y2j, y3j)⊤, thereby satisfying xj ≥0,xj ̸=0 and yj ≥0,yj ̸=0 for all j= 1, ..., n.
Let e be a column vector with all elements unity. The OLS estimator for predicting the total number of medalse⊤yj of DMUj (j∈ {1, ..., n})
is formulated as the following optimization prob- lem:
(OLS: Ordinary least squares) min
Xn j=1
d2j
s.t. dj =v⊤xj+ν−e⊤yj (j= 1, . . . , n).
Here, the estimation (v∗, ν∗) presents the hy- perplane passing the barycenter of n DMUs,
1 n
Pn
j=1xj,Pn
j=1e⊤yj
.
Since an upper bound v⊤xk+ν of e⊤yk im- plies v⊤xk+ν ≥ e⊤yk, we add n nonnegative constraints, d1 ≥ 0, . . . , dn ≥ 0, into the above OLS problem, which is formulated as
(DLS: Downside-deviation least squares) min
Xn j=1
d2j
s.t. 0≤dj =v⊤xj+ν−e⊤yj (j= 1, . . . , n) v≥0.
Let ( ˆd,v,ˆ ˆν) be an optimal solution to the DLS problem. Thus, ˆv⊤xk+ ˆν is one of the upper bounds of e⊤yk, and ˆdk is a downside deviation of e⊤yk. Let
a:=
Xn j=1
e⊤yj− ˆ
v⊤xj+ ˆν, (1) then we have
a= Xn j=1
dˆj.
Theorem 1. Let ( ˆd,v,ˆ ν)ˆ be an optimal solution to the DLS problem. The upper bound vˆ⊤xj + ˆν of e⊤yj is independent of the choice of optimal solutions to the DLS problem. The same is true of the equation (1).
2-D-4
日本オペレーションズ・リサーチ学会2021年 春季研究発表会
3. A multiplier DEA model with downside-deviation restrictions We define
V =
(v, ν)
Pn
j=1 v⊤xj +ν−e⊤yj
≤a e⊤yj ≤v⊤xj+ν, ∀j
v≥0
as a set of input multiplier (v, ν). Here,v⊤xk+ ν is referred to as a substantial medal total of DMUk for some (v, ν)∈V.
To evaluate the efficiency of Olympic achieve- ments based on the substantial medal totals, we propose the following DEA model (LPθ) :
θk= max. u⊤yk v⊤xk+ν s.t. u⊤yj−
v⊤xj+ν
≤0, ∀j
e⊤yj−
v⊤xj+ν
≤0, ∀j Xn
j=1
v⊤xj+ν−e⊤yj
≤a
R⊤u≥0,u≥0,v ≥0,
where R=
1 0 0
−P 1 0 0 −Q 1
. The notationsP
and Q represent the lower bounds of the ratios u1/u2 and u2/u3, respectively. If θk = 1, DMUk is efficient. Otherwise, DMUk is inefficient.
The efficiency of Olympic achievements for DMUk in model (LPθ) can be decomposed as
u⊤yk
v⊤xk+ν = e⊤yk
v⊤xk+ν ·u⊤yk
e⊤yk. Here, the first component, e⊤yk/ v⊤xk+ν
is referred to asthe achievement ratio of substantial medal total. The second component u⊤yk/e⊤yk
is termed as theunit value index of medals.
4. Examining the target of 30 gold medals in Tokyo 2020
The input vector of Japan (xJ) in Rio 2016 is used for predicting the total number of gold medals in Tokyo 2020, which is xJ =
(xJ1, xJ2, xJ3)⊤ = (32477.2,126958472,41)⊤. For the input vector xJ, we derive the output possibility set from the dual problem of (LPθ), which is
P(xJ) =
y
y≤Pn
j=1λjyj−Rd xJ ≥Pn
j=1(λj+µj)xj
Pn
j=1(λj +µj) = 1
−aµj ≤Pn
h=1µh(e⊤yh) d≥0, λ≥0, µ:free
.
Provided that the target of 30 gold medals in Tokyo 2020 is feasible, the optimal value of
max{y1|y∈ P(xJ), y is nonnegative} (2) is equal or more than 30. However, the optimal value of the problem (2) is 26, which implies the target of 30 gold medals in Tokyo 2020 is infea- sible even if we investigate all possible combina- tions of medals within P(xJ).
How many medals will Japan win if 26 gold medals are actually achieved? To answer this question from a practical point of view, it is necessary characterize the possible patterns of medal won based on the patterns of past Olympic games. Consider the following linear combina- tion:
YJ = ( 4
X
t=1
γtyJt
γ1≥0,· · · , γ4 ≥0 )
, whereγ1, ..., γ4 are scalars andy1J, . . . ,y4J are in- teger vectors of medals that Japan won in the past four Olympic games. Assume the medals Japan will achieve in Tokyo 2020 belong to YJ. Therefore, if Japan bags 26 gold medals in Tokyo 2020, the maximal number of medal totals will be the optimal solution of the following problem:
max ( 3
X
r=1
yr
y∈ P(xJ), yis nonnegative )
. (3) We found that the optimal value of the problem (3) is a total of 63, and y∗ = (y∗1, y∗2, y∗3)⊤ = (26,15,33)⊤. Thus, Japan will win 63 medals at most, which consists of 26 gold, 15 silver, and 22 bronze medals.