Lec3 最近の更新履歴 yyasuda's website

(1)

Lecture 3: Optimization

Advanced Microeconomics I

Yosuke YASUDA

National Graduate Institute for Policy Studies

October 10, 2013

(2)

What and How to Optimize?

Optimization is a set of mathematical procedures to find the optimal value of some function.

We frequently adopt the assumption that an economic agent seeks to maximize or minimize some function, for example:

◮ Consumer maximizes her utility function.

◮ Firm minimizes its cost function.

◮ Seller at the auction maximizes (expected) revenue function.

◮ Government tries to maximize the social welfare function. We study (and apply) static optimization.

→ Dynamic optimization, a common tool in modern macroeconomics, will not be covered in our lectures...

(3)

Equality Constraints

Consider choosing x₁ and x₂ to maximize f (x₁, x₂) , when x₁ and x₂ must satisfy some particular relation to each other that we write implicit form as g(x₁, x₂) = 0 .

Formally, we write this problem as follows: maxx₁,x₂^f^(x¹^{, x}²) subject to g(x₁, x₂) = 0.

◮ _f_(x

1^{, x}2): objective function.

◮ _x

1 ^{and x}2: choice variables.

◮ _g(x

1^{, x}2): constraint.

◮ The set of all (x1^{, x}2) that satisfy the constraint: feasible set.

✞

✝

☎

Q Does solution always exist?✆

(4)

Continuity (1)

Intuitively, a function is continuous if a “small movement” in the domain does not cause a “big jump” in the range.

Def A function f : D (⊂ Rⁿ) → R is called

1. continuous at a point x⁰ if, for all ε > 0, there exists δ > 0 such that d(x, x⁰) < δ implies that d(f (x), f (x⁰)) < ε. 2. continuous if it is continuous at every point in its domain. 3. uniformly continuous if, for all ε > 0, there exists δ > 0 such

that for all x ∈ D, d(x, x⁰) < δ implies that d(f (x), f (x⁰)) < ε.

✞

✝

☎

Rm If function f is uniformly continuous, then it must be✆ continuous. However, the converse is not true.

(5)

Continuity (2)

Now we consider more general functions.

Suppose D ⊂ R^m and R = Rⁿ, i.e., f : R^m→ Rⁿ^.

◮ The image of f is the set of points in the range into which some point in the domain is mapped:

I := {y|y = f (x) for some x ∈ D} ⊂ R

◮ The inverse image of a set of points S ⊂ I is defined as: f⁻¹(S) := {x|x ∈ D, f (x) ∈ S}

Def A1.9 Let D be a subset of R^m, and let f : D → Rⁿ^{. Then f} is continuous at a point x⁰ if, for all ε > 0, there exists δ > 0 such that

f(B_δ(x⁰) ∩ D) ⊂ Bε(f (x⁰)).

(6)

Existence of Solutions

Compact Set

◮ A set S in Rⁿis called bounded if there exists some ε > 0 such that S ⊂ Bε(x) for some x ∈ Rⁿ^.

◮ A set S in Rⁿis called compact if it is closed and bounded. Thm A1.10 (Weierstrass) Existence of Extreme Values Let f : S → R be a continuous real-valued function where S is a non-empty compact subset of Rⁿ. Then f has its maximum and minimum values. That is, there exists vectors x and x such that

f(x) ≤ f (x) ≤ f (x) for all x ∈ S.

◮

✞

✝

☎

Fg Figure A1.18 (see JR, pp.522)✆

◮ Most problems in economics have compact domains and continuous objective functions. ⇒ Solutions guaranteed!

(7)

Partial Derivative

Def Let {h^k} be a (infinite) sequence of real number converging to 0, and let f : S(⊂ Rⁿ) → R be a continuous function. Then, we say that f is differentiable with respect to xi at x if

k→∞lim

f(x₁, ..., x_i+ h^k, ..., x_n) − f (x₁, ..., x_i, ..., x_n) h^k

always exists and does not depend on h^k. We call this limit the partial derivative with respect to xi, denoted by ^∂f_∂x^(x)_i or fi(x). Note that the partial derivative itself is a function of x.

◮ If f is differentiable with respect to all the element x_i, i= 1, 2, ..., n, at x, then we say that f is differentiable at x.

◮ If f is differentiable at all x ∈S, then we say that f is a differentiable function.

(8)

Total Derivative

Def For a differentiable function f , the total differential is defined as follows:

df(x) = ^∂f(x)

∂x₁ ^dx¹⁺

∂f(x)

∂x₂ ^dx²^{+ · · · +}

∂f(x)

∂x_n ^dxⁿ^.

◮ If f is a differentiable function and fi(x) is a continuous function for all i, then we say f is continuously

differentiable.

(9)

Lagrange’s Method (1)

Lagrange’s method is a powerful way to solve (equality) constrained optimization problems, which essentially translates them into unconstrained problems.

Again, consider the following problem: maxx₁,x₂^f^(x¹^{, x}²) subject to g(x1^{, x}2) = 0.

Let us construct a new function L(x1^{, x}2, λ), called the Lagrangian function as follows:

L(x1^{, x}2, λ) = f (x1^{, x}2) + λg(x1^{, x}2).

(10)

Lagrange’s Method (2)

Then maximize this Lagrangian function, that is, derive the first order conditions:

∂L

∂x₁ ⁼

∂f(x^∗₁, x^∗₂)

∂x₁ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₁ ^{= 0}

∂L

∂x₂ ⁼

∂f(x^∗₁, x^∗₂)

∂x₂ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₂ ^{= 0}

∂L

∂λ ^{= g(x}

∗ 1^{, x}

∗ 2^{) = 0.}

◮ Lagrange’s method asserts that if we find values x^∗₁, x^∗₂, and λ^∗ that solves these three equations simultaneously, then we will have a critical point of along the constraint g(x₁, x₂) = 0.

(11)

Practice of Lagrange’s Method

_✞

✝

☎

Ex Example A2.11 (see JR, pp.508)✆

maxx₁,x₂^x¹^x² subject to a − 2x₁− 4x₂ = 0 Forming the Lagrangian, we get

L = x₁x₂+ λ[a − 2x₁− 4x₂], with first order conditions:

∂L

∂x₁ ^{= x}²^{− 2λ = 0,}

∂L

∂x₂ ^{= x}¹^{− 4λ = 0.}

∂L

∂λ ^{= a − 2x}¹^{− 4x}²^{= 0.} These can be solved to find

x₁= ^a 4^{, x}² ⁼

a 8^{, λ}⁼

a 16^.

(12)

Envelope Theorem (1)

Consider the following constrained optimization problem P 1: P1 : max

x

f(x, a) s.t. g(x, a) = 0.

where x is a vector of choice variables, and a := (a₁, ..., a_m) is a vector of parameters that may enter the objective function, the constraint, or both. Suppose that for each vector a, the solution is unique and denoted by x(a).

◮ A maximum-value function, denoted by M (a), is defined as follows:

M(a) := max

x

f(x, a) s.t. g(x, a) = 0, or equivalently, M (a) := f (x(a), a).

(13)

Envelope Theorem (2)

If the objective function, constraint, and the solutions are differentiable in the parameters, there is a very powerful theorem that shows how the solutions vary with the parameters.

Thm Envelope Theorem

Consider P 1 and suppose the objective function and constraint are continuously differentiable in a. For each a, let x(a) ≫ 0 uniquely solve P 1 and assume that it is also continuously differentiable in the parameters a. Then, the Envelope theorem states that

∂M(a)

∂a_j ⁼

∂L

∂a_j^|^x^(a),λ(a) ^j= 1, ..., m,

where the right hand side denotes the partial derivative of the Lagrangian function with respect to the parameter aj evaluated at the point (x(a),λ(a)).

(14)

Practice of Envelope Theorem

_✞

✝

☎

Ex Example A2.11 (again)✆

maxx₁,x₂^x¹^x² s.t. a − 2x₁− 4x₂= 0.

We form the maximum-value function by substituting the solutions for x₁ and x₂ into the objective function. Thus,

M(a) = x1(a)x2(a) = ^a 4^·

a 8 ⁼

a² 32^. Differentiating M (a) with respect to a, we get

dM(a)

da ⁼

a 16^,

which tells us how the maximized value varies with a. Applying the Envelope theorem, we directly obtain

dM(a)

da ⁼

∂L

∂a^|^x^(a),λ(a)^{= λ(a).}

Since λ(a) = ₁₆^a, we verified that the Envelope theorem works.

(15)

Intuitions of Lagrange’s Method (1)

_✞

✝

☎

Q Why does Lagrange’s method work?✆ Take the total differential of the Lagrangian:

dL = ^∂L

∂x₁^dx¹⁺

∂L

∂x₂^dx²⁺

∂L

∂λ^dλ.

When (x₁, x₂, λ) = (x^∗₁, x^∗₂, λ^∗), it can be re-written as follows: dL = (^∂f(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₁ ^)dx¹ + (^∂f^(x

∗ 1^{, x}

∗ 2⁾

∂x₂ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₂ ^)dx²^{+ g(x}

∗ 1^{, x}

∗

2^{)dλ = 0.}

Since g(x^∗₁, x^∗₂) = 0, 0 = dL = ^∂f(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^dx¹⁺

∂f(x^∗₁, x^∗₂)

∂x₂ ^dx² + λ^∗(^∂g(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^dx¹⁺

∂g(x^∗₁, x^∗₂)

∂x₂ ^dx²⁾

(16)

Intuitions of Lagrange’s Method (2)

Note that, dg = ^∂g(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^dx¹⁺

∂g(x^∗₁, x^∗₂)

∂x₂ ^dx²^{= 0.} So, we can show that

dL = ^∂f(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^dx¹⁺

∂f(x^∗₁, x^∗₂)

∂x₂ ^dx² ^{= 0}

for all dx₁ and dx₂ that satisfy the constraint g. Thus, (x^∗₁, x^∗₂) is indeed a critical point of f given that the variables must satisfy the constraint.

Lagrange’s method is very clever and useful. In effect, it offers us an algorithm for identifying the constrained optima in a wide class of practical problems.