Lec1 3 最近の更新履歴 yyasuda's website

(1)

Lecture 3: Optimization

Advanced Microeconomics I

Yosuke YASUDA

Osaka University, Department of Economics [email protected]

October 14, 2014

(2)

What and How to Optimize?

Optimization is a set of mathematical procedures to find the optimal value of some function.

We frequently adopt the assumption that an economic agent seeks to maximize or minimize some function, for example:

Consumer maximizes her utility function. Firm minimizes its cost function.

Seller at the auction maximizes (expected) revenue function. Government tries to maximize the social welfare function. We study (and apply) static optimization.

→ Dynamic optimization, a common tool in modern macroeconomics, will not be covered in our lectures...

(3)

Equality Constraints

Consider choosing x₁ and x₂ to maximize f (x₁, x₂) , when x₁ and x₂ must satisfy some particular relation to each other that we write implicit form as g(x₁, x₂) = 0 .

Formally, we write this problem as follows: maxx₁,x₂^f^(x¹^{, x}²) subject to g(x₁, x₂) = 0.

f(x₁, x₂): objective function. x₁ and x2: choice variables. g(x1^{, x}2): constraint.

The set of all (x₁, x₂) that satisfy the constraint: feasible set.

✞

✝

☎

Q Does solution always exist?✆

(4)

Continuity (1)

Intuitively, a function is continuous if a “small movement” in the domain does not cause a “big jump” in the range.

Definition 1

A function f : D (⊂ Rⁿ) → R is called

1 continuous at a point x⁰ if, for all ε > 0, there exists δ > 0 such that d(x, x⁰) < δ implies that d(f (x), f (x⁰)) < ε.

2 _continuous if it is continuous at every point in its domain.

3 uniformly continuous if, for all ε > 0, there exists δ > 0 such that for all x ∈ D, d(x, x⁰) < δ implies that

d(f (x), f (x⁰)) < ε.

✞

✝

☎

Rm If function f is uniformly continuous, then it must be✆ continuous. However, the converse is not true.

(5)

Continuity (2)

Now we consider more general functions.

Suppose D ⊂ R^m and R = Rⁿ, i.e., f : R^m→ Rⁿ^.

The image of f is the set of points in the range into which some point in the domain is mapped:

I := {y|y = f (x) for some x ∈ D} ⊂ R

The inverse image of a set of points S ⊂ I is defined as: f⁻¹(S) := {x|x ∈ D, f (x) ∈ S}

Definition 2 (Def A1.9)

Let D be a subset of R^m, and let f : D → Rⁿ. Then f is

continuousat a point x⁰ if, for all ε > 0, there exists δ > 0 such that

f(B_δ(x⁰) ∩ D) ⊂ B_ε(f (x⁰)).

(6)

Existence of Solutions

Compact Set

A set S in Rⁿis called bounded if there exists some ε > 0 such that S ⊂ Bε(x) for some x ∈ Rⁿ^.

A set S in Rⁿis called compact if it is closed and bounded. Theorem 3 (Weierstrass, Existence of Extreme Values) Let f: S → R be a continuous real-valued function where S is a non-empty compact subset of Rⁿ. Then f has its maximum and minimum values. That is, there exists vectors x and x such that

f(x) ≤ f (x) ≤ f (x) for all x ∈ S.

✞

✝

☎

Fg Figure A1.18 (see JR, pp.522)✆

Most problems in economics have compact domains and continuous objective functions. ⇒ Solutions guaranteed!

(7)

Partial Derivative

Definition 4

Let {h^k} be a (infinite) sequence of real number converging to 0, and let f : S(⊂ Rⁿ) → R be a continuous function. Then, we say that f is differentiable with respect to xi at x if

k→∞lim

f(x₁, ..., x_i+ h^k, ..., x_n) − f (x₁, ..., x_i, ..., x_n) h^k

always exists and does not depend on h^k. We call this limit the partial derivativewith respect to xi, denoted by ^∂f(x)_∂x

i ^{or f}ⁱ^(x).

Note that the partial derivative itself is a function of x.

If f is differentiable with respect to all the element x_i, i= 1, 2, ..., n, at x, then we say that f is differentiable at x. If f is differentiable at all x ∈S, then we say that f is a differentiable function.

(8)

Total Derivative

Definition 5

For a differentiable function f , the total differential is defined as follows:

df(x) = ^∂f(x)

∂x₁ ^dx¹⁺

∂f(x)

∂x₂ ^dx²^{+ · · · +}

∂f(x)

∂x_n ^dxⁿ^.

If f is a differentiable function and fi(x) is a continuous function for all i, then we say f is continuously

differentiable.

(9)

Lagrange’s Method (1)

Lagrange’s method is a powerful way to solve (equality) constrained optimization problems, which essentially translates them into unconstrained problems.

Again, consider the following problem: maxx₁,x₂^f^(x¹^{, x}²) subject to g(x₁, x₂) = 0.

Let us construct a new function L(x₁, x₂, λ), called the Lagrangian function as follows:

L(x₁, x₂, λ) = f (x₁, x₂) + λg(x₁, x₂).

(10)

Lagrange’s Method (2)

Then maximize this Lagrangian function, that is, derive the first order conditions:

∂L

∂x₁ ⁼

∂f(x^∗₁, x^∗₂)

∂x₁ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₁ ^{= 0}

∂L

∂x₂ ⁼

∂f(x^∗₁, x^∗₂)

∂x₂ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₂ ^{= 0}

∂L

∂λ ^{= g(x}

∗ 1^{, x}

∗ 2^{) = 0.}

Lagrange’s method asserts that if we find values x^∗₁, x^∗₂, and λ^∗ that solves these three equations simultaneously, then we will have a critical point of along the constraint g(x₁, x₂) = 0.

(11)

Practice of Lagrange’s Method

✞

✝

☎

Ex Example A2.11 (see JR, pp.508)✆

maxx₁,x₂^x¹^x² subject to a − 2x₁− 4x₂ = 0 Forming the Lagrangian, we get

L = x₁x₂+ λ[a − 2x₁− 4x₂], with first order conditions:

∂L

∂x₁ ^{= x}²^{− 2λ = 0,}

∂L

∂x₂ ^{= x}¹^{− 4λ = 0.}

∂L

∂λ ^{= a − 2x}¹^{− 4x}²^{= 0.} These can be solved to find

x₁= ^a 4^{, x}² ⁼

a 8^{, λ}⁼

a 16^.

Note that the solution of the problem is a function of parameter a.

(12)

Envelope Theorem (1)

Consider the following constrained optimization problem P 1: P1 : max

x

f(x, a) s.t. g(x, a) = 0.

where x is a vector of choice variables, and a := (a1^{, ..., a}m) is a vector of parameters that may enter the objective function, the constraint, or both. Suppose that for each vector a, the solution is unique and denoted by x(a).

A maximum-value function, denoted by M (a), is defined as follows:

M(a) := max

x

f(x, a) s.t. g(x, a) = 0, or equivalently, M (a) := f (x(a), a).

(13)

Envelope Theorem (2)

If the objective function, constraint, and the solutions are differentiable in the parameters, there is a very powerful theorem that shows how the solutions vary with the parameters.

Theorem 6 (Envelope Theorem)

Consider P1 and suppose the objective function and constraint are continuously differentiable in a. For each a, let x(a) ≫ 0 uniquely solve P1 and assume that it is also continuously differentiable in the parameters a. Then, the Envelope theorem states that

∂M(a)

∂a_j ⁼

∂L

∂a_j^|^x^(a),λ(a) ^j= 1, ..., m,

where the right hand side denotes the partial derivative of the Lagrangian function with respect to the parameter aj evaluated at the point(x(a),λ(a)).

See JR, pp.506-507 for the proof.

(14)

Practice of Envelope Theorem

✞

✝

☎

Ex Example A2.11 (again)✆

maxx₁,x₂^x¹^x² s.t. a − 2x1− 4x2= 0.

We form the maximum-value function by substituting the solutions for x₁ and x₂ into the objective function. Thus,

M(a) = x1(a)x2(a) = ^a 4^·

a 8 ⁼

a² 32^. Differentiating M (a) with respect to a, we get

dM(a)

da ⁼

a 16^,

which tells us how the maximized value varies with a. Applying the Envelope theorem, we directly obtain

dM(a)

da ⁼

∂L

∂a^|^x^(a),λ(a)^{= λ(a).}

Since λ(a) = ₁₆^a, we verified that the Envelope theorem works.

(15)

Intuitions of Lagrange’s Method (1)

✞

✝

☎

Q Why does Lagrange’s method work?✆ Take the total differential of the Lagrangian:

dL = ^∂L

∂x₁^dx¹⁺

∂L

∂x₂^dx²⁺

∂L

∂λ^dλ.

When (x1^{, x}2, λ) = (x^∗₁, x^∗₂, λ^∗), it can be re-written as follows: dL = (^∂f(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₁ ^)dx¹ + (^∂f^(x

∗ 1^{, x}

∗ 2⁾

∂x₂ ^{+ λ}

∗∂g(x^∗₁, x^∗₂)

∂x₂ ^)dx²^{+ g(x}

∗ 1^{, x}

∗

2^{)dλ = 0.}

Since g(x^∗₁, x^∗₂) = 0, 0 = dL = ^∂f(x

∗ 1^{, x}^∗2⁾

∂x₁ ^dx¹⁺

∂f(x^∗₁, x^∗₂)

∂x₂ ^dx² + λ^∗(^∂g(x

∗ 1^{, x}^∗2⁾

∂x₁ ^dx¹⁺

∂g(x^∗₁, x^∗₂)

∂x₂ ^dx²⁾ for all dx₁ and dx₂ that satisfy the constraint g.

(16)

Intuitions of Lagrange’s Method (2)

Note that, dg = ^∂g(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^dx¹⁺

∂g(x^∗₁, x^∗₂)

∂x₂ ^dx²^{= 0.} So, we can show that

dL = ^∂f(x

∗ 1^{, x}

∗ 2⁾

∂x₁ ^dx¹⁺

∂f(x^∗₁, x^∗₂)

∂x₂ ^dx² ^{= 0}

for all dx₁ and dx₂ that satisfy the constraint g. Thus, (x^∗₁, x^∗₂) is indeed a critical point of f given that the variables must satisfy the constraint.

Lagrange’s method is very clever and useful. In effect, it offers us an algorithm for identifying the constrained optima in a wide class of practical problems.