• 検索結果がありません。

2018-12-20 Taiji Suzuki e-mail: [email protected] Solve the following problems.

N/A
N/A
Protected

Academic year: 2021

シェア "2018-12-20 Taiji Suzuki e-mail: [email protected] Solve the following problems."

Copied!
1
0
0

読み込み中.... (全文を見る)

全文

(1)

Reporting assignment

京都大学集中講義

「機械学習と深層学習の数理と応用」

2018-12-20 Taiji Suzuki e-mail: [email protected] Solve the following problems.

Due date: 10 January/2019.

1. Consider a linear model

Y = + ϵ

where Y R n , X R n × p , and ϵ = (ϵ i ) n i=1 R n . Suppose that ϵ i is an i.i.d. noise such that E[ϵ i ] = 0 and E[ϵ 2 i ] = σ 2 and X X is full rank. In this setting, evaluate the in-sample predictive error

E Y | X

[ 1

n X β ˆ LS 2 ]

. of the least squared estimator ˆ β LS R p .

2. (Stein’s shrinkage estimator) Let X = [X 1 , . . . , X d ] R d be distributed from multivariate normal N(µ, I) (mean µ and variance-covariance I). Assume d 3. Then, show that

δ = (

1 (d 2)

X 2 )

X satisfies

E X N (µ,I) [ X µ 2 ] > E X N (µ,I) [ δ(X) µ 2 ] ( µ R d ).

You may use the following Stein’s identy without proof: For a function f : R d R d (X 7→ f (X ) = [f 1 (X), . . . , f d (X )] ) such that E X [f i (X )] exists and f i is differentiable almost everywhere for all i, it holds that

E X [ 2 µ X, f (X ) ] = 2σ 2 E X

[ d

i=1

∂f i (X )

∂X i

] .

3. For 1 q < , let w q := ( ∑ d

i=1 | w i | q ) 1/q for w R d , and H q := { f (x) = w x (x R d ) | ∥ w q 1, w R d } . Given x 1 , . . . , x n R n , its empirical Rademacher complexity is denoted by

R ˆ n ( H q ) = E σ [

sup

f ∈H

q

1 n

n i=1

σ i f (x i )

x 1 , . . . , x d ]

.

Now, suppose that 1 p, q < satisfies q < p. Then, show that R ˆ n ( H p ) d 1/p

1/q

R ˆ n ( H q ) where p = p/(p 1) and q = q/(q 1).

4. Prove the Massart’s theorem: For a finite set of functions F = { f 1 , . . . , f M } where each f i (i = 1, . . . , M ) is a function from R d to R satisfying sup x | f i (x) | ≤ R, it holds that

R ˆ n ( F ) R

√ 2 log M

n .

Hint: You may use the following inequalities:

Jensen’s inequality: exp(sE σ [g(σ 1 , . . . , σ n )]) E σ [exp(sg(σ 1 , . . . , σ n ))] for s R and g : R n R .

exp(max f ∈F F(f ))

f ∈F exp(F (f)) for F : F → R .

Hoeffding’s inequality: E σ

i

[exp(σ i a)] exp(a 2 /2) for a R .

5. Suppose that x < 1 (a.s.). Derive an upper bound of the Rademacher complexity of a neural network model:

F =

 

M j=1

α j η(a j x) | α j R , a j R d , max

1 j M a j p C 1 , α q C 2

 

 where 1 p, q ≤ ∞ and η(u) = max { u, 0 } .

1

参照

関連したドキュメント

In terms of the i-invariants, the absolute p-adic Grothendieck conjecture is reduced to the following two

のようにすべきだと考えていますか。 やっと開通します。長野、太田地区方面  

Motivated by the new perturbation results of closed linear generalized inverses [12], in this paper, we initiate the study of the following problems for bounded homogeneous

For the lighting and air conditioning equipment, which account for more than half of the building’s energy consumption, energy efficient systems have been adopted, such as a

I have been visiting The Nippon Foundation, Kashiwa Company, Japan Aerospace Exploration Agency (JAXA), Ariake Water Reclamation Center, Tokyo University of Marine Science and

Operators attempted to use the diesel-driven fire pump, which was developed for use as a so-called AM measure in order to further enhance plant safety, to inject water into

[r]

日髙真吾 企画課長 日髙真吾 園田直子 企画課長 鈴木 紀 丹羽典生 樫永真佐夫 樫永真佐夫 樫永真佐夫 川瀬 慈 齋藤玲子 樫永真佐夫 三島禎子 山中由里子 川瀬