nonneg_least_squares

IMSL_DROP_TOLERANCE, float tol (入力)

こ階数決定許容値す候補列：

a c d

    

d 最初ンよ下値が消去さいますそ結果相対条件：

2 2

d tol c _.

満たさけいそうい場合列 a 以前に落さた列に線形依存す制約引続満たさます

フ : tol = sqrt(imsl_f_machine(3))

IMSL_SUPPLY_WORK_ARRAYS , int lwork, float work[], int liwork, int iwork[] (入力/出力) こプン引数使うこによー作業用配列 work iwork イ位置提供大問題に対し効率が高く断片化も避けこ

がます maxt アクにッ最大数す次条件が満たさ

ませ：

lwork ≥ maxt*(m*(n+2) + n) よび liwork ≥ maxt*n

OpenMP 並列ッング使わい場合 maxt=1

IMSL_OPTIMIZED, int *flag (出力)

最適残差が得たう示す 0-1 フグ 1 値最適残差が

得たこ示します最大反復数に達した場合 0 にますフグ説明

0 最大反復数に達した

1 最適残差が得た

IMSL_DUAL_SOLUTION, float **dual (出力)

対ベク w

= A

(Ax - b)

^含 ^長さ n 配列こ反復最大数に先に達し

いた最適い (全成分が w ≤ 0 満たすこい) こもあます

IMSL_DUAL_SOLUTION_USER, float dual[] (出力)

ーによ提供さ対た領域 IMSL_DUAL_SOLUTION 参照 IMSL_RESIDUAL_NORM, float *rnorm (出力)

残差ベク

|| Ax - b ||

IMSL_RETURN_USER, float x[] (出力)

ー割当長さ n 配列近似解ベク x ≥ 0 含

説明

関数 imsl_f_nonneg_least_squares 制約 x ≥ 0

|| Ax - b ||

2 最小にすこによ Ax ≈ b 最小二乗解求ます使用すア NNLS こ Charles L. Lawson and Richard J.

Hanson, Solving Least Squares Problems, SIAM Publications, Chap. 23, (1995) に説明さいますこ関数が採用しいチッ制約無くすこ新しい機能すジ NNLS法チッに対応しいませまた一けが使わ場合も全対成分が計算さます最初に遭遇した変数非活性にす通常性能が向します最適解い方法得ます m n 相対イに制限あませ

例題 1

関数し次指数関数考えます：

f(t) = c1 + c2 exp(-λ2 t) + c3 exp(-λ3 t), t ≥ 0

指数部引数パータ

2 1, 3 5

    固定さます係数

0, 1,2,3 cj j ンプータ値

 

, 1,...,21

f t i

非負最小二乗法推定しますータし用い値 ti = 0.25i, i = 1, ..., 20

ここ

c1 = 1, c2 = 0.2, c3 = 0.3

#include <imsl.h>

#include <math.h>

#define M 21

#define N 3

int main() { int i;

float a[M][N], b[M], *c;

for (i = 0; i < M; i++) {

/* Generate exponential values. This model is y(t) = c_0 + c_1*exp(-t) + c_2*exp(-5*t) */

a[i][0] = 1.0;

a[i][1] = exp(-(i*0.25));

a[i][2] = exp(-(i*0.25)*5.0);

/* Compute sample values */

b[i] = a[i][0] + 0.2*a[i][1] + 0.3*a[i][2];

}

/* Solve for coefficients, constraining values to be non-negative. */

c = imsl_f_nonneg_least_squares(M, N, &a[0][0], b, 0);

/* With noise level = 0, solution should be (1, 0.2, 0.3) */

imsl_f_write_matrix("Coefficients", 1, N, c, 0);

}

出力

Coefficients

1 2 3 1.0 0.2 0.3

例題

指数関数

 

1 2

exp(

)

exp(

) ( ), 0 f t   c c   t  c   t  n t t 

ここ 2,3 値例題1 同す関数 n(t) ^標準偏差

σ

= 10^-³ 正規分布すンダイ

表わします n(t) ^に対し ⁿs = 10001個ンプーン行いましたそ

結果問題 OpenMP 並列解ます OpenMP 結果が正しいこチックすたに

OpenMP 使わいープも計算しますこ残差一致す OpenMP 並列にした

場合逐次処理した場合結果に差が無いこが分ます

#include <imsl.h>

#include <stdio.h>

#include <stdlib.h>

#include <math.h>

#include <omp.h>

#define M 21

#define N 3

#define NS 10001

int main() {

#define BS(i_,j_) bs[(i_)*M + (j_)]

#define X(i_,j_) x[(i_)*N + (j_)]

int thread_safe=1, seed=123457, i, *iwork, j, lwork, liwork, maxt;

float b[M], *work, sigma=1.0e-3, a[M][N], rseq[NS], rpar[NS], *bs, *x;

/* Allocate work memory for all threads that are used in the loops below. */

maxt = omp_get_max_threads();

lwork = maxt*(M*(N+2)+N);

liwork = maxt*N;

work = (float *) malloc(lwork * sizeof(float));

iwork = (int *) malloc(liwork * sizeof(int));

x = (float *) malloc(NS*N * sizeof(float));

bs = (float *) malloc(NS*M * sizeof(float));

for (i = 0; i < M; i++) { /* Generate matrix values.

This model is y(t) =

c_0 + c_1*exp(-t) + c_2*exp(-5*t) + n(t) */

a[i][0] = 1.0;

a[i][1] = exp(-(i*0.25));

a[i][2] = exp(-(i*0.25)*5.0);

}

/* Solve for coefficients, constraining values to be non-negative.

First use a sequential for loop. Then a parallel for loop.

Record the residual norms and compare them. */

imsl_random_seed_set(seed);

/* First the sequential loop.

Working memory is not included as an argument. */

for (j = 0; j < NS; j++) {

imsl_f_random_normal(M, IMSL_RETURN_USER, b, 0);

/* Add normal pdf noise at the level sigma. */

for (i=0; i<M; i++) {

b[i] = sigma*b[i] + a[i][0] + 0.2*a[i][1] + 0.3*a[i][2];

BS(j,i) = b[i];

}

imsl_f_nonneg_least_squares(M, N, &a[0][0], &BS(j,0), IMSL_RETURN_USER, &X(j,0),

IMSL_RESIDUAL_NORM, &rseq[j],

0);

}

/* Then the parallel for loop using OpenMP.

Working memory is an optional argument. This is not required but helps prevent memory fragmentation. */

/* Reset x for output for the OpenMP loop. */

for (i = 0; i < NS*N; i++) x[i] = 0.0;

#pragma omp parallel for private(j) for (j = 0; j < NS; j++) {

imsl_f_nonneg_least_squares(M, N, &a[0][0], &BS(j,0), IMSL_RETURN_USER, &X(j,0),

IMSL_RESIDUAL_NORM, &rpar[j],

IMSL_SUPPLY_WORK_ARRAYS, lwork, work, liwork, iwork, 0);

}

/* Check that residual norms agree exactly for both loops. They should because the same problems are solved - one set

sequentially and the next set in parallel. */

for (j = 0; j < NS; j++) {

/* Since the two loops solve the same set of problems, the residual norms must agree exactly. */

if (rpar[j] != rseq[j]) { thread_safe = 0;

break;

} }

if(thread_safe)

printf("imsl_f_nonneg_least_squares is thread-safe.\n");

else

printf("imsl_f_nonneg_least_squares is not thread-safe.\n");

system("pause");

}

出力

imsl_f_nonneg_least_squares is thread-safe.

警告ー

IMSL_MAX_NNLS_ITER_REACHED 反復数が最大値に達した最良解が返さます

“itmax” = # が使わましたよ大値にす

計算が完了すもしませ

ドキュメント内 imslc2016 1 math IMSL Cライブラリ v2016.1 ユーザーズガイド : Math (ページ 171-176)

= A

(Ax - b)

|| Ax - b ||

説明

|| Ax - b ||

例題 1

 

, 1,...,21

出力

例題

 

exp(

)

exp(

) ( ), 0 f t   c c   t  c   t  n t t 

σ

出力

警告 ー

警告ー