• 検索結果がありません。

A study on the noise suppression method based on the MTF concept

N/A
N/A
Protected

Academic year: 2021

シェア "A study on the noise suppression method based on the MTF concept"

Copied!
4
0
0

読み込み中.... (全文を見る)

全文

(1)

Japan Advanced Institute of Science and Technology

JAIST Repository

https://dspace.jaist.ac.jp/

Title 変調伝達関数に基づいた雑音抑圧に関する研究

Author(s) 山崎, 悠

Citation

Issue Date 2009‑03

Type Thesis or Dissertation Text version author

URL http://hdl.handle.net/10119/8099 Rights

Description Supervisor:鵜木 祐史, 情報科学研究科, 修士

(2)

A study on the noise suppression method based on the MTF concept

Yutaka Yamasaki (0710073) School of Information Science,

Japan Advanced Institute of Science and Technology February 5, 2009

Keywords: Modulation Transfer Function, Temporal power envelope, Noisy environments, Noise suppression.

In real environments, significant features of speech are smeared due to noise and reverberation so that sound quality and intelligibility of the ob- served speech are drastically reduced. Improvement of noisy reverberant speech (noise suppression and dereverberation) is, therefore, needed in vari- ous speech signal processing, such as hearing aid systems and preprocessing for automatic speech recognition systems.

There are several well-known suppression methods which can be used to remove the effects of noise or reverberation in either noisy or reverberant environments. There are, for example, spectral subtraction method pro- posed by Boll, Kalman filtering method proposed by Paliwal and Basu, minimum-phase inverse filtering method proposed by Neely and Allen, and multiple input/output inverse theorem (MINT) method proposed by Miyoshi and Kaneda. Although these methods can work well in either noisy or reverberant environments, it cannot work in both noisy and rever- berant environments, simultaneously. Recently, Kinoshita et al. studied a strategy of the speech enhancement method in noisy reverberant envi- ronments, by considering two sequential processes: noise reduction using spectral subtraction for noisy reverberant speech and then dereverberation using linear prediction for noise-reduced reverberant speech. However, it seems to be more complex in Kinoshita’s modeling. We thought that the

Copyright c2009 by Yutaka Yamasaki

1

(3)

best solution should be able to deal with both additive noise and reverber- ant effect simultaneously.

On the other hand, Houtgast and Steeneken proposed a prediction method that can assess the effects of the enclosure on speech intelligibility in both noisy and reverberant environments by using the modulation transfer func- tion (MTF). The MTF concept makes the suppression method of both noise and reverberation simultaneously realization.

Unoki et al. proposed the temporal power envelope inverse filtering method based on the MTF concept. Their method assumed an environ- ment that is the reverberation. These methods improved about 30% of the reduction in speech intelligibility caused by the reverberation. If we could propose a noise suppression method based on MTF concept, we can propose an MTF-based speech enhancement method which can suppress noise and reverberation simultaneously, so that this method can improve the loss of speech intelligibility suffered from additive noise and reverber- ation. The goal of our work is to propose a speech enhancement method for noise reduction and dereverberation.

We propose a noise suppression method by restoring the smeared MTF.

The modulation index and the averaged power in the temporal output power envelope (e2

y(t)) are affected by noise. We restore the averaged power levels and modulation index using MTF to suppress the noise effects.

Input information is needed for the calculation of MTF. The average of temporal noise power envelope (e2

n) is estimated from non-speech sections.

The average of temporal input power envelope is estimated by subtracting e2

n from the average of temporal output power envelope. MTF can be calculated by using this SNR.

We carried out three simulations on noise reduction to evaluate whether the proposed method can adequately restore the desired speech from noisy speech signals. Three Japanese sentences uttered by ten speakers (five males and five females) were used in the evaluations. Three types of noise, white, pink, and babble noise signals, were used in the simulations. Signal to noise ratios (SNRs) were fixed at 20, 10, 5, 0, and -5 dB. In these simulations, correlation (Corr), SNR, and log spectrum distortion (LSD) and weighted LSD of speech intelligibility were used as evaluation measures to show the improvement the restoration accuracy achieved through our

2

(4)

method. As the results of simulations, the maximum improvement in LSD was about 31 dB, the maximum improvement in weighted LSD was about 8 dB, the improvement in Corr was constant and the improvements in SNR increased as SNRs decreased. These results show that the proposed method can improve the temporal power envelope and the waveform of the input signal from the noisy signal. We found that the proposed method can be used to adequately restore the temporal power envelopes and to suppress the noise effects of noisy signal.

3

参照

関連したドキュメント

In this paper, we extend the results of [14, 20] to general minimization-based noise level- free parameter choice rules and general spectral filter-based regularization operators..

Using the semigroup approach for stochastic evolution equations in Banach spaces we obtain existence and uniqueness of solutions with sample paths in the space of continuous

These authors make the following objection to the classical Cahn-Hilliard theory: it does not seem to arise from an exact macroscopic description of microscopic models of

These authors make the following objection to the classical Cahn-Hilliard theory: it does not seem to arise from an exact macroscopic description of microscopic models of

Finally, in Section 7 we illustrate numerically how the results of the fractional integration significantly depends on the definition we choose, and moreover we illustrate the

The aim of this work is to prove the uniform boundedness and the existence of global solutions for Gierer-Meinhardt model of three substance described by reaction-diffusion

In this paper, based on a new general ans¨atz and B¨acklund transformation of the fractional Riccati equation with known solutions, we propose a new method called extended

First main point: A general solution obeying the 4 requirements above can be given for lattices in simple algebraic groups and general domains B t , using a method based on