• 検索結果がありません。

Noise Reduction Based on Microphone Array and Post‑filtering for Robust Hands‑free Speech Recognition in Adverse Environments

N/A
N/A
Protected

Academic year: 2021

シェア "Noise Reduction Based on Microphone Array and Post‑filtering for Robust Hands‑free Speech Recognition in Adverse Environments"

Copied!
2
0
0

読み込み中.... (全文を見る)

全文

(1)

Japan Advanced Institute of Science and Technology

JAIST Repository

https://dspace.jaist.ac.jp/

Title

Noise Reduction Based on Microphone Array and Post‑filtering for Robust Hands‑free Speech Recognition in Adverse Environments

Author(s) 李, 軍鋒

Citation

Issue Date 2006‑03

Type Thesis or Dissertation Text version author

URL http://hdl.handle.net/10119/973 Rights

Description Supervisor:赤木 正人, 情報科学研究科, 博士

(2)

Noise Reduction Based on Microphone Array and Post-filtering for Robust Hands-free Speech Recognition in Adverse

Environments

Junfeng Li

School of Information Science,

Japan Advanced Institute of Science and Technology March, 2006

Abstract

This research proposes a noise reduction system using microphone array and post-filtering with the goal of improving the recognition accuracy and robustness of hands-free speech recognition systems in adverse environments.

Acoustic noise signals dramatically degrade the performance of many speech applications, such as speech recognition system, in practical environments. Though the problem of dealing with acoustic noises has been researched for several decades and is still a challenging topic due to the complex and time-varying characteristics of signals and acoustic environments. Noises with different characteristics from various kinds of sources make it difficult to construct an effective noise reduction system. Moreover, only the system with small physical size is preferable because of the limited space, e.g., in car environments.

To deal with localized noise, we propose a hybrid noise estimation technique by combining the multi- channel estimation approach and a single-channel estimation approach. The estimation accuracy of this hybrid technique is further improved by integrating arobust and accurate speech absence probability (RA- SAP) estimator. The estimated spectrum of localized noise is then compensated and suppressed from that of noisy observation on each microphone. Moreover, we further develop a generalized subtractive beamformer by relaxing the assumption of a perfectly coherent noise field to the one of an arbitrary noise field. The theoretical analysis is also presented to show the linkage between these two beamformers and to show the theoretical performance of the generalized algorithm in the theoretically defined noise fields.

To further deal with the residual non-localized noise post-filtering is normally used at beamformer output. We propose a hybrid post-filter for microphone arrays with an assumption of a diffuse noise field.

In the proposed hybrid post-filter, a modified Zelinski post-filter is applied to the high frequencies; a single-channel Wiener filter is applied to the low frequencies. The proposed hybrid post-filter was proven to be able to deal with both high-correlated and low-correlated noise components in a diffuse noise field.

The performance of the proposed noise reduction system is finally investigated using speech recognition results. The speech recognition results show that the proposed noise reduction algorithm outperforms the other traditional ones in improving the speech recognition performance in the tested adverse environments.

Compared with other traditional noise reduction algorithms, this proposed algorithm demonstrates some advantages: (1) in theory, it provides the optimal solution to the problem of multi-channel noise reduction for broad-band inputs inminimum mean square error sense; (2) it is able to deal with various kinds of noise signals, including localized and non-localized noise, stationary and non-stationary (e.g., sudden) noise; (3) it avoids the problems of slow convergence rate and low stability in practical environ- ments; (4) it can be implemented in real-time mode; (5) it is successful in improving the performance of hands-free speech recognition systems in adverse environments.

In addition to hands-free speech recognition systems, the noise reduction system proposed in this thesis is also useful and preferable to many other applications. For example, for hearing aid, it is able to provide more clean and intelligible speech, enhancing the performance of hearing aid to hearing impaired with a small-size microphone array at a low computational complexity in adverse conditions.

Key Words: Hands-free speech recognition, noise reduction, microphone array, beamforming, post-filtering, multi-channel Wiener filter.

参照

関連したドキュメント

In this paper, we extend the results of [14, 20] to general minimization-based noise level- free parameter choice rules and general spectral filter-based regularization operators..

At the same time, a new multiplicative noise removal algorithm based on fourth-order PDE model is proposed for the restoration of noisy image.. To apply the proposed model for

In order to demonstrate that the CAB algorithm provides a better performance, it has been compared to other optimization approaches such as metaheuristic algorithms Section 4.2

– Classical solutions to a multidimensional free boundary problem arising in combustion theory, Commun.. – Mathematics contribute to the progress of combustion science, in

In this paper, we propose an exact algorithm based on dichotomic search to solve the two-dimensional strip packing problem with guillotine cut2. In Section 2 we present some

Definition An embeddable tiled surface is a tiled surface which is actually achieved as the graph of singular leaves of some embedded orientable surface with closed braid

From the- orems about applications of Fourier and Laplace transforms, for system of linear partial differential equations with constant coefficients, we see that in this case if

In order to solve this problem we in- troduce generalized uniformly continuous solution operators and use them to obtain the unique solution on a certain Colombeau space1. In