• 検索結果がありません。

JAIST Repository https://dspace.jaist.ac.jp/

N/A
N/A
Protected

Academic year: 2021

シェア "JAIST Repository https://dspace.jaist.ac.jp/"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)

Japan Advanced Institute of Science and Technology

JAIST Repository

https://dspace.jaist.ac.jp/

Title 両耳による選択的聴取を補助する雑音残響環境下音声

強調手法の研究

Author(s) 佐々木, 裕吉

Citation

Issue Date 2012‑03

Type Thesis or Dissertation Text version author

URL http://hdl.handle.net/10119/10436 Rights

Description Supervisor: 赤木 正人, 情報科学研究科, 修士

(2)

Speech enhancement technique supporting binaural selective hearing in noisy reverberant environment

Yuuki Sasaki (0910025) School of Information Science,

Japan Advanced Institute of Science and Technology Jun 31, 2012

Keywords: Speech Enhancement Technique, Binaural Selective Hearing, Noisy Reverberant Environment, Two–Stage Binaural Speech Enhancement with Wiener Filter, Cepstral Mean Subtraction.

Speech recognition becomes difficult under influence of noise and/or re- verberation. Additionally, there are some reports that listening capability of hearing handicapped person declines remarkably in noisy reverberant environments. Therefore, speech enhancement techniques in order to sup- press noise and/or reverberation have been introduced into applications like hearing–aids or speech recognition. In speech enhancement techniques proposed until now, some speech enhancement techniques focused on bin- aural hearing featured of humans.

Frequency domain binaural model (FDBM) based on Lindeman’s binau- ral hearing model was proposed by Usagawa et al. This method calculates interaural phase difference and interaural level difference to estimate the direction of the target signal. Then, the received signal is enhanced by FDBM in noisy environment. Two–Stage Binaural Speech Enhancement with Wiener Filter (TS-BASE/WF) was proposed by Li et al., to sup- press noise with two–step processing; noise estimation stage and noise sup- pression one. TS–BASE/WF has excellent noise-reduction performance, because TS–BASE/WF has two–step processing.

When these speech enhancement techniques are used indoors, suppres- sion ability of noise and reverberation simultaneously should be required.

Copyright c2012 by Yuuki Sasaki

1

(3)

Room impulse responses (RIR) can be divided into early reflection and late reverberation bordering on the time that is dependent on size of the room.

Early reflection correlates to the target signal. Late reverberation that is added several reflection sounds have less correlation to the target signal.

Moreover, late reverberation diffuses around the room.

FDBM estimate target signal direction by using cross-spectrum. Then, FDBM could not work well under infuluence of early reflection. Noise esti- mation stage of TS–BASE/WF without using cross-spectrum could work without the influence of early reflection and late reverberation. On the one hand, since noise suppression stage of TS–BASE/WF adopt Wiener filter in which it is assumed there is no correlation between target signal and noise. Hence, enhanced signal could be affected due to early reflection.

Almost all of speech enhancement techniques for supporting binaural selective hearing cannot suppress reverberation. This paper aims at con- structing speech enhancement supporting binaural selective hearing in noisy reverberant environment. Performance of TS–BASE/WF in reverberant environment is evaluated. In adition, experiments verifiy whether TS–

BASE/WF can suppress early reflection and late reverberation. Results show that TS–BASE/WF can suppress late reverberation. However, early refractions influence enhanced signals by TS–BASE/WF due to using a Wiener filter.

According to the previous experiment results, Cepstral Mean Subtrac- tion (CMS) is used as a frontend for TS–BASE/WF in order to suppress early reflection. Next, experiments are carried out to show whether the modified method is superior to TS-BASE/WF in reverberant and/or noisy environments. Those results indicate that the modified method exceeds TS–BASE/WF in reverberant environments and noisy reverberant. From those results, the speech enhancement technique for supporting binaural selective hearing in noisy reverberant environment was constructed. Apli- cations like hearing–aids or speech recognition which is introdeced the mod- ified method of TS–BASE/WF will be improved those perfomances.

2

参照

関連したドキュメント

Causation and effectuation processes: A validation study , Journal of Business Venturing, 26, pp.375-390. [4] McKelvie, Alexander & Chandler, Gaylen & Detienne, Dawn

Previous studies have reported phase separation of phospholipid membranes containing charged lipids by the addition of metal ions and phase separation induced by osmotic application

It is separated into several subsections, including introduction, research and development, open innovation, international R&D management, cross-cultural collaboration,

UBICOMM2008 BEST PAPER AWARD 丹   康 雄 情報科学研究科 教 授 平成20年11月. マルチメディア・仮想環境基礎研究会MVE賞

To investigate the synthesizability, we have performed electronic structure simulations based on density functional theory (DFT) and phonon simulations combined with DFT for the

During the implementation stage, we explored appropriate creative pedagogy in foreign language classrooms We conducted practical lectures using the creative teaching method

講演 1 「多様性の尊重とわたしたちにできること:LGBTQ+と無意識の 偏見」 (北陸先端科学技術大学院大学グローバルコミュニケーションセンター 講師 元山

Come with considering two features of collaboration, unstructured collaboration (information collaboration) and structured collaboration (process collaboration); we