• 検索結果がありません。

Japan Advanced Institute of Science and Technology

N/A
N/A
Protected

Academic year: 2021

シェア "Japan Advanced Institute of Science and Technology"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)

Japan Advanced Institute of Science and Technology

JAIST Repository

https://dspace.jaist.ac.jp/

Title

音の分離抽出における聴覚の計算理論に関する研究

Author(s)

鵜木, 祐史

Citation

Issue Date

1999‑03

Type

Thesis or Dissertation

Text version

author

URL

http://hdl.handle.net/10119/877

Rights

Description

Supervisor:赤木 正人, 情報科学研究科, 博士

(2)

concerned with sound segregation

Masashi Unoki

Scho ol of Information Science

Japan Institute of Science and Technology

14 January1999

Abstract

The aim of this paper is to construct a computational theory of audition. This work is to

explain the following questions: \what is a purp ose of auditory processing?" and \why must

auditory systemcompute it?",based onresearcheson psychology,physiology, and information

science. This computational theory corresponds to the auditory edition of the computational

theoryofvision proposedbyMarr. Ifthecomputational theoryofaudition canbeconstructed,

it can not only clarify human auditory functions but also contribute to some applications

such as a signal processing, robust speech recognition, and a modeling of psychoacoustical

phenomena. HoweverthecomputationaltheoryofauditioninanalogytoMarr'stheoryhasnot

b een constructedcompletely becausepsychoacousticaland physiologicalknowledgeofaudition

is not sucient toconstruct it inanalogy to Marr's theory.

This paper proposes a computational theory of audition concerned with sound segregation

based on the following approaches in analogy to Marr's theory: constraints on sound waves

and environment conditions are necessary in order to uniquely solve the problem (ill-p osed

inverse problem) of segregating the desired signal from mixed signals. This paper adopts

the following idea as a construction method of the computational theory: psychoacoustical

constraints that auditory system uses to solve the problem of auditory scene analysis, that is

the fourregularitiesproposedby Bregman,canb eusedtouniquelysolvethesignalsegregation

problem as mathematical constraints. This paper focuses on \segregation of two sounds" as

a fundamental auditory function. Therefore, the problem of segregating sounds is set to \the

problem ofsegregatingtwoacousticsources." It issupp osedthat thedesiredsignalis\an AM-

FMharmoniccomplextone"suchasvowelandinstrumentalsound. Moreover,acomputational

theory of audition is dened as a strategy of sound segregation, \how are the problem of

segregating twoacousticsources solveduniquely using the constraints ?"

In this paper, the problem of segregating two acoustic sources based on an amplitude and

phase spectra wasformulated. The four regularities proposed by Bregman were formulatedas

mathematicalconstraints: (i)common onsetandosetforthe comp onentofthe complextone,

(ii) continuitydenedby thepiecewise-polynomialapproximationandthe splineinterp olation,

(iii) harmonicity,and (iv)correlationbetween the amplitudeenvelopes. A methodof segregat-

ing AM{FMharmonic complextonefromthemixed signalusing theconstraintswasprop osed.

This metho d was examined whether it could be segregated the desired signal from the mixed

(3)

the sucient constraints. The derivedstrategy is touniquely solve the problem of segregating

two acoustic sources by regarding it as the piecewise linear problem and by constraining the

temp oral uctuations of the amplitude and the phase of the desired signal. Finally, the de-

rivedstrategy was examinedby applying ittheory into tworeal segregation problems: (1) the

problem of segregating the desiredreal speech(vowels) fromnoisyspeechand (2) the problem

of segregating pure tone from masked signal, that is co-modulation masking release (CMR).

These examinations showed the derivedstrategy of sound segregationcan be used tolead the

solution ofthe problems.

This strategy can contributetothe applications such asa preprocessor of the robust speech

recognition system and as a modeling of psychoacoustical phenomena. Moreover, it can also

contributetoanewconstructionmethodof the computationaltheory ofauditioninanalogy of

Marr's computational theory.

Key words: computational theory, auditoryscene analysis, the problem of segre-

gating two acoustic sources, Bregman's four regularities, mathematical constraint

c

参照

関連したドキュメント

Massoudi and Phuoc 44 proposed that for granular materials the slip velocity is proportional to the stress vector at the wall, that is, u s gT s n x , T s n y , where T s is the

– Classical solutions to a multidimensional free boundary problem arising in combustion theory, Commun.. – Mathematics contribute to the progress of combustion science, in

Definition An embeddable tiled surface is a tiled surface which is actually achieved as the graph of singular leaves of some embedded orientable surface with closed braid

The purpose of this paper is to apply a new method, based on the envelope theory of the family of planes, to derive necessary and sufficient conditions for the partial

We study the classical invariant theory of the B´ ezoutiant R(A, B) of a pair of binary forms A, B.. We also describe a ‘generic reduc- tion formula’ which recovers B from R(A, B)

The scattering structure is assumed to be buried in the fluid seabed bellow a water waveguide and is a circular elastic shell filled with a fluid that may have different properties

Giuseppe Rosolini, Universit` a di Genova: [email protected] Alex Simpson, University of Edinburgh: [email protected] James Stasheff, University of North

p≤x a 2 p log p/p k−1 which is proved in Section 4 using Shimura’s split of the Rankin–Selberg L -function into the ordinary Riemann zeta-function and the sym- metric square