• 検索結果がありません。

JAIST Repository https://dspace.jaist.ac.jp/

N/A
N/A
Protected

Academic year: 2021

シェア "JAIST Repository https://dspace.jaist.ac.jp/"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)

Japan Advanced Institute of Science and Technology

JAIST Repository

https://dspace.jaist.ac.jp/

Title 声区表現を可能とする歌声合成を目的としたARX‑LFモ

デルの制御法に関する研究

Author(s) 元田, 紘樹

Citation

Issue Date 2013‑03

Type Thesis or Dissertation Text version author

URL http://hdl.handle.net/10119/11326 Rights

Description Supervisor:赤木正人, 情報科学研究科, 修士

(2)

A method for controlling ARX-LF model for synthesizing singing voices characterizing vocal registers

Hiroki Motoda (1110061) School of Information Science,

Japan Advanced Institute of Science and Technology February 6, 2013

Keywords: Vocal registers, Singing voices synthesis, ARX-LF model, Voice production mechanisms.

Singing voices synthesis is one of the important topics in speech science. Construc- tion of singing voices synthesis system having naturalness and variety contributes to mu- sic information processing and elucidation of the relationships between voice production mechanisms, acoustic features, and perceptions. However, singing voices synthesis cannot represent ‘Vocal registers’ appropriately.

Vocal register is a particular series of tones in the human voice that are produced by one particular vibratory pattern of the vocal folds and therefore possess a common quality.

Human can sing songs naturally in wide range of frequency by training how to use vocal fold vibrations to represent vocal registers. Many reserchers investigated relationships be- tween vocal registers and characteristics of glottal sources. However, even state-of-the-art singing voices synthesis systems cannot produce vocal registers appropriately. Natural- ness of the synthesized singing voices using these systems is degraded in low and high tone ranges. One of the methods for improving naturalness is adding characteristics of glottal sources for each vocal register. This paper considers a model that can represent glot- tal sources with characteristics of vocal registers for an advanced singing voice synthesis system.

This paper proposes a method for controlling characteristics of glottal sources to syn- thesize singing voices having characteristics of vocal registers. This paper constructs a singing voice synthesis system with the ARX-LF model that can describe glottal sources for each vocal register by simulating human voice production mechanisms.A model for controlling ARX-LF parameters corresponding to characteristics of glottal sources was constructed in accordance with tone ranges. A singing voices synthesis system for syn- thesizing singing voices characterizing vocal registers was proposed, and the model for

Copyright c2013 by Hiroki Motoda

1

(3)

controlling ARX-LF parameters is constructed. Singing voices were systhesized by the proposed system, and the control model was evaluated by performing both of objective and subjective evaluation. This paper reports these results.

A singing voices sysnthesis system based on ARX-LF model was constructed to add characteristics of glottal sources for each vocal register, that has analysis - control - synthesis procedures. Singing voice data of each vocal register were analyzed by the ARX-LF model, and ARX-LF parameter values corresponding to glottal source of each vocal register were obtained. The control model was constructed using the results of the analysis. ARX-LF parameters for each vocal register were interpolated in a linear manner.

Singing voices were systhesized by the proposed system, and objective evaluation was performed. Spectral tilt in low frequency ranges was analyzed. As the results, almost the same tendency of spectral tilt were obtained from the synthesized singing voices as those from actual singing voices in each vocal register. Quality of the synthesized voices was evaluated for subjective evaluation. As the results, almost the same impressions were obtained from the synthesized singing voices as those from actual singing voices in each vocal register. Additionally, it was suggested that controlling characteristics of glottal sources of each vocal register is important in high and low tone ranges. Results revealed effectiveness of the control model for synthesizing singing voices to characterize vocal registers.

Methods and findings on this study are though of as leading to the realization of singing voices synthesis having naturalness and variety, and the elucidation of the relationships between voice production mechanisms, acoustic features, and perceptions.

2

参照

関連したドキュメント

Causation and effectuation processes: A validation study , Journal of Business Venturing, 26, pp.375-390. [4] McKelvie, Alexander & Chandler, Gaylen & Detienne, Dawn

Previous studies have reported phase separation of phospholipid membranes containing charged lipids by the addition of metal ions and phase separation induced by osmotic application

It is separated into several subsections, including introduction, research and development, open innovation, international R&D management, cross-cultural collaboration,

UBICOMM2008 BEST PAPER AWARD 丹   康 雄 情報科学研究科 教 授 平成20年11月. マルチメディア・仮想環境基礎研究会MVE賞

To investigate the synthesizability, we have performed electronic structure simulations based on density functional theory (DFT) and phonon simulations combined with DFT for the

During the implementation stage, we explored appropriate creative pedagogy in foreign language classrooms We conducted practical lectures using the creative teaching method

講演 1 「多様性の尊重とわたしたちにできること:LGBTQ+と無意識の 偏見」 (北陸先端科学技術大学院大学グローバルコミュニケーションセンター 講師 元山

Come with considering two features of collaboration, unstructured collaboration (information collaboration) and structured collaboration (process collaboration); we