Title: Abstract of Doctoral Thesis

(1)

[様式－学 5]

Abstract of Doctoral Thesis

Title: A Study of a Highly-Realistic Singing-Experience System Based on Real-Time Vocoder and Sound Field Reproduction

Doctoral Program In Information Science and Engineering Graduate School of Information Science and Engineering Ritsumeikan University

ふりがななかのこうた氏名 NAKANO Kota

Karaoke is one of the most popular forms of entertainment. Karaoke is singing, and it provides everyone opportunities for enjoyment. Karaoke is often used as a tool for communication, however, singing is also a skill. Some people, such as tone-deaf people, tend to avoid singing Karaoke with their companions. This tendency prevents good communication for them. In this thesis, I propose a system to solve the problem. The proposed system provides virtual singing-experience to users. The system modifies the singing-style of the users' singing-voice to professional-like one in real-time. The system also reproduces sound fields such as performance halls. Accordingly, the system provides users with good singing-experiences. The system supports singing for tone-deaf people, and also provides more entertainment than regular Karaoke to other users.

Firstly, I propose a singing morphing system by using vocoder framework based on the source-filter model. The system transcripts the singing-style of professional singers to amateur users’ singing-voice in real-time. To achieve the system, I propose an approach for a high quality vocoder, STRAIGHT to rapidly process the singing-voice, which depends on inverse-filtering method with STRAIGHT spectrum. According to the evaluations, I confirm that the proposed system can transcript singing-style of professional singers to users' singing-voice in real time.

Secondly, I propose a sound-reproduction system by using a semi-transaural loudspeaker-system and improved sound-field simulator. The semi-transaural

loudspeaker-system achieves high robustness for crosstalk of loudspeaker-system. The sound-field simulator depends on the finite-difference time-domain method. I propose an improvement for the simulator by employing spectral method and coordinate conversion for accurate computation. According to the evaluations, I confirm that the proposed system can robustly present fine sound-images to users.

Finally, I propose a singing-experience system with high-realistic sensation. The system depends on the integrated systems which is proposed in the previous sections in this thesis.

According to the evaluation, the sound-field simulator complemented the quality of singing morphing system. It was indicated that the integrated-systems could effectively provide good singing-experiences to users.