News Release

14th December 2018
Techno-Speech, Inc.

Reproducing high-quality singing voice
with state-of-the-art AI technology

Techno-Speech, Inc. and Nagoya Institute of Technology Speech and Language Processing Laboratory recently developed a singing voice synthesis technology that can reproduce human voice quality, unique characteristics, and singing style more precisely than ever.

Techno-Speech, Inc. and Nagoya Institute of Technology are collaborating on the research and development of speech/singing-voice synthesis technology. The technologies they have developed so far have already been applied in the commercial karaoke system ”JOYSOUND,” voice creation software “CeVIO Creative Studio,’’ and elsewhere. In this research, a singing-voice database of about two hours of singing recorded by a specific singer is used to develop human voice quality, unique characteristics, and singing style by applying AI technology such as deep learning. When synthesizing, high-quality singing voices can be produced simply by entering any musical score with lyrics.

Languages: Japanese, English, Chinese
Samples: New technology (mix and a cappella)
Current technology (a cappella)

Input: Musical score with lyrics that has not been manually adjusted

* Singing voice database providers

Japanese: CeVIO Project “Sato Sasara” http://www.cevio.jp/
English: 1st PLACE co., Ltd. “IA” (Voice source: Lia) http://1stplace.co.jp/ia/world/

[Japanese] Diamonds

New (mix) -

00:0000:00

New (a cappella) -

00:0000:00

Current (a cappella) -

00:0000:00

[Japanese] 瞳 (Hitomi)

New (mix) -

00:0000:00

New (a cappella) -

00:0000:00

Current (a cappella) -

00:0000:00

[English] Rolling In The Deep

New (mix) -

00:0000:00

New (a cappella) -

00:0000:00

Current (a cappella) -

00:0000:00

[English] Everytime

New (mix) -

00:0000:00

New (a cappella) -

00:0000:00

Current (a cappella) -

00:0000:00

[Chinese] 爱情转移 (Ai Qing Zhuan Yi)

New (mix) -

00:0000:00

New (a cappella) -

00:0000:00

Current (a cappella) -

00:0000:00

This research is based on a joint research project between Techno-Speech, Inc. and Nagoya Institute of Technology. The research results will be presented at a spring meeting of the Acoustical Society of Japan held in March 2019. Possible applications of this technology include the followings.

Reproduction of artist’s singing voice (including that of a deceased person)
Usage in music production and game development
Video streaming/live events conducted by virtual YouTubers
Post-recording system for virtual actors
Vocalization module of AI or speech dialogue systems
Generation of flexible reference speech for foreign language/singing education
Speech devices for ALS or laryngeal cancer patients
Digital signage for nursing facility

[Contact Details]
Techno-Speech, Inc. (President: Keiichiro Oura)

URL: https://www.techno-speech.com/

Business: Research and development of software related to multimedia
Address: Nagoya Life Science Incubator, 2-22-8, Chikusa, Chikusa-ku, Nagoya, 464-0858, Japan
E-mail: info@techno-speech.com

Nagoya Institute of Technology Speech and Language Processing Laboratory (Director: Keiichi Tokuda)

URL: http://www.sp.nitech.ac.jp/index.php?HOME

Address: Gokiso-cho, Showa-ku, Nagoya, 466-8555, Japan

E-mail: tokuda@nitech.ac.jp

News Release

Reproducing high-quality singing voice with state-of-the-art AI technology

Reproducing high-quality singing voice
with state-of-the-art AI technology