Here's a thread where I attempt to make known many different vocal synthesis engines for the convenience of curious aspiring producers. Note that this post is as of January 2, 2024, unless I decide to edit it with new information. Much of this information comes from VocaDB and various wiki pages, but some of this I already knew myself.
One-time purchase:
VOCALOID by YAMAHA (please note that I'm counting voice packages via the wiki pages so the numbers may be a little off in some ways, and some voices have been discontinued but I'm counting them as historically available):
VOCALOID (aka VOCALOID1), 2004, 5 available voice packages (its 20th anniversary is this year!)
VOCALOID2, 2007, 22 available voice packages (incl. Kagamine Rin and Len Acts 1 and 2 as separate)
VOCALOID3, 2011, 42 available voice packages
VOCALOID4, 2014, 44 available voice packages
VOCALOID5, 2018, 10 available voice packages (counting the 4 default vocals as one package as they come together with the editor)
VOCALOID6, 2022, current engine, 6 voice packages available as of now (counting the 8 current default vocals as one package as they come with the editor)
Synthesizer V by Dreamtonics:
Synthesizer V (2018): "R1" edition, no longer sold; still has previously-available renewable year-long "evaluation", much longer than Vocaloid's two-week to one-month trial periods, 8 available voice packages
Synthesizer V Studio (2020): "R2" edition. Has free Basic edition with stripped-down features; "Lite" version voices are stripped down and not allowed commercial use. One free full voicebank, Yamine Renri, is available via a Google form (she might not be allowed for commercial use but is fully functioning), and Mai is available free with the purchase of the Pro editor. Currently has 60 voices available, counting Tsurumaki Maki's Japanese and English voicebanks as separate.
Emvoice One, 2019, 5 available voices that are sold individually (free tier with a 7-semitone vocal range available).
Chipspeech by Plogue: Retro-style vocal synthesizer emulating old sound chips, released 2015, 12 released voicebanks (Daisy is discontinued)
CeVIO by Techno-Speech (both speech and singing synthesis):
CeVIO Creative Studio (2013): Uses Hidden Markov Model (HMM); human-like tone for the time, but lots of engine noise.
CeVIO AI (2021): Uses deep neural network technology and is significantly more realistic than the Creative Studio version.
Cantor by VirSyn: Vocal synthesis engine released in 2004, shortly after the release of Vocaloid. Unlike Vocaloid, rather than few voicebanks sampled from human voices, there are many voices available that are artificially synthesized. Cantor 2 was released in 2007, in the same year VOCALOID2 was released. A demo version is available, but it requires the purchase of a software protection dongle. Cantor is still available to purchase but no longer being updated.
LaLaVoice by TOSHIBA: A speech and singing synthesis engine released in 2001. Its singing synthesis side, LaLaSong, uses a sheet music layout rather than a piano roll. It is known for being used in voicing Vibri in the PS1 video game Vib-Ribbon.
Virtual Singer by Myriad: An early singing synthesis engine released in 2000 by Myriad, the makers of Harmony Assistant; it itself is an add-on for Melody Assistant. I believe it is one of the first to sample the human voice (as opposed to generating a "voice" from scratch), an approach Vocaloid would be famous for taking later?
Realivox by Realitone: A Kontakt-based vocal synthesizer released in 2012, coming in two forms: The Ladies (a group of singers who can sing in vocables rather than full English) and Blue (a single singer who can sing preset phrases as well as entered English lyrics).
Piapro Studio (VOCALOID version) by Crypton Future Media: A vocal synthesizer using the Vocaloid API released in 2013 with Crypton's "KAITO V3" package, for use with its V3 (and later V4X) vocals; support for other Vocaloids was added a little later. Piapro Studio for V4X was released in 2015 with Megurine Luka V4X. Most vocals come with a VSTi version that is compatible with all Vocaloids up to VOCALOID4 (or VOCALOID3 if only V3 Crypton vocals are owned); Hatsune Miku V4 Chinese includes a standalone variant, but it can only be used with that vocal.
Piapro Studio NT (New Type) by Crypton Future Media: A vocal synthesizer engine released in 2020 out of Crypton's dissatisfaction with the Vocaloid engine's sound. So far only Hatsune Miku NT is available, but 5 other vocals are being tested in the SEGA rhythm game Project SEKAI: Colorful Stage ft. Hatsune Miku (known worldwide as Hatsune Miku: Colorful Stage). As of now it is only available in Japanese. It uses a resampling synthesis method involving very short samples, and is more robotic and "crunchy" sounding than Miku's original Vocaloid versions. For those who dislike the sound of NT, the older versions of Hatsune Miku are still available alongside the new version.
Subscription:
ACE Studio by Beijing Timedomain Technology, 2022 (started out free, became paid via subscription in 2023), 40 available voices and counting
Pocket Singer by ACCIDENTAL AI, mobile, 2023 (also started out free but became subscription-based, from the developers of ACE), practically unlimited "original" voices made by mixing "voice seeds"
Freemium/partially free:
VoiSona (fmr. CeVIO Pro) by Techno-Speech: Editor and one voice (Chis-A Japanese) are free, other voices are either bought through a one-time purchase or used through a paid subscription. Singing and speech versions available. 11 "song" voices available, 6 being VoiSona exclusives and the other 5 being CeVIO AI ports.
Maghni AI by VocaTone and Misbah Studios: Currently-upcoming American vocal synth engine promising an advanced AI voice synthesis model and many new features, including a promise of support of around 40 languages. 20 voices teased as of now, 2 being free test vocals while the other 18 seem to be paid. Similar to VoiSona, the editor itself and the aforementioned test vocals will be free. It was partly funded via a crowdfund; the goal has been reached, but people are still welcome to contribute.
Splash Pro: An AI music generator with a couple of available singing voices utilized. The plug-in version was discontinued, but it is still available as a website.
One-time purchase:
VOCALOID by YAMAHA (please note that I'm counting voice packages via the wiki pages so the numbers may be a little off in some ways, and some voices have been discontinued but I'm counting them as historically available):
VOCALOID (aka VOCALOID1), 2004, 5 available voice packages (its 20th anniversary is this year!)
VOCALOID2, 2007, 22 available voice packages (incl. Kagamine Rin and Len Acts 1 and 2 as separate)
VOCALOID3, 2011, 42 available voice packages
VOCALOID4, 2014, 44 available voice packages
VOCALOID5, 2018, 10 available voice packages (counting the 4 default vocals as one package as they come together with the editor)
VOCALOID6, 2022, current engine, 6 voice packages available as of now (counting the 8 current default vocals as one package as they come with the editor)
Synthesizer V by Dreamtonics:
Synthesizer V (2018): "R1" edition, no longer sold; still has previously-available renewable year-long "evaluation", much longer than Vocaloid's two-week to one-month trial periods, 8 available voice packages
Synthesizer V Studio (2020): "R2" edition. Has free Basic edition with stripped-down features; "Lite" version voices are stripped down and not allowed commercial use. One free full voicebank, Yamine Renri, is available via a Google form (she might not be allowed for commercial use but is fully functioning), and Mai is available free with the purchase of the Pro editor. Currently has 60 voices available, counting Tsurumaki Maki's Japanese and English voicebanks as separate.
Emvoice One, 2019, 5 available voices that are sold individually (free tier with a 7-semitone vocal range available).
Chipspeech by Plogue: Retro-style vocal synthesizer emulating old sound chips, released 2015, 12 released voicebanks (Daisy is discontinued)
CeVIO by Techno-Speech (both speech and singing synthesis):
CeVIO Creative Studio (2013): Uses Hidden Markov Model (HMM); human-like tone for the time, but lots of engine noise.
CeVIO AI (2021): Uses deep neural network technology and is significantly more realistic than the Creative Studio version.
Cantor by VirSyn: Vocal synthesis engine released in 2004, shortly after the release of Vocaloid. Unlike Vocaloid, rather than few voicebanks sampled from human voices, there are many voices available that are artificially synthesized. Cantor 2 was released in 2007, in the same year VOCALOID2 was released. A demo version is available, but it requires the purchase of a software protection dongle. Cantor is still available to purchase but no longer being updated.
LaLaVoice by TOSHIBA: A speech and singing synthesis engine released in 2001. Its singing synthesis side, LaLaSong, uses a sheet music layout rather than a piano roll. It is known for being used in voicing Vibri in the PS1 video game Vib-Ribbon.
Virtual Singer by Myriad: An early singing synthesis engine released in 2000 by Myriad, the makers of Harmony Assistant; it itself is an add-on for Melody Assistant. I believe it is one of the first to sample the human voice (as opposed to generating a "voice" from scratch), an approach Vocaloid would be famous for taking later?
Realivox by Realitone: A Kontakt-based vocal synthesizer released in 2012, coming in two forms: The Ladies (a group of singers who can sing in vocables rather than full English) and Blue (a single singer who can sing preset phrases as well as entered English lyrics).
Piapro Studio (VOCALOID version) by Crypton Future Media: A vocal synthesizer using the Vocaloid API released in 2013 with Crypton's "KAITO V3" package, for use with its V3 (and later V4X) vocals; support for other Vocaloids was added a little later. Piapro Studio for V4X was released in 2015 with Megurine Luka V4X. Most vocals come with a VSTi version that is compatible with all Vocaloids up to VOCALOID4 (or VOCALOID3 if only V3 Crypton vocals are owned); Hatsune Miku V4 Chinese includes a standalone variant, but it can only be used with that vocal.
Piapro Studio NT (New Type) by Crypton Future Media: A vocal synthesizer engine released in 2020 out of Crypton's dissatisfaction with the Vocaloid engine's sound. So far only Hatsune Miku NT is available, but 5 other vocals are being tested in the SEGA rhythm game Project SEKAI: Colorful Stage ft. Hatsune Miku (known worldwide as Hatsune Miku: Colorful Stage). As of now it is only available in Japanese. It uses a resampling synthesis method involving very short samples, and is more robotic and "crunchy" sounding than Miku's original Vocaloid versions. For those who dislike the sound of NT, the older versions of Hatsune Miku are still available alongside the new version.
Subscription:
ACE Studio by Beijing Timedomain Technology, 2022 (started out free, became paid via subscription in 2023), 40 available voices and counting
Pocket Singer by ACCIDENTAL AI, mobile, 2023 (also started out free but became subscription-based, from the developers of ACE), practically unlimited "original" voices made by mixing "voice seeds"
Freemium/partially free:
VoiSona (fmr. CeVIO Pro) by Techno-Speech: Editor and one voice (Chis-A Japanese) are free, other voices are either bought through a one-time purchase or used through a paid subscription. Singing and speech versions available. 11 "song" voices available, 6 being VoiSona exclusives and the other 5 being CeVIO AI ports.
Maghni AI by VocaTone and Misbah Studios: Currently-upcoming American vocal synth engine promising an advanced AI voice synthesis model and many new features, including a promise of support of around 40 languages. 20 voices teased as of now, 2 being free test vocals while the other 18 seem to be paid. Similar to VoiSona, the editor itself and the aforementioned test vocals will be free. It was partly funded via a crowdfund; the goal has been reached, but people are still welcome to contribute.
Splash Pro: An AI music generator with a couple of available singing voices utilized. The plug-in version was discontinued, but it is still available as a website.
Last edited: