Profile Picture

Avatar voice engines

Posted By jawalsh 6 Years Ago
You don't have permission to rate!
Author
Message
jawalsh
jawalsh
Posted 6 Years Ago
View Quick Profile
Senior Member

Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)Senior Member (308 reputation)

Group: Forum Members
Last Active: 5 Years Ago
Posts: 2, Visits: 12
Simple question here. Where can I get the best, most authentic sounding voices for the avatars? I've purchased a couple of NextUp voice engines for a variety of uses but they still have an artificial sound to them. Any suggestions?
planetstardragon
planetstardragon
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)

Group: Forum Members
Last Active: Last Month
Posts: 10.5K, Visits: 34.1K
Hi jawalsh,

Welcome to the forum! BigGrin

they all sound choppy to be honest - the best one's I've heard are from at&t -
http://www2.research.att.com/~ttsweb/tts/demo.php

when you use these voices in artificial intelligence software like http://zabaware.com the software itself adds a more life like cadence to the voice.

- the most diverse one's I heard are from http://www.voiceforge.com/demo?uservoice=Belle

and a few more like Cepstral, ivonna and neo-speech...

all 3rd party TTS developers ( less the microsoft royalty free that already come with windows ) want to charge you an arm and 3 legs for distribution because they are not royalty free - not even for non commercial youtube use.

if you don't mind getting a little technical you could always try your hand at Utauloids - which is a do it yourself synthetic voice software - never tried it myself but looks interesting - http://utau.wikia.com/wiki/UTAUloids

then there are voice changing tools - http://www.screamingbee.com/product/MorphVOX.aspx

Cheers!


animagic
animagic
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)

Group: Forum Members
Last Active: 36 minutes ago
Posts: 11.3K, Visits: 19.6K
I like Acapela: http://www.acapela-group.com/.

I use their online version, called acapela-box: http://www.acapela-box.com/, which allows me to use just the voices I need for as much as I need them (it is "pay-per-word"). The big advantage also is that the recorded voice files do not have the usage restrictions that come with most TTS packages (as discussed on this forum).


https://forum.reallusion.com/uploads/images/1a09220f-ab50-42ac-ad1a-33ec.pnghttps://forum.reallusion.com/Uploads/Images/d14339d0-cd32-4b35-88f9-40a0.png


planetstardragon
planetstardragon
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)

Group: Forum Members
Last Active: Last Month
Posts: 10.5K, Visits: 34.1K
there are ways to make them sound more natural, but it might be more trouble than what it's worth. - it's the spacing and pitch that make it sound un natural.

gather a collection of common words from the service ...truncate them efficiently ...then add them to a sampler and play them from a keyboard, this would give you access to pitchbend, timing, timestretching etc. this way you buy a word once and build a library.

the musical equivalent of keyframing


RobertoColombo
RobertoColombo
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)

Group: Forum Members
Last Active: 6 days ago
Posts: 1.6K, Visits: 3.0K
I still dream to have some TTS SW that:

1. is able to assign different intonation, i.e. the possibility to assign different "moods" to each word or sentence (e.g happy, angry, surprise, sad, etc.) ...
2. is able to generate different accents (i.e italian ppl speaking english or vice verse, indian ppl speaking english, etc.)
3. has no legal issues based on where the final voice is being used
4. has a reasonable price

Why no SW house made this ? Is all of this more difficult (in terms of SW programming and algos) than building the whole iClone SW & tools ?
Maybe RL will think about this as the next "killing plug-in"... Wink



My PC:
OS: Windows 10 Pro English 64-bit / CPU: Intel i7-9700 3.6GHz / MB: ASUS ROG Strix Z390  RAM: 32GB DDR4 2.6GHz / HD: 2TB+3TB  /  
SSD: 2x512GB Samsung 860 EVO / 
VB: Palit GTX2080 TI GamingPro 11GB / AB: embedded in the MB and VB (audio from the VB with an HDMI splitter) / DirectX: 12

Edited
6 Years Ago by RobertoColombo
wendyluvscatz
wendyluvscatz
Posted 6 Years Ago
View Quick Profile
Senior Forum Member

Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)Senior Forum Member (8.0K reputation)

Group: Forum Members
Last Active: Yesterday
Posts: 2.4K, Visits: 14.7K
I use the MS SAPI5 ones as being redistributable usage of the voices would hold no legal ramifications as since you CAN redistribute the whole files etc for creating them in the first place.
You CAN edit the wav using other software too like Audacity using the Rovee plugin for variation of voices or use screaming bee on the wav file too.
None sound as good as using real people though.
I use Balabolka to export the wav files as iClone only sees MS Anna unfortunately.




RobertoColombo
RobertoColombo
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)

Group: Forum Members
Last Active: 6 days ago
Posts: 1.6K, Visits: 3.0K
"None sound as good as using real people though."

That's the point nr. 5, which I forgot in my list:

5. the voice shall sound real


My PC:
OS: Windows 10 Pro English 64-bit / CPU: Intel i7-9700 3.6GHz / MB: ASUS ROG Strix Z390  RAM: 32GB DDR4 2.6GHz / HD: 2TB+3TB  /  
SSD: 2x512GB Samsung 860 EVO / 
VB: Palit GTX2080 TI GamingPro 11GB / AB: embedded in the MB and VB (audio from the VB with an HDMI splitter) / DirectX: 12

Edited
6 Years Ago by RobertoColombo
planetstardragon
planetstardragon
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)Distinguished Member (17.7K reputation)

Group: Forum Members
Last Active: Last Month
Posts: 10.5K, Visits: 34.1K
@swoop, yes, thats exactly how I feel about some 3D functions. Wink

I could spend every waking moment insisting you learn about audio production in extreme detail and calling you names when you don't want to though, you just got to take it reaally slow, I'll learnz ya!!. :p


Edited
6 Years Ago by planetstardragon
animagic
animagic
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)Distinguished Member (19.3K reputation)

Group: Forum Members
Last Active: 36 minutes ago
Posts: 11.3K, Visits: 19.6K
sw00000p (8/26/2013)
For me, that's way more trouble than it's worth!

It's a challenge, just like animation... It would be easier to just have a bunch of real people talking, but not as much fun...BigGrin

Apart from Pinhead, who sounds like Pinhead (Australian Lee), the voices in my Breezaway commercial were done with Acapela Box.


https://forum.reallusion.com/uploads/images/1a09220f-ab50-42ac-ad1a-33ec.pnghttps://forum.reallusion.com/Uploads/Images/d14339d0-cd32-4b35-88f9-40a0.png


RobertoColombo
RobertoColombo
Posted 6 Years Ago
View Quick Profile
Distinguished Member

Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)Distinguished Member (7.2K reputation)

Group: Forum Members
Last Active: 6 days ago
Posts: 1.6K, Visits: 3.0K
Hi Animagic,

nice... nice!! I like the tone of the voices on Acapela!
It is also easy to produce non-native English speakers by changing the text with the right phoneme (e.g. instead of Hi or I => type AI, otherwise an Italian/Spanish voice will pronounce it like you pronounce "E" in English) .

But.... why they do not show the prices and the characteristics of the SW ?
Also they mention about the NDA to be signed in order to get these "super-secret" information...
That's the most ridiculous thing I have seen about a SW house...

Honestly, they can only scare people away instead of understanding that they have a very good technology and just openly show what their SW can do and at what price.

My PC:
OS: Windows 10 Pro English 64-bit / CPU: Intel i7-9700 3.6GHz / MB: ASUS ROG Strix Z390  RAM: 32GB DDR4 2.6GHz / HD: 2TB+3TB  /  
SSD: 2x512GB Samsung 860 EVO / 
VB: Palit GTX2080 TI GamingPro 11GB / AB: embedded in the MB and VB (audio from the VB with an HDMI splitter) / DirectX: 12




Reading This Topic