I like the Acculips feature, but I'm finding it's feature for converting speech to text is fairly bad.
I have a song that I want my character to sing with lips movements to match the vocals of the song.
First I used software to separate the vocals from the music. The separation was actually really good and clear.
I loaded that vocals only track into Acculips and it failed miserably at converting the speech to text. Just a bunch of garbled words that didn't even resemble the words.
It would be impractical to edit every word with the correct timing.
So I recorded myself clearly speaking and recording the vocals in time with the song (not singing, just speaking the words clearly) while the song played into my headphones.
Once again I uploaded the vocals only track to Acculips. The conversion to text was ever so slightly improved over the last attempt, but still terrible, full of words that weren't even words.
Just to make sure it wasn't my voice or microphone that wasn't clear, I uploaded the both vocal files to an online speech to text converter. And the conversion was almost perfect.
Suggesting that it's the Acculips speech to text engine that is the problem.
Also I'm not sure if there is a limit to how much speech Acculips will convert, but it only seemed to convert maybe half of the vocal file.
And in the iClone instruction manual for Acculips, it shows an icon which allows you to jump to the current word that is being played. That icon no longer exists, and would be a handy feature to have back.
What I'd like to see is an overhaul of Acculips. Update the speech to text engine to something more up to date, and ensure all features are working (as per the iClone user manual)
_________________________________________________________