Lips Editor... is this suppose to be phonics based?


https://forum.reallusion.com/Topic298814.aspx
Print Topic | Close Window

By dickymac - 8 Years Ago
When I let CT auto lip sync the audio file, it was way out and looked really unnatural, so I deleted all the edits... not simple... and then started from scratch.
So, once I'd discovered how to get the, Lip Editor up and started working my way through the song, I noticed that there was something missing and that's when it got me thinking how the CT creators had worked this editor out. I thought it was phonic based, but as a ex Primary Teacher, it isn't anything I was familiar with. Some of the options were obvious, but others not so and when you started to put these together, you got some really strange expressions!

Here is an idea... that may help... according to the Oxford Dictionary... the 100 words below, make up about 50% of words used daily, but more importantly, they form a good phonic base... Could one of the team create a bank of words and have them as an option to select? Each word will obvious take several, Lip Editions, to form the word, but I think this would not only help users of the program, but help them to develop an understand of how the letters and words are formed. It's an idea... I think most examples I've seen, seem to use one option from the, Lip Editor, for each word, or phrase.
RankWord
1the
2be
3to
4of
5and
6a
7in
8that
9have
10I
11it
12for
13not
14on
15with
16he
17as
18you
19do
20at
RankWord
21this
22but
23his
24by
25from
26they
27we
28say
29her
30she
31or
32an
33will
34my
35one
36all
37would
38there
39their
40what
RankWord
41so
42up
43out
44if
45about
46who
47get
48which
49go
50me
51when
52make
53can
54like
55time
56no
57just
58him
59know
60take
RankWord
61people
62into
63year
64your
65good
66some
67could
68them
69see
70other
71than
72then
73now
74look
75only
76come
77its
78over
79think
80also
RankWord
81back
82after
83use
84two
85how
86our
87work
88first
89well
90way
91even
92new
93want
94because
95any
96these
97give
98day
99most
100us


I would like to hear what others think of this idea and any other ideas to help?

It seems one of those things you see all the time... lip syncing... just not quiet there!?
By Peter (RL) - 8 Years Ago
Thank you for the feedback. The quality of the lip-sync will be based on the quality of the source audio. Any audio that has background noise, music or other sounds will cause the lip-sync to be affected so it is always wise to check the quality of the source audio. This should help greatly.

For your suggestions, please be sure to submit any requests via the Feedback Tracker. This way the development teams can review and keep you updated on any suggestions that have been put forward. You can find the Feedback tracker below.

http://www.reallusion.com/FeedBackTracker/
By Kelleytoons - 8 Years Ago
The other thing most lip sync programs offer is the ability to type the text that is being said so that it can also be matched up -- this can improve the accuracy of sync 1000%.

I'm not much for putting in suggestions, though -- my experience has been they seldom make any difference unless you're a beta (and I'm a beta on a LOT of software and can see the results of my suggestions a lot better there).

For the OP -- you might want to hold out for the promised land (that is to say, SOMEDAY there may actually be facial mocap, maybe even in our lifetime.  This should improve things a LOT).
By dickymac - 8 Years Ago
Hi SW00000p... sorry I don't know your name!

Thanks for taking the time to reply and offering such good advice.

I did try to do as you suggest, although the concept is certainly sound, doing this with CT 8 Pro, isn't that easy. Could you produce a quick tutorial? You being a tutor, I'm sure others would benefit from this and I'm certain I would!? Just something simple like, Hello! My name is Finn, or using a simple audio file to demonstrate the best way to use it and produce an acceptable finished piece?

The track I used was a single track. I've added a clip... 1MB zip... if you want to check it out and any advice about how to improve quality would be gratefully received!?

Can I ask you a few questions regarding you reply please:

1. You mention a PROPER audio file... can you please clarify what would be best? I did try aif and mp3?

2. How would you say is the best way to use, Reallusion's Speech Engine as there seems to be different ways possible?

3. What is the best method to refine the result, if needed? I did try expanding the Timeline, but could get in enough to see each segment?

4. This seems a bit laborious for each word... I did try, but found it impossible to move by keyframes, or know what keyframe I was on!?
Second Past:
3. At frame one.... Hit EVERY vowel. 
Set a Key (For Each Vowel)......"3 KeyFrames BEFORE THE ACTUAL SOUND"  (with the proper strength)
Note:
Now you focus on each vowel.

Third Past:
3. Before and After... each vowel.... Blend the proper Consonant.
_________________________________________________________

could you please demonstrate? 

Hopefully, with your help, I'm sure I will be able to use CT to produce good speech!?

Many thanks for your help!





By dickymac - 8 Years Ago
Hi again!

I'm not 100% sure what you mean, or how I do this within CT, so could you give a tutorial... please?
By dickymac - 8 Years Ago
Hi Kelleytoons! 

Thanks for taking the time to reply! It seems that you have experience with lip sync programs!? Typing in the words would certainly help, so can I do this with CT? 

Is CT the best way to go with my attempt at getting my dog talking, or is there some other way I should go? Other than an operation! :)


By dickymac - 8 Years Ago
Hi Again!

The graph; RL's Phoneme Chart, you've shown is awesome and ideal for what I'm looking for... so how do I access this in CT please?


The Lip Sync Editor I've been using in CT is obviously not the right place!

You agreed 100% with, Kelleytoons about typing in text... so how do I do this please?
By Kelleytoons - 8 Years Ago
dickymac (10/26/2016)
Hi Kelleytoons! 

Thanks for taking the time to reply! It seems that you have experience with lip sync programs!? Typing in the words would certainly help, so can I do this with CT? 

Is CT the best way to go with my attempt at getting my dog talking, or is there some other way I should go? Other than an operation! :)



I do a lot of animation with many other programs but for lip sync work I mostly use Anime Studio (strictly 2D animation).  However, let me see if I can help you with CrazyTalk.

What you have to understand is that CT is the very low end of the spectrum in terms of "professional" versus "amateur" software.  It's really strictly designed so that folks like yourself, who might want to have their dog talk, can do so without much (or any) understanding of the basics of animation.  It doesn't have higher end tools because there aren't higher end users using it.  It's also why Swoops advice is difficult to follow because he's advising you to use techniques that are way above your pay grade, no offense meant.

I don't think there are any easier programs to do what you want, although Anime Studio (the lower end that isn't very expensive) might be worth looking at, as recently they added some very good image manipulation that would do it.  But it isn't nearly the canned solution that CT is.  The problem, as you've found, is that a canned solution that gives you a "one-size-fits-all" isn't a very good fit for anyone, really.

Okay, so here's (hopefully) something that will help.  What you really need is the ability to translate your text into those symbols that CT *does* support.  I *think* you may well be able to use the freeware program Papagayo to do this (Google "Papagayo lip sync" to find it -- Mac or PC versions are available).  Papagayo DOES allow you to type in your text and will sync this to an audio file.  In my years as a professional animator I've found nothing better.  The only problem (and it's real and I don't want to minimize it) is that the file it produces can't be easily imported into anything other than Anime Studio (where it does an amazing job -- as I said, I've found nothing better and I've paid thousands for programs to do this process alone.  It's how we were able to do 22 minutes of mostly facial animation for our series each week).

The file it produces IS an ASCII file, though, so that will help.  You will get a file that has the phonemes broken down, as well as the frames on which they exist.  Reading this will help you work through what you need do in CT.  Because Papagayo uses a standard ASCII dictionary for its phoneme breakdown, you can even edit THIS to show the symbols you want, so you could (in theory at least) edit it to reflect what is available in CT.

But the main reason I'm mentioning this is it will at least give you an inside look at the breakdown of each word without having to parse it yourself.  Then you can try (manually, in CT) to align things.  It won't be easy because, again, CT doesn't give you the right tools.  But it might make it a bit easier.

(Or you can, as I said, wait for facial mocap -- this would be the ultimate solution to EVERYONE'S issues, and RL has said it will come with iClone 7, unless I was dreaming).

By dickymac - 8 Years Ago
Hi KelleyToons and thanks for the prompt reply!

No offence taken... thick skinned and I'm certainly no animator! 

For the first Youtube i tried, I used an app on my iPhone called, Talking Pet and if you check out the video, https://www.youtube.com/watch?v=vVO6rAe_O2M you'll see that although nowhere near perfect, compared to the results in CT... it is much better! or that's what I think. I just wanted to add a bit more realism and though CT would help... not sure I'm right. I was hoping that I could get a good auto generated baseline to work from and it would be a fairly simple matter of adjusting a few thing!? I must say that working with the program... it doesn't follow many everyday conventions... but I suppose it is Windows based and I'm a solid user of Mac.

I've looked at your suggestion about Papagayo and it looks good at what it does... it adds a cartoon based mouth... and I want to animate the dogs mouth, so it looks like he is singing. Anime doesn't appear to give you this option either, so is there a program that does animate a given image, rather than adding a mouth on top of an image? 





By dickymac - 8 Years Ago
Hi Swoop .... did you miss this post?
By dickymac - 8 Years Ago
Hi Swoop.. and thanks for the information about Magpie!

Did you get a chance to see my other questions to you about your last post?

You also gave me this image showing a very good, Lip Shape Data, but how do I get to this in CT?

https://forum.reallusion.com/uploads/images/ae7ca0bc-e52f-4f84-902f-1acb.jpg
By dickymac - 8 Years Ago
Hi Kelleytoons and thanks for the information.

I didn't see CT as a low end product, but I suppose that would account for some of the issues!?

I've seen Papagayo before, when I was looking at the idea of animation... but I think this is intended as a prompt sheet editor  for animator, rather than having a mirror in front of them, but I can see what you mean in terms of being an aid.

Swoop suggested Magpie pro by, ThirdWishSoftware, but although it certainly has some great feature.... especially the timeline disign and use, it doesn't seem to have had any development of the last 5 years and a lot as happened in that time.

I've been looking at Adobe's Character Animator... this is a simple, intuitive and different approach to animation and lip syncing. Interest that the new feature of RL Animator 3... is here too!?  Downside... only cloud version... cc and although the program is stand alone, you can't buy it as a stand alone! RL should check it out!? :)

My feeling is that CT is too esoteric in it's design... for example, I couldn't get to grips with the flags on the timeline and the speed and loop options. Although they looked like the same options in many other programs.... they don't work in the same way. Jeff from RL, did a great job at explaining this, but I still couldn't get to use them in my workflow... they were just so different!

RL should strip CT down... remove a lot of the stuff that doesn't make intuitive sense; make the timeline user friendly... can't stand zooming in, pressing play to review and it zooms out again and you have to start all over... frustrating and time consuming... It should look at the algorithm used to create the basic lip sync track... it adds too many phonemes.. needs basic ones inputed and then simple ability to edit, add, or delete.. Shortcut keys for this would be great help... and I'm still looking for this option that Swoop showed me: 

https://forum.reallusion.com/uploads/images/b26576be-6a1a-42c2-9043-3881.jpg

Thanks for you time again, much appreciated! 
By Kelleytoons - 8 Years Ago
Magpie is one of the programs I bought and used before Papagayo -- as I said, I have spent thousands of dollars trying to get good lip sync.

Papagayo will give you lip sync -- the cartoon mouth is only a representation of what it produces, not what it is intended for.  As you note, it's a bit of a clue sheet for animators but an automated one.  That is, it produces the phonemes so you can load them into AS and manipulate what you want (whether it's cartoon or not).  In the case of your dog animation, what *might* be possible (using the new tools in AS 12) is to create mouth shapes from the actual dog picture and then have Papagayo load in the right phoneme manipulation.  I say "might" because although I have AS 12 I haven't worked with it much (my own fault -- I haven't done much animation this year).

If I get the time I'll take a closer look and see how realistic that might be -- can you post an image of your dog here that you are trying to do this with?

But what I had suggested before was using the file that Papagayo produces (an ASCII file) and by reading the file yourself you could see what phonemes were required and then (possibly) set those in Crazytalk yourself.  I realize the start of this thread you were complaining that CT doesn't actually DO phonemes, so there is that problem, but if you had some kind of conversion table over what it does produce you might still be able to use this approach.  It would not be automatic, though, not in the same way that loading in that Papagayo file into AS would be, but it might be a start (although I have CrazyTalk I ONLY use it for the ability to create 3D heads, so I can't tell you anything about lip sync in it).
By dickymac - 8 Years Ago
Hi.. 

I've been working on CT all day and it has just about fried my brain! It just doesn't seem to be fit for purpose.... When I look at it and I put the audio file in and the image into place after going  through the setting up... I thought it should be simple!? I was happy to try and adjust the lip editions that CT had attempted, but this process seems just impossible! It's as if CT is fighting the user all the way... Not intuitive, or friendly at all!  I think I'll leave it tomorrow and go and do a bit of bomb disposal... not so taxing, or unprodictable! 

I'll put the image here... when the RL server up and running again.... but please don't feel you have to have a go.... I'm grateful for your input.. I'm waiting to hear from Swoop... I asked him some questions and if he could do a quick tutorial as he seemed to know what I was doing wrong!? Nothing yet!

https://forum.reallusion.com/uploads/images/cd3bc084-c418-47d3-a093-e3a2.png