LTX 2.3 Tutorial Part 2: Total Narrative Control with First & Last Frame Workflow

Watch on YouTube

Now Playing

Transcript

288 segments

0:00

Hi guys. Hi everyone. Thank you to be

0:03

here again with me with Edo in this

0:06

channel. I want to thank you for your

0:08

support, for your feedback, for

0:10

everything you ask me in the comments.

0:13

I'm It's a pleasure to me. Um today I'm

0:18

I want to make a part two video, okay?

0:22

um of the video with LTX 2.3 where we

0:28

saw how to install how to surfing around

0:32

this you know workflow simple workflow

0:36

for one image for one image

0:40

uh to have something uh more specific uh

0:44

with you know um just prompt and first

0:48

frame. Today I want to show you

0:52

something different.

0:53

This workflow here uh useful to create

0:58

something with first and last frame

1:01

video. Uh it's powerful and I think it's

1:05

very cool. Um

1:08

and thanks to what dream cost confi

1:12

nodes. Um I tried many many many many of

1:17

these first last frame workflow and no

1:20

one worked. No one no single one worked

1:25

well.

1:26

This is the only one worked for me.

1:30

Worked well. Okay. So I just create I

1:35

can create a real first last frame video

1:39

in local. Remember in local with my 5060

1:42

Ti 16 gig of RAM and 74 gig of RAM in my

1:48

spec. I'm working with the Ryzen 9. So

1:51

this worked for me.

1:55

I hope will work for you as well. Okay.

1:59

Uh the workflow it's quite simple. It's

2:04

quite clever. Again compliment again was

2:07

the dream cost. Very very very cool. uh

2:10

workflow. Um, and I want to show you how

2:14

to work with this workflow.

2:17

Uh, he made um he made also uh you know

2:22

2:24

um a a

2:27

video a YouTube video you can find in

2:29

your uh in this uh tutorial if you want

2:32

to follow him the the the creator. I

2:35

want to show you my my my my opinion and

2:39

my tests. Okay. Um as you can see we

2:43

have three parts of the the the you know

2:46

the model. The first part is the loader

2:49

with everything in here. Okay. The

2:53

second part is the multi- image loader

2:57

created to uh to have the control of the

3:01

images. And then we have the upscale

3:04

part.

3:07

Okay.

3:08

With two two stage of us upscaling if

3:12

you want. I'm using just one. Okay. So

3:17

the the idea it's to insert two you know

3:22

two images the first and the last

3:25

uh create um a sequencer.

3:29

Okay. If I remove everything, it will

3:33

remove also the images. But I want to

3:37

insert again. Okay. And it will create

3:40

this. We work with frames. Uh and we

3:44

saying to the model, insert the first

3:47

images here. Okay. Uh and the last frame

3:51

is the second one. If I'm if I die the

3:55

deal minus one as here. Okay. I tell to

4:00

the model this is the last frame so work

4:03

as you want. By the way, we can work

4:06

with the seconds

4:09

and we want to insert the second frame

4:12

when we want in terms of seconds and

4:15

work with frames. So uh you know I'm

4:19

just leave this and we

4:23

have to deal with something different

4:27

from the previous one model. As you can

4:31

remember this model here we have just

4:36

one powerful model or the checkpoint on

4:39

one Lora. Everything else is in this

4:42

model here. For this workflow,

4:46

we need a different

4:49

loader.

4:51

And for this workflow, you have to

4:55

update your KJ confi nodes. How to do

5:00

this? Right here, right here, and

5:04

update.

5:06

If you don't update these notes here,

5:09

this workflow will not work. Okay. So

5:13

update everything everything and then

5:16

use the the last checkpoint we used for

5:19

LTX. Okay, I'm using the dev FB8

5:24

less powerful but very fast. And we have

5:27

to download this

5:31

um v

5:33

here. This they here and this also here

5:39

you can find in your uh right there uh

5:45

you can find here here and here there

5:48

are different um

5:53

different this is the the the workflow

5:55

for for LTX. Okay. This is the uh

6:00

upscaler. Okay. Okay. And this is the V

6:05

LTX.

6:08

Uh,

6:09

and we are here. I'm using the Laura I

6:13

used previously in the LTX single frame

6:17

and the two text encoder Gemma and LTX

6:20

checkpoint. And this is the spe special

6:24

upscaler special upscaler. Okay. And

6:27

here we have the value of seconds. We

6:30

can say seven whatsoever. And it's very

6:34

clever because he create this node with

6:38

a mathemat math um uh you know

6:42

operation. Uh it's very very simple but

6:45

clever cool because we don't have to

6:48

make everything every time the

6:50

conversion

6:51

uh from uh the number of frame and the

6:54

value in seconds.

6:57

So it's it's simple but it's cool.

7:00

Great, great job.

7:02

And and this is everything you need to

7:05

work with this uh workflow. Okay. Um

7:11

this is everything you need. Yes. I want

7:14

to show you something else. I don't

7:16

think so. No, it's very simple. Here we

7:22

have the process latent right there with

7:27

the

7:29

you know the upscale that reduce the the

7:32

the the

7:35

scale of your image the the the rate of

7:37

your image by a half and then we'll

7:40

upscale again by two. Okay, this is the

7:44

stage one

7:46

and this is then the stage two here.

7:50

Here, right there. I just uh change this

7:55

um cfg to ancestral

8:00

uh C cfgpp or uler cfgppp.

8:05

I read somewhere that it's the best for

8:08

LTX, so I work with this. Okay. Uh or

8:14

you can just leave as you want. And this

8:16

is the the second one uler as CFG. As

8:20

you can see that the noise for the

8:22

second part.

8:24

And leave everything as you see. It's

8:28

very very precise the the the flux. So

8:32

just leave it. Okay. If you want you can

8:36

create another stage of refiner with a

8:39

third part but you know for me it's fine

8:44

just this okay as you can see another

8:47

another another stage of upscale

8:52

and that's it.

8:54

Okay, this is the the decoder of your of

8:56

course and we can see everything right

8:59

there with the tile the temporal size

9:02

and the overlap.

9:04

This is the default value for this

9:06

model. You can work with this. I work

9:09

with this fine. So why not use this

9:12

default values and we can

9:15

see the last part of the video uh the

9:20

prompt. Remember LTX it's very sensitive

9:25

to prompt injection. So try to be very

9:28

precise in what you see in the scene and

9:31

what you want to see in the movement.

9:34

Okay.

9:36

In this particular scene, a

9:38

photorealistic high quality cinematic

9:40

video transition. I want to see the

9:42

transition. The woman begins to stand up

9:46

gracefully from the stool. So I just

9:50

create the perfect you know match with

9:52

the the the image as she rise the camera

9:56

performance control fluid crane up and

9:59

slide up pan in movement to maintain

10:01

high levels focus and adjust blah blah

10:04

blah simultaneously she turns his body

10:06

and face the camera blah blah blah blah

10:08

blah blah and she says cheers have a

10:12

nice party the background of glowing

10:15

leor bottles and warmah

10:17

B. So be precise.

10:20

Be precise if you want a precise uh

10:24

movement and a precise video with LTX.

10:28

It's demanding. So uh if you create a

10:31

poor um poor

10:35

um prompt, the video will be poor. Okay.

10:40

Just another thing if you want these two

10:44

these two uh you know um these two

10:47

values is very important the pro the

10:50

compression the lowest value the the the

10:55

the detail will be uh more precise

11:00

crispies sometimes so balance in your

11:03

with your um you know pleasure I'm very

11:07

good with 15 20 values

11:10

Okay. And then remember the the multiply

11:15

this loader will in insert the the

11:19

images and crop it in a right way in

11:24

order to then upscale again. So in my

11:27

opinion it's very important to have a

11:30

first frame with the right ratio. Okay.

11:34

divide uh that you can divide by 30 32

11:40

or 16. Okay, in my case work well this

11:43

value that is the standard for LTX

11:48

uh se 736

11:51

and 1,280

11:54

pixels. Okay, this is divided you can

11:57

divide it by 32. It's important just to

12:02

have the right ratio and the right

12:05

dropping ratio with LTX and not to have

12:08

something strange or you know messy. So

12:13

this is all you need to know to perform

12:16

a run with this model and I want to show

12:19

you the the the result. Okay,

12:23

this is result.

12:31

Okay, again

12:34

as you can see he moved she moved the

12:36

the chair.

12:39

Cheers party. It's very good. It's very

12:43

very smooth. Okay, I g I made another

12:46

one,

12:51

but the the the torsion of the head it's

12:55

quite quite

12:57

12:59

it's good but not so good. Okay, maybe

13:02

there is another one. I think it's this.

13:05

Okay, sorry

13:07

I made a mess

13:10

from the last video I made. uh you know

13:13

I perform a a zoom in from uh this start

13:19

video

13:20

start frame to this last frame.

13:26

This is in local. This is open source

13:31

and I take 300 seconds. 300 seconds.

13:35

Okay. Five minutes. six minutes with my

13:40

6050 Ti. You know this is the value

13:46

500 seconds

13:49

for you know for six uh length video six

13:53

seconds length video or 10 second length

13:57

video I think it's impressive very very

14:01

impressive I think it's quite good uh

14:08

you can find I want to show you

14:10

something very strange change happened.

14:13

This is one of uh with a a wrong prompt.

14:17

I made a mess with the prompt and I want

14:20

and I wanted to create something very

14:22

long. 10 seconds.

14:25

Oh, whoa. What's going on? Come on.

14:27

Sorry. Again.

14:31

Okay. It crashed everything. I don't

14:35

know why.

14:40

Okay.

14:43

Strange.

14:45

Maybe I asked

14:48

something wrong to my GPU. I don't know

14:51

why. Uh, okay. And another one. Okay.

14:55

This is good. This is okay. I don't know

14:59

why. This is a 10 seconds. So, she

15:01

stands up. She go back and she moved the

15:05

chair. Blah blah blah.

15:09

Okay.

15:11

is not good but it's not so bad. Okay.

15:16

In terms of what are we doing with open

15:19

source

15:21

open source guys.

15:25

So I think it's an incredible way to

15:28

work with AI in a open source uh uh you

15:32

know perspective.

15:34

This is the top the top the aex of the

15:38

the video generation model

15:41

and I hope this

15:44

workflow could help you. Thanks again

15:48

Wild Minder.

15:50

No sorry not wild mind. Thank you also

15:53

one minder. Okay. Wet dream cost. What

15:56

dream cost? Thank you guy. Thank you

15:58

very much. It's Jonathan Jonathan

16:02

Watkins. Jonathan, thank you. Thank you

16:05

for your uh you know um your your

16:09

effort. It's powerful. It's it's very uh

16:13

you know clean, beautiful to to see and

16:16

to uh to work with. So thanks. Uh it

16:20

worked for me. I hope will work for you.

16:24

And again, thank you to to be here with

16:27

me with Edo and I hope this kind of

16:30

video could help you in your AI journey.

16:33

I work with this um advanced generation

16:36

um modus all days and I hope my effort

16:41

could help you in your journey and I

16:45

hope to see you again next video.

Interactive Summary

Ask follow-up questions or revisit key timestamps.

The video presents a part two tutorial on creating videos with first and last frames using the LTX workflow, specifically highlighting a powerful and clever method developed by "dream cost". The presenter emphasizes that this is the only first-last frame workflow that has worked well for them and can be run locally on their hardware. The tutorial details the workflow, which involves loading the first and last images, using a sequencer to handle frames, and then upscaling. Key steps include updating the "KJ confi nodes", using a specific checkpoint (dev FB8), downloading necessary components (V, LTX, upscaler), and leveraging a special upscaler. The presenter explains the math operation node for calculating seconds and frames, the two-stage upscaling process, and the importance of precise prompting for LTX due to its sensitivity to prompt injection. They also discuss compression values and the recommended image ratio for the first frame. The video showcases the results of the workflow, including smooth transitions and movements, and briefly touches upon potential issues like crashes with complex prompts or long durations. The presenter concludes by expressing gratitude to "dream cost" and "Jonathan Watkins" for their contributions and hopes the tutorial assists viewers in their AI journey.