first create a load image node,
then connect it with two clip encode text editors.
one for positive prompt, one with negative prompt. then connect vae to vae encoder, and connect the mask to laten noise ..i forgot this one.
load your favourite flux1/zimage...model. enjoy
