The generated image differs significantly from Canny's
#9
by
demo001s
- opened
I think it should be an image generated based on the text
Connect text ouput with clip input text also, and try depth controlnet if it won't work
Is it because my picture is not 1024x1024 ?
Did you connect text to clip input also?