The main theme of this article originates from the struggle between ControlNets. Speaking specifically about ControlNets, in certain combinations, it can be challenging to replace targets in the image, such as clothing, background,
Note: This operation is highly VRAM-intensive. When creating a short video, after loading ControlNet calculations, it consumed around 16GB of VRAM. If your VRAM is insufficient, it is recommended to use the ComfyUI
Although AnimateDiff can provide a model algorithm for the flow of animation, the issue of variability in the produced images due to Stable Diffusion has led to significant problems such as video flickering