Cleaned up version of https://gist.github.com/mrsteyk/74ad3ec2f6f823111ae4c90e168505ac,
which is in turn based on the public_diff_vae.ConvUNetVAE
from https://github.com/openai/consistencydecoder.
Install the consistency decoder code (for the inference logic) and download the extracted weights: