D-Ogi

## wd1-4.md

      
              1 file
            
          
              2 forks
            
          
              16 comments
            
          
              86 stars
            
          
                harubaru
                / wd1-4.md
            
            
              Last active
              September 11, 2023 04:12
            
          
    Waifu Diffusion 1.4 Overview


_{An image generated at resolution 512x512 then upscaled to 1024x1024 with Waifu Diffusion 1.3 Epoch 7.}
Goals


Improving image generation at different aspect ratios using conditional masking during training. This will allow for the entire image to be seen during training instead of center cropped images, which will allow for better results when generating full body images, portraits, and improving the composition.
Expanded the input context from 77 tokens to 231 tokens or perhaps to an unlimited amount of tokens. Out of 77 tokens for input, only 75 are useable. This does not give nearly enough room for complex prompting that requires a lot of detail.