Ollin Boer Bohan madebyollin

## informal_evaluation_of_vlm_captioners.md

      
              1 file
            
          
              0 forks
            
          
              2 comments
            
          
              1 star
            
          
                madebyollin
                / informal_evaluation_of_vlm_captioners.md
            
            
              Last active
              July 10, 2024 03:05
            
          
    Objective

Informal (vibes-based) evaluation of the following vision-language-model captioners:

Florence-2-base-ft
CogVLM2
BLIP-2
MoonDream2
Share-Captioner
Florence-2-SD3-Captioner


## sd3_gradio_demo_with_taesd_preview.py
#!/usr/bin/env python3
import gradio as gr
import numpy as np
import random
import torch
from diffusers import (
    StableDiffusion3Pipeline,
    SD3Transformer2DModel,
    FlowMatchEulerDiscreteScheduler,
    AutoencoderTiny,

## 2024_04_07_Space_Filling_VAE_Animation.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                madebyollin
                / 2024_04_07_Space_Filling_VAE_Animation.ipynb
            
            
              Created
              April 7, 2024 22:58
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## automatic_profiling_markers.py
def add_profiling_markers(model):
    """Monkey-patch profiling markers into an nn.Module.

    Args:
        model: an nn.Module

    Effect:
        all model.named_module() forward calls get wrapped in their
        own profiling scope, making traces easier to understand.
    """

## Mamba_Diffusion_IADB_Colab.ipynb

      
              1 file
            
          
              1 fork
            
          
              4 comments
            
          
              12 stars
            
          
                madebyollin
                / Mamba_Diffusion_IADB_Colab.ipynb
            
            
              Created
              December 6, 2023 04:47
            
              
                Mamba Diffusion (IADB)
              
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## notes_on_sd_vae.md

      
              1 file
            
          
              0 forks
            
          
              10 comments
            
          
              10 stars
            
          
                madebyollin
                / notes_on_sd_vae.md
            
            
              Last active
              July 23, 2024 04:35
            
              
                notes_on_sd_vae
              
          
    Notes / Links about Stable Diffusion VAE

Stable Diffusion's VAE is a neural network that encodes images into a compressed "latent" format and decodes them back. The encoder performs 48x lossy compression, and the decoder generates new detail to fill in the gaps.

(Calling this model a "VAE" is sort of a misnomer - it's an encoder with some very slight KL regularization, and a conditional GAN decoder)
This document is a big pile of various links with more info.

  
## README.md

      
              2 files
            
          
              0 forks
            
          
              12 comments
            
          
              8 stars
            
          
                madebyollin
                / README.md
            
            
              Last active
              April 2, 2024 13:50
                — forked from mrsteyk/README.md
            
              
                dalle_runner_api.model_infra.modules.public_diff_vae
              
          
    Consistency Decoder PyTorch Model Code

Cleaned up version of https://gist.github.com/mrsteyk/74ad3ec2f6f823111ae4c90e168505ac,
which is in turn based on the public_diff_vae.ConvUNetVAE from https://github.com/openai/consistencydecoder.
Example Usage

Install the consistency decoder code (for the inference logic) and download the extracted weights:


## 2023_09_23_VAE_Text_Testing.ipynb

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                madebyollin
                / 2023_09_23_VAE_Text_Testing.ipynb
            
            
              Created
              September 23, 2023 15:55
            
          
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## model_activation_printer.py
def summarize_tensor(x):
    return f"\033[34m{str(tuple(x.shape)).ljust(24)}\033[0m (\033[31mmin {x.min().item():+.4f}\033[0m / \033[32mmean {x.mean().item():+.4f}\033[0m / \033[33mmax {x.max().item():+.4f}\033[0m)"


class ModelActivationPrinter:
    def __init__(self, module, submodules_to_log):
        self.id_to_name = {
            id(module): str(name) for name, module in module.named_modules()
        }
        self.submodules = submodules_to_log

## compare_safetensors.py
#!/usr/bin/env python3
from pathlib import Path
from safetensors.torch import load_file

def summarize_tensor(x):
    if x is None:
        return "None"
    x = x.float()
    return f"({x.min().item():.3f}, {x.mean().item():.3f}, {x.max().item():.3f})"
	#!/usr/bin/env python3
	import gradio as gr
	import numpy as np
	import random
	import torch
	from diffusers import (
	StableDiffusion3Pipeline,
	SD3Transformer2DModel,
	FlowMatchEulerDiscreteScheduler,
	AutoencoderTiny,
	def add_profiling_markers(model):
	"""Monkey-patch profiling markers into an nn.Module.

	Args:
	model: an nn.Module

	Effect:
	all model.named_module() forward calls get wrapped in their
	own profiling scope, making traces easier to understand.
	"""
	def summarize_tensor(x):
	return f"\033[34m{str(tuple(x.shape)).ljust(24)}\033[0m (\033[31mmin {x.min().item():+.4f}\033[0m / \033[32mmean {x.mean().item():+.4f}\033[0m / \033[33mmax {x.max().item():+.4f}\033[0m)"


	class ModelActivationPrinter:
	def __init__(self, module, submodules_to_log):
	self.id_to_name = {
	id(module): str(name) for name, module in module.named_modules()
	}
	self.submodules = submodules_to_log
	#!/usr/bin/env python3
	from pathlib import Path
	from safetensors.torch import load_file

	def summarize_tensor(x):
	if x is None:
	return "None"
	x = x.float()
	return f"({x.min().item():.3f}, {x.mean().item():.3f}, {x.max().item():.3f})"