vuiseng9/hf-text-gen-pipeline.md

## hf-text-gen-pipeline.md

      
    Raw
  

              hf-text-gen-pipeline.md
            
          
    Setup

pip install transformers torch
git clone https://huggingface.co/EleutherAI/gpt-j-6b # depends on git-lfs 
Run following as python script

from transformers import AutoTokenizer, pipeline

model_dir=<path to cloned model>
tokenizer = AutoTokenizer.from_pretrained(model_dir)
generator_pipe = pipeline('text-generation', model=model_dir, tokenizer=tokenizer)
output = generator_pipe("I love the Avengers", max_length=30, num_return_sequences=1)
print(output)
For large model, the pytorch checkpoint will be split into multiple files during cloning,
Running a pipeline using it will decompress them into one single pytorch_model.bin
Loading weight (python)

import torch

sd=torch.load("/path/to/pytorch.bin")

# Number of parameters by layer
for k, v in sd.items():
    print(f'{v.numel():10} | {k}')