AnasKAN/acceleration.md

## acceleration.md

      
    Raw
  

              acceleration.md
            
          
    Open Neural Network Exchange (ONNX)


onnx.ai is an open format built to represent machine learning models. ONNX defines a common set of operators - the building blocks of machine learning and deep learning models - and a common file format to enable AI developers to use models with a variety of frameworks, tools, runtimes, and compilers.
Intro to onnx
Docs (very useful)
CUDA Kernels Low-level ML materials


this section list resources to learn about pytorch/tensorflow/JAX gpu accelerations which can speed up fine tuning up to 33-50%!!! instead of you waiting for 12 hours to finish training you can trim it to 6 hours, cutting costs and saving time.
You'll learn Triton, JAX, Flash attention and so much more.
GPUMODE channel have so many lectures teaching you all you need literally from the ground up. and their discord server is this.
GPUMODE discord
Unsloth is one of their kinds. Have you heard of bug bounty hunters in cybersecurity field? those guys are LLMs bug hunters they work non-stop to accelerate LLMs inference and fine-tuning, lately they branched off to multimodal models including TTS models and VLMs!
Docs
Valuable lecture
To accelerate diffusion models finetuning:
dreambooth
Efficient Finetuning

ML in production

Lightining # its pretty good to pair it with hydra btw