chicobentojr/_a-recipe-for-training-neural-networks.md

## _a-recipe-for-training-neural-networks.md

      
    Raw
  

              _a-recipe-for-training-neural-networks.md
            
          
    A Recipe for Training Neural Networks - Andrej Karpathy


## recipe.md

      
    Raw
  

              recipe.md
            
          
    A Recipe for Training Neural Networks


Tweet
Yes you should understand backprop

CS231n Winter 2016


Neural net training is a leaky abstraction
Neural net training fails silently

The Recipe


Become one with the data
Set up the end-to-end training/evaluation skeleton + get dumb baseline

fix random seed
simplify

Data augmentation

Data Augmentation | How to use Deep Learning when you have Limited Data — Part 2
Google ‘fixed’ its racist algorithm by removing gorillas from its image-labeling tech
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

http://people.csail.mit.edu/junyanz/


Deep Photo Style Transfer
NanoNets


add significant digits to your eval
verify loss @ init
init well
human baseline
input-indepent baseline
overfit one batch
verify decreasing training loss
visualize just before the net
visualize prediction dynamics
use backprop to chart dependencies
generalize a special case


Overfit

picking the model
adam is safe
complexify only one at a time
do not trust learning rate decay defaults


Regularize

get more data
data augment
creative augmentation

Learning Dexterity
Playing for Data: Ground Truth from Computer Games
Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection


pretrain
stick with supervised learning
smaller input dimensionality
smaller model size
decrease the batch size
drop

Understanding the Disharmony between Dropout and Batch Normalization by Variance Shift


weight decay
early stopping
try a larger model


Tune

random over grid search
hyper-parameter optimization

Random Search for Hyper-Parameter Optimization


Squeeze out the juice

ensembles

Distilling the Knowledge in a Neural Network


leave it training