Skip to content

Instantly share code, notes, and snippets.

View bhavul's full-sized avatar
👨‍💻
from sky import *

Bhavul Gauri bhavul

👨‍💻
from sky import *
View GitHub Profile
@bhavul
bhavul / Troubleshoot-dcnn.md
Created August 27, 2018 19:20 — forked from zeyademam/Troubleshoot-dcnn.md
Troubleshooting Convolutional Neural Nets

Troubleshooting Convolutional Neural Networks

Intro

This is a list of hacks gathered primarily from prior experiences as well as online sources (most notably Stanford's CS231n course notes) on how to troubleshoot the performance of a convolutional neural network . We will focus mainly on supervised learning using deep neural networks. While this guide assumes the user is coding in Python3.6 using tensorflow (TF), it can still be helpful as a language agnostic guide.

Suppose we are given a convolutional neural network to train and evaluate and assume the evaluation results are worse than expected. The following are steps to troubleshoot and potentially improve performance. The first section corresponds to must-do's and generally good practices before you start troubleshooting. Every subsequent section header corresponds to a problem and the section is devoted to solving it. The sections are ordered to reflect "more common" issues first and under each header the "most-eas