Skip to content

Instantly share code, notes, and snippets.

@alessiot
Last active September 30, 2019 17:30
Show Gist options
  • Save alessiot/9f09625d3832c2e22213b0e8ac8f7fc7 to your computer and use it in GitHub Desktop.
Save alessiot/9f09625d3832c2e22213b0e8ac8f7fc7 to your computer and use it in GitHub Desktop.
Programming Neural Networks from Scratch
Display the source blob
Display the rendered blob
Raw
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Programming Neural Networks from Scratch\n",
"\n",
"In order to understand how to make changes to the optimization process in a neural network model, we will start by illustrating how to program neural networks in basic Python, that is by using numpy. I will be reusing the [code](https://github.com/SkalskiP/ILearnDeepLearning.py) written by Piotr Skalski and reviewed in his [article on Medium](https://towardsdatascience.com/lets-code-a-neural-network-in-plain-numpy-ae7e74410795)"
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [],
"source": [
"import pandas as pd\n",
"import VisualizeNN as VisNN\n",
"import numpy as np"
]
},
{
"cell_type": "code",
"execution_count": 2,
"metadata": {},
"outputs": [],
"source": [
"data = {'F1':[0,0,1,1], 'F2':[0,1,0,1], \"Target\":[0,1,1,0]} #XOR\n",
"df = pd.DataFrame(data) \n",
"\n",
"X_xor = df[[\"F1\",\"F2\"]]\n",
"y_xor = df[\"Target\"].values.tolist()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Neural Network Architecture and Initialization"
]
},
{
"cell_type": "code",
"execution_count": 3,
"metadata": {},
"outputs": [],
"source": [
"NN_ARCHITECTURE = [\n",
" {\"input_dim\": X_xor.shape[1], \"output_dim\": 2, \"activation\": \"sigmoid\"},\n",
" {\"input_dim\": 2, \"output_dim\": 1, \"activation\": \"sigmoid\"}\n",
"]"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"Before training a neural network, one needs to initialize its weights. This is typically done by randomly generating small values to increase the algorithm efficiency to find optimal weight values during the first iterations. "
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {},
"outputs": [],
"source": [
"def init_layers(nn_architecture, seed = 99):\n",
" # random seed initiation\n",
" np.random.seed(seed)\n",
" # number of layers in our neural network\n",
" number_of_layers = len(nn_architecture)\n",
" # parameters storage initiation\n",
" params_values = {}\n",
" \n",
" # iteration over network layers\n",
" for idx, layer in enumerate(nn_architecture):\n",
" # we number network layers from 1\n",
" layer_idx = idx + 1\n",
" \n",
" # extracting the number of units in layers\n",
" layer_input_size = layer[\"input_dim\"]\n",
" layer_output_size = layer[\"output_dim\"]\n",
" \n",
" # initiating the values of the W matrix\n",
" # and vector b for subsequent layers\n",
" params_values['W' + str(layer_idx)] = np.random.randn(\n",
" layer_output_size, layer_input_size) * 0.1\n",
" params_values['b' + str(layer_idx)] = np.random.randn(\n",
" layer_output_size, 1) * 0.1\n",
" \n",
" return params_values"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Activation Functions\n",
"\n",
"Activation functions are non-linear squashing functions that limit their output and act as perceptrons. For a review, one can refer to the Medium article [here](https://medium.com/binaryandmore/beginners-guide-to-deriving-and-implementing-backpropagation-e3c1a5a1e536)."
]
},
{
"cell_type": "code",
"execution_count": 5,
"metadata": {},
"outputs": [],
"source": [
"def sigmoid(Z):\n",
" return 1/(1+np.exp(-Z))\n",
"\n",
"def relu(Z):\n",
" return np.maximum(0,Z)\n",
"\n",
"def tanh(Z):\n",
" return 1.0/(1.0 + np.exp(-2*Z)) - 1.0\n",
"\n",
"def softmax(Z):\n",
" #return np.exp(Z) / np.sum(np.exp(Z), axis=0)\n",
" # stable version, no overflowing or NaN\n",
" return np.exp(Z-np.max(Z)) / np.sum(np.exp(Z), axis=0)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Forward Propagation\n",
"\n",
"At any given layer $l$, input values $A^{l-1}$ fanned out from the previous layer $l-1$ are linearly transformed using the weights connecting layer $l-1$ to $l$, $W^{l}$, and the bias of layer $l$, $b^{l}$, as follows \n",
"\n",
"\\begin{equation}\n",
"Z^l = W^{l}\\cdot A^{l-1} + b^l.\n",
"\\end{equation}\n",
"\n",
"$Z^l$ is the vector of values held at the nodes of layer $l$.\n",
"A non-linear transformation is applied to $Z^{l}$ using activation functions as follows\n",
"\n",
"\\begin{equation}\n",
"A^l = g^{l}\\left(Z^l\\right).\n",
"\\end{equation}"
]
},
{
"cell_type": "code",
"execution_count": 6,
"metadata": {},
"outputs": [],
"source": [
"def single_layer_forward_propagation(A_prev, W_curr, b_curr, activation=\"relu\"):\n",
" # calculation of the input value for the activation function\n",
" Z_curr = np.dot(W_curr, A_prev) + b_curr\n",
" \n",
" # selection of activation function\n",
" if activation is \"relu\":\n",
" activation_func = relu\n",
" elif activation is \"sigmoid\":\n",
" activation_func = sigmoid\n",
" elif activation is \"tanh\":\n",
" activation_func = tanh\n",
" elif activation is \"softmax\":\n",
" activation_func = softmax\n",
" else:\n",
" raise Exception('Non-supported activation function')\n",
" \n",
" # return of calculated activation A and the intermediate Z matrix\n",
" return activation_func(Z_curr), Z_curr\n",
"\n",
"def full_forward_propagation(X, params_values, nn_architecture):\n",
" \n",
" # creating a temporary memory to store the information needed for a backward step\n",
" memory = {}\n",
" \n",
" # X vector is the activation for layer 0 \n",
" A_curr = X\n",
" \n",
" # iteration over network layers\n",
" for idx, layer in enumerate(nn_architecture):\n",
" # we number network layers from 1\n",
" layer_idx = idx + 1\n",
" # transfer the activation from the previous iteration\n",
" A_prev = A_curr\n",
" \n",
" # extraction of the activation function for the current layer\n",
" activ_function_curr = layer[\"activation\"]\n",
" # extraction of W for the current layer\n",
" W_curr = params_values[\"W\" + str(layer_idx)]\n",
" # extraction of b for the current layer\n",
" b_curr = params_values[\"b\" + str(layer_idx)]\n",
" # calculation of activation for the current layer\n",
" A_curr, Z_curr = single_layer_forward_propagation(A_prev, W_curr, b_curr, activ_function_curr)\n",
" \n",
" # saving calculated values in the memory\n",
" memory[\"A\" + str(idx)] = A_prev # what's coming into this layer from previous layer\n",
" memory[\"Z\" + str(layer_idx)] = Z_curr\n",
" memory[\"W\" + str(layer_idx)] = W_curr\n",
" memory[\"b\" + str(layer_idx)] = b_curr\n",
" \n",
" # return of prediction vector and a dictionary containing intermediate values\n",
" return A_curr, memory"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Error Backpropagation and Gradient Descend\n",
"\n",
"During backpropagation the gradients of the cost $C$ with respect to the network weights are calculated for each layer $l$. For layer $l$,\n",
"\n",
"\\begin{equation}\n",
"\\frac{dC}{dW^l} = \\frac{dC}{dZ^l}\\cdot (A^{l-1})^T,\n",
"\\end{equation}\n",
"\n",
"where \n",
"\n",
"\\begin{equation}\n",
"\\frac{dC}{dZ^l} = (W^{l+1})^T \\cdot \\frac{dC}{dZ^{l+1}} \\cdot g^{l '} \\left(Z^l\\right) = dA^l \\cdot g^{l '} \\left(Z^l\\right).\n",
"\\end{equation}"
]
},
{
"cell_type": "code",
"execution_count": 7,
"metadata": {},
"outputs": [],
"source": [
"def sigmoid_backward(dA, Z):\n",
" sig = sigmoid(Z)\n",
" return dA * sig * (1 - sig)\n",
"\n",
"def relu_backward(dA, Z):\n",
" dZ = np.array(dA, copy = True)\n",
" dZ[Z <= 0] = 0;\n",
" return dZ;\n",
"\n",
"def tanh_backward(dA, Z):\n",
" ta = tanh(Z)\n",
" return dA * (1.0 - ta) * (1.0 + ta)\n",
"\n",
"def softmax_grad(x):\n",
" # x here is the result of softmax applied to an input vector\n",
" # Reshape the 1-d softmax to 2-d so that np.dot will do the matrix multiplication\n",
" s = x.reshape(-1,1)\n",
" return np.diagflat(s) - np.dot(s, s.T)\n",
"\n",
"def softmax_backward(dA, Z):\n",
" so = softmax_grad(Z)\n",
" return dA * so"
]
},
{
"cell_type": "code",
"execution_count": 8,
"metadata": {},
"outputs": [],
"source": [
"def single_layer_backward_propagation(dA_curr, W_curr, b_curr, Z_curr, A_prev, activation=\"relu\"):\n",
" # number of examples\n",
" m = A_prev.shape[1]\n",
" \n",
" # selection of activation function\n",
" if activation is \"relu\":\n",
" backward_activation_func = relu_backward\n",
" elif activation is \"sigmoid\":\n",
" backward_activation_func = sigmoid_backward\n",
" elif activation is \"tanh\":\n",
" backward_activation_func = tanh_backward\n",
" elif activation is \"softmax\":\n",
" backward_activation_func = softmax_backward\n",
" else:\n",
" raise Exception('Non-supported activation function')\n",
" \n",
" # calculation of the activation function derivative\n",
" dZ_curr = backward_activation_func(dA_curr, Z_curr)\n",
" \n",
" # derivative of the matrix W\n",
" dW_curr = np.dot(dZ_curr, A_prev.T) / m\n",
" # derivative of the vector b\n",
" db_curr = np.sum(dZ_curr, axis=1, keepdims=True) / m\n",
" # derivative of the matrix A_prev\n",
" dA_prev = np.dot(W_curr.T, dZ_curr)\n",
"\n",
" return dA_prev, dW_curr, db_curr\n",
"\n",
"def full_backward_propagation(Y_hat, Y, memory, params_values, nn_architecture):\n",
" grads_values = {}\n",
" \n",
" # number of examples\n",
" m = Y.shape[1]\n",
" # a hack ensuring the same shape of the prediction vector and labels vector\n",
" Y = Y.reshape(Y_hat.shape)\n",
" \n",
" # initiation of gradient descent algorithm - dError to minimize\n",
" dA_prev = - (np.divide(Y, Y_hat) - np.divide(1 - Y, 1 - Y_hat)); #cross entropy\n",
" \n",
" for layer_idx_prev, layer in reversed(list(enumerate(nn_architecture))):\n",
" # we number network layers from 1\n",
" layer_idx_curr = layer_idx_prev + 1\n",
" # extraction of the activation function for the current layer\n",
" activ_function_curr = layer[\"activation\"]\n",
" \n",
" dA_curr = dA_prev\n",
" \n",
" A_prev = memory[\"A\" + str(layer_idx_prev)]\n",
" Z_curr = memory[\"Z\" + str(layer_idx_curr)]\n",
" \n",
" W_curr = params_values[\"W\" + str(layer_idx_curr)]\n",
" b_curr = params_values[\"b\" + str(layer_idx_curr)]\n",
" \n",
" dA_prev, dW_curr, db_curr = single_layer_backward_propagation(\n",
" dA_curr, W_curr, b_curr, Z_curr, A_prev, activ_function_curr)\n",
" \n",
" grads_values[\"dW\" + str(layer_idx_curr)] = dW_curr\n",
" grads_values[\"db\" + str(layer_idx_curr)] = db_curr\n",
" \n",
" return grads_values"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"Once backpropagation is complete, the objective is to minimize the cost function by updating the weights during several iterations (epochs)."
]
},
{
"cell_type": "code",
"execution_count": 9,
"metadata": {},
"outputs": [],
"source": [
"def update(params_values, grads_values, nn_architecture, learning_rate):\n",
"\n",
" # iteration over network layers\n",
" for layer_idx, layer in enumerate(nn_architecture, 1):\n",
" params_values[\"W\" + str(layer_idx)] -= learning_rate * grads_values[\"dW\" + str(layer_idx)] \n",
" params_values[\"b\" + str(layer_idx)] -= learning_rate * grads_values[\"db\" + str(layer_idx)]\n",
"\n",
" return params_values"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Monitoring the Learning Process\n",
"\n",
"The loss function is designed to show how far we are from the target solution. "
]
},
{
"cell_type": "code",
"execution_count": 10,
"metadata": {},
"outputs": [],
"source": [
"# Cross-entropy loss function\n",
"def get_cost_value(Y_hat, Y):\n",
" # number of examples\n",
" m = Y_hat.shape[1]\n",
" # calculation of the cost according to the formula\n",
" cost = -1 / m * (np.dot(Y, np.log(Y_hat).T) + np.dot(1 - Y, np.log(1 - Y_hat).T))\n",
" return np.squeeze(cost)\n",
"\n",
"def convert_prob_into_class(probs):\n",
" probs_ = np.copy(probs)\n",
" probs_[probs_ > 0.5] = 1\n",
" probs_[probs_ <= 0.5] = 0\n",
" return probs_\n",
"\n",
"def get_accuracy_value(Y_hat, Y):\n",
" Y_hat_ = convert_prob_into_class(Y_hat)\n",
" return (Y_hat_ == Y).all(axis=0).mean()\n",
"\n",
"def train(X, Y, nn_architecture, epochs, learning_rate, verbose=False, callback=None):\n",
" # initiation of neural net parameters\n",
" params_values = init_layers(nn_architecture, 2)\n",
" # initiation of lists storing the history \n",
" # of metrics calculated during the learning process \n",
" cost_history = []\n",
" accuracy_history = []\n",
" \n",
" # Store weights over time\n",
" weights_over_time = {}\n",
" for layer_idx, layer in enumerate(nn_architecture, 1):\n",
" if layer_idx == 1:\n",
" weights_over_time[-1] = params_values[\"W\" + str(layer_idx)].flatten()\n",
" else:\n",
" weights_over_time[-1] = np.append(weights_over_time[-1], params_values[\"W\" + str(layer_idx)].flatten())\n",
" \n",
" # performing calculations for subsequent iterations\n",
" for i in range(epochs):\n",
" # step forward\n",
" Y_hat, cashe = full_forward_propagation(X, params_values, nn_architecture)\n",
" \n",
" # calculating metrics and saving them in history\n",
" cost = get_cost_value(Y_hat, Y)\n",
" cost_history.append(cost)\n",
" accuracy = get_accuracy_value(Y_hat, Y)\n",
" accuracy_history.append(accuracy)\n",
" \n",
" # step backward - calculating gradient\n",
" grads_values = full_backward_propagation(Y_hat, Y, cashe, params_values, nn_architecture)\n",
" # updating model state\n",
" params_values = update(params_values, grads_values, nn_architecture, learning_rate)\n",
"\n",
" for layer_idx, layer in enumerate(nn_architecture, 1):\n",
" if layer_idx == 1:\n",
" weights_over_time[i] = params_values[\"W\" + str(layer_idx)].flatten()\n",
" else:\n",
" weights_over_time[i] = np.append(weights_over_time[i], params_values[\"W\" + str(layer_idx)].flatten())\n",
" \n",
" if(i % 50 == 0):\n",
" if(verbose):\n",
" print(\"Iteration: {:05} - cost: {:.5f} - accuracy: {:.5f}\".format(i, cost, accuracy))\n",
" if(callback is not None):\n",
" callback(i, params_values)\n",
" \n",
" return params_values, cost_history, accuracy_history, weights_over_time"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## XOR Again"
]
},
{
"cell_type": "code",
"execution_count": 11,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"array([[0, 0],\n",
" [0, 1],\n",
" [1, 0],\n",
" [1, 1]])"
]
},
"execution_count": 11,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"X_xor = X_xor.to_numpy()\n",
"y_xor = np.array(y_xor)\n",
"\n",
"X_xor"
]
},
{
"cell_type": "code",
"execution_count": 12,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"CPU times: user 13.5 s, sys: 68.2 ms, total: 13.6 s\n",
"Wall time: 13.6 s\n"
]
}
],
"source": [
"%time params_values, cost_history_bp, accuracy_history_bp, weights_over_time = train(np.transpose(X_xor), np.transpose(y_xor.reshape((y_xor.shape[0], 1))), NN_ARCHITECTURE, 100000, 0.1, verbose=False)"
]
},
{
"cell_type": "code",
"execution_count": 13,
"metadata": {},
"outputs": [],
"source": [
"import matplotlib.pyplot as plt\n",
"import matplotlib.axes as axes\n",
"%matplotlib inline"
]
},
{
"cell_type": "code",
"execution_count": 14,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "\n",
"text/plain": [
"<Figure size 432x288 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"weights_over_time_lst = sorted(weights_over_time.items()) # sorted by key, return a list of tuples\n",
"\n",
"e, w = zip(*weights_over_time_lst) # unpack a list of pairs into two tuples\n",
"\n",
"plt.plot(e, w)\n",
"plt.show()"
]
},
{
"cell_type": "code",
"execution_count": 15,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"[<matplotlib.lines.Line2D at 0x11f032c50>]"
]
},
"execution_count": 15,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAXcAAAD8CAYAAACMwORRAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDMuMC4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvnQurowAAEs9JREFUeJzt3X+MXWl93/H3J3YNDaFlyQ7V1j+wCU4qq4lYmC5LU6U0gcTLVjZSSLGTKtCSWEnqJmSrtraotu1WkTa0CjSq1axDqNIo4N1sI3B2B1kE6B+NYOPZsgHsxWEwm3hYmh1+hEStyGLy7R/3GO4OdzxnZu74zjzzfkmjOc9znjnne3yuP3Pm3HPuSVUhSWrLt0y6AEnS+BnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAZtn9SKb7755tq7d++kVi9Jm9Kjjz76+aqaWm7cxMJ97969zM7OTmr1krQpJfmjPuM8LSNJDTLcJalBhrskNahXuCc5mORSkrkkJ0bMf1uSx7qvP0zyp+MvVZLU17JvqCbZBpwCXg3MA+eTnK2qi9fGVNXPD43/58Ct61CrJKmnPlfL3AbMVdVlgCRngMPAxSXGHwX+7XjKe6a/9ZYZvvK1bzxc5Nnbwid/4TXrsSpJ2tT6hPtO4MpQex54+aiBSV4I7AM+uPbSnmlxsAN85WvF3hMPj3tVknp64t47J12CltDnnHtG9C31bL4jwINV9bWRC0qOJZlNMruwsNC3RoBvCnZJk7f3xMMeYG1QfcJ9Htg91N4FPLnE2CPAu5daUFWdrqrpqpqemlr2BitJm4QBv/H0CffzwP4k+5LsYBDgZxcPSvJdwE3Ah8dboqTNwIDfWJYN96q6ChwHzgGPAw9U1YUk9yQ5NDT0KHCmqjx/Im1RBvzG0euzZapqBphZ1Hf3ova/G19ZkqS12DR3qPquvCT1N7FPhVwNA16aHE+5bC6b5shd0mQ9ce+dvQ6w/CWwMRjuklbEv6A3B8NdkhpkuEtSgwx3SWqQ4S5pxTzvvvEZ7pLUIMNd0tjt83LIiTPcJY2dHzA1eYa7JDXIcJekBhnuktQgw13Sqng55MZmuEtSgwx3SWqQ4S5JDTLcJalBhrukdfHik96lOkm9wj3JwSSXkswlObHEmH+U5GKSC0neNd4yJW02V71NdaKWfYZqkm3AKeDVwDxwPsnZqro4NGY/cBL43qr6UpIXrFfBkqTl9Tlyvw2Yq6rLVfU0cAY4vGjMTwKnqupLAFX11HjLlCStRJ9w3wlcGWrPd33DvhP4ziS/l+QjSQ6Oq0BJ0sr1CfeM6Ft8Nm07sB94JXAUeEeS533TgpJjSWaTzC4sLKy0VkkbjHepblx9wn0e2D3U3gU8OWLMe6vqq1X1GeASg7B/hqo6XVXTVTU9NTW12polScvoE+7ngf1J9iXZARwBzi4a8x7gHwAkuZnBaZrL4yxUktTfsuFeVVeB48A54HHggaq6kOSeJIe6YeeALyS5CHwI+JdV9YX1KlqSdH3LXgoJUFUzwMyivruHpgu4q/uSJE2Yd6hKWjc/9qsfnnQJW5bhLmnd/N6nvzjpErYsw12SGmS4S1KDDHdJapDhLmlNvEt1YzLcJalBhrskNchwl6QGGe6S1CDDXdK6evkvvH/SJWxJhrukdfUnf/70pEvYkgx3SWqQ4S5JDTLcJalBhrukNfMu1Y3HcJekBhnuktQgw12SGmS4S1KDeoV7koNJLiWZS3JixPw3JllI8lj39RPjL1WS1Nf25QYk2QacAl4NzAPnk5ytqouLht5fVcfXoUZJ0gr1OXK/DZirqstV9TRwBji8vmVJktaiT7jvBK4Mtee7vsV+OMnHkjyYZPeoBSU5lmQ2yezCwsIqypUk9dEn3DOirxa1fwfYW1XfA/wu8OujFlRVp6tquqqmp6amVlapJKm3PuE+Dwwfie8CnhweUFVfqKq/6Jq/CrxsPOVJasHeEw9PuoQtp0+4nwf2J9mXZAdwBDg7PCDJLUPNQ8Dj4ytRkrRSy14tU1VXkxwHzgHbgHdW1YUk9wCzVXUW+Nkkh4CrwBeBN65jzZKkZSwb7gBVNQPMLOq7e2j6JHByvKVJklbLO1QljcXbX/+SSZegIYa7pLF47a2jrpDWpBjuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1KBe4Z7kYJJLSeaSnLjOuNclqSTT4ytRkrRSy4Z7km3AKeAO4ABwNMmBEeOeC/ws8Mi4i5QkrUyfI/fbgLmqulxVTwNngMMjxv0H4K3AV8ZYnyRpFfqE+07gylB7vuv7uiS3Arur6qEx1iZJWqU+4Z4RffX1mcm3AG8D/sWyC0qOJZlNMruwsNC/Skmb3otPPjzpEraUPuE+D+weau8CnhxqPxf428D/TPIEcDtwdtSbqlV1uqqmq2p6ampq9VVL2nSu1vJjND59wv08sD/JviQ7gCPA2Wszq+rLVXVzVe2tqr3AR4BDVTW7LhVLkpa1bLhX1VXgOHAOeBx4oKouJLknyaH1LlCStHLb+wyqqhlgZlHf3UuMfeXay5K0Gb399S/hzfc/NukyhHeoShqj1966c/lBuiEMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBvcI9ycEkl5LMJTkxYv5PJfl4kseS/K8kB8ZfqiSpr2XDPck24BRwB3AAODoivN9VVd9dVS8B3gr80tgrlST11ufI/TZgrqouV9XTwBng8PCAqvqzoeZzgBpfiZKkldreY8xO4MpQex54+eJBSf4ZcBewA/j+UQtKcgw4BrBnz56V1ipJ6qnPkXtG9H3TkXlVnaqq7wD+NfBvRi2oqk5X1XRVTU9NTa2sUklSb33CfR7YPdTeBTx5nfFngNeupShJ0tr0CffzwP4k+5LsAI4AZ4cHJNk/1LwT+NT4SpQkrdSy59yr6mqS48A5YBvwzqq6kOQeYLaqzgLHk7wK+CrwJeAN61m0JOn6+ryhSlXNADOL+u4emv65MdclSVoD71CVpAYZ7pJumPd89LOTLmHLMNwl3TBvvv+xSZewZRjuktQgw12SGmS4SxqrUbe068Yz3CWN1WfuvXPSJQjDXZKaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkN6hXuSQ4muZRkLsmJEfPvSnIxyceSfCDJC8dfqiSpr2XDPck24BRwB3AAOJrkwKJhHwWmq+p7gAeBt467UElSf32O3G8D5qrqclU9DZwBDg8PqKoPVdX/65ofAXaNt0xJ0kr0CfedwJWh9nzXt5Q3Ae9bS1GSpLXZ3mPMqAer1MiByT8GpoG/v8T8Y8AxgD179vQsUZK0Un2O3OeB3UPtXcCTiwcleRXwFuBQVf3FqAVV1emqmq6q6ampqdXUK0nqoU+4nwf2J9mXZAdwBDg7PCDJrcB9DIL9qfGXKUlaiWXDvaquAseBc8DjwANVdSHJPUkOdcP+I/BtwG8leSzJ2SUWJ0m6Afqcc6eqZoCZRX13D02/asx1SZLWwDtUJd1Qe088POkStgTDXZIaZLhLUoMMd0lqkOEuaez2v+A5ky5hyzPcJY3d++965aRL2PIMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLumG82N/15/hLkkNMtwlqUGGuyQ1yHCXpAb1CvckB5NcSjKX5MSI+d+X5H8nuZrkdeMvU9Jm88S9d066hC1t2XBPsg04BdwBHACOJjmwaNgfA28E3jXuAiVJK7e9x5jbgLmqugyQ5AxwGLh4bUBVPdHN+8t1qFGStEJ9TsvsBK4Mtee7PknSBtUn3DOir1azsiTHkswmmV1YWFjNIiQ1whuZ1lefcJ8Hdg+1dwFPrmZlVXW6qqaranpqamo1i5Ak9dAn3M8D+5PsS7IDOAKcXd+yJElrsWy4V9VV4DhwDngceKCqLiS5J8khgCR/J8k88CPAfUkurGfRkqTr63O1DFU1A8ws6rt7aPo8g9M1kqQNwDtUJa2b5W5k8k3V9WO4S1KDDHdJapDhLkkNMtwlTZTn3deH4S5pXfnpkJNhuEtSgwx3SRPnqZnxM9wlqUGGu6R11+e8u0fv42W4S9owDPjxMdwlbSgG/HgY7pJuiJVcEmnAr53hLmlD2nviYUN+DQx3STfMam5oMuRXJ1Wrehzqmk1PT9fs7OxE1i1pstYa1lv5rtckj1bV9LLjDHdJkzDuo/GtEviGu6QN70afbmnhF4DhLmlT2Kzn0yf1i2Ks4Z7kIPCfgW3AO6rq3kXznwX8d+BlwBeA11fVE9dbpuEu6ZrNGvDjsNJfEn3DfdmrZZJsA04BdwAHgKNJDiwa9ibgS1X1YuBtwC+uqFpJW9oT997ZxCmT1VivX2x9LoW8DZirqstV9TRwBji8aMxh4Ne76QeBH0iS8ZUpaSvYyiE/btt7jNkJXBlqzwMvX2pMVV1N8mXg24HPj6NISVvLcMBv5VM2a9En3EcdgS8+Ud9nDEmOAccA9uzZ02PVkra6UUfyBv7y+oT7PLB7qL0LeHKJMfNJtgN/Hfji4gVV1WngNAzeUF1NwZLkRwgvr0+4nwf2J9kHfBY4AvzoojFngTcAHwZeB3ywJnWNpSQx3ksV1/MXxXq9x7BsuHfn0I8D5xhcCvnOqrqQ5B5gtqrOAr8G/EaSOQZH7EfWpVpJmoDN+CZvnyN3qmoGmFnUd/fQ9FeAHxlvaZKk1fJTISWpQYa7JDXIcJekBhnuktSgiX0qZJIF4I9W+eM3s/XufnWbtwa3eWtYyza/sKqmlhs0sXBfiySzfT4VrSVu89bgNm8NN2KbPS0jSQ0y3CWpQZs13E9PuoAJcJu3Brd5a1j3bd6U59wlSde3WY/cJUnXsenCPcnBJJeSzCU5Mel6ViLJ7iQfSvJ4kgtJfq7rf36S9yf5VPf9pq4/SX6529aPJXnp0LLe0I3/VJI3DPW/LMnHu5/55Y3yRKwk25J8NMlDXXtfkke6+u9PsqPrf1bXnuvm7x1axsmu/1KSHxrq33CviSTPS/Jgkk92+/sVre/nJD/fva4/keTdSZ7d2n5O8s4kTyX5xFDfuu/XpdZxXVW1ab4YfCrlp4EXATuAPwAOTLquFdR/C/DSbvq5wB8yeC7tW4ETXf8J4Be76dcA72PwMJTbgUe6/ucDl7vvN3XTN3Xzfh94Rfcz7wPumPR2d3XdBbwLeKhrPwAc6aZ/BfjpbvpngF/ppo8A93fTB7r9/SxgX/c62LZRXxMMHjv5E930DuB5Le9nBk9j+wzwV4f27xtb28/A9wEvBT4x1Lfu+3WpdVy31kn/J1jhP+wrgHND7ZPAyUnXtYbteS/wauAScEvXdwtwqZu+Dzg6NP5SN/8ocN9Q/31d3y3AJ4f6nzFugtu5C/gA8P3AQ90L9/PA9sX7lcFHS7+im97ejcvifX1t3EZ8TQB/rQu6LOpvdj/zjUdtPr/bbw8BP9Tifgb28sxwX/f9utQ6rve12U7LjHqe684J1bIm3Z+htwKPAH+jqj4H0H1/QTdsqe29Xv/8iP5Jezvwr4C/7NrfDvxpVV3t2sN1PuN5vMC15/Gu9N9ikl4ELAD/rTsV9Y4kz6Hh/VxVnwX+E/DHwOcY7LdHaXs/X3Mj9utS61jSZgv3Xs9q3eiSfBvwP4A3V9WfXW/oiL5aRf/EJPmHwFNV9ehw94ihtcy8TbPNDI5EXwr816q6Ffi/DP6UXsqm3+buHPBhBqdS/ibwHOCOEUNb2s/Lmeg2brZw7/M81w0tyV9hEOy/WVW/3XX/SZJbuvm3AE91/Utt7/X6d43on6TvBQ4leQI4w+DUzNuB52XwvF14Zp1f37Y883m8K/23mKR5YL6qHunaDzII+5b386uAz1TVQlV9Ffht4O/S9n6+5kbs16XWsaTNFu5ff55r9677EQbPb90Uune+fw14vKp+aWjWtWfQ0n1/71D/j3fvut8OfLn7k+wc8INJbuqOmH6QwfnIzwF/nuT2bl0/PrSsiaiqk1W1q6r2MthfH6yqHwM+xOB5u/DN23zt32L4ebxngSPdVRb7gP0M3nzacK+Jqvo/wJUk39V1/QBwkYb3M4PTMbcn+daupmvb3Ox+HnIj9utS61jaJN+EWeWbGa9hcJXJp4G3TLqeFdb+9xj8mfUx4LHu6zUMzjV+APhU9/353fgAp7pt/TgwPbSsfwrMdV//ZKh/GvhE9zP/hUVv6k14+1/JN66WeRGD/7RzwG8Bz+r6n92157r5Lxr6+bd023WJoatDNuJrAngJMNvt6/cwuCqi6f0M/Hvgk11dv8Hgipem9jPwbgbvKXyVwZH2m27Efl1qHdf78g5VSWrQZjstI0nqwXCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalB/x+XetQZACV5OgAAAABJRU5ErkJggg==\n",
"text/plain": [
"<Figure size 432x288 with 1 Axes>"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"plt.plot(range(len(cost_history_bp)), cost_history_bp, 'o-.')"
]
},
{
"cell_type": "code",
"execution_count": 16,
"metadata": {},
"outputs": [],
"source": [
"# Prediction\n",
"Y_xor_hat, _ = full_forward_propagation(np.transpose(X_xor), params_values, NN_ARCHITECTURE)"
]
},
{
"cell_type": "code",
"execution_count": 17,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Test set accuracy: 1.00\n"
]
}
],
"source": [
"# Accuracy achieved on the test set\n",
"acc_xor = get_accuracy_value(Y_xor_hat, np.transpose(y_xor.reshape((y_xor.shape[0], 1))))\n",
"print(\"Test set accuracy: {:.2f}\".format(acc_xor))"
]
},
{
"cell_type": "code",
"execution_count": 18,
"metadata": {},
"outputs": [],
"source": [
"from sklearn.metrics import classification_report, confusion_matrix"
]
},
{
"cell_type": "code",
"execution_count": 19,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
" precision recall f1-score support\n",
"\n",
" 0.0 1.00 1.00 1.00 2\n",
" 1.0 1.00 1.00 1.00 2\n",
"\n",
" micro avg 1.00 1.00 1.00 4\n",
" macro avg 1.00 1.00 1.00 4\n",
"weighted avg 1.00 1.00 1.00 4\n",
"\n",
"[[2 0]\n",
" [0 2]]\n"
]
}
],
"source": [
"print(classification_report(np.transpose(convert_prob_into_class(Y_xor_hat).reshape(y_xor.shape)), y_xor))\n",
"print(confusion_matrix(np.transpose(convert_prob_into_class(Y_xor_hat).reshape(y_xor.shape)), y_xor))"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.7.3"
},
"toc": {
"base_numbering": 1,
"nav_menu": {},
"number_sections": true,
"sideBar": true,
"skip_h1_title": false,
"title_cell": "Table of Contents",
"title_sidebar": "Contents",
"toc_cell": false,
"toc_position": {},
"toc_section_display": true,
"toc_window_display": false
}
},
"nbformat": 4,
"nbformat_minor": 2
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment