I'm reading "Grokking deep learning" - I've enjoyed chapters 1-4 but Chapter 5's speed, mistakes and omitted materials have lead to a bit of a roadblock for me.
I believe I've managed to create a simple neural net, that uses gradient descent to learn but when I use the raw MNIST dataset, without transforming the pixel values from large numbers (numbers above 1) to binary (0,1) then I can't train my model, I get "double scalar" errors and all sorts of weird things (see below for examples).
I was wondering if you could look over my code, tell me where I've went wrong (tweet me: @bunsen or email: auston.bunsen@gmail.com) & if you'd like you can use this code for free, under the MIT license.
Hey! Your training code looks good but I can't tell if the second set of testing code is correct.