Skip to content

Instantly share code, notes, and snippets.

@hughperkins
Created July 25, 2016 03:41
Show Gist options
  • Save hughperkins/b13f07e553dea1b742dc73b6686f367e to your computer and use it in GitHub Desktop.
Save hughperkins/b13f07e553dea1b742dc73b6686f367e to your computer and use it in GitHub Desktop.
ubuntu@peach:~/git/DeepCL/build$ ./deepcl_unittests tests=testlogicaloperators.Convolve_2layers_relu_Xor
args: ./deepcl_unittests --gtest_filter=testlogicaloperators.Convolve_2layers_relu_Xor
Note: Google Test filter = testlogicaloperators.Convolve_2layers_relu_Xor
[==========] Running 1 test from 1 test case.
[----------] Global test environment set-up.
[----------] 1 test from testlogicaloperators
[ RUN ] testlogicaloperators.Convolve_2layers_relu_Xor
Xor, convolve
Using NVIDIA Corporation , OpenCL platform: NVIDIA CUDA
Using OpenCL device: GeForce 940M
initializing clblas
hand-setting weights...
statefultimer v0.7
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
... seems valid
ForwardAuto: kernel 1 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
... seems valid
BackpropWeightsAuto: kernel 1 0ms
Loss L 0.152638
forward try kernel 2
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
... seems valid
ForwardAuto: kernel 2 0ms
backward try kernel 2
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
... seems valid
ForwardAuto: kernel 3 0ms
forward try kernel 3
... seems valid
ForwardAuto: kernel 3 0ms
backward try kernel 3
... seems valid
BackwardAuto: kernel 3 5ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
... seems valid
BackpropWeightsAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 4
... seems valid
ForwardAuto: kernel 4 0ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 5ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
BackpropWeightsAuto: kernel 4 4ms
calcGradWeights try kernel 4
... seems valid
BackpropWeightsAuto: kernel 4 4ms
forward try kernel 5
... seems valid
ForwardAuto: kernel 5 0ms
forward try kernel 5
... seems valid
ForwardAuto: kernel 5 0ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 4ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 4ms
calcGradWeights layer selected kernel 1
forward try kernel 6
... seems valid
ForwardAuto: kernel 6 0ms
forward try kernel 6
... seems valid
ForwardAuto: kernel 6 0ms
Loss L 0.00640068
forward try kernel 7
... seems valid
ForwardAuto: kernel 7 2ms
forward try kernel 7
... seems valid
ForwardAuto: kernel 7 2ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 0ms
forward kernel 5 time: 0ms
forward kernel 6 time: 0ms
forward kernel 7 time: 2ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 0ms
forward kernel 5 time: 0ms
forward kernel 6 time: 0ms
forward kernel 7 time: 2ms
forward layer selected kernel 1
Loss L 0.00139435
Loss L 0.000383307
Loss L 0.000117079
Loss L 4.63626e-05
Loss L 1.8873e-05
Loss L 7.15534e-06
Loss L 2.83958e-06
Loss L 1.12727e-06
Loss L 4.44109e-07
Loss L 1.72233e-07
Loss L 6.82345e-08
Loss L 2.76343e-08
Loss L 1.04286e-08
Loss L 4.13357e-09
Loss L 1.67201e-09
Loss L 6.29148e-10
Loss L 2.4837e-10
Loss L 1.00833e-10
Loss L 3.80673e-11
Loss L 1.5131e-11
Loss L 5.84421e-12
Loss L 2.16893e-12
Loss L 9.52127e-13
Loss L 3.58824e-13
Loss L 1.56319e-13
Loss L 9.9476e-14
Loss L 9.9476e-14
Loss L 9.9476e-14
Loss L 9.9476e-14
Loss L 9.23706e-14
Loss L 9.23706e-14
Loss L 9.41469e-14
Loss L 8.70415e-14
Loss L 9.41469e-14
Loss L 8.52651e-14
Loss L 8.52651e-14
Loss L 8.52651e-14
Loss L 8.52651e-14
layer 0:InputLayer{ outputPlanes=2 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:ActivationLayer{ RELU }
layer 5:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=6 50.0%
layer 3: params=6 50.0%
TOTAL : params=12
accuracy: 4/4 100%
loss, E, 8.52651e-14
clblas teardown
[ OK ] testlogicaloperators.Convolve_2layers_relu_Xor (297 ms)
[----------] 1 test from testlogicaloperators (297 ms total)
[----------] Global test environment tear-down
[==========] 1 test from 1 test case ran. (297 ms total)
[ PASSED ] 1 test.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment