Skip to content

Instantly share code, notes, and snippets.

@jakakonda
Created October 30, 2015 08:49
Show Gist options
  • Save jakakonda/9e5c83ca10c7b1f64968 to your computer and use it in GitHub Desktop.
Save jakakonda/9e5c83ca10c7b1f64968 to your computer and use it in GitHub Desktop.
args: deepcl_unittests.exe --gtest_filter=-SLOW*
Note: Google Test filter = -SLOW*
[==========] Running 160 tests from 29 test cases.
[----------] Global test environment set-up.
[----------] 8 tests from testClBlas
[ RUN ] testClBlas.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
clblas teardown
[ OK ] testClBlas.basic (503 ms)
[ RUN ] testClBlas.basicthrice
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
clblas teardown
initializing clblas
clblas teardown
initializing clblas
clblas teardown
[ OK ] testClBlas.basicthrice (157 ms)
[ RUN ] testClBlas.transA
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
1 2 9
3 7 5
initializing clblas
clblas teardown
[ OK ] testClBlas.transA (57 ms)
[ RUN ] testClBlas.transB
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
3
-1
initializing clblas
clblas teardown
[ OK ] testClBlas.transB (57 ms)
[ RUN ] testClBlas.colMajor
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
&AWrap->getBuffer()676423513200
&BWrap->getBuffer()676423512720
&CWrap->getBuffer()676423511520
clblas teardown
[ OK ] testClBlas.colMajor (57 ms)
[ RUN ] testClBlas.colMajor2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
clblas teardown
[ OK ] testClBlas.colMajor2 (58 ms)
[ RUN ] testClBlas.colMajorTransA
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
clblas teardown
[ OK ] testClBlas.colMajorTransA (58 ms)
[ RUN ] testClBlas.colMajorTransB
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
clblas teardown
[ OK ] testClBlas.colMajorTransB (57 ms)
[----------] 8 tests from testClBlas (1006 ms total)
[----------] 1 test from testDeepCL
[ RUN ] testDeepCL.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
expected number of output: 4
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
clblas teardown
[ OK ] testDeepCL.basic (266 ms)
[----------] 1 test from testDeepCL (266 ms total)
[----------] 23 tests from testupdateweights
[ RUN ] testupdateweights.conv1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=2 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=2 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
batchSize: 4
inputtotalsize=200 outputTotalSize=72
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
weightsize=36 biassize=0
statefultimer v0.7
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=2 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
Parameters overview: (skipping 2 layers with 0 params)
layer 1: params=36 100.0%
TOTAL : params=36
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
idx=8 predicted losschange=0.000111445 actual=0.000112534
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
idx=13 predicted losschange=-0.000886715 actual=-0.000884056
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
idx=0 predicted losschange=0.000210491 actual=0.000212669
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
idx=22 predicted losschange=-0.000164224 actual=-0.000163078
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 120ms
idx=22 predicted losschange=-0.000164224 actual=-0.000163078
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 120ms
forward layer selected kernel 1
idx=35 predicted losschange=-0.000391028 actual=-0.000391006
idx=26 predicted losschange=2.23142e-05 actual=2.57492e-05
idx=27 predicted losschange=9.38328e-05 actual=9.44138e-05
idx=27 predicted losschange=9.38328e-05 actual=9.44138e-05
idx=10 predicted losschange=0.00186697 actual=0.00187111
clblas teardown
[ OK ] testupdateweights.conv1 (830 ms)
[ RUN ] testupdateweights.conv1z
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=2 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
layer 2:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=2 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
layer 2:SquareLossLayer{}
batchSize: 4
inputtotalsize=72 outputTotalSize=72
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
weightsize=36 biassize=0
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=2 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
layer 2:SquareLossLayer{}
Parameters overview: (skipping 2 layers with 0 params)
layer 1: params=36 100.0%
TOTAL : params=36
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
idx=8 predicted losschange=0.00039831 actual=0.000397682
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 2ms
idx=13 predicted losschange=-0.000426502 actual=-0.000426292
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
idx=0 predicted losschange=0.000143287 actual=0.000144005
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, padzeros must be disabled
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
idx=22 predicted losschange=-1.7916e-06 actual=0
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 119ms
idx=22 predicted losschange=-1.7916e-06 actual=0
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 2ms
forward kernel 4 time: 0ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 119ms
forward layer selected kernel 1
idx=35 predicted losschange=-2.82565e-05 actual=-2.76566e-05
idx=26 predicted losschange=3.62191e-05 actual=3.71933e-05
idx=27 predicted losschange=-0.000319862 actual=-0.000317574
idx=27 predicted losschange=-0.000319862 actual=-0.000317574
idx=10 predicted losschange=-0.000883857 actual=-0.000883102
clblas teardown
[ OK ] testupdateweights.conv1z (767 ms)
[ RUN ] testupdateweights.numericallytest
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=1 filterSize=1 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=1 filterSize=1 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 0.0367983 loss2 0.0367913 change: 7.01472e-06
sumweightsdiff -0.000264842
loss change 7.01472e-06
estimatedLossChangeFromW 7.01413e-06
[ OK ] testupdateweights.numericallytest (555 ms)
[ RUN ] testupdateweights.numericallytest_imagesize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 2ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=1 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 2ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=1 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 1.23358 loss2 1.21612 change: 0.0174605
sumweightsdiff -0.0132709
loss change 0.0174605
estimatedLossChangeFromW 0.0176118
[ OK ] testupdateweights.numericallytest_imagesize3 (562 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=1 outputSize=5 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 2ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=1 outputSize=5 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 4.12958 loss2 4.11952 change: 0.0100665
sumweightsdiff -0.0101708
loss change 0.0100665
estimatedLossChangeFromW 0.0103444
[ OK ] testupdateweights.numericallytest_imagesize5 (582 ms)
[ RUN ] testupdateweights.numericallytest_imagesize9
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=1 outputSize=9 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=1 outputSize=9 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 13.4341 loss2 13.4339 change: 0.000207901
sumweightsdiff 0.00153953
loss change 0.000207901
estimatedLossChangeFromW 0.000237015
[ OK ] testupdateweights.numericallytest_imagesize9 (602 ms)
[ RUN ] testupdateweights.numericallytest_imagesize9_filtersize9
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=9 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=81 100.0%
TOTAL : params=81
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=9 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=81 100.0%
TOTAL : params=81
loss 0.135896 loss2 0.0848782 change: 0.0510182
sumweightsdiff -0.0322406
loss change 0.0510182
estimatedLossChangeFromW 0.0555841
[ OK ] testupdateweights.numericallytest_imagesize9_filtersize9 (614 ms)
[ RUN ] testupdateweights.numericallytest_imagesize9_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=3 outputSize=7 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=3 outputSize=7 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 7.70633 loss2 7.41581 change: 0.290529
sumweightsdiff -0.0898813
loss change 0.290529
estimatedLossChangeFromW 0.316231
[ OK ] testupdateweights.numericallytest_imagesize9_filtersize3 (619 ms)
[ RUN ] testupdateweights.numericallytest_imagesize3_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 2ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=3 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 2ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 2ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=3 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 0.0719101 loss2 0.0694461 change: 0.00246408
sumweightsdiff -0.0110647
loss change 0.00246408
estimatedLossChangeFromW 0.00248372
[ OK ] testupdateweights.numericallytest_imagesize3_filtersize3 (563 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 1.20022 loss2 1.17241 change: 0.0278131
sumweightsdiff -0.0203888
loss change 0.0278131
estimatedLossChangeFromW 0.0280929
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3 (576 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_batchsize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 2ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 2ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 4.97142 loss2 4.78768 change: 0.183745
sumweightsdiff -0.056004
loss change 0.183745
estimatedLossChangeFromW 0.193264
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_batchsize3 (571 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 2ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
loss 1.08887 loss2 0.9575 change: 0.13137
sumweightsdiff -0.00764532
loss change 0.13137
estimatedLossChangeFromW 0.134379
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3 (630 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3_batchsize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
loss 4.76631 loss2 4.18154 change: 0.584769
sumweightsdiff 0.029606
loss change 0.584769
estimatedLossChangeFromW 0.620442
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3_batchsize3 (628 ms)
[ RUN ] testupdateweights.backprop_weights_2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2 (53 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=2 -D gInputSizeSquared=4 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=2 -D gOutputSizeSquared=4 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=2 -DgInputStripeOuterNumRows=2 -DgInputStripeInnerSize=4 -DgInputStripeOuterSize=4 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=2 -DgOutputStripeSize=4
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize2 (61 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
mismatch for i 1
mismatch for i 2
mismatch for i 3
mismatch for i 4
mismatch for i 5
mismatch for i 6
mismatch for i 7
mismatch for i 8
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize3 (57 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize4_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=2 -D gOutputSizeSquared=4 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=8 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=32 -DgInputStripeMarginSize=8 -DgOutputStripeNumRows=2 -DgOutputStripeSize=4
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
mismatch for i 8
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize4_filtersize3 (64 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize5_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
mismatch for i 4
mismatch for i 8
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize5_filtersize3 (75 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=3 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=9 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize1 (65 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize16_filtersize1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=16 -D gInputSizeSquared=256 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=16 -D gOutputSizeSquared=256 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=16 -DgInputStripeOuterNumRows=16 -DgInputStripeInnerSize=256 -DgInputStripeOuterSize=256 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=16 -DgOutputStripeSize=256
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize16_filtersize1 (111 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1
LayerDimensions{ inputPlanes=1 inputSize=17 numFilters=1 filterSize=1 outputSize=17 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=17 -D gInputSizeSquared=289 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=17 -D gOutputSizeSquared=289 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=17 -DgInputStripeOuterNumRows=17 -DgInputStripeInnerSize=289 -DgInputStripeOuterSize=289 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=17 -DgOutputStripeSize=289
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1 (90 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1_moredata
expectedresult: -958.715
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=17 -D gInputSizeSquared=289 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=17 -D gOutputSizeSquared=289 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=17 -DgInputStripeOuterNumRows=17 -DgInputStripeInnerSize=289 -DgInputStripeOuterSize=289 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=17 -DgOutputStripeSize=289
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1_moredata (90 ms)
[ RUN ] testupdateweights.backprop_instance3_smaller2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
numweights: 36
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=96 -D gInputSizeSquared=9216 -D gNumFilters=1 -D gFilterSize=6 -D gHalfFilterSize=3 -D gFilterSizeSquared=36 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=91 -D gOutputSizeSquared=8281 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=8 -DgInputStripeMarginRows=5 -DgInputStripeInnerNumRows=12 -DgInputStripeOuterNumRows=22 -DgInputStripeInnerSize=1152 -DgInputStripeOuterSize=2112 -DgInputStripeMarginSize=480 -DgOutputStripeNumRows=12 -DgOutputStripeSize=1092
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
138 0 0 0 0 0
132 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
132 0 0 0 0 0
138 0 0 0 0 0
132 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
132 0 0 0 0 0
......
......
......
......
......
......
0=0 0 0 0 0 0 0 0
1=0 0 0 0 0 0 0 0
2=0 0 0 0 0 0 0 0
3=0 0 0 0 0 0 0 0
4=0 0 0 0 0 0 0 0
5=0 0 0 0 0 0 0 0
6=0 0 0 0 0 0 0 0
7=0 0 0 0 0 0 0 0
8=0 0 0 0 0 0 0 0
9=0 0 0 0 0 0 0 0
10=0 0 0 0 0 0 0 0
11=0 0 0 0 0 0 0 0
0=0 0 0 0 0 0 0 0
1=0 0 0 0 0 0 0 0
2=0 0 0 0 0 0 0 0
3=0 0 0 0 0 0 0 0
4=0 0 0 0 0 0 0 0
5=0 0 0 0 0 0 0 0
6=0 0 0 0 0 0 0 0
7=0 0 0 0 0 0 0 0
8=0 0 0 0 0 0 0 0
9=0 0 0 0 0 0 0 0
10=0 0 0 0 0 0 0 0
11=0 0 0 0 0 0 0 0
12=0 0 0 0 0 0 0 0
13=0 0 0 0 0 0 0 0
14=0 0 0 0 0 0 0 0
15=0 0 0 0 0 0 0 0
16=0 0 0 0 0 0 0 0
17=0 0 0 0 0 0 0 0
18=0 0 0 0 0 0 0 0
19=0 0 0 0 0 0 0 0
[ OK ] testupdateweights.backprop_instance3_smaller2 (168 ms)
[----------] 23 tests from testupdateweights (8935 ms total)
[----------] 17 tests from testforward
[ RUN ] testforward.imagesize2_nopadzeros
expected number of output: 4
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testforward.imagesize2_nopadzeros (241 ms)
[ RUN ] testforward.imagesize2_padzeros
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
checking result[0]=0 expecting: 0
checking result[1]=0 expecting: 0
checking result[2]=0 expecting: 0
checking result[3]=0.2 expecting: 0.2
checking result[4]=-0.13 expecting: -0.13
checking result[5]=-0.15 expecting: -0.15
checking result[6]=0 expecting: 0
checking result[7]=0 expecting: 0
checking result[8]=0 expecting: 0
checking result[9]=0 expecting: 0
checking result[10]=0 expecting: 0
checking result[11]=0 expecting: 0
checking result[12]=-0.55 expecting: -0.55
checking result[13]=0.02 expecting: 0.02
checking result[14]=0.21 expecting: 0.21
checking result[27]=-14.3 expecting: -14.3
checking result[28]=-9.6 expecting: -9.6
checking result[29]=11.9 expecting: 11.9
checking result[35]=0.46 expecting: 0.46
[ OK ] testforward.imagesize2_padzeros (96 ms)
[ RUN ] testforward.imagesize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
test1 ok
[ OK ] testforward.imagesize3 (93 ms)
[ RUN ] testforward.test2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testforward.test2 (86 ms)
[ RUN ] testforward.test3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testforward.test3 (89 ms)
[ RUN ] testforward.compare_0_1_biased_nopad
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_0_1_biased_nopad (167 ms)
[ RUN ] testforward.compare_0_1_biased_pad
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_0_1_biased_pad (175 ms)
[ RUN ] testforward.compare_1_n_biased_nopad
instance: 2
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 3
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 4
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 6
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 7
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_n_biased_nopad (1384 ms)
[ RUN ] testforward.compare_1_n_biased_pad
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
instance: 2
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 3
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 4
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 6
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 7
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_n_biased_pad (1430 ms)
[ RUN ] testforward.compare_1_5_biased_nopad
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=19 outputSize=1 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=19 outputSize=1 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_5_biased_nopad (204 ms)
[ RUN ] testforward.compare_1_4_fcscenario
LayerDimensions{ inputPlanes=10 inputSize=24 numFilters=10 filterSize=24 outputSize=1 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=10 inputSize=24 numFilters=10 filterSize=24 outputSize=1 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_4_fcscenario (244 ms)
[ RUN ] testforward.compare_break1_0_1
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 1
dump enabled=0
batch 0 batchsize 1
dump enabled=0
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
clblas teardown
[ OK ] testforward.compare_break1_0_1 (88 ms)
[ RUN ] testforward.compare_break1_0_4
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 1
dump enabled=0
batch 0 batchsize 1
dump enabled=0
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
clblas teardown
[ OK ] testforward.compare_break1_0_4 (95 ms)
[ RUN ] testforward.comparespecific_break2
LayerDimensions{ inputPlanes=64 inputSize=19 numFilters=64 filterSize=19 outputSize=1 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=64 inputSize=19 numFilters=64 filterSize=19 outputSize=1 padZeros=0 biased=0 skip=0}
clblas teardown
[ OK ] testforward.comparespecific_break2 (277 ms)
[ RUN ] testforward.softmax
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
output[0]=0.0320586
output[1]=0.0871443
output[2]=0.643914
output[3]=0.236883
loss 0.44019
loss 3.44019
loss 2.44019
loss 1.44019
[ OK ] testforward.softmax (7 ms)
[ RUN ] testforward.softmax_byplane
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
output[0]=0.0320586
output[1]=0.0871443
output[2]=0.643914
output[3]=0.236883
loss 0.44019
loss 3.44019
loss 2.44019
loss 1.44019
[ OK ] testforward.softmax_byplane (5 ms)
[ RUN ] testforward.crash_from_jm
-D gNumInputPlanes=32 -D gInputPlanes=32 -D gInputSize=28 -D gInputSizeSquared=784 -D gNumFilters=20 -D gFilterSize=28 -D gHalfFilterSize=14 -D gFilterSizeSquared=784 -D gNumOutputPlanes=20 -D gOutputPlanes=20 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
dump enabled=0
[ OK ] testforward.crash_from_jm (241 ms)
[----------] 17 tests from testforward (4923 ms total)
[----------] 2 tests from testfilehelper
[ RUN ] testfilehelper.testfilehelper
[ OK ] testfilehelper.testfilehelper (19 ms)
[ RUN ] testfilehelper.testreadchunk
[ OK ] testfilehelper.testreadchunk (6 ms)
[----------] 2 tests from testfilehelper (25 ms total)
[----------] 12 tests from testsimpleconvolvenet
[ RUN ] testsimpleconvolvenet.imagesize1_planes2_filters2_unbiased_tanh
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.141046
accuracy: 2/2 100%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 134ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 134ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 100ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 100ms
forward layer selected kernel 1
loss, E, 0.0733092
accuracy: 2/2 100%
loss, E, 0.0426809
accuracy: 2/2 100%
loss, E, 0.0262453
accuracy: 2/2 100%
loss, E, 0.0164245
accuracy: 2/2 100%
loss, E, 0.0107573
accuracy: 2/2 100%
accuracy: 2/2
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_planes2_filters2_unbiased_tanh (1260 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_planes2_filters2_tanh
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.964924
accuracy: 1/2 50%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 229ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 229ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 94ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 94ms
forward layer selected kernel 1
loss, E, 0.0057046
accuracy: 2/2 100%
loss, E, 1.34826e-05
accuracy: 2/2 100%
loss, E, 3.61967e-08
accuracy: 2/2 100%
accuracy: 2/2
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_planes2_filters2_tanh (1341 ms)
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_tanh
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.13283
accuracy: 3/4 75%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 226ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 226ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
loss, E, 0.00996344
accuracy: 4/4 100%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 105ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 2ms
forward kernel 7 time: 105ms
forward layer selected kernel 1
loss, E, 4.70675e-05
accuracy: 4/4 100%
loss, E, 4.09645e-07
accuracy: 4/4 100%
accuracy: 4/4
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_tanh (1302 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_2planes_filtersize1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 4ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.751601
accuracy: 2/2 100%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 3ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 238ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 4ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 238ms
calcGradWeights layer selected kernel 2
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 0.195916
accuracy: 2/2 100%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 96ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 3ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 96ms
forward layer selected kernel 1
loss, E, 0.0679117
accuracy: 2/2 100%
loss, E, 0.023677
accuracy: 2/2 100%
loss, E, 0.00825563
accuracy: 2/2 100%
loss, E, 0.00287856
accuracy: 2/2 100%
loss, E, 0.00100369
accuracy: 2/2 100%
loss, E, 0.000349964
accuracy: 2/2 100%
accuracy: 2/2 100%
accuracy: 2/2
loss, E, 0.000150648
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_2planes_filtersize1 (1314 ms)
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_relu
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.48951
accuracy: 2/4 50%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 226ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 3ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 226ms
calcGradWeights layer selected kernel 2
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
loss, E, 1.12957
accuracy: 2/4 50%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 102ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 0ms
forward kernel 4 time: 1ms
forward kernel 5 time: 3ms
forward kernel 6 time: 2ms
forward kernel 7 time: 102ms
forward layer selected kernel 3
loss, E, 0.070782
accuracy: 4/4 100%
loss, E, 0.003026
accuracy: 4/4 100%
loss, E, 0.00021158
accuracy: 4/4 100%
loss, E, 1.96858e-05
accuracy: 4/4 100%
loss, E, 2.03002e-06
accuracy: 4/4 100%
loss, E, 2.15572e-07
accuracy: 4/4 100%
loss, E, 2.3083e-08
accuracy: 4/4 100%
loss, E, 2.48239e-09
accuracy: 4/4 100%
loss, E, 4.14442e-10
accuracy: 4/4 100%
accuracy: 4/4
loss, E, 4.14442e-10
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_relu (1382 ms)
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_linear
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.50604
accuracy: 4/4 100%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 227ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 227ms
calcGradWeights layer selected kernel 3
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
loss, E, 0.0565529
accuracy: 4/4 100%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 98ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 2ms
forward kernel 7 time: 98ms
forward layer selected kernel 1
loss, E, 0.00777245
accuracy: 4/4 100%
loss, E, 0.00106831
accuracy: 4/4 100%
loss, E, 0.000218376
accuracy: 4/4 100%
accuracy: 4/4
loss, E, 0.000218376
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_linear (1191 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_n2_2layers_unbiased
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
epoch 0 loss, E, 0.0559531
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
epoch 1 loss, E, 0.0254554
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 94ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
epoch 2 loss, E, 0.0172943
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 94ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 221ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 53ms
epoch 3 loss, E, 0.0138013
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
forward try kernel 5
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 221ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 53ms
calcGradWeights layer selected kernel 1
epoch 4 loss, E, 0.0115848
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 0ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
epoch 5 loss, E, 0.00987036
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 95ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 53ms
epoch 6 loss, E, 0.00844797
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5 time: 2ms
forward kernel 6 time: 0ms
forward kernel 7 time: 95ms
forward layer selected kernel 4
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 53ms
forward layer selected kernel 4
epoch 7 loss, E, 0.00724182
epoch 8 loss, E, 0.00621212
epoch 9 loss, E, 0.00533106
epoch 10 loss, E, 0.00457645
epoch 11 loss, E, 0.00392979
epoch 12 loss, E, 0.00337539
epoch 13 loss, E, 0.00289992
epoch 14 loss, E, 0.002492
epoch 15 loss, E, 0.00214191
epoch 16 loss, E, 0.00184138
epoch 17 loss, E, 0.00158331
epoch 18 loss, E, 0.00136164
epoch 19 loss, E, 0.0011712
epoch 20 loss, E, 0.00100754
epoch 21 loss, E, 0.000866877
epoch 22 loss, E, 0.000745946
epoch 23 loss, E, 0.000641966
epoch 24 loss, E, 0.000552543
epoch 25 loss, E, 0.000475625
epoch 26 loss, E, 0.000409454
epoch 27 loss, E, 0.000352522
epoch 28 loss, E, 0.000303531
epoch 29 loss, E, 0.00026137
epoch 30 loss, E, 0.000225082
epoch 31 loss, E, 0.000193845
epoch 32 loss, E, 0.000166954
epoch 33 loss, E, 0.000143801
epoch 34 loss, E, 0.000123866
epoch 35 loss, E, 0.000106699
epoch 36 loss, E, 9.19176e-05
epoch 37 loss, E, 7.91864e-05
epoch 38 loss, E, 6.82211e-05
epoch 39 loss, E, 5.87767e-05
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=4 40.0%
layer 3: params=6 60.0%
TOTAL : params=10
loss, E, 5.87767e-05
accuracy: 2/2 100%
accuracy: 2/2
loss, E, 5.87767e-05
loss, E, 5.87767e-05
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=4 40.0%
layer 3: params=6 60.0%
TOTAL : params=10
float weights1[] = {-0.303866f, -1.59823f};
float weights3[] = {0.426358f, -0.719592f, -0.420361f, 0.719566f};
float bias1[] = {-0.324465f, 0.60279f};
float bias3[] = {0.506862f, -0.506837f};
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_n2_2layers_unbiased (2189 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_n2_2layers_biased
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.19067
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 94ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 1ms
backward kernel 3 time: 94ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 228ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 59ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
forward try kernel 5
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 228ms
calcGradWeights layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 59ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 0.0667568
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 95ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 53ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 95ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 53ms
forward layer selected kernel 1
loss, E, 0.00923595
loss, E, 0.00112611
loss, E, 0.000117399
loss, E, 1.15641e-05
dump enabled=0
loss, E, 1.78565e-06
accuracy: 2/2 100%
accuracy: 2/2
loss, E, 1.78565e-06
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_n2_2layers_biased (2125 ms)
[ RUN ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.70364
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 105ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=3 -D gFilterSize=4 -D gHalfFilterSize=2 -D gFilterSizeSquared=16 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=3 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=10 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=40 -DgInputStripeMarginSize=12 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=2 -D gHalfFilterSize=1 -D gFilterSizeSquared=4 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=4 -D gOutputSizeSquared=16 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=1 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=35 -DgInputStripeMarginSize=5 -DgOutputStripeNumRows=4 -DgOutputStripeSize=16
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 1ms
backward kernel 3 time: 105ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 235ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 271ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 235ms
calcGradWeights layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 271ms
calcGradWeights layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 107ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 107ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 170ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 170ms
forward layer selected kernel 1
loss, E, 0.000912283
loss, E, 3.47041e-07
loss, E, 5.0589e-10
loss, E, 1.80567e-12
loss, E, 2.88658e-13
loss, E, 1.87406e-13
loss, E, 1.67866e-13
loss, E, 1.51879e-13
loss, E, 1.51879e-13
loss, E, 1.51879e-13
accuracy: 3/3 100%
accuracy: 3/3
loss, E, 1.51879e-13
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n3 (8581 ms)
[ RUN ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n6
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 3.64011
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 107ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=3 -D gFilterSize=4 -D gHalfFilterSize=2 -D gFilterSizeSquared=16 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=3 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=10 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=40 -DgInputStripeMarginSize=12 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=2 -D gHalfFilterSize=1 -D gFilterSizeSquared=4 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=4 -D gOutputSizeSquared=16 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=1 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=35 -DgInputStripeMarginSize=5 -DgOutputStripeNumRows=4 -DgOutputStripeSize=16
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 1ms
backward kernel 3 time: 107ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 238ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 267ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 238ms
calcGradWeights layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 267ms
calcGradWeights layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 104ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 2ms
forward kernel 7 time: 104ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 169ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 0ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 169ms
forward layer selected kernel 1
loss, E, 4.07193e-10
loss, E, 5.90639e-14
loss, E, 1.28342e-13
loss, E, 7.10543e-15
loss, E, 5.32907e-15
accuracy: 6/6 100%
accuracy: 6/6
loss, E, 5.32907e-15
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n6 (5562 ms)
[ RUN ] testsimpleconvolvenet.imagesize_5_3_2layers_filtersize_3_3_biased_n6
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 4.00796
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 144ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=3 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 1ms
backward kernel 3 time: 144ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 271ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 175ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 271ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 175ms
calcGradWeights layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 113ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 113ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 58ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 58ms
forward layer selected kernel 1
loss, E, 1.87685e-08
loss, E, 3.41949e-14
loss, E, 5.77316e-15
accuracy: 6/6 100%
accuracy: 6/6
loss, E, 5.77316e-15
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize_5_3_2layers_filtersize_3_3_biased_n6 (4194 ms)
[ RUN ] testsimpleconvolvenet.imagesize_5_3_2layers_filtersize_3_3_biased_n18
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 10.6129
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 144ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=3 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 2ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 144ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 271ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 175ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 271ms
calcGradWeights layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 175ms
calcGradWeights layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 113ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 0ms
forward kernel 4 time: 2ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 113ms
forward layer selected kernel 3
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 57ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 2ms
forward kernel 7 time: 57ms
forward layer selected kernel 1
loss, E, 0.534475
loss, E, 0.0609361
loss, E, 0.0234247
loss, E, 0.0115108
loss, E, 0.00646704
loss, E, 0.00379292
loss, E, 0.00236515
loss, E, 0.00154625
loss, E, 0.00106407
loss, E, 0.000774096
loss, E, 0.000594543
loss, E, 0.000478669
loss, E, 0.000399373
loss, E, 0.00034237
loss, E, 0.000300228
loss, E, 0.000267169
loss, E, 0.00023959
loss, E, 0.000216118
loss, E, 0.000195875
loss, E, 0.000178242
loss, E, 0.000162761
loss, E, 0.000149073
loss, E, 0.000136902
loss, E, 0.000126021
loss, E, 0.00011625
loss, E, 0.000107442
loss, E, 9.94714e-05
loss, E, 9.22389e-05
loss, E, 8.56547e-05
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=3 filterSize=3 outputSize=3 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=3 numFilters=3 filterSize=3 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=30 26.3%
layer 3: params=84 73.7%
TOTAL : params=114
loss, E, 7.97045e-05
accuracy: 18/18 100%
accuracy: 18/18
loss, E, 7.97045e-05
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize_5_3_2layers_filtersize_3_3_biased_n18 (19955 ms)
[----------] 12 tests from testsimpleconvolvenet (50397 ms total)
[----------] 3 tests from testlogicaloperators
[ RUN ] testlogicaloperators.Convolve_1layer_biased_And
And
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
Loss L 6.84146
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 221ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 221ms
calcGradWeights layer selected kernel 2
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
Loss L 0.907687
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 97ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5 time: 2ms
forward kernel 6 time: 2ms
forward kernel 7 time: 97ms
forward layer selected kernel 4
Loss L 0.469401
Loss L 0.326441
accuracy: 4/4
loss, E, 0.282905
clblas teardown
[ OK ] testlogicaloperators.Convolve_1layer_biased_And (1170 ms)
[ RUN ] testlogicaloperators.Convolve_1layerbiased_Or
Or, convolve
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
Loss L 4.36978
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 223ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 223ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
Loss L 0.74123
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 96ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 0ms
forward kernel 4 time: 1ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 96ms
forward layer selected kernel 1
Loss L 0.415566
Loss L 0.307695
accuracy: 4/4 100%
loss, E, 0.274835
clblas teardown
[ OK ] testlogicaloperators.Convolve_1layerbiased_Or (1179 ms)
[ RUN ] testlogicaloperators.Convolve_2layers_relu_Xor
Xor, convolve
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
hand-setting weights...
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
Loss L 0.152638
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 95ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 1ms
backward kernel 3 time: 95ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 222ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 53ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
forward try kernel 5
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 222ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 53ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
Loss L 0.00640068
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 95ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 53ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 95ms
forward layer selected kernel 2
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 2ms
forward kernel 7 time: 53ms
forward layer selected kernel 1
Loss L 0.00139435
Loss L 0.000383307
Loss L 0.000117079
Loss L 4.63626e-05
Loss L 1.8873e-05
Loss L 7.15534e-06
Loss L 2.83958e-06
Loss L 1.12727e-06
Loss L 4.44109e-07
Loss L 1.72233e-07
Loss L 6.82345e-08
Loss L 2.76343e-08
Loss L 1.04286e-08
Loss L 4.13357e-09
Loss L 1.67201e-09
Loss L 6.29148e-10
Loss L 2.4837e-10
Loss L 1.00833e-10
Loss L 3.80673e-11
Loss L 1.5131e-11
Loss L 5.84421e-12
Loss L 2.16893e-12
Loss L 9.52127e-13
Loss L 3.58824e-13
Loss L 1.56319e-13
Loss L 9.9476e-14
Loss L 9.9476e-14
Loss L 9.9476e-14
Loss L 9.9476e-14
Loss L 9.23706e-14
Loss L 9.23706e-14
Loss L 9.41469e-14
Loss L 8.70415e-14
Loss L 9.41469e-14
Loss L 8.52651e-14
Loss L 8.52651e-14
Loss L 8.52651e-14
Loss L 8.52651e-14
layer 0:InputLayer{ outputPlanes=2 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:ActivationLayer{ RELU }
layer 5:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=6 50.0%
layer 3: params=6 50.0%
TOTAL : params=12
accuracy: 4/4 100%
loss, E, 8.52651e-14
clblas teardown
[ OK ] testlogicaloperators.Convolve_2layers_relu_Xor (3149 ms)
[----------] 3 tests from testlogicaloperators (5498 ms total)
[----------] 12 tests from testbackward
[ RUN ] testbackward.squareloss
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ForceBackpropLayer{ outputPlanes=3 outputSize=5 }
layer 2:SquareLossLayer{}
inputtotalsize=2400 outputTotalSize=2400
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ForceBackpropLayer{ outputPlanes=3 outputSize=5 }
layer 2:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
TOTAL : params=0
idx=44 predicted losschange=-0.000912508 actual=-0.000976563
idx=2245 predicted losschange=0.00785823 actual=0.00805664
idx=648 predicted losschange=0.00965759 actual=0.00976563
idx=586 predicted losschange=0.0136895 actual=0.0136719
idx=730 predicted losschange=0.00117897 actual=0.00146484
idx=611 predicted losschange=0.00152302 actual=0.00195313
idx=1130 predicted losschange=0.0159167 actual=0.0161133
idx=15 predicted losschange=0.0434798 actual=0.0439453
idx=1923 predicted losschange=-0.00790002 actual=-0.0078125
idx=670 predicted losschange=0.0335141 actual=0.0336914
[ OK ] testbackward.squareloss (7 ms)
[ RUN ] testbackward.crossentropyloss
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ForceBackpropLayer{ outputPlanes=3 outputSize=5 }
layer 2:Layer{}
inputtotalsize=300 outputTotalSize=300
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ForceBackpropLayer{ outputPlanes=3 outputSize=5 }
layer 2:Layer{}
Parameters overview: (skipping 3 layers with 0 params)
TOTAL : params=0
idx=44 predicted losschange=0.000274935 actual=0.000274658
idx=145 predicted losschange=-0.000885784 actual=-0.00088501
idx=48 predicted losschange=-0.000859834 actual=-0.000854492
idx=286 predicted losschange=0.00713042 actual=0.00717163
idx=130 predicted losschange=-0.000264829 actual=-0.000244141
idx=11 predicted losschange=-1.98163e-05 actual=0
idx=230 predicted losschange=-0.000594819 actual=-0.000610352
idx=15 predicted losschange=-0.0006499 actual=-0.000640869
idx=123 predicted losschange=-0.000846121 actual=-0.000823975
idx=70 predicted losschange=0.000790196 actual=0.000793457
[ OK ] testbackward.crossentropyloss (7 ms)
[ RUN ] testbackward.softmaxloss
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
inputtotalsize=10 outputTotalSize=10
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
Parameters overview: (skipping 3 layers with 0 params)
TOTAL : params=0
idx=4 predicted losschange=0.000113075 actual=0.00011301
idx=5 predicted losschange=0.000145627 actual=0.000145674
idx=8 predicted losschange=3.16699e-05 actual=3.19481e-05
idx=6 predicted losschange=4.89271e-06 actual=5.24521e-06
idx=0 predicted losschange=2.29469e-05 actual=2.28882e-05
idx=1 predicted losschange=-8.26119e-05 actual=-8.27312e-05
idx=0 predicted losschange=2.29469e-05 actual=2.28882e-05
idx=5 predicted losschange=0.000145627 actual=0.000145674
idx=3 predicted losschange=-5.50179e-05 actual=-5.50747e-05
idx=0 predicted losschange=2.29469e-05 actual=2.28882e-05
[ OK ] testbackward.softmaxloss (8 ms)
[ RUN ] testbackward.squareloss2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SquareLossLayer{}
batchSize: 32
inputtotalsize=160 outputTotalSize=160
layer SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
TOTAL : params=0
idx=44 predicted losschange=0.000126406 actual=0.000125885
idx=5 predicted losschange=0.00461891 actual=0.00464439
idx=8 predicted losschange=0.000356787 actual=0.000356674
idx=106 predicted losschange=0.00716324 actual=0.00719643
idx=90 predicted losschange=0.000474759 actual=0.000480652
idx=131 predicted losschange=0.000979017 actual=0.000984192
idx=10 predicted losschange=0.000660134 actual=0.000663757
idx=15 predicted losschange=0.00961313 actual=0.00965118
idx=3 predicted losschange=0.00264732 actual=0.00267029
idx=30 predicted losschange=0.00865312 actual=0.00868607
[ OK ] testbackward.squareloss2 (7 ms)
[ RUN ] testbackward.crossentropy2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:Layer{}
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:Layer{}
batchSize: 2
inputtotalsize=10 outputTotalSize=10
layer Layer{}
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:Layer{}
Parameters overview: (skipping 3 layers with 0 params)
TOTAL : params=0
idx=4 predicted losschange=0.00258649 actual=-nan(ind)
idx=5 predicted losschange=0.0227095 actual=-nan(ind)
idx=8 predicted losschange=-0.00202714 actual=-nan(ind)
idx=6 predicted losschange=-0.000846508 actual=-nan(ind)
idx=0 predicted losschange=-0.000424821 actual=-nan(ind)
idx=1 predicted losschange=-0.00171216 actual=-nan(ind)
idx=0 predicted losschange=-0.000424821 actual=-nan(ind)
idx=5 predicted losschange=0.0227095 actual=-nan(ind)
idx=3 predicted losschange=0.0123444 actual=-nan(ind)
idx=0 predicted losschange=-0.000424821 actual=-nan(ind)
[ OK ] testbackward.crossentropy2 (8 ms)
[ RUN ] testbackward.softmax2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
batchSize: 2
inputtotalsize=10 outputTotalSize=10
layer SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
layer 0:InputLayer{ outputPlanes=5 outputSize=1 }
layer 1:ForceBackpropLayer{ outputPlanes=5 outputSize=1 }
layer 2:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
Parameters overview: (skipping 3 layers with 0 params)
TOTAL : params=0
idx=4 predicted losschange=0.00035729 actual=0.000357628
idx=5 predicted losschange=0.0015055 actual=0.00151086
idx=8 predicted losschange=-5.63632e-05 actual=-5.65052e-05
idx=6 predicted losschange=-1.48864e-05 actual=-1.4782e-05
idx=0 predicted losschange=1.96542e-05 actual=1.97887e-05
idx=1 predicted losschange=-0.000287167 actual=-0.000287056
idx=0 predicted losschange=1.96542e-05 actual=1.97887e-05
idx=5 predicted losschange=0.0015055 actual=0.00151086
idx=3 predicted losschange=-0.000152824 actual=-0.00014782
idx=0 predicted losschange=1.96542e-05 actual=1.97887e-05
[ OK ] testbackward.softmax2 (8 ms)
[ RUN ] testbackward.conv1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=2 outputSize=4 }
layer 1:ForceBackpropLayer{ outputPlanes=2 outputSize=4 }
layer 2:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=4 numFilters=2 filterSize=3 outputSize=2 padZeros=0 biased=0 skip=0} }
layer 3:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=2 outputSize=4 }
layer 1:ForceBackpropLayer{ outputPlanes=2 outputSize=4 }
layer 2:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=4 numFilters=2 filterSize=3 outputSize=2 padZeros=0 biased=0 skip=0} }
layer 3:SquareLossLayer{}
batchSize: 4
inputtotalsize=128 outputTotalSize=32
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=4 numFilters=2 filterSize=3 outputSize=2 padZeros=0 biased=0 skip=0} }
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=2 outputSize=4 }
layer 1:ForceBackpropLayer{ outputPlanes=2 outputSize=4 }
layer 2:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=4 numFilters=2 filterSize=3 outputSize=2 padZeros=0 biased=0 skip=0} }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 2: params=36 100.0%
TOTAL : params=36
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
idx=44 predicted losschange=-0.000247746 actual=-0.000247955
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
idx=37 predicted losschange=0.0104374 actual=0.0104504
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
idx=40 predicted losschange=-0.00114652 actual=-0.00113678
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
idx=106 predicted losschange=-0.000407703 actual=-0.00039959
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 121ms
idx=122 predicted losschange=-0.00107492 actual=-0.00039959
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 121ms
forward layer selected kernel 2
idx=99 predicted losschange=-0.00114029 actual=-0.00113869
idx=10 predicted losschange=0.00172434 actual=0.0017252
idx=47 predicted losschange=-0.00161501 actual=-0.00161457
idx=67 predicted losschange=-0.000216964 actual=-0.000211716
idx=126 predicted losschange=-0.000768316 actual=-0.000762939
clblas teardown
[ OK ] testbackward.conv1 (794 ms)
[ RUN ] testbackward.fc1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=2 outputSize=4 }
layer 1:ForceBackpropLayer{ outputPlanes=2 outputSize=4 }
layer 2:FullyConnectedLayer{ numPlanes=4 imageSize=1 }
layer 3:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=2 outputSize=4 }
layer 1:ForceBackpropLayer{ outputPlanes=2 outputSize=4 }
layer 2:FullyConnectedLayer{ numPlanes=4 imageSize=1 }
layer 3:SquareLossLayer{}
batchSize: 4
inputtotalsize=128 outputTotalSize=16
layer FullyConnectedLayer{ numPlanes=4 imageSize=1 }
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=2 outputSize=4 }
layer 1:ForceBackpropLayer{ outputPlanes=2 outputSize=4 }
layer 2:FullyConnectedLayer{ numPlanes=4 imageSize=1 }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 2: params=128 100.0%
TOTAL : params=128
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
idx=44 predicted losschange=0.000143006 actual=0.000143051
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
idx=37 predicted losschange=0.00272767 actual=0.00273252
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
idx=40 predicted losschange=0.00326425 actual=0.003268
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
idx=106 predicted losschange=0.000116936 actual=0.000119925
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
idx=122 predicted losschange=0.000618769 actual=0.000622034
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 155ms
idx=99 predicted losschange=-0.000108237 actual=0.000622034
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 155ms
forward layer selected kernel 1
idx=10 predicted losschange=0.000239861 actual=0.000240326
idx=47 predicted losschange=-0.00088519 actual=-0.000884056
idx=67 predicted losschange=0.00203682 actual=0.00203872
idx=126 predicted losschange=-0.000419895 actual=-0.000416517
clblas teardown
[ OK ] testbackward.fc1 (888 ms)
[ RUN ] testbackward.act1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=2 }
layer 1:ForceBackpropLayer{ outputPlanes=1 outputSize=2 }
layer 2:ActivationLayer{ RELU }
layer 3:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=1 outputSize=2 }
layer 1:ForceBackpropLayer{ outputPlanes=1 outputSize=2 }
layer 2:ActivationLayer{ RELU }
layer 3:SquareLossLayer{}
batchSize: 1
inputtotalsize=4 outputTotalSize=4
layer ActivationLayer{ RELU }
layer 0:InputLayer{ outputPlanes=1 outputSize=2 }
layer 1:ForceBackpropLayer{ outputPlanes=1 outputSize=2 }
layer 2:ActivationLayer{ RELU }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
TOTAL : params=0
idx=0 predicted losschange=-0.000880961 actual=-0.00088048
idx=1 predicted losschange=-0.00151209 actual=-0.00151044
idx=0 predicted losschange=-0.000880961 actual=-0.00088048
idx=2 predicted losschange=-0.00245153 actual=-0.0024423
idx=2 predicted losschange=-0.00245153 actual=-0.0024423
idx=3 predicted losschange=-0.00214455 actual=-0.00212085
idx=2 predicted losschange=-0.00245153 actual=-0.0024423
idx=3 predicted losschange=-0.00214455 actual=-0.00212085
idx=3 predicted losschange=-0.00214455 actual=-0.00212085
idx=2 predicted losschange=-0.00245153 actual=-0.0024423
[ OK ] testbackward.act1 (102 ms)
[ RUN ] testbackward.checknumerically
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
loss 0.0986296 loss2 0.0984814 change: 0.000148199
sumweightsdiff 0.0038507
loss change 0.000148199
estimatedLossChangeFromW 0.000148279
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
forward try kernel 5
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss 0.0984814 loss2 0.0983336 change: 0.000147872
sumweightsdiff 0.00384641
loss change 0.000147872
estimatedLossChangeFromW 0.000147948
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 94ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 52ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 0ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 94ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 0ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 52ms
forward layer selected kernel 2
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 95ms
calcGradWeights try kernel 3
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
loss 0.0983336 loss2 0.098186 change: 0.000147529
sumweightsdiff 0.00384223
loss change 0.000147529
estimatedLossChangeFromW 0.000147628
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 95ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 132ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 52ms
loss 0.098186 loss2 0.0980388 change: 0.000147231
sumweightsdiff 0.00383794
loss change 0.000147231
estimatedLossChangeFromW 0.000147298
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 132ms
calcGradWeights layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 52ms
calcGradWeights layer selected kernel 1
loss 0.0980388 loss2 0.0978919 change: 0.000146888
sumweightsdiff 0.00383377
loss change 0.000146888
estimatedLossChangeFromW 0.000146978
clblas teardown
[ OK ] testbackward.checknumerically (1985 ms)
[ RUN ] testbackward.checknumerically_imagesize5_filter3_relu
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
loss 630.466 loss2 608.021 change: 22.4443
sumweightsdiff -0.035685
loss change 22.4443
estimatedLossChangeFromW 22.6629
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 0ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 159ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 70ms
loss 608.021 loss2 586.349 change: 21.672
sumweightsdiff -0.0350289
loss change 21.672
estimatedLossChangeFromW 21.7974
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 159ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5: cannot be used
forward kernel 6 time: 0ms
forward kernel 7 time: 70ms
forward layer selected kernel 1
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 155ms
calcGradWeights try kernel 3
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=5 -D gOutputSizeSquared=25 -D gPadZeros=1 -D gMargin=1 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=5 -DgOutputStripeSize=25
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=5 -D gOutputSizeSquared=25 -D gPadZeros=1 -D gMargin=1 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=5 -DgOutputStripeSize=25
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
loss 586.349 loss2 565.324 change: 21.025
sumweightsdiff -0.0345262
loss change 21.025
estimatedLossChangeFromW 21.2378
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 155ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 153ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 71ms
loss 565.324 loss2 545.133 change: 20.1916
sumweightsdiff -0.0338754
loss change 20.1916
estimatedLossChangeFromW 20.3956
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 153ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 71ms
calcGradWeights layer selected kernel 1
loss 545.133 loss2 525.742 change: 19.3912
sumweightsdiff -0.0332378
loss change 19.3912
estimatedLossChangeFromW 19.5872
loss 525.742 loss2 507.119 change: 18.6229
sumweightsdiff -0.0326132
loss change 18.6229
estimatedLossChangeFromW 18.8111
loss 507.119 loss2 489.233 change: 17.8853
sumweightsdiff -0.032001
loss change 17.8853
estimatedLossChangeFromW 18.066
loss 489.233 loss2 472.056 change: 17.1772
sumweightsdiff -0.0314012
loss change 17.1772
estimatedLossChangeFromW 17.3506
loss 472.056 loss2 455.559 change: 16.4975
sumweightsdiff -0.0308135
loss change 16.4975
estimatedLossChangeFromW 16.6639
loss 455.559 loss2 439.714 change: 15.8447
sumweightsdiff -0.0302379
loss change 15.8447
estimatedLossChangeFromW 16.0046
loss 439.714 loss2 424.416 change: 15.2976
sumweightsdiff -0.0296733
loss change 15.2976
estimatedLossChangeFromW 15.3717
loss 424.416 loss2 409.545 change: 14.871
sumweightsdiff -0.0299227
loss change 14.871
estimatedLossChangeFromW 15.0234
loss 409.545 loss2 395.271 change: 14.274
sumweightsdiff -0.0293575
loss change 14.274
estimatedLossChangeFromW 14.4202
loss 395.271 loss2 381.57 change: 13.7013
sumweightsdiff -0.0288033
loss change 13.7013
estimatedLossChangeFromW 13.8415
loss 381.57 loss2 368.418 change: 13.1519
sumweightsdiff -0.0282608
loss change 13.1519
estimatedLossChangeFromW 13.2864
loss 368.418 loss2 355.794 change: 12.6248
sumweightsdiff -0.0277294
loss change 12.6248
estimatedLossChangeFromW 12.7538
loss 355.794 loss2 343.675 change: 12.119
sumweightsdiff -0.027209
loss change 12.119
estimatedLossChangeFromW 12.2429
loss 343.675 loss2 332.041 change: 11.634
sumweightsdiff -0.0266991
loss change 11.634
estimatedLossChangeFromW 11.7526
loss 332.041 loss2 320.872 change: 11.1684
sumweightsdiff -0.0261997
loss change 11.1684
estimatedLossChangeFromW 11.2823
loss 320.872 loss2 310.15 change: 10.7218
sumweightsdiff -0.0257105
loss change 10.7218
estimatedLossChangeFromW 10.8312
clblas teardown
[ OK ] testbackward.checknumerically_imagesize5_filter3_relu (2769 ms)
[ RUN ] testbackward.compare_1_n_kgsgo_32c5
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
-D BIASED -D gNumInputPlanes=32 -D gInputPlanes=32 -D gInputSize=19 -D gInputSizeSquared=361 -D gNumFilters=32 -D gFilterSize=5 -D gHalfFilterSize=2 -D gFilterSizeSquared=25 -D gNumOutputPlanes=32 -D gOutputPlanes=32 -D gOutputSize=19 -D gOutputSizeSquared=361 -D gPadZeros=1 -D gMargin=2 -D gEven=0 -D gSkip=0
batchsize=8 LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
output[0]=-0.0308112 -0.0308112 SAME || -0.129603 || -0.048413 || 0.07916 || -0.118675 || 0.0416933 || 0.100887 || -0.106013
output[1]=-0.0574008 -0.0574008 SAME || 0.099984 || 0.0155394 || 0.00411644 || 0.131031 || -0.0107744 || 0.121347 || 0.0437087
output[2]=-0.0227139 -0.0227139 SAME || -0.0115189 || -0.190989 || -0.0445787 || -0.013341 || -0.04953 || -0.109186 || 0.104814
output[3]=-0.0805896 -0.0805896 SAME || 0.0216207 || -0.128649 || -0.0159031 || 0.0534839 || 0.0301581 || 0.104269 || -0.0841106
output[4]=-0.0723994 -0.0723994 SAME || -0.0164838 || -0.00649171 || -0.042007 || 0.147102 || -0.0702085 || -0.0120931 || 0.0597854
output[5]=0.130336 0.130336 SAME || -0.0816751 || -0.272227 || 0.0707071 || 0.133967 || 0.0323092 || 0.124248 || -0.0138626
output[6]=-0.00415662 -0.00415662 SAME || -0.0920411 || 0.0352436 || 0.0541946 || 0.00491123 || -0.0805987 || 0.0834764 || 0.0631893
output[7]=-0.0915931 -0.0915931 SAME || -0.0358497 || 0.0445722 || -0.0472172 || 0.0778742 || -0.0550363 || -0.179262 || -0.0812755
output[8]=0.0556533 0.0556533 SAME || -0.0684331 || -0.0243033 || -0.0822076 || -0.0104788 || -0.043145 || -0.0481164 || 0.0538944
output[9]=-0.0725742 -0.0725742 SAME || 0.0486592 || -0.0286811 || -0.0249626 || 0.0394469 || -0.144496 || 0.0909432 || -0.0152857
output[10]=-0.0153476 -0.0153476 SAME || -0.0677297 || -0.140709 || -0.0161164 || 0.131645 || 0.0545684 || -0.0210541 || 0.0611338
output[11]=-0.0212713 -0.0212713 SAME || 0.100494 || 0.2122 || -0.0812487 || 0.0532493 || -0.0183774 || -0.0937923 || -0.069912
output[12]=0.0389741 0.0389741 SAME || 0.0809882 || 0.0370538 || 0.0241565 || -0.0582968 || 0.0437625 || 0.139931 || -0.065007
output[13]=0.0349705 0.0349705 SAME || -0.0251775 || -0.0759114 || 0.0945214 || 0.00389841 || -0.0377205 || 0.17624 || -0.114476
output[14]=0.0366689 0.0366689 SAME || -0.0348694 || -0.0581568 || 0.0376178 || -0.0298947 || -0.0299259 || -0.0913825 || -0.0745193
output[15]=0.0186965 0.0186965 SAME || 0.0281147 || 0.00937999 || 0.108983 || -0.0505074 || -0.0573388 || 0.067382 || 0.0387854
output[16]=0.0658136 0.0658136 SAME || -0.0412163 || -0.128719 || 0.150029 || 0.0555238 || -0.0203267 || -0.0795422 || -0.123847
output[17]=0.0705919 0.0705919 SAME || 0.147334 || 0.151016 || -0.0122364 || 0.0360484 || -0.0609187 || 0.0166715 || -0.141399
output[18]=-0.0508929 -0.0508929 SAME || 0.0131358 || -0.0101773 || -0.120741 || -0.00821514 || 0.00894922 || -0.117651 || 0.0631629
output[19]=-0.0110406 -0.0110406 SAME || 0.189081 || 0.0665268 || 0.0622702 || 0.151629 || -0.0172241 || -0.0215623 || 0.0457666
clblas teardown
instance 2
batchsize=8 LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
output[0]=-0.0308112 -0.0308112 SAME || -0.129603 || -0.048413 || 0.07916 || -0.118675 || 0.0416933 || 0.100887 || -0.106013
output[1]=-0.0574008 -0.0574008 SAME || 0.099984 || 0.0155394 || 0.00411644 || 0.131031 || -0.0107744 || 0.121347 || 0.0437087
output[2]=-0.0227139 -0.0227139 SAME || -0.0115189 || -0.190989 || -0.0445787 || -0.013341 || -0.04953 || -0.109186 || 0.104814
output[3]=-0.0805896 -0.0805896 SAME || 0.0216207 || -0.128649 || -0.0159031 || 0.0534839 || 0.0301581 || 0.104269 || -0.0841106
output[4]=-0.0723994 -0.0723994 SAME || -0.0164838 || -0.00649171 || -0.042007 || 0.147102 || -0.0702085 || -0.0120931 || 0.0597854
output[5]=0.130336 0.130336 SAME || -0.0816751 || -0.272227 || 0.0707071 || 0.133967 || 0.0323092 || 0.124248 || -0.0138626
output[6]=-0.00415662 -0.00415662 SAME || -0.0920411 || 0.0352436 || 0.0541946 || 0.00491123 || -0.0805987 || 0.0834764 || 0.0631893
output[7]=-0.0915931 -0.0915931 SAME || -0.0358497 || 0.0445722 || -0.0472172 || 0.0778742 || -0.0550363 || -0.179262 || -0.0812755
output[8]=0.0556533 0.0556533 SAME || -0.0684331 || -0.0243033 || -0.0822076 || -0.0104788 || -0.043145 || -0.0481164 || 0.0538944
output[9]=-0.0725742 -0.0725742 SAME || 0.0486592 || -0.0286811 || -0.0249626 || 0.0394469 || -0.144496 || 0.0909432 || -0.0152857
output[10]=-0.0153476 -0.0153476 SAME || -0.0677297 || -0.140709 || -0.0161164 || 0.131645 || 0.0545684 || -0.0210541 || 0.0611338
output[11]=-0.0212713 -0.0212713 SAME || 0.100494 || 0.2122 || -0.0812487 || 0.0532493 || -0.0183774 || -0.0937923 || -0.069912
output[12]=0.0389741 0.0389741 SAME || 0.0809882 || 0.0370538 || 0.0241565 || -0.0582968 || 0.0437625 || 0.139931 || -0.065007
output[13]=0.0349705 0.0349705 SAME || -0.0251775 || -0.0759114 || 0.0945214 || 0.00389841 || -0.0377205 || 0.17624 || -0.114476
output[14]=0.0366689 0.0366689 SAME || -0.0348694 || -0.0581568 || 0.0376178 || -0.0298947 || -0.0299259 || -0.0913825 || -0.0745193
output[15]=0.0186965 0.0186965 SAME || 0.0281147 || 0.00937999 || 0.108983 || -0.0505074 || -0.0573388 || 0.067382 || 0.0387854
output[16]=0.0658136 0.0658136 SAME || -0.0412163 || -0.128719 || 0.150029 || 0.0555238 || -0.0203267 || -0.0795422 || -0.123847
output[17]=0.0705919 0.0705919 SAME || 0.147334 || 0.151016 || -0.0122364 || 0.0360484 || -0.0609187 || 0.0166715 || -0.141399
output[18]=-0.0508929 -0.0508929 SAME || 0.0131358 || -0.0101773 || -0.120741 || -0.00821514 || 0.00894922 || -0.117651 || 0.0631629
output[19]=-0.0110406 -0.0110406 SAME || 0.189081 || 0.0665268 || 0.0622702 || 0.151629 || -0.0172241 || -0.0215623 || 0.0457666
clblas teardown
instance 3
batchsize=8 LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
output[0]=-0.0308112 -0.0308112 SAME || -0.129603 || -0.048413 || 0.07916 || -0.118675 || 0.0416933 || 0.100887 || -0.106013
output[1]=-0.0574008 -0.0574008 SAME || 0.0999841 || 0.0155394 || 0.00411649 || 0.131031 || -0.0107744 || 0.121347 || 0.0437087
output[2]=-0.0227139 -0.0227139 SAME || -0.0115189 || -0.190989 || -0.0445787 || -0.013341 || -0.0495299 || -0.109186 || 0.104814
output[3]=-0.0805896 -0.0805895 SAME || 0.0216206 || -0.128649 || -0.0159031 || 0.053484 || 0.0301581 || 0.104269 || -0.0841106
output[4]=-0.0723994 -0.0723994 SAME || -0.0164838 || -0.00649171 || -0.042007 || 0.147102 || -0.0702085 || -0.0120931 || 0.0597853
output[5]=0.130336 0.130336 SAME || -0.0816751 || -0.272227 || 0.0707071 || 0.133967 || 0.0323092 || 0.124248 || -0.0138626
output[6]=-0.00415662 -0.00415662 SAME || -0.0920411 || 0.0352437 || 0.0541946 || 0.00491122 || -0.0805988 || 0.0834764 || 0.0631893
output[7]=-0.0915931 -0.0915931 SAME || -0.0358497 || 0.0445722 || -0.0472172 || 0.0778742 || -0.0550363 || -0.179262 || -0.0812755
output[8]=0.0556533 0.0556533 SAME || -0.0684331 || -0.0243033 || -0.0822076 || -0.0104788 || -0.043145 || -0.0481164 || 0.0538944
output[9]=-0.0725742 -0.0725742 SAME || 0.0486591 || -0.0286811 || -0.0249626 || 0.0394469 || -0.144496 || 0.0909432 || -0.0152857
output[10]=-0.0153476 -0.0153476 SAME || -0.0677297 || -0.140709 || -0.0161163 || 0.131645 || 0.0545684 || -0.0210541 || 0.0611338
output[11]=-0.0212713 -0.0212713 SAME || 0.100494 || 0.2122 || -0.0812488 || 0.0532493 || -0.0183774 || -0.0937924 || -0.069912
output[12]=0.0389741 0.0389741 SAME || 0.0809881 || 0.0370538 || 0.0241565 || -0.0582968 || 0.0437625 || 0.139931 || -0.0650069
output[13]=0.0349705 0.0349705 SAME || -0.0251774 || -0.0759114 || 0.0945214 || 0.00389844 || -0.0377205 || 0.17624 || -0.114476
output[14]=0.0366689 0.0366688 SAME || -0.0348695 || -0.0581568 || 0.0376178 || -0.0298947 || -0.0299259 || -0.0913827 || -0.0745193
output[15]=0.0186965 0.0186966 SAME || 0.0281147 || 0.00937996 || 0.108983 || -0.0505074 || -0.0573388 || 0.067382 || 0.0387854
output[16]=0.0658136 0.0658136 SAME || -0.0412163 || -0.128719 || 0.150029 || 0.0555237 || -0.0203267 || -0.0795422 || -0.123847
output[17]=0.0705919 0.0705919 SAME || 0.147334 || 0.151016 || -0.0122364 || 0.0360484 || -0.0609187 || 0.0166715 || -0.141399
output[18]=-0.0508929 -0.0508929 SAME || 0.0131358 || -0.0101772 || -0.120741 || -0.00821514 || 0.00894922 || -0.117651 || 0.0631629
output[19]=-0.0110406 -0.0110407 SAME || 0.189081 || 0.0665268 || 0.0622704 || 0.151629 || -0.0172242 || -0.0215623 || 0.0457666
clblas teardown
[ OK ] testbackward.compare_1_n_kgsgo_32c5 (1574 ms)
[----------] 12 tests from testbackward (8159 ms total)
[----------] 6 tests from testsinglebatch
[ RUN ] testsinglebatch.imagesize5_filtersize3_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=5 filterSize=3 outputSize=3 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ LINEAR }
layer 3:FullyConnectedLayer{ numPlanes=5 imageSize=1 }
layer 4:ActivationLayer{ TANH }
layer 5:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=50 17.9%
layer 3: params=230 82.1%
TOTAL : params=280
weightsTotalSize=280
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 9ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 2ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 148ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=5 -D gInputPlanes=5 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 116ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 116ms
forward layer selected kernel 4
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 61ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 0ms
backward kernel 3 time: 148ms
backward layer selected kernel 2
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 299ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 216ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 61ms
forward layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 9ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 299ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 2ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 216ms
calcGradWeights layer selected kernel 1
batch time 2711 ms
dump enabled=0
clblas teardown
[ OK ] testsinglebatch.imagesize5_filtersize3_batchsize2 (2994 ms)
[ RUN ] testsinglebatch.imagesize5_filtersize3_batchsize2_10filters
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=10 filterSize=3 outputSize=3 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:FullyConnectedLayer{ numPlanes=10 imageSize=1 }
layer 4:ActivationLayer{ TANH }
layer 5:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=100 9.9%
layer 3: params=910 90.1%
TOTAL : params=1010
weightsTotalSize=1010
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 4ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 3ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 3ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 155ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=10 -D gInputPlanes=10 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=10 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=10 -D gOutputPlanes=10 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=10 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=10 -D gOutputPlanes=10 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 114ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 3ms
forward kernel 7 time: 114ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 59ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 0ms
backward kernel 3 time: 155ms
backward layer selected kernel 2
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 289ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 212ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 3ms
forward kernel 5 time: 1ms
forward kernel 6 time: 2ms
forward kernel 7 time: 59ms
forward layer selected kernel 2
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 4ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 289ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 212ms
calcGradWeights layer selected kernel 2
tolerance 1.3
DIFF, epoch=68 :
losschangefromw 2.72785e-05
actual loss change 1.83154e-05
tolerance 1.3
DIFF, epoch=78 :
losschangefromw 1.6472e-05
actual loss change 7.9358e-06
tolerance 1.3
DIFF, epoch=91 :
losschangefromw 1.24148e-05
actual loss change 4.21237e-06
batch time 3499 ms
dump enabled=0
D:\DeepCL\test\testsinglebatch.cpp(221): error: Value of: allOk
Actual: false
Expected: true
clblas teardown
[ FAILED ] testsinglebatch.imagesize5_filtersize3_batchsize2_10filters (3896 ms)
[ RUN ] testsinglebatch.imagesize28
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=28 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=28 numFilters=10 filterSize=3 outputSize=26 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:FullyConnectedLayer{ numPlanes=10 imageSize=1 }
layer 4:ActivationLayer{ TANH }
layer 5:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=100 0.1%
layer 3: params=67610 99.9%
TOTAL : params=67710
weightsTotalSize=67710
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 2ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 2
ForwardAuto: kernel 2: this instance cant be used: cannot use forward2, since outputimagesize * outputimagesize > maxworkgroupsize
... not valid
forward try kernel 3
ForwardAuto: kernel 3: this instance cant be used: cannot use forward3, since outputimagesize * outputimagesize > maxworkgroupsize
... not valid
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 5ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 10ms
backward try kernel 2
BackwardAuto: kernel 2: this instance cant be used: cannot use BackwardGpuCached, since inputSize * inputSize > maxworkgroupsize
... not valid
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 170ms
calcGradWeights try kernel 2
BackpropWeightsAuto: kernel 2: this instance cant be used: cannot use BackpropWeightsScratch, since filterSize * filterSize > maxworkgroupsize
... not valid
calcGradWeights try kernel 3
BackpropWeightsAuto: kernel 3: this instance cant be used: cannot use BackpropWeightsScratchLarge, since filterSize * filterSize > maxworkgroupsize
... not valid
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 328ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 157ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 4ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2: cannot be used
forward kernel 3: cannot be used
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 157ms
forward layer selected kernel 1
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2: cannot be used
backward kernel 3 time: 170ms
backward layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2: cannot be used
calcGradWeights kernel 3: cannot be used
calcGradWeights kernel 4 time: 328ms
calcGradWeights layer selected kernel 1
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=28 -D gInputSizeSquared=784 -D gNumFilters=10 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=10 -D gOutputPlanes=10 -D gOutputSize=26 -D gOutputSizeSquared=676 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=28 -DgInputStripeOuterNumRows=32 -DgInputStripeInnerSize=784 -DgInputStripeOuterSize=896 -DgInputStripeMarginSize=56 -DgOutputStripeNumRows=26 -DgOutputStripeSize=676
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 3ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 136ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 185ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 5ms
forward kernel 3 time: 10ms
forward kernel 4 time: 4ms
forward kernel 5 time: 2ms
forward kernel 6 time: 3ms
forward kernel 7 time: 136ms
forward layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 2ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 185ms
calcGradWeights layer selected kernel 2
batch time 3843 ms
dump enabled=0
clblas teardown
[ OK ] testsinglebatch.imagesize28 (4129 ms)
[ RUN ] testsinglebatch.imagesize28_filtersize5
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=28 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=28 numFilters=10 filterSize=5 outputSize=24 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:FullyConnectedLayer{ numPlanes=10 imageSize=1 }
layer 4:ActivationLayer{ TANH }
layer 5:SquareLossLayer{}
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=260 0.4%
layer 3: params=57610 99.6%
TOTAL : params=57870
weightsTotalSize=57870
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 2
ForwardAuto: kernel 2: this instance cant be used: cannot use forward2, since outputimagesize * outputimagesize > maxworkgroupsize
... not valid
forward try kernel 3
ForwardAuto: kernel 3: this instance cant be used: cannot use forward3, since outputimagesize * outputimagesize > maxworkgroupsize
... not valid
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 4ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 11ms
backward try kernel 2
BackwardAuto: kernel 2: this instance cant be used: cannot use BackwardGpuCached, since inputSize * inputSize > maxworkgroupsize
... not valid
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 125ms
calcGradWeights try kernel 2
BackpropWeightsAuto: kernel 2: this instance cant be used: cannot use BackpropWeightsScratch, since filterSize * filterSize > maxworkgroupsize
... not valid
calcGradWeights try kernel 3
BackpropWeightsAuto: kernel 3: this instance cant be used: cannot use BackpropWeightsScratchLarge, since filterSize * filterSize > maxworkgroupsize
... not valid
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 285ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 159ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 3ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2: cannot be used
forward kernel 3: cannot be used
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 2ms
forward kernel 7 time: 159ms
forward layer selected kernel 1
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2: cannot be used
backward kernel 3 time: 125ms
backward layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2: cannot be used
calcGradWeights kernel 3: cannot be used
calcGradWeights kernel 4 time: 285ms
calcGradWeights layer selected kernel 1
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=28 -D gInputSizeSquared=784 -D gNumFilters=10 -D gFilterSize=5 -D gHalfFilterSize=2 -D gFilterSizeSquared=25 -D gNumOutputPlanes=10 -D gOutputPlanes=10 -D gOutputSize=24 -D gOutputSizeSquared=576 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=4 -DgInputStripeInnerNumRows=28 -DgInputStripeOuterNumRows=36 -DgInputStripeInnerSize=784 -DgInputStripeOuterSize=1008 -DgInputStripeMarginSize=112 -DgOutputStripeNumRows=24 -DgOutputStripeSize=576
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 5ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 3ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 198ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 318ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 4ms
forward kernel 3 time: 11ms
forward kernel 4 time: 3ms
forward kernel 5 time: 1ms
forward kernel 6 time: 3ms
forward kernel 7 time: 198ms
forward layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 5ms
calcGradWeights kernel 4 time: 318ms
calcGradWeights layer selected kernel 1
batch time 4219 ms
dump enabled=0
clblas teardown
[ OK ] testsinglebatch.imagesize28_filtersize5 (4506 ms)
[ RUN ] testsinglebatch.imagesize5_filtersize3_batchsize2_softmax
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=5 filterSize=3 outputSize=5 padZeros=1 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=5 inputSize=5 numFilters=5 filterSize=3 outputSize=5 padZeros=1 biased=1 skip=0} }
layer 4:ActivationLayer{ RELU }
layer 5:FullyConnectedLayer{ numPlanes=5 imageSize=1 }
layer 6:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
Parameters overview: (skipping 4 layers with 0 params)
layer 1: params=50 5.5%
layer 3: params=230 25.3%
layer 5: params=630 69.2%
TOTAL : params=910
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 2ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
layer 1 offset: 0
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 1
from w: 0
actual: -3.37811
layer 2 offset: 50
layer 3 offset: 50
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
layer 3
from w: 0
actual: -3.37811
layer 4 offset: 280
layer 5 offset: 280
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
layer 5
from w: 0
actual: -3.37811
layer 6 offset: 910
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 168ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 75ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
full thisloss: 3.37811
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 168ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 75ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 65ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
layer 1 offset: 0
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 65ms
forward layer selected kernel 1
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 148ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=5 -D gInputPlanes=5 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=5 -D gFilterSize=5 -D gHalfFilterSize=2 -D gFilterSizeSquared=25 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=4 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=13 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=65 -DgInputStripeMarginSize=20 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 162ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=5 -D gInputPlanes=5 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=5 -D gOutputSizeSquared=25 -D gPadZeros=1 -D gMargin=1 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=5 -DgOutputStripeSize=25
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=5 -D gOutputSizeSquared=25 -D gPadZeros=1 -D gMargin=1 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=5 -DgOutputStripeSize=25
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
backward kernel 0: cannot be used
backward kernel 1 time: 2ms
backward kernel 2 time: 0ms
backward kernel 3 time: 148ms
backward layer selected kernel 2
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 289ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 1ms
backward kernel 3 time: 162ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 213ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 70ms
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 289ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 213ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 70ms
calcGradWeights layer selected kernel 2
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 3
from w: 0
actual: 0
layer 4 offset: 280
layer 5 offset: 280
layer 5
from w: 0
actual: 0
layer 6 offset: 910
full thisloss: 3.37811
batch time 3976 ms
dump enabled=0
clblas teardown
[ OK ] testsinglebatch.imagesize5_filtersize3_batchsize2_softmax (4242 ms)
[ RUN ] testsinglebatch.imagesize4_filtersize3_batchsize2_pooling
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/PoolingBackwardGpuNaive.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/PoolingBackwardGpuNaive.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=12 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=12 numFilters=5 filterSize=3 outputSize=12 padZeros=1 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:PoolingLayer{ inputPlanes=5 inputSize=12 poolingSize=2 }
layer 4:ConvolutionalLayer{ LayerDimensions{ inputPlanes=5 inputSize=6 numFilters=5 filterSize=3 outputSize=6 padZeros=1 biased=1 skip=0} }
layer 5:ActivationLayer{ RELU }
layer 6:PoolingLayer{ inputPlanes=5 inputSize=6 poolingSize=2 }
layer 7:FullyConnectedLayer{ numPlanes=5 imageSize=1 }
layer 8:SoftMaxLayer{ perPlane=0 numPlanes=5 imageSize=1 }
Parameters overview: (skipping 6 layers with 0 params)
layer 1: params=50 9.8%
layer 4: params=230 45.1%
layer 7: params=230 45.1%
TOTAL : params=510
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
layer 1 offset: 0
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 1
from w: 0
actual: -3.56465
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
layer 4
from w: 0
actual: -3.56465
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
layer 7
from w: 0
actual: -3.56465
layer 8 offset: 510
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 123ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 123ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
full thisloss: 3.56465
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 123ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 123ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardAuto: kernel 7 59ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
layer 1 offset: 0
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 0ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 59ms
forward layer selected kernel 2
layer 1
from w: 0
actual: 2.38419e-07
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 2.38419e-07
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 2.38419e-07
layer 8 offset: 510
full thisloss: 3.56465
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 142ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=5 -D gInputPlanes=5 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
backward try kernel 3
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackwardAuto: kernel 3 153ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=5 -D gInputPlanes=5 -D gInputSize=6 -D gInputSizeSquared=36 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=6 -D gOutputSizeSquared=36 -D gPadZeros=1 -D gMargin=1 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=6 -DgInputStripeOuterNumRows=10 -DgInputStripeInnerSize=36 -DgInputStripeOuterSize=60 -DgInputStripeMarginSize=12 -DgOutputStripeNumRows=6 -DgOutputStripeSize=36
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=12 -D gInputSizeSquared=144 -D gNumFilters=5 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=5 -D gOutputPlanes=5 -D gOutputSize=12 -D gOutputSizeSquared=144 -D gPadZeros=1 -D gMargin=1 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=12 -DgInputStripeOuterNumRows=16 -DgInputStripeInnerSize=144 -DgInputStripeOuterSize=192 -DgInputStripeMarginSize=24 -DgOutputStripeNumRows=12 -DgOutputStripeSize=144
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 0ms
backward kernel 3 time: 142ms
backward layer selected kernel 2
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 286ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 1ms
backward kernel 3 time: 153ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 185ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
BackpropWeightsAuto: kernel 4 166ms
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 286ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 185ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 166ms
calcGradWeights layer selected kernel 1
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
layer 1 offset: 0
layer 1
from w: 0
actual: 0
layer 2 offset: 50
layer 3 offset: 50
layer 4 offset: 50
layer 4
from w: 0
actual: 0
layer 5 offset: 280
layer 6 offset: 280
layer 7 offset: 280
layer 7
from w: 0
actual: 0
layer 8 offset: 510
full thisloss: 3.56465
batch time 4485 ms
dump enabled=0
clblas teardown
[ OK ] testsinglebatch.imagesize4_filtersize3_batchsize2_pooling (4958 ms)
[----------] 6 tests from testsinglebatch (24726 ms total)
[----------] 9 tests from testpoolingforward
[ RUN ] testpoolingforward.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.basic (49 ms)
[ RUN ] testpoolingforward.basic_2plane_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.basic_2plane_batchsize2 (45 ms)
[ RUN ] testpoolingforward.fromwrappers
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.fromwrappers (51 ms)
[ RUN ] testpoolingforward.comparespecific_0_1_pooling2
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.comparespecific_0_1_pooling2 (57 ms)
[ RUN ] testpoolingforward.comparespecific_0_1_pooling3
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.comparespecific_0_1_pooling3 (63 ms)
[ RUN ] testpoolingforward.comparespecific_0_1_pooling2_pz
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.comparespecific_0_1_pooling2_pz (54 ms)
[ RUN ] testpoolingforward.comparespecific_0_1_pooling3_pz
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.comparespecific_0_1_pooling3_pz (59 ms)
[ RUN ] testpoolingforward.comparespecific_0_1_pooling3_small
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.comparespecific_0_1_pooling3_small (48 ms)
[ RUN ] testpoolingforward.comparespecific_0_1_pooling3_small2
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testpoolingforward.comparespecific_0_1_pooling3_small2 (49 ms)
[----------] 9 tests from testpoolingforward (475 ms total)
[----------] 2 tests from testpoolingbackward
[ RUN ] testpoolingbackward.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testpoolingbackward.basic (5 ms)
[ RUN ] testpoolingbackward.basic_2plane_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testpoolingbackward.basic_2plane_batchsize2 (5 ms)
[----------] 2 tests from testpoolingbackward (10 ms total)
[----------] 1 test from testNorbLoader
[ RUN ] testNorbLoader.load1000
unknown file: error: C++ exception with description "failed to open file: ..\data\norb\training-shuffled-dat.mat" thrown in the test body.
[ FAILED ] testNorbLoader.load1000 (1 ms)
[----------] 1 test from testNorbLoader (1 ms total)
[----------] 7 tests from teststringhelper
[ RUN ] teststringhelper.split
[ OK ] teststringhelper.split (0 ms)
[ RUN ] teststringhelper.split2
[ OK ] teststringhelper.split2 (0 ms)
[ RUN ] teststringhelper.split3
[ OK ] teststringhelper.split3 (0 ms)
[ RUN ] teststringhelper.tolower
[ OK ] teststringhelper.tolower (0 ms)
[ RUN ] teststringhelper.replace
[ OK ] teststringhelper.replace (0 ms)
[ RUN ] teststringhelper.replaceglobal
[ OK ] teststringhelper.replaceglobal (0 ms)
[ RUN ] teststringhelper.strcpy_safe
[ OK ] teststringhelper.strcpy_safe (0 ms)
[----------] 7 tests from teststringhelper (1 ms total)
[----------] 1 test from testGtestGlobals
[ RUN ] testGtestGlobals.basic
There are 1 parameters:
argv[0]=deepcl_unittests.exe
[ OK ] testGtestGlobals.basic (0 ms)
[----------] 1 test from testGtestGlobals (0 ms total)
[----------] 1 test from testMemset
[ RUN ] testMemset.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
myArray[0]=99
myArray[1]=99
myArray[2]=99
myArray[3]=99
myArray[4]=99
myArray[5]=99
myArray[6]=99
myArray[7]=99
myArray[8]=99
myArray[9]=99
[ OK ] testMemset.basic (43 ms)
[----------] 1 test from testMemset (43 ms total)
[----------] 2 tests from testCopyBuffer
[ RUN ] testCopyBuffer.floats
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
3
4
5
6
7
8
9
10
11
12
build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
3
4
5
6
7
8
9
10
11
12
build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
3
4
5
6
7
8
9
10
11
12
[ OK ] testCopyBuffer.floats (113 ms)
[ RUN ] testCopyBuffer.nits
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
3
4
5
6
7
8
9
10
11
12
build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
3
4
5
6
7
8
9
10
11
12
build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
3
4
5
6
7
8
9
10
11
12
[ OK ] testCopyBuffer.nits (115 ms)
[----------] 2 tests from testCopyBuffer (228 ms total)
[----------] 2 tests from testCopyBlock
[ RUN ] testCopyBlock.testPos
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
in[0]=3076
in[1]=8
in[2]=14
test/testCopyBlock.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
res[0]=3
res[1]=4
res[2]=8206
res[3]=8
res[4]=14
[ OK ] testCopyBlock.testPos (50 ms)
[ RUN ] testCopyBlock.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
test/testCopyBlock.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
2 3 4
6 7 8
0 0 0 0
5 6 7
9 10 11
0 0 0 0
[ OK ] testCopyBlock.basic (52 ms)
[----------] 2 tests from testCopyBlock (102 ms total)
[----------] 1 test from testCopyLocal
[ RUN ] testCopyLocal.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
0 0 0 0
test/testCopyLocal.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
1 2 3 4
5 6 7 8
9 10 11 12
0 0 0 0
[ OK ] testCopyLocal.basic (47 ms)
[----------] 1 test from testCopyLocal (48 ms total)
[----------] 8 tests from testNetdefToNet
[ RUN ] testNetdefToNet.empty
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testNetdefToNet.empty (5 ms)
[ RUN ] testNetdefToNet.onefc
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testNetdefToNet.onefc (90 ms)
[ RUN ] testNetdefToNet.onefclinear
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testNetdefToNet.onefclinear (90 ms)
[ RUN ] testNetdefToNet.150n_10n
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testNetdefToNet.150n_10n (90 ms)
[ RUN ] testNetdefToNet.3xfclinear
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
nnString: [3]
repeatNum 3
remainderString [150n]
inner [150n]
multiplied string: 150n-150n-150n
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=19 }
layer 1:FullyConnectedLayer{ numPlanes=150 imageSize=1 }
layer 2:FullyConnectedLayer{ numPlanes=150 imageSize=1 }
layer 3:FullyConnectedLayer{ numPlanes=150 imageSize=1 }
layer 4:SoftMaxLayer{ perPlane=0 numPlanes=150 imageSize=1 }
Parameters overview: (skipping 2 layers with 0 params)
layer 1: params=54300 54.5%
layer 2: params=22650 22.7%
layer 3: params=22650 22.7%
TOTAL : params=99600
[ OK ] testNetdefToNet.3xfclinear (102 ms)
[ RUN ] testNetdefToNet.mp2_3x32c5z_10n
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
prefix: [mp2]
nnString: [3]
repeatNum 3
remainderString [32c5z-10n ]
postfix [10n ]
inner [32c5z]
multiplied string: mp2-32c5z-32c5z-32c5z-10n
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/PoolingBackwardGpuNaive.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=19 }
layer 1:PoolingLayer{ inputPlanes=1 inputSize=19 poolingSize=2 }
layer 2:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=32 filterSize=5 outputSize=9 padZeros=1 biased=1 skip=0} }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=9 numFilters=32 filterSize=5 outputSize=9 padZeros=1 biased=1 skip=0} }
layer 4:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=9 numFilters=32 filterSize=5 outputSize=9 padZeros=1 biased=1 skip=0} }
layer 5:FullyConnectedLayer{ numPlanes=10 imageSize=1 }
layer 6:SoftMaxLayer{ perPlane=0 numPlanes=10 imageSize=1 }
Parameters overview: (skipping 3 layers with 0 params)
layer 2: params=832 1.1%
layer 3: params=25632 32.9%
layer 4: params=25632 32.9%
layer 5: params=25930 33.2%
TOTAL : params=78026
[ OK ] testNetdefToNet.mp2_3x32c5z_10n (208 ms)
[ RUN ] testNetdefToNet.3x32c5zmp2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
nnString: [3]
repeatNum 3
remainderString [(32c5z-mp2)-10n]
inner [32c5z-mp2]
newRemainder [-10n]
postfix [10n]
multiplied string: 32c5z-mp2-32c5z-mp2-32c5z-mp2-10n
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/PoolingBackwardGpuNaive.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/PoolingBackwardGpuNaive.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/pooling.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/PoolingBackwardGpuNaive.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/memset.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=128 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=128 numFilters=32 filterSize=5 outputSize=128 padZeros=1 biased=1 skip=0} }
layer 2:PoolingLayer{ inputPlanes=32 inputSize=128 poolingSize=2 }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=64 numFilters=32 filterSize=5 outputSize=64 padZeros=1 biased=1 skip=0} }
layer 4:PoolingLayer{ inputPlanes=32 inputSize=64 poolingSize=2 }
layer 5:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=32 numFilters=32 filterSize=5 outputSize=32 padZeros=1 biased=1 skip=0} }
layer 6:PoolingLayer{ inputPlanes=32 inputSize=32 poolingSize=2 }
layer 7:FullyConnectedLayer{ numPlanes=10 imageSize=1 }
layer 8:SoftMaxLayer{ perPlane=0 numPlanes=10 imageSize=1 }
Parameters overview: (skipping 5 layers with 0 params)
layer 1: params=832 0.6%
layer 3: params=25632 19.1%
layer 5: params=25632 19.1%
layer 7: params=81930 61.1%
TOTAL : params=134026
[ OK ] testNetdefToNet.3x32c5zmp2 (442 ms)
[ RUN ] testNetdefToNet.2x32c7_3x32c5z
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
nnString: [2]
repeatNum 2
remainderString [32c7z-3*32c5z-10n]
postfix [3*32c5z-10n]
inner [32c7z]
nnString: [3]
repeatNum 3
remainderString [32c5z-10n]
postfix [10n]
inner [32c5z]
multiplied string: 32c5z-32c5z-32c5z-10n
multiplied string: 32c7z-32c7z-32c5z-32c5z-32c5z-10n
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=19 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=19 numFilters=32 filterSize=7 outputSize=19 padZeros=1 biased=1 skip=0} }
layer 2:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=7 outputSize=19 padZeros=1 biased=1 skip=0} }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} }
layer 4:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} }
layer 5:ConvolutionalLayer{ LayerDimensions{ inputPlanes=32 inputSize=19 numFilters=32 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} }
layer 6:FullyConnectedLayer{ numPlanes=10 imageSize=1 }
layer 7:SoftMaxLayer{ perPlane=0 numPlanes=10 imageSize=1 }
Parameters overview: (skipping 2 layers with 0 params)
layer 1: params=1600 0.7%
layer 2: params=50208 20.6%
layer 3: params=25632 10.5%
layer 4: params=25632 10.5%
layer 5: params=25632 10.5%
layer 6: params=115530 47.3%
TOTAL : params=244234
[ OK ] testNetdefToNet.2x32c7_3x32c5z (114 ms)
[----------] 8 tests from testNetdefToNet (1141 ms total)
[----------] 10 tests from testactivationforward
[ RUN ] testactivationforward.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testactivationforward.basic (5 ms)
[ RUN ] testactivationforward.basic_2plane_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testactivationforward.basic_2plane_batchsize2 (5 ms)
[ RUN ] testactivationforward.fromwrappers
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.fromwrappers (47 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation2
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation2 (48 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation3
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation3 (49 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation2_pz
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation2_pz (45 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation3_pz
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation3_pz (46 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation3_small
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation3_small (46 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation3_small2
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation3_small2 (46 ms)
[ RUN ] testactivationforward.comparespecific_0_1_activation3_small2_tanh
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testactivationforward.comparespecific_0_1_activation3_small2_tanh (81 ms)
[----------] 10 tests from testactivationforward (419 ms total)
[----------] 2 tests from testactivationbackward
[ RUN ] testactivationbackward.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
gradInput=3
gradInput=0
gradInput=-2.7
gradInput=2
gradInput=-0
gradInput=2.1
gradInput=0
gradInput=-1.1
gradInput=0
[ OK ] testactivationbackward.basic (4 ms)
[ RUN ] testactivationbackward.basic_2plane_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
gradInput=3
gradInput=0
gradInput=0
gradInput=9
[ OK ] testactivationbackward.basic_2plane_batchsize2 (5 ms)
[----------] 2 tests from testactivationbackward (9 ms total)
[----------] 1 test from testRandomSingleton
[ RUN ] testRandomSingleton.testMockRandom
0.462129
0.930973
0.838558
0.209218
0.259291
0.157555
0.728508
0.968573
0.240331
0.914489
[ OK ] testRandomSingleton.testMockRandom (1 ms)
[----------] 1 test from testRandomSingleton (1 ms total)
[----------] 10 tests from testdropoutforward
[ RUN ] testdropoutforward.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testdropoutforward.basic (4 ms)
[ RUN ] testdropoutforward.basic_2plane_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testdropoutforward.basic_2plane_batchsize2 (5 ms)
[ RUN ] testdropoutforward.fromwrappers
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
[ OK ] testdropoutforward.fromwrappers (7 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout2
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout2 (47 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout3
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout3 (47 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout2_pz
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout2_pz (47 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout3_pz
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout3_pz (48 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout3_small
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout3_small (45 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout3_small2
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout3_small2 (46 ms)
[ RUN ] testdropoutforward.comparespecific_0_1_dropout3_small2_tanh
instance0: 0
instance1: 1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutforward.comparespecific_0_1_dropout3_small2_tanh (46 ms)
[----------] 10 tests from testdropoutforward (344 ms total)
[----------] 3 tests from testdropoutbackward
[ RUN ] testdropoutbackward.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutbackward.basic (43 ms)
[ RUN ] testdropoutbackward.basic_2plane_batchsize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutbackward.basic_2plane_batchsize2 (44 ms)
[ RUN ] testdropoutbackward.compare_args
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/dropout.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testdropoutbackward.compare_args (48 ms)
[----------] 3 tests from testdropoutbackward (136 ms total)
[----------] 1 test from testsgd
[ RUN ] testsgd.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
inputtotalsize=50 outputTotalSize=18
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testsgd.basic (364 ms)
[----------] 1 test from testsgd (364 ms total)
[----------] 9 tests from testCLMathWrapper
[ RUN ] testCLMathWrapper.assign
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=4
a[1]=2.1
a[2]=5
a[3]=3
a[4]=9.2
[ OK ] testCLMathWrapper.assign (47 ms)
[ RUN ] testCLMathWrapper.assignScalar
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=3.4
a[1]=3.4
a[2]=3.4
a[3]=3.4
a[4]=3.4
[ OK ] testCLMathWrapper.assignScalar (46 ms)
[ RUN ] testCLMathWrapper.addinplace
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=5
a[1]=5.1
a[2]=14
a[3]=15.5
a[4]=11.7
[ OK ] testCLMathWrapper.addinplace (46 ms)
[ RUN ] testCLMathWrapper.multiplyinplace
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=1.5
a[1]=4.5
a[2]=13.5
a[3]=18.75
a[4]=3.75
[ OK ] testCLMathWrapper.multiplyinplace (46 ms)
[ RUN ] testCLMathWrapper.addscalar
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=2.5
a[1]=4.5
a[2]=10.5
a[3]=14
a[4]=4
[ OK ] testCLMathWrapper.addscalar (45 ms)
[ RUN ] testCLMathWrapper.sqrt
cl/per_element_op1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=1
a[1]=1.73205
a[2]=3
a[3]=3.53553
a[4]=1.58114
[ OK ] testCLMathWrapper.sqrt (46 ms)
[ RUN ] testCLMathWrapper.squared
cl/per_element_op1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=1
a[1]=9
a[2]=81
a[3]=156.25
a[4]=6.25
[ OK ] testCLMathWrapper.squared (45 ms)
[ RUN ] testCLMathWrapper.inverse
cl/per_element_op1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=1
a[1]=0.333333
a[2]=0.111111
a[3]=0.08
a[4]=0.4
[ OK ] testCLMathWrapper.inverse (45 ms)
[ RUN ] testCLMathWrapper.perelementmult
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=4
a[1]=6.3
a[2]=45
a[3]=37.5
a[4]=23
[ OK ] testCLMathWrapper.perelementmult (47 ms)
[----------] 9 tests from testCLMathWrapper (413 ms total)
[----------] 1 test from testreducesegments
[ RUN ] testreducesegments.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testreducesegments.basic (46 ms)
[----------] 1 test from testreducesegments (46 ms total)
[----------] 4 tests from testGpuOp
[ RUN ] testGpuOp.addinplace
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=5
a[1]=5.1
a[2]=14
a[3]=15.5
a[4]=11.7
[ OK ] testGpuOp.addinplace (46 ms)
[ RUN ] testGpuOp.addoutofplace
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=1
a[1]=3
a[2]=9
a[3]=12.5
a[4]=2.5
c[0]=5
c[1]=5.1
c[2]=14
c[3]=15.5
c[4]=11.7
[ OK ] testGpuOp.addoutofplace (45 ms)
[ RUN ] testGpuOp.inverse
cl/per_element_op1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=1
a[1]=0.333333
a[2]=0.111111
a[3]=0.08
a[4]=0.4
[ OK ] testGpuOp.inverse (46 ms)
[ RUN ] testGpuOp.addscalarinplace
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
a[0]=5.2
a[1]=7.2
a[2]=13.2
a[3]=16.7
a[4]=6.7
[ OK ] testGpuOp.addscalarinplace (45 ms)
[----------] 4 tests from testGpuOp (182 ms total)
[----------] 1 test from testjpeghelper
[ RUN ] testjpeghelper.writeread
[ OK ] testjpeghelper.writeread (5 ms)
[----------] 1 test from testjpeghelper (5 ms total)
[----------] Global test environment tear-down
[==========] 160 tests from 29 test cases ran. (107904 ms total)
[ PASSED ] 158 tests.
[ FAILED ] 2 tests, listed below:
[ FAILED ] testsinglebatch.imagesize5_filtersize3_batchsize2_10filters
[ FAILED ] testNorbLoader.load1000
2 FAILED TESTS
YOU HAVE 2 DISABLED TESTS
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment