-
-
Save jakakonda/590a71d0175e79b8bdda to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
args: deepcl_unittests.exe --gtest_filter=-SLOW* | |
Note: Google Test filter = -SLOW* | |
[==========] Running 160 tests from 29 test cases. | |
[----------] Global test environment set-up. | |
[----------] 8 tests from testClBlas | |
[ RUN ] testClBlas.basic | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.basic (402 ms) | |
[ RUN ] testClBlas.basicthrice | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
clblas teardown | |
initializing clblas | |
clblas teardown | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.basicthrice (166 ms) | |
[ RUN ] testClBlas.transA | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
1 2 9 | |
3 7 5 | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.transA (57 ms) | |
[ RUN ] testClBlas.transB | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
3 | |
-1 | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.transB (55 ms) | |
[ RUN ] testClBlas.colMajor | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
&AWrap->getBuffer()1019762231104 | |
&BWrap->getBuffer()1019762232832 | |
&CWrap->getBuffer()1019762232160 | |
clblas teardown | |
[ OK ] testClBlas.colMajor (59 ms) | |
[ RUN ] testClBlas.colMajor2 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.colMajor2 (59 ms) | |
[ RUN ] testClBlas.colMajorTransA | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.colMajorTransA (53 ms) | |
[ RUN ] testClBlas.colMajorTransB | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
clblas teardown | |
[ OK ] testClBlas.colMajorTransB (54 ms) | |
[----------] 8 tests from testClBlas (905 ms total) | |
[----------] 1 test from testDeepCL | |
[ RUN ] testDeepCL.basic | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
expected number of output: 4 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
clblas teardown | |
[ OK ] testDeepCL.basic (259 ms) | |
[----------] 1 test from testDeepCL (259 ms total) | |
[----------] 23 tests from testupdateweights | |
[ RUN ] testupdateweights.conv1 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
layer 0:InputLayer{ outputPlanes=2 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:SquareLossLayer{} | |
layer 0:InputLayer{ outputPlanes=2 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:SquareLossLayer{} | |
batchSize: 4 | |
inputtotalsize=200 outputTotalSize=72 | |
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
weightsize=36 biassize=0 | |
statefultimer v0.7 | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
layer 0:InputLayer{ outputPlanes=2 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:SquareLossLayer{} | |
Parameters overview: (skipping 2 layers with 0 params) | |
layer 1: params=36 100.0% | |
TOTAL : params=36 | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 0ms | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
idx=8 predicted losschange=0.000111445 actual=0.000112534 | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
idx=13 predicted losschange=-0.000886715 actual=-0.000884056 | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 0ms | |
idx=0 predicted losschange=0.000210491 actual=0.000212669 | |
forward try kernel 5 | |
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical | |
... not valid | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
idx=22 predicted losschange=-0.000164224 actual=-0.000163078 | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 126ms | |
idx=22 predicted losschange=-0.000164224 actual=-0.000163078 | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 0ms | |
forward kernel 3 time: 1ms | |
forward kernel 4 time: 0ms | |
forward kernel 5: cannot be used | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 126ms | |
forward layer selected kernel 1 | |
idx=35 predicted losschange=-0.000391028 actual=-0.000391006 | |
idx=26 predicted losschange=2.23142e-05 actual=2.57492e-05 | |
idx=27 predicted losschange=9.38328e-05 actual=9.44138e-05 | |
idx=27 predicted losschange=9.38328e-05 actual=9.44138e-05 | |
idx=10 predicted losschange=0.00186697 actual=0.00187111 | |
clblas teardown | |
[ OK ] testupdateweights.conv1 (886 ms) | |
[ RUN ] testupdateweights.conv1z | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
layer 0:InputLayer{ outputPlanes=2 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} } | |
layer 2:SquareLossLayer{} | |
layer 0:InputLayer{ outputPlanes=2 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} } | |
layer 2:SquareLossLayer{} | |
batchSize: 4 | |
inputtotalsize=72 outputTotalSize=72 | |
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} } | |
weightsize=36 biassize=0 | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
layer 0:InputLayer{ outputPlanes=2 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} } | |
layer 2:SquareLossLayer{} | |
Parameters overview: (skipping 2 layers with 0 params) | |
layer 1: params=36 100.0% | |
TOTAL : params=36 | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
idx=8 predicted losschange=0.00039831 actual=0.000397682 | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
idx=13 predicted losschange=-0.000426502 actual=-0.000426292 | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
idx=0 predicted losschange=0.000143287 actual=0.000144005 | |
forward try kernel 5 | |
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, padzeros must be disabled | |
... not valid | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
idx=22 predicted losschange=-1.7916e-06 actual=0 | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 123ms | |
idx=22 predicted losschange=-1.7916e-06 actual=0 | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 0ms | |
forward kernel 3 time: 1ms | |
forward kernel 4 time: 1ms | |
forward kernel 5: cannot be used | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 123ms | |
forward layer selected kernel 1 | |
idx=35 predicted losschange=-2.82565e-05 actual=-2.76566e-05 | |
idx=26 predicted losschange=3.62191e-05 actual=3.71933e-05 | |
idx=27 predicted losschange=-0.000319862 actual=-0.000317574 | |
idx=27 predicted losschange=-0.000319862 actual=-0.000317574 | |
idx=10 predicted losschange=-0.000883857 actual=-0.000883102 | |
clblas teardown | |
[ OK ] testupdateweights.conv1z (770 ms) | |
[ RUN ] testupdateweights.numericallytest | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 2ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=1 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=1 filterSize=1 outputSize=1 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=1 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=1 filterSize=1 outputSize=1 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
loss 0.0367983 loss2 0.0367913 change: 7.01472e-06 | |
sumweightsdiff -0.000264842 | |
loss change 7.01472e-06 | |
estimatedLossChangeFromW 7.01413e-06 | |
[ OK ] testupdateweights.numericallytest (547 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=1 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=1 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
loss 1.23358 loss2 1.21612 change: 0.0174605 | |
sumweightsdiff -0.0132709 | |
loss change 0.0174605 | |
estimatedLossChangeFromW 0.0176118 | |
[ OK ] testupdateweights.numericallytest_imagesize3 (570 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize5 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=1 outputSize=5 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=1 outputSize=5 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
loss 4.12958 loss2 4.11952 change: 0.0100665 | |
sumweightsdiff -0.0101708 | |
loss change 0.0100665 | |
estimatedLossChangeFromW 0.0103444 | |
[ OK ] testupdateweights.numericallytest_imagesize5 (592 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize9 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=9 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=1 outputSize=9 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=9 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=1 outputSize=9 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=1 100.0% | |
TOTAL : params=1 | |
loss 13.4341 loss2 13.4339 change: 0.000207901 | |
sumweightsdiff 0.00153953 | |
loss change 0.000207901 | |
estimatedLossChangeFromW 0.000237015 | |
[ OK ] testupdateweights.numericallytest_imagesize9 (578 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize9_filtersize9 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=9 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=9 outputSize=1 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=81 100.0% | |
TOTAL : params=81 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=9 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=9 outputSize=1 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=81 100.0% | |
TOTAL : params=81 | |
loss 0.135896 loss2 0.0848782 change: 0.0510182 | |
sumweightsdiff -0.0322406 | |
loss change 0.0510182 | |
estimatedLossChangeFromW 0.0555841 | |
[ OK ] testupdateweights.numericallytest_imagesize9_filtersize9 (590 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize9_filtersize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=9 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=3 outputSize=7 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=9 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=3 outputSize=7 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
loss 7.70633 loss2 7.41581 change: 0.290529 | |
sumweightsdiff -0.0898813 | |
loss change 0.290529 | |
estimatedLossChangeFromW 0.316231 | |
[ OK ] testupdateweights.numericallytest_imagesize9_filtersize3 (575 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize3_filtersize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=3 outputSize=1 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=3 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=3 outputSize=1 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
loss 0.0719101 loss2 0.0694461 change: 0.00246408 | |
sumweightsdiff -0.0110647 | |
loss change 0.00246408 | |
estimatedLossChangeFromW 0.00248372 | |
[ OK ] testupdateweights.numericallytest_imagesize3_filtersize3 (541 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
loss 1.20022 loss2 1.17241 change: 0.0278131 | |
sumweightsdiff -0.0203888 | |
loss change 0.0278131 | |
estimatedLossChangeFromW 0.0280929 | |
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3 (575 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_batchsize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=1 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=9 100.0% | |
TOTAL : params=9 | |
loss 4.97142 loss2 4.78768 change: 0.183745 | |
sumweightsdiff -0.056004 | |
loss change 0.183745 | |
estimatedLossChangeFromW 0.193264 | |
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_batchsize3 (579 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=3 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=27 100.0% | |
TOTAL : params=27 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=3 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=27 100.0% | |
TOTAL : params=27 | |
loss 1.08887 loss2 0.9575 change: 0.13137 | |
sumweightsdiff -0.00764532 | |
loss change 0.13137 | |
estimatedLossChangeFromW 0.134379 | |
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3 (636 ms) | |
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3_batchsize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
layer 0:InputLayer{ outputPlanes=3 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=27 100.0% | |
TOTAL : params=27 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 0ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
layer 0:InputLayer{ outputPlanes=3 outputSize=5 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} } | |
layer 2:ActivationLayer{ TANH } | |
layer 3:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=27 100.0% | |
TOTAL : params=27 | |
loss 4.76631 loss2 4.18154 change: 0.584769 | |
sumweightsdiff 0.029606 | |
loss change 0.584769 | |
estimatedLossChangeFromW 0.620442 | |
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3_batchsize3 (630 ms) | |
[ RUN ] testupdateweights.backprop_weights_2 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
[ OK ] testupdateweights.backprop_weights_2 (56 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize2 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=2 -D gInputSizeSquared=4 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=2 -D gOutputSizeSquared=4 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=2 -DgInputStripeOuterNumRows=2 -DgInputStripeInnerSize=4 -DgInputStripeOuterSize=4 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=2 -DgOutputStripeSize=4 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize2 (59 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
mismatch for i 1 | |
mismatch for i 2 | |
mismatch for i 3 | |
mismatch for i 4 | |
mismatch for i 5 | |
mismatch for i 6 | |
mismatch for i 7 | |
mismatch for i 8 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize3 (55 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize4_filtersize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=2 -D gOutputSizeSquared=4 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=8 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=32 -DgInputStripeMarginSize=8 -DgOutputStripeNumRows=2 -DgOutputStripeSize=4 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
mismatch for i 8 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize4_filtersize3 (64 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize5_filtersize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
mismatch for i 4 | |
mismatch for i 8 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize5_filtersize3 (78 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize1 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=3 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=9 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize1 (65 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize16_filtersize1 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=16 -D gInputSizeSquared=256 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=16 -D gOutputSizeSquared=256 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=16 -DgInputStripeOuterNumRows=16 -DgInputStripeInnerSize=256 -DgInputStripeOuterSize=256 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=16 -DgOutputStripeSize=256 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize16_filtersize1 (111 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1 | |
LayerDimensions{ inputPlanes=1 inputSize=17 numFilters=1 filterSize=1 outputSize=17 padZeros=0 biased=0 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=17 -D gInputSizeSquared=289 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=17 -D gOutputSizeSquared=289 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=17 -DgInputStripeOuterNumRows=17 -DgInputStripeInnerSize=289 -DgInputStripeOuterSize=289 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=17 -DgOutputStripeSize=289 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1 (98 ms) | |
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1_moredata | |
expectedresult: -958.715 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=17 -D gInputSizeSquared=289 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=17 -D gOutputSizeSquared=289 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=17 -DgInputStripeOuterNumRows=17 -DgInputStripeInnerSize=289 -DgInputStripeOuterSize=289 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=17 -DgOutputStripeSize=289 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
mismatch for i 0 | |
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1_moredata (90 ms) | |
[ RUN ] testupdateweights.backprop_instance3_smaller2 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
numweights: 36 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=96 -D gInputSizeSquared=9216 -D gNumFilters=1 -D gFilterSize=6 -D gHalfFilterSize=3 -D gFilterSizeSquared=36 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=91 -D gOutputSizeSquared=8281 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=8 -DgInputStripeMarginRows=5 -DgInputStripeInnerNumRows=12 -DgInputStripeOuterNumRows=22 -DgInputStripeInnerSize=1152 -DgInputStripeOuterSize=2112 -DgInputStripeMarginSize=480 -DgOutputStripeNumRows=12 -DgOutputStripeSize=1092 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
138 0 0 0 0 0 | |
132 0 0 0 0 0 | |
138 0 0 0 0 0 | |
138 0 0 0 0 0 | |
138 0 0 0 0 0 | |
132 0 0 0 0 0 | |
138 0 0 0 0 0 | |
132 0 0 0 0 0 | |
138 0 0 0 0 0 | |
138 0 0 0 0 0 | |
138 0 0 0 0 0 | |
132 0 0 0 0 0 | |
...... | |
...... | |
...... | |
...... | |
...... | |
...... | |
0=0 0 0 0 0 0 0 0 | |
1=0 0 0 0 0 0 0 0 | |
2=0 0 0 0 0 0 0 0 | |
3=0 0 0 0 0 0 0 0 | |
4=0 0 0 0 0 0 0 0 | |
5=0 0 0 0 0 0 0 0 | |
6=0 0 0 0 0 0 0 0 | |
7=0 0 0 0 0 0 0 0 | |
8=0 0 0 0 0 0 0 0 | |
9=0 0 0 0 0 0 0 0 | |
10=0 0 0 0 0 0 0 0 | |
11=0 0 0 0 0 0 0 0 | |
0=0 0 0 0 0 0 0 0 | |
1=0 0 0 0 0 0 0 0 | |
2=0 0 0 0 0 0 0 0 | |
3=0 0 0 0 0 0 0 0 | |
4=0 0 0 0 0 0 0 0 | |
5=0 0 0 0 0 0 0 0 | |
6=0 0 0 0 0 0 0 0 | |
7=0 0 0 0 0 0 0 0 | |
8=0 0 0 0 0 0 0 0 | |
9=0 0 0 0 0 0 0 0 | |
10=0 0 0 0 0 0 0 0 | |
11=0 0 0 0 0 0 0 0 | |
12=0 0 0 0 0 0 0 0 | |
13=0 0 0 0 0 0 0 0 | |
14=0 0 0 0 0 0 0 0 | |
15=0 0 0 0 0 0 0 0 | |
16=0 0 0 0 0 0 0 0 | |
17=0 0 0 0 0 0 0 0 | |
18=0 0 0 0 0 0 0 0 | |
19=0 0 0 0 0 0 0 0 | |
[ OK ] testupdateweights.backprop_instance3_smaller2 (169 ms) | |
[----------] 23 tests from testupdateweights (8915 ms total) | |
[----------] 17 tests from testforward | |
[ RUN ] testforward.imagesize2_nopadzeros | |
expected number of output: 4 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
[ OK ] testforward.imagesize2_nopadzeros (255 ms) | |
[ RUN ] testforward.imagesize2_padzeros | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
checking result[0]=0 expecting: 0 | |
checking result[1]=0 expecting: 0 | |
checking result[2]=0 expecting: 0 | |
checking result[3]=0.2 expecting: 0.2 | |
checking result[4]=-0.13 expecting: -0.13 | |
checking result[5]=-0.15 expecting: -0.15 | |
checking result[6]=0 expecting: 0 | |
checking result[7]=0 expecting: 0 | |
checking result[8]=0 expecting: 0 | |
checking result[9]=0 expecting: 0 | |
checking result[10]=0 expecting: 0 | |
checking result[11]=0 expecting: 0 | |
checking result[12]=-0.55 expecting: -0.55 | |
checking result[13]=0.02 expecting: 0.02 | |
checking result[14]=0.21 expecting: 0.21 | |
checking result[27]=-14.3 expecting: -14.3 | |
checking result[28]=-9.6 expecting: -9.6 | |
checking result[29]=11.9 expecting: 11.9 | |
checking result[35]=0.46 expecting: 0.46 | |
[ OK ] testforward.imagesize2_padzeros (97 ms) | |
[ RUN ] testforward.imagesize3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
test1 ok | |
[ OK ] testforward.imagesize3 (102 ms) | |
[ RUN ] testforward.test2 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
[ OK ] testforward.test2 (87 ms) | |
[ RUN ] testforward.test3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
[ OK ] testforward.test3 (89 ms) | |
[ RUN ] testforward.compare_0_1_biased_nopad | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_0_1_biased_nopad (171 ms) | |
[ RUN ] testforward.compare_0_1_biased_pad | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_0_1_biased_pad (182 ms) | |
[ RUN ] testforward.compare_1_n_biased_nopad | |
instance: 2 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
instance: 3 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
instance: 4 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
instance: 6 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
instance: 7 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_1_n_biased_nopad (1420 ms) | |
[ RUN ] testforward.compare_1_n_biased_pad | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
instance: 2 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
clblas teardown | |
instance: 3 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
clblas teardown | |
instance: 4 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
clblas teardown | |
instance: 6 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
clblas teardown | |
instance: 7 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_1_n_biased_pad (1433 ms) | |
[ RUN ] testforward.compare_1_5_biased_nopad | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=19 outputSize=1 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=19 outputSize=1 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_1_5_biased_nopad (205 ms) | |
[ RUN ] testforward.compare_1_4_fcscenario | |
LayerDimensions{ inputPlanes=10 inputSize=24 numFilters=10 filterSize=24 outputSize=1 padZeros=0 biased=1 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=10 inputSize=24 numFilters=10 filterSize=24 outputSize=1 padZeros=0 biased=1 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_1_4_fcscenario (247 ms) | |
[ RUN ] testforward.compare_break1_0_1 | |
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 1 | |
dump enabled=0 | |
batch 0 batchsize 1 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_break1_0_1 (85 ms) | |
[ RUN ] testforward.compare_break1_0_4 | |
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 1 | |
dump enabled=0 | |
batch 0 batchsize 1 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0} | |
clblas teardown | |
[ OK ] testforward.compare_break1_0_4 (97 ms) | |
[ RUN ] testforward.comparespecific_break2 | |
LayerDimensions{ inputPlanes=64 inputSize=19 numFilters=64 filterSize=19 outputSize=1 padZeros=0 biased=0 skip=0} | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
batch 0 batchsize 4 | |
dump enabled=0 | |
batch 0 batchsize 4 | |
dump enabled=0 | |
LayerDimensions{ inputPlanes=64 inputSize=19 numFilters=64 filterSize=19 outputSize=1 padZeros=0 biased=0 skip=0} | |
clblas teardown | |
[ OK ] testforward.comparespecific_break2 (281 ms) | |
[ RUN ] testforward.softmax | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
output[0]=0.0320586 | |
output[1]=0.0871443 | |
output[2]=0.643914 | |
output[3]=0.236883 | |
loss 0.44019 | |
loss 3.44019 | |
loss 2.44019 | |
loss 1.44019 | |
[ OK ] testforward.softmax (5 ms) | |
[ RUN ] testforward.softmax_byplane | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
output[0]=0.0320586 | |
output[1]=0.0871443 | |
output[2]=0.643914 | |
output[3]=0.236883 | |
loss 0.44019 | |
loss 3.44019 | |
loss 2.44019 | |
loss 1.44019 | |
[ OK ] testforward.softmax_byplane (5 ms) | |
[ RUN ] testforward.crash_from_jm | |
-D gNumInputPlanes=32 -D gInputPlanes=32 -D gInputSize=28 -D gInputSizeSquared=784 -D gNumFilters=20 -D gFilterSize=28 -D gHalfFilterSize=14 -D gFilterSizeSquared=784 -D gNumOutputPlanes=20 -D gOutputPlanes=20 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
dump enabled=0 | |
[ OK ] testforward.crash_from_jm (246 ms) | |
[----------] 17 tests from testforward (5010 ms total) | |
[----------] 2 tests from testfilehelper | |
[ RUN ] testfilehelper.testfilehelper | |
[ OK ] testfilehelper.testfilehelper (18 ms) | |
[ RUN ] testfilehelper.testreadchunk | |
[ OK ] testfilehelper.testreadchunk (6 ms) | |
[----------] 2 tests from testfilehelper (24 ms total) | |
[----------] 12 tests from testsimpleconvolvenet | |
[ RUN ] testsimpleconvolvenet.imagesize1_planes2_filters2_unbiased_tanh | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 0.141046 | |
accuracy: 2/2 100% | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
calcGradWeights try kernel 3 | |
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 1ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 133ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 1ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 1ms | |
calcGradWeights kernel 2 time: 0ms | |
calcGradWeights kernel 3 time: 1ms | |
calcGradWeights kernel 4 time: 133ms | |
calcGradWeights layer selected kernel 2 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 106ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 1ms | |
forward kernel 3 time: 0ms | |
forward kernel 4 time: 1ms | |
forward kernel 5 time: 1ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 106ms | |
forward layer selected kernel 1 | |
loss, E, 0.0733092 | |
accuracy: 2/2 100% | |
loss, E, 0.0426809 | |
accuracy: 2/2 100% | |
loss, E, 0.0262453 | |
accuracy: 2/2 100% | |
loss, E, 0.0164245 | |
accuracy: 2/2 100% | |
loss, E, 0.0107573 | |
accuracy: 2/2 100% | |
accuracy: 2/2 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize1_planes2_filters2_unbiased_tanh (1229 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize1_planes2_filters2_tanh | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 0ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 0.964924 | |
accuracy: 1/2 50% | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 1ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 226ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 1ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 0ms | |
calcGradWeights kernel 2 time: 0ms | |
calcGradWeights kernel 3 time: 1ms | |
calcGradWeights kernel 4 time: 226ms | |
calcGradWeights layer selected kernel 1 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 98ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 1ms | |
forward kernel 2 time: 0ms | |
forward kernel 3 time: 1ms | |
forward kernel 4 time: 1ms | |
forward kernel 5 time: 1ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 98ms | |
forward layer selected kernel 2 | |
loss, E, 0.0057046 | |
accuracy: 2/2 100% | |
loss, E, 1.34826e-05 | |
accuracy: 2/2 100% | |
loss, E, 3.61967e-08 | |
accuracy: 2/2 100% | |
accuracy: 2/2 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize1_planes2_filters2_tanh (1337 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_tanh | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 0ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 1.13283 | |
accuracy: 3/4 75% | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 1ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 231ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 1ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 0ms | |
calcGradWeights kernel 2 time: 1ms | |
calcGradWeights kernel 3 time: 0ms | |
calcGradWeights kernel 4 time: 231ms | |
calcGradWeights layer selected kernel 1 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 0ms | |
loss, E, 0.00996344 | |
accuracy: 4/4 100% | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 101ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 1ms | |
forward kernel 2 time: 0ms | |
forward kernel 3 time: 0ms | |
forward kernel 4 time: 1ms | |
forward kernel 5 time: 1ms | |
forward kernel 6 time: 0ms | |
forward kernel 7 time: 101ms | |
forward layer selected kernel 2 | |
loss, E, 4.70675e-05 | |
accuracy: 4/4 100% | |
loss, E, 4.09645e-07 | |
accuracy: 4/4 100% | |
accuracy: 4/4 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_tanh (1327 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize1_2planes_filtersize1 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 0.751601 | |
accuracy: 2/2 100% | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 0ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 226ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 2ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 1ms | |
calcGradWeights kernel 2 time: 0ms | |
calcGradWeights kernel 3 time: 0ms | |
calcGradWeights kernel 4 time: 226ms | |
calcGradWeights layer selected kernel 2 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
loss, E, 0.195916 | |
accuracy: 2/2 100% | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 98ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 0ms | |
forward kernel 3 time: 1ms | |
forward kernel 4 time: 0ms | |
forward kernel 5 time: 2ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 98ms | |
forward layer selected kernel 1 | |
loss, E, 0.0679117 | |
accuracy: 2/2 100% | |
loss, E, 0.023677 | |
accuracy: 2/2 100% | |
loss, E, 0.00825563 | |
accuracy: 2/2 100% | |
loss, E, 0.00287856 | |
accuracy: 2/2 100% | |
loss, E, 0.00100369 | |
accuracy: 2/2 100% | |
loss, E, 0.000349964 | |
accuracy: 2/2 100% | |
accuracy: 2/2 100% | |
accuracy: 2/2 | |
loss, E, 0.000150648 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize1_2planes_filtersize1 (1216 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_relu | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 1.48951 | |
accuracy: 2/4 50% | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 0ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 231ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 1ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 1ms | |
calcGradWeights kernel 2 time: 0ms | |
calcGradWeights kernel 3 time: 0ms | |
calcGradWeights kernel 4 time: 231ms | |
calcGradWeights layer selected kernel 2 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
loss, E, 1.12957 | |
accuracy: 2/4 50% | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 111ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 0ms | |
forward kernel 3 time: 0ms | |
forward kernel 4 time: 0ms | |
forward kernel 5 time: 1ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 111ms | |
forward layer selected kernel 1 | |
loss, E, 0.070782 | |
accuracy: 4/4 100% | |
loss, E, 0.003026 | |
accuracy: 4/4 100% | |
loss, E, 0.00021158 | |
accuracy: 4/4 100% | |
loss, E, 1.96858e-05 | |
accuracy: 4/4 100% | |
loss, E, 2.03002e-06 | |
accuracy: 4/4 100% | |
loss, E, 2.15572e-07 | |
accuracy: 4/4 100% | |
loss, E, 2.3083e-08 | |
accuracy: 4/4 100% | |
loss, E, 2.48239e-09 | |
accuracy: 4/4 100% | |
loss, E, 4.14442e-10 | |
accuracy: 4/4 100% | |
accuracy: 4/4 | |
loss, E, 4.14442e-10 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_relu (1453 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_linear | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 0.50604 | |
accuracy: 4/4 100% | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 1ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 1ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 231ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 1ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 1ms | |
calcGradWeights kernel 2 time: 1ms | |
calcGradWeights kernel 3 time: 0ms | |
calcGradWeights kernel 4 time: 231ms | |
calcGradWeights layer selected kernel 3 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
loss, E, 0.0565529 | |
accuracy: 4/4 100% | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 103ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 1ms | |
forward kernel 3 time: 1ms | |
forward kernel 4 time: 1ms | |
forward kernel 5 time: 1ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 103ms | |
forward layer selected kernel 1 | |
loss, E, 0.00777245 | |
accuracy: 4/4 100% | |
loss, E, 0.00106831 | |
accuracy: 4/4 100% | |
loss, E, 0.000218376 | |
accuracy: 4/4 100% | |
accuracy: 4/4 | |
loss, E, 0.000218376 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_linear (1221 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize1_n2_2layers_unbiased | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 0ms | |
backward try kernel 0 | |
... not plausibly optimal, skipping | |
backward try kernel 1 | |
cl/backward.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackwardAuto: kernel 1 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 2ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
epoch 0 loss, E, 0.0559531 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
backward try kernel 2 | |
cl/backward_cached.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 1ms | |
epoch 1 loss, E, 0.0254554 | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
backward try kernel 3 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackwardAuto: kernel 3 98ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 1ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 1ms | |
epoch 2 loss, E, 0.0172943 | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 0ms | |
backward kernel 0: cannot be used | |
backward kernel 1 time: 1ms | |
backward kernel 2 time: 0ms | |
backward kernel 3 time: 98ms | |
backward layer selected kernel 2 | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 230ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 58ms | |
epoch 3 loss, E, 0.0138013 | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 1ms | |
forward try kernel 5 | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 2ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 1ms | |
calcGradWeights kernel 2 time: 0ms | |
calcGradWeights kernel 3 time: 1ms | |
calcGradWeights kernel 4 time: 230ms | |
calcGradWeights layer selected kernel 2 | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 2ms | |
calcGradWeights kernel 2 time: 1ms | |
calcGradWeights kernel 3 time: 1ms | |
calcGradWeights kernel 4 time: 58ms | |
calcGradWeights layer selected kernel 2 | |
epoch 4 loss, E, 0.0115848 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
epoch 5 loss, E, 0.00987036 | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 95ms | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 52ms | |
epoch 6 loss, E, 0.00844797 | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 1ms | |
forward kernel 3 time: 0ms | |
forward kernel 4 time: 0ms | |
forward kernel 5 time: 1ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 95ms | |
forward layer selected kernel 1 | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 0ms | |
forward kernel 2 time: 1ms | |
forward kernel 3 time: 0ms | |
forward kernel 4 time: 0ms | |
forward kernel 5 time: 2ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 52ms | |
forward layer selected kernel 1 | |
epoch 7 loss, E, 0.00724182 | |
epoch 8 loss, E, 0.00621212 | |
epoch 9 loss, E, 0.00533106 | |
epoch 10 loss, E, 0.00457645 | |
epoch 11 loss, E, 0.00392979 | |
epoch 12 loss, E, 0.00337539 | |
epoch 13 loss, E, 0.00289992 | |
epoch 14 loss, E, 0.002492 | |
epoch 15 loss, E, 0.00214191 | |
epoch 16 loss, E, 0.00184138 | |
epoch 17 loss, E, 0.00158331 | |
epoch 18 loss, E, 0.00136164 | |
epoch 19 loss, E, 0.0011712 | |
epoch 20 loss, E, 0.00100754 | |
epoch 21 loss, E, 0.000866877 | |
epoch 22 loss, E, 0.000745946 | |
epoch 23 loss, E, 0.000641966 | |
epoch 24 loss, E, 0.000552543 | |
epoch 25 loss, E, 0.000475625 | |
epoch 26 loss, E, 0.000409454 | |
epoch 27 loss, E, 0.000352522 | |
epoch 28 loss, E, 0.000303531 | |
epoch 29 loss, E, 0.00026137 | |
epoch 30 loss, E, 0.000225082 | |
epoch 31 loss, E, 0.000193845 | |
epoch 32 loss, E, 0.000166954 | |
epoch 33 loss, E, 0.000143801 | |
epoch 34 loss, E, 0.000123866 | |
epoch 35 loss, E, 0.000106699 | |
epoch 36 loss, E, 9.19176e-05 | |
epoch 37 loss, E, 7.91864e-05 | |
epoch 38 loss, E, 6.82211e-05 | |
epoch 39 loss, E, 5.87767e-05 | |
layer 0:InputLayer{ outputPlanes=1 outputSize=1 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} } | |
layer 2:ActivationLayer{ RELU } | |
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} } | |
layer 4:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=4 40.0% | |
layer 3: params=6 60.0% | |
TOTAL : params=10 | |
loss, E, 5.87767e-05 | |
accuracy: 2/2 100% | |
accuracy: 2/2 | |
loss, E, 5.87767e-05 | |
loss, E, 5.87767e-05 | |
layer 0:InputLayer{ outputPlanes=1 outputSize=1 } | |
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} } | |
layer 2:ActivationLayer{ RELU } | |
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} } | |
layer 4:SquareLossLayer{} | |
Parameters overview: (skipping 3 layers with 0 params) | |
layer 1: params=4 40.0% | |
layer 3: params=6 60.0% | |
TOTAL : params=10 | |
float weights1[] = {-0.303866f, -1.59823f}; | |
float weights3[] = {0.426358f, -0.719592f, -0.420361f, 0.719566f}; | |
float bias1[] = {-0.324465f, 0.60279f}; | |
float bias3[] = {0.506862f, -0.506837f}; | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize1_n2_2layers_unbiased (2208 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize1_n2_2layers_biased | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
backward try kernel 0 | |
... not plausibly optimal, skipping | |
backward try kernel 1 | |
cl/backward.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackwardAuto: kernel 1 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 2ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 1.19067 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
backward try kernel 2 | |
cl/backward_cached.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackwardAuto: kernel 2 1ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 2ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 2ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
backward try kernel 3 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackwardAuto: kernel 3 100ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 2ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
backward kernel 0: cannot be used | |
backward kernel 1 time: 1ms | |
backward kernel 2 time: 1ms | |
backward kernel 3 time: 100ms | |
backward layer selected kernel 1 | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 223ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 53ms | |
forward try kernel 5 | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 2ms | |
forward try kernel 5 | |
cl/forward_fc_wgperrow.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 5 0ms | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 0ms | |
calcGradWeights kernel 2 time: 2ms | |
calcGradWeights kernel 3 time: 0ms | |
calcGradWeights kernel 4 time: 223ms | |
calcGradWeights layer selected kernel 1 | |
calcGradWeights kernel 0: cannot be used | |
calcGradWeights kernel 1 time: 2ms | |
calcGradWeights kernel 2 time: 0ms | |
calcGradWeights kernel 3 time: 0ms | |
calcGradWeights kernel 4 time: 53ms | |
calcGradWeights layer selected kernel 2 | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
forward try kernel 6 | |
cl/forward_byinputplane.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/reduce_segments.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 6 1ms | |
loss, E, 0.0667568 | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 102ms | |
forward try kernel 7 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardAuto: kernel 7 52ms | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 1ms | |
forward kernel 2 time: 1ms | |
forward kernel 3 time: 2ms | |
forward kernel 4 time: 2ms | |
forward kernel 5 time: 2ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 102ms | |
forward layer selected kernel 1 | |
forward kernel 0: cannot be used | |
forward kernel 1 time: 1ms | |
forward kernel 2 time: 1ms | |
forward kernel 3 time: 0ms | |
forward kernel 4 time: 1ms | |
forward kernel 5 time: 0ms | |
forward kernel 6 time: 1ms | |
forward kernel 7 time: 52ms | |
forward layer selected kernel 3 | |
loss, E, 0.00923595 | |
loss, E, 0.00112611 | |
loss, E, 0.000117399 | |
loss, E, 1.15641e-05 | |
dump enabled=0 | |
loss, E, 1.78565e-06 | |
accuracy: 2/2 100% | |
accuracy: 2/2 | |
loss, E, 1.78565e-06 | |
clblas teardown | |
[ OK ] testsimpleconvolvenet.imagesize1_n2_2layers_biased (2093 ms) | |
[ RUN ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n3 | |
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL | |
Using OpenCL device: Intel(R) HD Graphics 4000 | |
initializing clblas | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/copy.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/activate.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/applyActivationDeriv.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/per_element_add.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
forward try kernel 0 | |
... not plausibly optimal, skipping | |
forward try kernel 1 | |
cl/forward1.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 1 1ms | |
backward try kernel 0 | |
... not plausibly optimal, skipping | |
backward try kernel 1 | |
cl/backward.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackwardAuto: kernel 1 0ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
calcGradWeights try kernel 0 | |
... not plausibly optimal, skipping | |
calcGradWeights try kernel 1 | |
cl/backpropweights.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 1 1ms | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
cl/per_element_op2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
loss, E, 1.58166 | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 1ms | |
forward try kernel 2 | |
cl/forward2.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 2 2ms | |
backward try kernel 2 | |
cl/backward_cached.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackwardAuto: kernel 2 0ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 2ms | |
calcGradWeights try kernel 2 | |
cl/BackpropWeightsScratch.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 2 1ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 0ms | |
forward try kernel 3 | |
cl/forward3.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 3 2ms | |
backward try kernel 3 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackwardAuto: kernel 3 102ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=3 -D gFilterSize=4 -D gHalfFilterSize=2 -D gFilterSizeSquared=16 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=3 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=10 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=40 -DgInputStripeMarginSize=12 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
calcGradWeights try kernel 3 | |
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=2 -D gHalfFilterSize=1 -D gFilterSizeSquared=4 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=4 -D gOutputSizeSquared=16 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=1 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=35 -DgInputStripeMarginSize=5 -DgOutputStripeNumRows=4 -DgOutputStripeSize=16 | |
cl/BackpropWeightsScratchLarge.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
BackpropWeightsAuto: kernel 3 0ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 1ms | |
forward try kernel 4 | |
cl/forward4.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
... seems valid | |
ForwardAuto: kernel 4 2ms | |
backward kernel 0: cannot be used | |
backward kernel 1 time: 0ms | |
backward kernel 2 time: 0ms | |
backward kernel 3 time: 102ms | |
backward layer selected kernel 1 | |
calcGradWeights try kernel 4 | |
... seems valid | |
cl/per_element_op2_scalar.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
BackpropWeightsAuto: kernel 4 232ms | |
calcGradWeights try kernel 4 | |
... seems valid | |
ForwardIm2Col.cl build log: | |
fcl build 1 succeeded. | |
fcl build 2 succeeded. | |
bcl build succeeded. | |
OpenCL error -43 on line 163 of D:\DeepCL\clMathLibraries\clBLAS\src\library\blas\xgemm.cc |
What you could do is insert the following at line 159 of xgemm.cc:
printf("build options %s\n", sourceBuildOptions);
then rebuild and rerun.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
(though they will probably complain that we dont have a simple example to replicate the problem, but we can add that later perhaps)