Skip to content

Instantly share code, notes, and snippets.

@jakakonda
Created October 28, 2015 22:19
Show Gist options
  • Save jakakonda/1697a560cce76346b086 to your computer and use it in GitHub Desktop.
Save jakakonda/1697a560cce76346b086 to your computer and use it in GitHub Desktop.
args: deepcl_unittests.exe --gtest_filter=-SLOW*
Note: Google Test filter = -SLOW*
[==========] Running 160 tests from 29 test cases.
[----------] Global test environment set-up.
[----------] 8 tests from testClBlas
[ RUN ] testClBlas.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
sgemm_Col_NN_B0_3x1x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.basic (656 ms)
[ RUN ] testClBlas.basicthrice
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
sgemm_Col_NN_B0_3x1x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
initializing clblas
sgemm_Col_NN_B0_3x1x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
initializing clblas
sgemm_Col_NN_B0_3x1x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.basicthrice (156 ms)
[ RUN ] testClBlas.transA
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
1 2 9
3 7 5
initializing clblas
sgemm_Col_NT_B0_3x1x2
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.transA (63 ms)
[ RUN ] testClBlas.transB
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
3
-1
initializing clblas
sgemm_Col_TN_B0_3x1x2
makeGemmKernel: "sgemm_Col_TN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.transB (56 ms)
[ RUN ] testClBlas.colMajor
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
&AWrap->getBuffer()893855619888
&BWrap->getBuffer()893855620800
&CWrap->getBuffer()893855619936
sgemm_Col_NN_B0_3x1x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.colMajor (59 ms)
[ RUN ] testClBlas.colMajor2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
sgemm_Col_NN_B0_4x3x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.colMajor2 (57 ms)
[ RUN ] testClBlas.colMajorTransA
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
sgemm_Col_TN_B0_3x1x2
makeGemmKernel: "sgemm_Col_TN_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.colMajorTransA (54 ms)
[ RUN ] testClBlas.colMajorTransB
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
sgemm_Col_NT_B0_3x1x2
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NL016_KX01" now built; returning.
clblas teardown
[ OK ] testClBlas.colMajorTransB (56 ms)
[----------] 8 tests from testClBlas (1157 ms total)
[----------] 1 test from testDeepCL
[ RUN ] testDeepCL.basic
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
expected number of output: 4
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
clblas teardown
[ OK ] testDeepCL.basic (262 ms)
[----------] 1 test from testDeepCL (262 ms total)
[----------] 23 tests from testupdateweights
[ RUN ] testupdateweights.conv1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=2 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=2 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
batchSize: 4
inputtotalsize=200 outputTotalSize=72
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
weightsize=36 biassize=0
statefultimer v0.7
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=2 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=5 numFilters=2 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:SquareLossLayer{}
Parameters overview: (skipping 2 layers with 0 params)
layer 1: params=36 100.0%
TOTAL : params=36
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
idx=8 predicted losschange=0.000111445 actual=0.000112534
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
idx=13 predicted losschange=-0.000886715 actual=-0.000884056
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
idx=0 predicted losschange=0.000210491 actual=0.000212669
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
idx=22 predicted losschange=-0.000164224 actual=-0.000163078
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 115ms
idx=22 predicted losschange=-0.000164224 actual=-0.000163078
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 115ms
forward layer selected kernel 1
idx=35 predicted losschange=-0.000391028 actual=-0.000391006
idx=26 predicted losschange=2.23142e-05 actual=2.57492e-05
idx=27 predicted losschange=9.38328e-05 actual=9.44138e-05
idx=27 predicted losschange=9.38328e-05 actual=9.44138e-05
idx=10 predicted losschange=0.00186697 actual=0.00187111
clblas teardown
[ OK ] testupdateweights.conv1 (828 ms)
[ RUN ] testupdateweights.conv1z
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
layer 0:InputLayer{ outputPlanes=2 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
layer 2:SquareLossLayer{}
layer 0:InputLayer{ outputPlanes=2 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
layer 2:SquareLossLayer{}
batchSize: 4
inputtotalsize=72 outputTotalSize=72
layer ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
weightsize=36 biassize=0
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=2 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=3 numFilters=2 filterSize=3 outputSize=3 padZeros=1 biased=0 skip=0} }
layer 2:SquareLossLayer{}
Parameters overview: (skipping 2 layers with 0 params)
layer 1: params=36 100.0%
TOTAL : params=36
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
idx=8 predicted losschange=0.00039831 actual=0.000397682
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
idx=13 predicted losschange=-0.000426502 actual=-0.000426292
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
idx=0 predicted losschange=0.000143287 actual=0.000144005
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, padzeros must be disabled
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
idx=22 predicted losschange=-1.7916e-06 actual=0
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_9x2x18
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 114ms
idx=22 predicted losschange=-1.7916e-06 actual=0
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 0ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 114ms
forward layer selected kernel 4
idx=35 predicted losschange=-2.82565e-05 actual=-2.76566e-05
idx=26 predicted losschange=3.62191e-05 actual=3.71933e-05
idx=27 predicted losschange=-0.000319862 actual=-0.000317574
idx=27 predicted losschange=-0.000319862 actual=-0.000317574
idx=10 predicted losschange=-0.000883857 actual=-0.000883102
clblas teardown
[ OK ] testupdateweights.conv1z (748 ms)
[ RUN ] testupdateweights.numericallytest
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=1 filterSize=1 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=1 filterSize=1 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 0.0367983 loss2 0.0367913 change: 7.01472e-06
sumweightsdiff -0.000264842
loss change 7.01472e-06
estimatedLossChangeFromW 7.01413e-06
[ OK ] testupdateweights.numericallytest (529 ms)
[ RUN ] testupdateweights.numericallytest_imagesize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=1 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=1 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 1.23358 loss2 1.21612 change: 0.0174605
sumweightsdiff -0.0132709
loss change 0.0174605
estimatedLossChangeFromW 0.0176118
[ OK ] testupdateweights.numericallytest_imagesize3 (543 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=1 outputSize=5 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=1 outputSize=5 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 4.12958 loss2 4.11952 change: 0.0100665
sumweightsdiff -0.0101708
loss change 0.0100665
estimatedLossChangeFromW 0.0103444
[ OK ] testupdateweights.numericallytest_imagesize5 (562 ms)
[ RUN ] testupdateweights.numericallytest_imagesize9
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=1 outputSize=9 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=1 outputSize=9 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=1 100.0%
TOTAL : params=1
loss 13.4341 loss2 13.4339 change: 0.000207901
sumweightsdiff 0.00153953
loss change 0.000207901
estimatedLossChangeFromW 0.000237015
[ OK ] testupdateweights.numericallytest_imagesize9 (561 ms)
[ RUN ] testupdateweights.numericallytest_imagesize9_filtersize9
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=9 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=81 100.0%
TOTAL : params=81
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=9 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=81 100.0%
TOTAL : params=81
loss 0.135896 loss2 0.0848782 change: 0.0510182
sumweightsdiff -0.0322406
loss change 0.0510182
estimatedLossChangeFromW 0.0555841
[ OK ] testupdateweights.numericallytest_imagesize9_filtersize9 (596 ms)
[ RUN ] testupdateweights.numericallytest_imagesize9_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=3 outputSize=7 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=9 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=9 numFilters=1 filterSize=3 outputSize=7 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 7.70633 loss2 7.41581 change: 0.290529
sumweightsdiff -0.0898813
loss change 0.290529
estimatedLossChangeFromW 0.316231
[ OK ] testupdateweights.numericallytest_imagesize9_filtersize3 (571 ms)
[ RUN ] testupdateweights.numericallytest_imagesize3_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=3 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=1 outputSize=3 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=3 numFilters=1 filterSize=3 outputSize=1 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 0.0719101 loss2 0.0694461 change: 0.00246408
sumweightsdiff -0.0110647
loss change 0.00246408
estimatedLossChangeFromW 0.00248372
[ OK ] testupdateweights.numericallytest_imagesize3_filtersize3 (539 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 1.20022 loss2 1.17241 change: 0.0278131
sumweightsdiff -0.0203888
loss change 0.0278131
estimatedLossChangeFromW 0.0280929
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3 (564 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_batchsize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=1 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=9 100.0%
TOTAL : params=9
loss 4.97142 loss2 4.78768 change: 0.183745
sumweightsdiff -0.056004
loss change 0.183745
estimatedLossChangeFromW 0.193264
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_batchsize3 (569 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
loss 1.08887 loss2 0.9575 change: 0.13137
sumweightsdiff -0.00764532
loss change 0.13137
estimatedLossChangeFromW 0.134379
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3 (636 ms)
[ RUN ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3_batchsize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
layer 0:InputLayer{ outputPlanes=3 outputSize=5 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=3 inputSize=5 numFilters=1 filterSize=3 outputSize=3 padZeros=0 biased=0 skip=0} }
layer 2:ActivationLayer{ TANH }
layer 3:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=27 100.0%
TOTAL : params=27
loss 4.76631 loss2 4.18154 change: 0.584769
sumweightsdiff 0.029606
loss change 0.584769
estimatedLossChangeFromW 0.620442
[ OK ] testupdateweights.numericallytest_imagesize5_filtersize3_planes3_batchsize3 (615 ms)
[ RUN ] testupdateweights.backprop_weights_2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2 (52 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=2 -D gInputSizeSquared=4 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=2 -D gOutputSizeSquared=4 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=2 -DgInputStripeOuterNumRows=2 -DgInputStripeInnerSize=4 -DgInputStripeOuterSize=4 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=2 -DgOutputStripeSize=4
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize2 (58 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
mismatch for i 1
mismatch for i 2
mismatch for i 3
mismatch for i 4
mismatch for i 5
mismatch for i 6
mismatch for i 7
mismatch for i 8
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize3 (56 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize4_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=2 -D gOutputSizeSquared=4 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=8 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=32 -DgInputStripeMarginSize=8 -DgOutputStripeNumRows=2 -DgOutputStripeSize=4
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
mismatch for i 8
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize4_filtersize3 (79 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize5_filtersize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=1 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=9 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=45 -DgInputStripeMarginSize=10 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
mismatch for i 4
mismatch for i 8
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize5_filtersize3 (73 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=3 -D gOutputSizeSquared=9 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=3 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=9 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=3 -DgOutputStripeSize=9
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize3_filtersize1 (65 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize16_filtersize1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=16 -D gInputSizeSquared=256 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=16 -D gOutputSizeSquared=256 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=16 -DgInputStripeOuterNumRows=16 -DgInputStripeInnerSize=256 -DgInputStripeOuterSize=256 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=16 -DgOutputStripeSize=256
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize16_filtersize1 (108 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1
LayerDimensions{ inputPlanes=1 inputSize=17 numFilters=1 filterSize=1 outputSize=17 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=17 -D gInputSizeSquared=289 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=17 -D gOutputSizeSquared=289 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=17 -DgInputStripeOuterNumRows=17 -DgInputStripeInnerSize=289 -DgInputStripeOuterSize=289 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=17 -DgOutputStripeSize=289
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1 (91 ms)
[ RUN ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1_moredata
expectedresult: -958.715
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=17 -D gInputSizeSquared=289 -D gNumFilters=1 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=17 -D gOutputSizeSquared=289 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=17 -DgInputStripeOuterNumRows=17 -DgInputStripeInnerSize=289 -DgInputStripeOuterSize=289 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=17 -DgOutputStripeSize=289
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
mismatch for i 0
[ OK ] testupdateweights.backprop_weights_2_upstreamimagesize17_filtersize1_moredata (88 ms)
[ RUN ] testupdateweights.backprop_instance3_smaller2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
numweights: 36
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=96 -D gInputSizeSquared=9216 -D gNumFilters=1 -D gFilterSize=6 -D gHalfFilterSize=3 -D gFilterSizeSquared=36 -D gNumOutputPlanes=1 -D gOutputPlanes=1 -D gOutputSize=91 -D gOutputSizeSquared=8281 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=8 -DgInputStripeMarginRows=5 -DgInputStripeInnerNumRows=12 -DgInputStripeOuterNumRows=22 -DgInputStripeInnerSize=1152 -DgInputStripeOuterSize=2112 -DgInputStripeMarginSize=480 -DgOutputStripeNumRows=12 -DgOutputStripeSize=1092
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
138 0 0 0 0 0
132 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
132 0 0 0 0 0
138 0 0 0 0 0
132 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
138 0 0 0 0 0
132 0 0 0 0 0
......
......
......
......
......
......
0=0 0 0 0 0 0 0 0
1=0 0 0 0 0 0 0 0
2=0 0 0 0 0 0 0 0
3=0 0 0 0 0 0 0 0
4=0 0 0 0 0 0 0 0
5=0 0 0 0 0 0 0 0
6=0 0 0 0 0 0 0 0
7=0 0 0 0 0 0 0 0
8=0 0 0 0 0 0 0 0
9=0 0 0 0 0 0 0 0
10=0 0 0 0 0 0 0 0
11=0 0 0 0 0 0 0 0
0=0 0 0 0 0 0 0 0
1=0 0 0 0 0 0 0 0
2=0 0 0 0 0 0 0 0
3=0 0 0 0 0 0 0 0
4=0 0 0 0 0 0 0 0
5=0 0 0 0 0 0 0 0
6=0 0 0 0 0 0 0 0
7=0 0 0 0 0 0 0 0
8=0 0 0 0 0 0 0 0
9=0 0 0 0 0 0 0 0
10=0 0 0 0 0 0 0 0
11=0 0 0 0 0 0 0 0
12=0 0 0 0 0 0 0 0
13=0 0 0 0 0 0 0 0
14=0 0 0 0 0 0 0 0
15=0 0 0 0 0 0 0 0
16=0 0 0 0 0 0 0 0
17=0 0 0 0 0 0 0 0
18=0 0 0 0 0 0 0 0
19=0 0 0 0 0 0 0 0
[ OK ] testupdateweights.backprop_instance3_smaller2 (162 ms)
[----------] 23 tests from testupdateweights (8695 ms total)
[----------] 17 tests from testforward
[ RUN ] testforward.imagesize2_nopadzeros
expected number of output: 4
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testforward.imagesize2_nopadzeros (248 ms)
[ RUN ] testforward.imagesize2_padzeros
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
checking result[0]=0 expecting: 0
checking result[1]=0 expecting: 0
checking result[2]=0 expecting: 0
checking result[3]=0.2 expecting: 0.2
checking result[4]=-0.13 expecting: -0.13
checking result[5]=-0.15 expecting: -0.15
checking result[6]=0 expecting: 0
checking result[7]=0 expecting: 0
checking result[8]=0 expecting: 0
checking result[9]=0 expecting: 0
checking result[10]=0 expecting: 0
checking result[11]=0 expecting: 0
checking result[12]=-0.55 expecting: -0.55
checking result[13]=0.02 expecting: 0.02
checking result[14]=0.21 expecting: 0.21
checking result[27]=-14.3 expecting: -14.3
checking result[28]=-9.6 expecting: -9.6
checking result[29]=11.9 expecting: 11.9
checking result[35]=0.46 expecting: 0.46
[ OK ] testforward.imagesize2_padzeros (96 ms)
[ RUN ] testforward.imagesize3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
test1 ok
[ OK ] testforward.imagesize3 (94 ms)
[ RUN ] testforward.test2
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testforward.test2 (87 ms)
[ RUN ] testforward.test3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
[ OK ] testforward.test3 (90 ms)
[ RUN ] testforward.compare_0_1_biased_nopad
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_0_1_biased_nopad (161 ms)
[ RUN ] testforward.compare_0_1_biased_pad
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_0_1_biased_pad (172 ms)
[ RUN ] testforward.compare_1_n_biased_nopad
instance: 2
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 3
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 4
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 6
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
instance: 7
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_225x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" now built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" now built; returning.
sgemm_Col_NN_B0_225x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" already built; returning.
sgemm_Col_NN_B0_225x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" already built; returning.
sgemm_Col_NN_B0_225x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" already built; returning.
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=15 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_n_biased_nopad (1383 ms)
[ RUN ] testforward.compare_1_n_biased_pad
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
instance: 2
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 3
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 4
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 6
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
instance: 7
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_361x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" now built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" now built; returning.
sgemm_Col_NN_B0_361x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" already built; returning.
sgemm_Col_NN_B0_361x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" already built; returning.
sgemm_Col_NN_B0_361x8x200
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX08" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX08" already built; returning.
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=5 outputSize=19 padZeros=1 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_n_biased_pad (1438 ms)
[ RUN ] testforward.compare_1_5_biased_nopad
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=19 outputSize=1 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=8 inputSize=19 numFilters=8 filterSize=19 outputSize=1 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_5_biased_nopad (198 ms)
[ RUN ] testforward.compare_1_4_fcscenario
LayerDimensions{ inputPlanes=10 inputSize=24 numFilters=10 filterSize=24 outputSize=1 padZeros=0 biased=1 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=10 inputSize=24 numFilters=10 filterSize=24 outputSize=1 padZeros=0 biased=1 skip=0}
clblas teardown
[ OK ] testforward.compare_1_4_fcscenario (240 ms)
[ RUN ] testforward.compare_break1_0_1
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 1
dump enabled=0
batch 0 batchsize 1
dump enabled=0
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
clblas teardown
[ OK ] testforward.compare_break1_0_1 (87 ms)
[ RUN ] testforward.compare_break1_0_4
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 1
dump enabled=0
batch 0 batchsize 1
dump enabled=0
LayerDimensions{ inputPlanes=1 inputSize=33 numFilters=1 filterSize=1 outputSize=33 padZeros=0 biased=0 skip=0}
clblas teardown
[ OK ] testforward.compare_break1_0_4 (94 ms)
[ RUN ] testforward.comparespecific_break2
LayerDimensions{ inputPlanes=64 inputSize=19 numFilters=64 filterSize=19 outputSize=1 padZeros=0 biased=0 skip=0}
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
batch 0 batchsize 4
dump enabled=0
batch 0 batchsize 4
dump enabled=0
LayerDimensions{ inputPlanes=64 inputSize=19 numFilters=64 filterSize=19 outputSize=1 padZeros=0 biased=0 skip=0}
clblas teardown
[ OK ] testforward.comparespecific_break2 (274 ms)
[ RUN ] testforward.softmax
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
output[0]=0.0320586
output[1]=0.0871443
output[2]=0.643914
output[3]=0.236883
loss 0.44019
loss 3.44019
loss 2.44019
loss 1.44019
[ OK ] testforward.softmax (6 ms)
[ RUN ] testforward.softmax_byplane
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
output[0]=0.0320586
output[1]=0.0871443
output[2]=0.643914
output[3]=0.236883
loss 0.44019
loss 3.44019
loss 2.44019
loss 1.44019
[ OK ] testforward.softmax_byplane (5 ms)
[ RUN ] testforward.crash_from_jm
-D gNumInputPlanes=32 -D gInputPlanes=32 -D gInputSize=28 -D gInputSizeSquared=784 -D gNumFilters=20 -D gFilterSize=28 -D gHalfFilterSize=14 -D gFilterSizeSquared=784 -D gNumOutputPlanes=20 -D gOutputPlanes=20 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
dump enabled=0
[ OK ] testforward.crash_from_jm (241 ms)
[----------] 17 tests from testforward (4915 ms total)
[----------] 2 tests from testfilehelper
[ RUN ] testfilehelper.testfilehelper
[ OK ] testfilehelper.testfilehelper (19 ms)
[ RUN ] testfilehelper.testreadchunk
[ OK ] testfilehelper.testreadchunk (5 ms)
[----------] 2 tests from testfilehelper (24 ms total)
[----------] 12 tests from testsimpleconvolvenet
[ RUN ] testsimpleconvolvenet.imagesize1_planes2_filters2_unbiased_tanh
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.141046
accuracy: 2/2 100%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 132ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 132ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 97ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 97ms
forward layer selected kernel 1
loss, E, 0.0733092
accuracy: 2/2 100%
loss, E, 0.0426809
accuracy: 2/2 100%
loss, E, 0.0262453
accuracy: 2/2 100%
loss, E, 0.0164245
accuracy: 2/2 100%
loss, E, 0.0107573
accuracy: 2/2 100%
accuracy: 2/2
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_planes2_filters2_unbiased_tanh (1218 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_planes2_filters2_tanh
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.964924
accuracy: 1/2 50%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 2ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 223ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 2ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 223ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 99ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 0ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 99ms
forward layer selected kernel 3
loss, E, 0.0057046
accuracy: 2/2 100%
loss, E, 1.34826e-05
accuracy: 2/2 100%
loss, E, 3.61967e-08
accuracy: 2/2 100%
accuracy: 2/2
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_planes2_filters2_tanh (1301 ms)
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_tanh
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.13283
accuracy: 3/4 75%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 224ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 224ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 0.00996344
accuracy: 4/4 100%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 106ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 106ms
forward layer selected kernel 1
loss, E, 4.70675e-05
accuracy: 4/4 100%
loss, E, 4.09645e-07
accuracy: 4/4 100%
accuracy: 4/4
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_tanh (1307 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_2planes_filtersize1
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.751601
accuracy: 2/2 100%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 226ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 226ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 0.195916
accuracy: 2/2 100%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 96ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 96ms
forward layer selected kernel 1
loss, E, 0.0679117
accuracy: 2/2 100%
loss, E, 0.023677
accuracy: 2/2 100%
loss, E, 0.00825563
accuracy: 2/2 100%
loss, E, 0.00287856
accuracy: 2/2 100%
loss, E, 0.00100369
accuracy: 2/2 100%
loss, E, 0.000349964
accuracy: 2/2 100%
accuracy: 2/2 100%
accuracy: 2/2
loss, E, 0.000150648
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_2planes_filtersize1 (1231 ms)
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_relu
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.48951
accuracy: 2/4 50%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 2ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 254ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 254ms
calcGradWeights layer selected kernel 1
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 1.12957
accuracy: 2/4 50%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 100ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 2ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 100ms
forward layer selected kernel 1
loss, E, 0.070782
accuracy: 4/4 100%
loss, E, 0.003026
accuracy: 4/4 100%
loss, E, 0.00021158
accuracy: 4/4 100%
loss, E, 1.96858e-05
accuracy: 4/4 100%
loss, E, 2.03002e-06
accuracy: 4/4 100%
loss, E, 2.15572e-07
accuracy: 4/4 100%
loss, E, 2.3083e-08
accuracy: 4/4 100%
loss, E, 2.48239e-09
accuracy: 4/4 100%
loss, E, 4.14442e-10
accuracy: 4/4 100%
accuracy: 4/4
loss, E, 4.14442e-10
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_relu (1548 ms)
[ RUN ] testsimpleconvolvenet.imagesize3_n4_filtersize3_linear
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 0.50604
accuracy: 4/4 100%
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=3 -D gInputSizeSquared=9 -D gNumFilters=2 -D gFilterSize=3 -D gHalfFilterSize=1 -D gFilterSizeSquared=9 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=2 -DgInputStripeInnerNumRows=3 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=9 -DgInputStripeOuterSize=21 -DgInputStripeMarginSize=6 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_9x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 336ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 0ms
calcGradWeights kernel 4 time: 336ms
calcGradWeights layer selected kernel 2
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 0.0565529
accuracy: 4/4 100%
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x9
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 135ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 135ms
forward layer selected kernel 1
loss, E, 0.00777245
accuracy: 4/4 100%
loss, E, 0.00106831
accuracy: 4/4 100%
loss, E, 0.000218376
accuracy: 4/4 100%
accuracy: 4/4
loss, E, 0.000218376
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize3_n4_filtersize3_linear (1662 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_n2_2layers_unbiased
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
epoch 0 loss, E, 0.0559531
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 12ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
epoch 1 loss, E, 0.0254554
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
backward try kernel 3
... seems valid
sgemm_Col_NT_B0_1x2x2
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NL016_KX01" now built; returning.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NT_B0_1x2x2
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NL016_KX01" already built; returning.
BackwardAuto: kernel 3 151ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
epoch 2 loss, E, 0.0172943
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 2ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 5ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 151ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_2x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_2x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 309ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 75ms
epoch 3 loss, E, 0.0138013
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 2ms
forward try kernel 5
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 4ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 12ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 309ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 75ms
calcGradWeights layer selected kernel 1
epoch 4 loss, E, 0.0115848
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 8ms
epoch 5 loss, E, 0.00987036
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 146ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 63ms
epoch 6 loss, E, 0.00844797
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 2ms
forward kernel 5 time: 2ms
forward kernel 6 time: 1ms
forward kernel 7 time: 146ms
forward layer selected kernel 1
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 0ms
forward kernel 4 time: 5ms
forward kernel 5 time: 4ms
forward kernel 6 time: 8ms
forward kernel 7 time: 63ms
forward layer selected kernel 2
epoch 7 loss, E, 0.00724182
epoch 8 loss, E, 0.00621212
epoch 9 loss, E, 0.00533106
epoch 10 loss, E, 0.00457645
epoch 11 loss, E, 0.00392979
epoch 12 loss, E, 0.00337539
epoch 13 loss, E, 0.00289992
epoch 14 loss, E, 0.002492
epoch 15 loss, E, 0.00214191
epoch 16 loss, E, 0.00184138
epoch 17 loss, E, 0.00158331
epoch 18 loss, E, 0.00136164
epoch 19 loss, E, 0.0011712
epoch 20 loss, E, 0.00100754
epoch 21 loss, E, 0.000866877
epoch 22 loss, E, 0.000745946
epoch 23 loss, E, 0.000641966
epoch 24 loss, E, 0.000552543
epoch 25 loss, E, 0.000475625
epoch 26 loss, E, 0.000409454
epoch 27 loss, E, 0.000352522
epoch 28 loss, E, 0.000303531
epoch 29 loss, E, 0.00026137
epoch 30 loss, E, 0.000225082
epoch 31 loss, E, 0.000193845
epoch 32 loss, E, 0.000166954
epoch 33 loss, E, 0.000143801
epoch 34 loss, E, 0.000123866
epoch 35 loss, E, 0.000106699
epoch 36 loss, E, 9.19176e-05
epoch 37 loss, E, 7.91864e-05
epoch 38 loss, E, 6.82211e-05
epoch 39 loss, E, 5.87767e-05
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=4 40.0%
layer 3: params=6 60.0%
TOTAL : params=10
loss, E, 5.87767e-05
accuracy: 2/2 100%
accuracy: 2/2
loss, E, 5.87767e-05
loss, E, 5.87767e-05
layer 0:InputLayer{ outputPlanes=1 outputSize=1 }
layer 1:ConvolutionalLayer{ LayerDimensions{ inputPlanes=1 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 2:ActivationLayer{ RELU }
layer 3:ConvolutionalLayer{ LayerDimensions{ inputPlanes=2 inputSize=1 numFilters=2 filterSize=1 outputSize=1 padZeros=0 biased=1 skip=0} }
layer 4:SquareLossLayer{}
Parameters overview: (skipping 3 layers with 0 params)
layer 1: params=4 40.0%
layer 3: params=6 60.0%
TOTAL : params=10
float weights1[] = {-0.303866f, -1.59823f};
float weights3[] = {0.426358f, -0.719592f, -0.420361f, 0.719566f};
float bias1[] = {-0.324465f, 0.60279f};
float bias3[] = {0.506862f, -0.506837f};
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_n2_2layers_unbiased (3275 ms)
[ RUN ] testsimpleconvolvenet.imagesize1_n2_2layers_biased
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 3ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.19067
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 7ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 9ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 2ms
backward try kernel 3
... seems valid
sgemm_Col_NT_B0_1x2x2
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NL016_KX01" now built; returning.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NT_B0_1x2x2
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NL016_KX01" already built; returning.
BackwardAuto: kernel 3 144ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=2 -D gInputPlanes=2 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=1 -D gInputSizeSquared=1 -D gNumFilters=2 -D gFilterSize=1 -D gHalfFilterSize=0 -D gFilterSizeSquared=1 -D gNumOutputPlanes=2 -D gOutputPlanes=2 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=0 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=0 -DgInputStripeInnerNumRows=1 -DgInputStripeOuterNumRows=1 -DgInputStripeInnerSize=1 -DgInputStripeOuterSize=1 -DgInputStripeMarginSize=0 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 0ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 0ms
backward kernel 3 time: 144ms
backward layer selected kernel 2
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_2x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_2x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 292ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_1x2x1
makeGemmKernel: "sgemm_Col_TN_B1_ML016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 74ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 3ms
forward try kernel 5
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 7ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 292ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 1ms
calcGradWeights kernel 2 time: 0ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 74ms
calcGradWeights layer selected kernel 2
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 0ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
loss, E, 0.0667568
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_1x2x1
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 145ms
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_1x2x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_1x2x2
makeGemmKernel: "sgemm_Col_NN_B0_ML016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 70ms
forward kernel 0: cannot be used
forward kernel 1 time: 1ms
forward kernel 2 time: 0ms
forward kernel 3 time: 9ms
forward kernel 4 time: 0ms
forward kernel 5 time: 3ms
forward kernel 6 time: 0ms
forward kernel 7 time: 145ms
forward layer selected kernel 2
forward kernel 0: cannot be used
forward kernel 1 time: 3ms
forward kernel 2 time: 1ms
forward kernel 3 time: 2ms
forward kernel 4 time: 0ms
forward kernel 5 time: 1ms
forward kernel 6 time: 1ms
forward kernel 7 time: 70ms
forward layer selected kernel 4
loss, E, 0.00923595
loss, E, 0.00112611
loss, E, 0.000117399
loss, E, 1.15641e-05
dump enabled=0
loss, E, 1.78565e-06
accuracy: 2/2 100%
accuracy: 2/2
loss, E, 1.78565e-06
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize1_n2_2layers_biased (2961 ms)
[ RUN ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n3
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 0ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 0ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 1.33412
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 0ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 0ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 1ms
backward try kernel 3
... seems valid
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" now built; returning.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
BackwardAuto: kernel 3 105ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=3 -D gFilterSize=4 -D gHalfFilterSize=2 -D gFilterSizeSquared=16 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=3 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=10 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=40 -DgInputStripeMarginSize=12 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=2 -D gHalfFilterSize=1 -D gFilterSizeSquared=4 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=4 -D gOutputSizeSquared=16 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=1 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=35 -DgInputStripeMarginSize=5 -DgOutputStripeNumRows=4 -DgOutputStripeSize=16
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
backward kernel 0: cannot be used
backward kernel 1 time: 0ms
backward kernel 2 time: 0ms
backward kernel 3 time: 105ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 231ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
kernel 140712488066176 binary 0 size 0
makeGemmKernel: "sgemm_Col_TN_B1_MX032_NX032_KX16_BRANCH_src" now built; returning.
kernel 140712488066176 binary 0 size 0
makeGemmKernel: "sgemm_Col_TN_B1_MX032_NX032_KX16_BRANCH_src" already built; returning.
kernel 140712488066176 binary 0 size 0
makeGemmKernel: "sgemm_Col_TN_B1_MX032_NX032_KX16_BRANCH_src" already built; returning.
BackpropWeightsAuto: kernel 4 259ms
forward try kernel 5
ForwardAuto: kernel 5: this instance cant be used: For ForwardFc, filtersize and inputimagesize must be identical
... not valid
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 1ms
forward try kernel 5
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward_fc_wgperrow.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 5 1ms
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 231ms
calcGradWeights layer selected kernel 1
calcGradWeights kernel 0: cannot be used
calcGradWeights kernel 1 time: 0ms
calcGradWeights kernel 2 time: 1ms
calcGradWeights kernel 3 time: 1ms
calcGradWeights kernel 4 time: 259ms
calcGradWeights layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NN_B0_16x3x4
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX01" now built; returning.
sgemm_Col_NN_B0_16x3x4
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX01" already built; returning.
sgemm_Col_NN_B0_16x3x4
makeGemmKernel: "sgemm_Col_NN_B0_MX016_NL016_KX01" already built; returning.
ForwardAuto: kernel 7 104ms
forward try kernel 6
cl/forward_byinputplane.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/reduce_segments.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 6 2ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 1ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5: cannot be used
forward kernel 6 time: 1ms
forward kernel 7 time: 104ms
forward layer selected kernel 1
forward try kernel 7
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
makeGemmKernel: "sgemm_Col_NN_B1_MX032_NX032_KX16_BRANCH" now built; returning.
makeGemmKernel: "sgemm_Col_NN_B1_MX032_NX032_KX16_BRANCH" already built; returning.
makeGemmKernel: "sgemm_Col_NN_B1_MX032_NX032_KX16_BRANCH" already built; returning.
ForwardAuto: kernel 7 155ms
forward kernel 0: cannot be used
forward kernel 1 time: 0ms
forward kernel 2 time: 0ms
forward kernel 3 time: 1ms
forward kernel 4 time: 1ms
forward kernel 5 time: 1ms
forward kernel 6 time: 2ms
forward kernel 7 time: 155ms
forward layer selected kernel 1
loss, E, 0.0064664
loss, E, 1.10358e-05
loss, E, 3.56415e-08
loss, E, 1.29538e-10
loss, E, 9.03277e-13
loss, E, 2.17604e-13
loss, E, 1.48326e-13
loss, E, 1.16351e-13
loss, E, 1.09246e-13
loss, E, 1.09246e-13
accuracy: 3/3 100%
accuracy: 3/3
loss, E, 1.09246e-13
clblas teardown
[ OK ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n3 (7463 ms)
[ RUN ] testsimpleconvolvenet.imagesize_5_4_2layers_filtersize_2_4_biased_n6
Using Intel(R) Corporation , OpenCL platform: Intel(R) OpenCL
Using OpenCL device: Intel(R) HD Graphics 4000
initializing clblas
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/copy.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/activate.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/applyActivationDeriv.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/per_element_add.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
forward try kernel 0
... not plausibly optimal, skipping
forward try kernel 1
cl/forward1.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 1 1ms
backward try kernel 0
... not plausibly optimal, skipping
backward try kernel 1
cl/backward.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
calcGradWeights try kernel 0
... not plausibly optimal, skipping
calcGradWeights try kernel 1
cl/backpropweights.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 1 1ms
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
cl/per_element_op2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
loss, E, 3.64011
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
forward try kernel 2
cl/forward2.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 2 1ms
backward try kernel 2
cl/backward_cached.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackwardAuto: kernel 2 1ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 2ms
calcGradWeights try kernel 2
cl/BackpropWeightsScratch.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 2 1ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
forward try kernel 3
cl/forward3.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 3 0ms
backward try kernel 3
... seems valid
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" now built; returning.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
sgemm_Col_NT_B0_1x48x3
makeGemmKernel: "sgemm_Col_NT_B0_ML016_NX016_KX01" already built; returning.
BackwardAuto: kernel 3 100ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=3 -D gInputPlanes=3 -D gInputSize=4 -D gInputSizeSquared=16 -D gNumFilters=3 -D gFilterSize=4 -D gHalfFilterSize=2 -D gFilterSizeSquared=16 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=1 -D gOutputSizeSquared=1 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=3 -DgInputStripeInnerNumRows=4 -DgInputStripeOuterNumRows=10 -DgInputStripeInnerSize=16 -DgInputStripeOuterSize=40 -DgInputStripeMarginSize=12 -DgOutputStripeNumRows=1 -DgOutputStripeSize=1
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 2ms
calcGradWeights try kernel 3
options: -D BIASED -D gNumInputPlanes=1 -D gInputPlanes=1 -D gInputSize=5 -D gInputSizeSquared=25 -D gNumFilters=3 -D gFilterSize=2 -D gHalfFilterSize=1 -D gFilterSizeSquared=4 -D gNumOutputPlanes=3 -D gOutputPlanes=3 -D gOutputSize=4 -D gOutputSizeSquared=16 -D gPadZeros=0 -D gMargin=0 -D gEven=1 -D gSkip=0 -DgNumStripes=1 -DgInputStripeMarginRows=1 -DgInputStripeInnerNumRows=5 -DgInputStripeOuterNumRows=7 -DgInputStripeInnerSize=25 -DgInputStripeOuterSize=35 -DgInputStripeMarginSize=5 -DgOutputStripeNumRows=4 -DgOutputStripeSize=16
cl/BackpropWeightsScratchLarge.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
BackpropWeightsAuto: kernel 3 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 1ms
forward try kernel 4
cl/forward4.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
... seems valid
ForwardAuto: kernel 4 2ms
backward kernel 0: cannot be used
backward kernel 1 time: 1ms
backward kernel 2 time: 1ms
backward kernel 3 time: 100ms
backward layer selected kernel 1
calcGradWeights try kernel 4
... seems valid
cl/per_element_op2_scalar.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" now built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
sgemm_Col_TN_B1_48x3x1
makeGemmKernel: "sgemm_Col_TN_B1_MX016_NL016_KX01" already built; returning.
BackpropWeightsAuto: kernel 4 239ms
calcGradWeights try kernel 4
... seems valid
ForwardIm2Col.cl build log:
fcl build 1 succeeded.
fcl build 2 succeeded.
bcl build succeeded.
kernel 140712488066176 binary 0 size 0
makeGemmKernel: "sgemm_Col_TN_B1_MX032_NX032_KX16_BRANCH_src" already built; returning.
OpenCL error -38 on line 674
Assertion failed: false, file D:\DeepCL\clMathLibraries\clBLAS\src\library\blas\specialCases\GemmSpecialCases.cpp, line 674
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment