You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@singa.apache.org by GitBox <gi...@apache.org> on 2019/11/06 13:02:45 UTC

[GitHub] [singa] chrishkchris commented on issue #555: SINGA-490 Optimization on GPU Malloc and Cudastream

chrishkchris commented on issue #555: SINGA-490 Optimization on GPU Malloc and Cudastream
URL: https://github.com/apache/singa/pull/555#issuecomment-550299437
 
 
   Before I do further checking of the code, I submit the passed unit test results first: (1) CPP level operation unit test, (2) Python level operation unit test:
   
   1. CPP unit test
   ```
   ubuntu@ip-172-31-34-232:~/singa/build/bin$ ./test_singa
   Running main() from gtest_main.cc
   [==========] Running 238 tests from 48 test cases.
   [----------] Global test environment set-up.
   [----------] 1 test from Accuracy
   [ RUN      ] Accuracy.Compute
   [       OK ] Accuracy.Compute (0 ms)
   [----------] 1 test from Accuracy (0 ms total)
   
   [----------] 3 tests from Activation
   [ RUN      ] Activation.Setup
   [       OK ] Activation.Setup (0 ms)
   [ RUN      ] Activation.Forward
   [       OK ] Activation.Forward (0 ms)
   [ RUN      ] Activation.Backward
   [       OK ] Activation.Backward (0 ms)
   [----------] 3 tests from Activation (0 ms total)
   
   [----------] 2 tests from AdaGrad
   [ RUN      ] AdaGrad.ApplyCPU
   [       OK ] AdaGrad.ApplyCPU (0 ms)
   [ RUN      ] AdaGrad.ApplyCUDA
   [       OK ] AdaGrad.ApplyCUDA (869 ms)
   [----------] 2 tests from AdaGrad (869 ms total)
   
   [----------] 3 tests from BatchNorm
   [ RUN      ] BatchNorm.Setup
   [       OK ] BatchNorm.Setup (0 ms)
   [ RUN      ] BatchNorm.Forward
   [       OK ] BatchNorm.Forward (0 ms)
   [ RUN      ] BatchNorm.Backward
   [       OK ] BatchNorm.Backward (0 ms)
   [----------] 3 tests from BatchNorm (0 ms total)
   
   [----------] 2 tests from BinFileWriter
   [ RUN      ] BinFileWriter.Create
   [       OK ] BinFileWriter.Create (0 ms)
   [ RUN      ] BinFileWriter.Append
   [       OK ] BinFileWriter.Append (0 ms)
   [----------] 2 tests from BinFileWriter (0 ms total)
   
   [----------] 2 tests from BinFileReader
   [ RUN      ] BinFileReader.Read
   [       OK ] BinFileReader.Read (1 ms)
   [ RUN      ] BinFileReader.SeekToFirst
   [       OK ] BinFileReader.SeekToFirst (0 ms)
   [----------] 2 tests from BinFileReader (1 ms total)
   
   [----------] 3 tests from Channel
   [ RUN      ] Channel.InitChannel
   [       OK ] Channel.InitChannel (0 ms)
   [ RUN      ] Channel.SendStringToFile
   [W d1106 t12:49:58 p04664:-240 /home/ubuntu/singa/src/utils/channel.cc:70] Messages will be appended to an existed file: /tmp/test_channel
   [       OK ] Channel.SendStringToFile (0 ms)
   [ RUN      ] Channel.SendStringToFileAndStderr
   test to both file and stderr
   [       OK ] Channel.SendStringToFileAndStderr (0 ms)
   [----------] 3 tests from Channel (0 ms total)
   
   [----------] 9 tests from Concat
   [ RUN      ] Concat.Setup
   [       OK ] Concat.Setup (0 ms)
   [ RUN      ] Concat.ForwardConcatRowCpp
   [       OK ] Concat.ForwardConcatRowCpp (0 ms)
   [ RUN      ] Concat.ForwardConcatColumnCpp
   [       OK ] Concat.ForwardConcatColumnCpp (0 ms)
   [ RUN      ] Concat.ForwardConcatRowCuda
   [       OK ] Concat.ForwardConcatRowCuda (35 ms)
   [ RUN      ] Concat.ForwardConcatColumnCuda
   [       OK ] Concat.ForwardConcatColumnCuda (37 ms)
   [ RUN      ] Concat.BackwardConcatRowCpp
   [       OK ] Concat.BackwardConcatRowCpp (0 ms)
   [ RUN      ] Concat.BackwardConcatColumn
   [       OK ] Concat.BackwardConcatColumn (0 ms)
   [ RUN      ] Concat.BackwardConcatRowCuda
   [       OK ] Concat.BackwardConcatRowCuda (52 ms)
   [ RUN      ] Concat.BackwardConcatColumnCuda
   [       OK ] Concat.BackwardConcatColumnCuda (51 ms)
   [----------] 9 tests from Concat (175 ms total)
   
   [----------] 3 tests from Convolution
   [ RUN      ] Convolution.Setup
   [       OK ] Convolution.Setup (0 ms)
   [ RUN      ] Convolution.Forward
   [       OK ] Convolution.Forward (0 ms)
   [ RUN      ] Convolution.Backward
   [       OK ] Convolution.Backward (0 ms)
   [----------] 3 tests from Convolution (0 ms total)
   
   [----------] 4 tests from CppCPU
   [ RUN      ] CppCPU.Constructor
   [       OK ] CppCPU.Constructor (0 ms)
   [ RUN      ] CppCPU.MemoryMallocFree
   [       OK ] CppCPU.MemoryMallocFree (0 ms)
   [ RUN      ] CppCPU.Exec
   [       OK ] CppCPU.Exec (0 ms)
   [ RUN      ] CppCPU.CopyData
   [       OK ] CppCPU.CopyData (0 ms)
   [----------] 4 tests from CppCPU (0 ms total)
   
   [----------] 8 tests from TestSoftmaxCrossEntropy
   [ RUN      ] TestSoftmaxCrossEntropy.CppForward
   [       OK ] TestSoftmaxCrossEntropy.CppForward (0 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CppForwardAryTarget
   [       OK ] TestSoftmaxCrossEntropy.CppForwardAryTarget (0 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CppBackward
   [       OK ] TestSoftmaxCrossEntropy.CppBackward (0 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CppBackwardAryTarget
   [       OK ] TestSoftmaxCrossEntropy.CppBackwardAryTarget (0 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CudaForward
   [       OK ] TestSoftmaxCrossEntropy.CudaForward (32 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CudaForwardAryTarget
   [       OK ] TestSoftmaxCrossEntropy.CudaForwardAryTarget (32 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CudaBackward
   [       OK ] TestSoftmaxCrossEntropy.CudaBackward (33 ms)
   [ RUN      ] TestSoftmaxCrossEntropy.CudaBackwardAryTarget
   [       OK ] TestSoftmaxCrossEntropy.CudaBackwardAryTarget (33 ms)
   [----------] 8 tests from TestSoftmaxCrossEntropy (131 ms total)
   
   [----------] 1 test from CSV
   [ RUN      ] CSV.EncoderDecode
   [       OK ] CSV.EncoderDecode (0 ms)
   [----------] 1 test from CSV (0 ms total)
   
   [----------] 3 tests from CudnnActivation
   [ RUN      ] CudnnActivation.Setup
   [       OK ] CudnnActivation.Setup (0 ms)
   [ RUN      ] CudnnActivation.Forward
   [       OK ] CudnnActivation.Forward (31 ms)
   [ RUN      ] CudnnActivation.Backward
   [       OK ] CudnnActivation.Backward (31 ms)
   [----------] 3 tests from CudnnActivation (62 ms total)
   
   [----------] 3 tests from CudnnBatchNorm
   [ RUN      ] CudnnBatchNorm.Setup
   [       OK ] CudnnBatchNorm.Setup (0 ms)
   [ RUN      ] CudnnBatchNorm.Forward
   [       OK ] CudnnBatchNorm.Forward (30 ms)
   [ RUN      ] CudnnBatchNorm.Backward
   [       OK ] CudnnBatchNorm.Backward (30 ms)
   [----------] 3 tests from CudnnBatchNorm (60 ms total)
   
   [----------] 3 tests from CudnnConvolution
   [ RUN      ] CudnnConvolution.Setup
   [       OK ] CudnnConvolution.Setup (0 ms)
   [ RUN      ] CudnnConvolution.Forward
   [       OK ] CudnnConvolution.Forward (31 ms)
   [ RUN      ] CudnnConvolution.Backward
   [       OK ] CudnnConvolution.Backward (32 ms)
   [----------] 3 tests from CudnnConvolution (63 ms total)
   
   [----------] 3 tests from CudnnConvolution_AT
   [ RUN      ] CudnnConvolution_AT.Setup
   [       OK ] CudnnConvolution_AT.Setup (0 ms)
   [ RUN      ] CudnnConvolution_AT.Forward
   [       OK ] CudnnConvolution_AT.Forward (31 ms)
   [ RUN      ] CudnnConvolution_AT.Backward
   [       OK ] CudnnConvolution_AT.Backward (30 ms)
   [----------] 3 tests from CudnnConvolution_AT (61 ms total)
   
   [----------] 3 tests from CudnnDropout
   [ RUN      ] CudnnDropout.Setup
   [       OK ] CudnnDropout.Setup (0 ms)
   [ RUN      ] CudnnDropout.Forward
   [       OK ] CudnnDropout.Forward (52 ms)
   [ RUN      ] CudnnDropout.Backward
   [       OK ] CudnnDropout.Backward (53 ms)
   [----------] 3 tests from CudnnDropout (106 ms total)
   
   [----------] 3 tests from CudnnLRN
   [ RUN      ] CudnnLRN.Setup
   [       OK ] CudnnLRN.Setup (0 ms)
   [ RUN      ] CudnnLRN.Forward
   [       OK ] CudnnLRN.Forward (25 ms)
   [ RUN      ] CudnnLRN.Backward
   [       OK ] CudnnLRN.Backward (25 ms)
   [----------] 3 tests from CudnnLRN (50 ms total)
   
   [----------] 3 tests from CudnnPooling
   [ RUN      ] CudnnPooling.Setup
   [       OK ] CudnnPooling.Setup (0 ms)
   [ RUN      ] CudnnPooling.Forward
   [       OK ] CudnnPooling.Forward (25 ms)
   [ RUN      ] CudnnPooling.Backward
   [       OK ] CudnnPooling.Backward (24 ms)
   [----------] 3 tests from CudnnPooling (49 ms total)
   
   [----------] 3 tests from TestCudnnRNN
   [ RUN      ] TestCudnnRNN.Setup
   [       OK ] TestCudnnRNN.Setup (0 ms)
   [ RUN      ] TestCudnnRNN.Forward
   [       OK ] TestCudnnRNN.Forward (48 ms)
   [ RUN      ] TestCudnnRNN.Backward
   [       OK ] TestCudnnRNN.Backward (48 ms)
   [----------] 3 tests from TestCudnnRNN (96 ms total)
   
   [----------] 5 tests from CudnnSoftmax
   [ RUN      ] CudnnSoftmax.Setup
   [       OK ] CudnnSoftmax.Setup (0 ms)
   [ RUN      ] CudnnSoftmax.Forward1D
   [       OK ] CudnnSoftmax.Forward1D (24 ms)
   [ RUN      ] CudnnSoftmax.Backward1D
   [       OK ] CudnnSoftmax.Backward1D (24 ms)
   [ RUN      ] CudnnSoftmax.Forward2D
   [       OK ] CudnnSoftmax.Forward2D (25 ms)
   [ RUN      ] CudnnSoftmax.Backward2D
   [       OK ] CudnnSoftmax.Backward2D (24 ms)
   [----------] 5 tests from CudnnSoftmax (97 ms total)
   
   [----------] 5 tests from Dense
   [ RUN      ] Dense.Setup
   [       OK ] Dense.Setup (0 ms)
   [ RUN      ] Dense.ForwardCpp
   [       OK ] Dense.ForwardCpp (0 ms)
   [ RUN      ] Dense.BackwardCpp
   [       OK ] Dense.BackwardCpp (0 ms)
   [ RUN      ] Dense.ForwardCuda
   [       OK ] Dense.ForwardCuda (25 ms)
   [ RUN      ] Dense.BackwardCuda
   [       OK ] Dense.BackwardCuda (24 ms)
   [----------] 5 tests from Dense (49 ms total)
   
   [----------] 3 tests from Dropout
   [ RUN      ] Dropout.Setup
   [       OK ] Dropout.Setup (0 ms)
   [ RUN      ] Dropout.Forward
   [       OK ] Dropout.Forward (0 ms)
   [ RUN      ] Dropout.Backward
   [       OK ] Dropout.Backward (0 ms)
   [----------] 3 tests from Dropout (0 ms total)
   
   [----------] 5 tests from Flatten
   [ RUN      ] Flatten.Setup
   [       OK ] Flatten.Setup (0 ms)
   [ RUN      ] Flatten.ForwardCPU
   [       OK ] Flatten.ForwardCPU (0 ms)
   [ RUN      ] Flatten.BackwardCPU
   [       OK ] Flatten.BackwardCPU (0 ms)
   [ RUN      ] Flatten.ForwardGPU
   [       OK ] Flatten.ForwardGPU (25 ms)
   [ RUN      ] Flatten.BackwardGPU
   [       OK ] Flatten.BackwardGPU (24 ms)
   [----------] 5 tests from Flatten (49 ms total)
   
   [----------] 5 tests from ImageTransformer
   [ RUN      ] ImageTransformer.Setup
   [       OK ] ImageTransformer.Setup (0 ms)
   [ RUN      ] ImageTransformer.Apply3D
   [       OK ] ImageTransformer.Apply3D (0 ms)
   [ RUN      ] ImageTransformer.Apply2D
   [       OK ] ImageTransformer.Apply2D (0 ms)
   [ RUN      ] ImageTransformer.Crop
   [       OK ] ImageTransformer.Crop (0 ms)
   [ RUN      ] ImageTransformer.Mirror
   [       OK ] ImageTransformer.Mirror (0 ms)
   [----------] 5 tests from ImageTransformer (0 ms total)
   
   [----------] 5 tests from Initializer
   [ RUN      ] Initializer.Constant
   [       OK ] Initializer.Constant (0 ms)
   [ RUN      ] Initializer.Gaussian
   [       OK ] Initializer.Gaussian (0 ms)
   [ RUN      ] Initializer.ConstantCUDA
   [       OK ] Initializer.ConstantCUDA (24 ms)
   [ RUN      ] Initializer.GaussianCUDA
   [       OK ] Initializer.GaussianCUDA (35 ms)
   [ RUN      ] Initializer.XavierCUDA
   [       OK ] Initializer.XavierCUDA (24 ms)
   [----------] 5 tests from Initializer (83 ms total)
   
   [----------] 2 tests from Layer
   [ RUN      ] Layer.CreateLayer
   [       OK ] Layer.CreateLayer (0 ms)
   [ RUN      ] Layer.CreateCudnnLayer
   [       OK ] Layer.CreateCudnnLayer (0 ms)
   [----------] 2 tests from Layer (0 ms total)
   
   [----------] 6 tests from Logging
   [ RUN      ] Logging.InfoLogging
   [I d1106 t12:49:59 p04664:-240 /home/ubuntu/singa/test/singa/test_logging.cc:29] test info logging
   [       OK ] Logging.InfoLogging (0 ms)
   [ RUN      ] Logging.WarningLogging
   [W d1106 t12:49:59 p04664:-240 /home/ubuntu/singa/test/singa/test_logging.cc:35] test warning logging
   [       OK ] Logging.WarningLogging (0 ms)
   [ RUN      ] Logging.ErrorLogging
   [E d1106 t12:49:59 p04664:-240 /home/ubuntu/singa/test/singa/test_logging.cc:41] test error logging
   [       OK ] Logging.ErrorLogging (0 ms)
   [ RUN      ] Logging.FatalLogging
   [       OK ] Logging.FatalLogging (0 ms)
   [ RUN      ] Logging.SetLogDestination
   [       OK ] Logging.SetLogDestination (0 ms)
   [ RUN      ] Logging.StderrLoggingLevel
   [W d1106 t12:49:59 p04664:-240 /home/ubuntu/singa/test/singa/test_logging.cc:62] test warning logging to stderr and file
   [E d1106 t12:49:59 p04664:-240 /home/ubuntu/singa/test/singa/test_logging.cc:63] test error logging to stderr and file
   [       OK ] Logging.StderrLoggingLevel (0 ms)
   [----------] 6 tests from Logging (1 ms total)
   
   [----------] 3 tests from LRN
   [ RUN      ] LRN.Setup
   [       OK ] LRN.Setup (0 ms)
   [ RUN      ] LRN.Forward
   [       OK ] LRN.Forward (0 ms)
   [ RUN      ] LRN.Backward
   [       OK ] LRN.Backward (0 ms)
   [----------] 3 tests from LRN (0 ms total)
   
   [----------] 1 test from MemPool
   [ RUN      ] MemPool.CompareCudaCnmem
   [       OK ] MemPool.CompareCudaCnmem (947 ms)
   [----------] 1 test from MemPool (947 ms total)
   
   [----------] 4 tests from TestMSE
   [ RUN      ] TestMSE.CppForward
   [       OK ] TestMSE.CppForward (0 ms)
   [ RUN      ] TestMSE.CppBackward
   [       OK ] TestMSE.CppBackward (0 ms)
   [ RUN      ] TestMSE.CudaForward
   [       OK ] TestMSE.CudaForward (24 ms)
   [ RUN      ] TestMSE.CudaBackward
   [       OK ] TestMSE.CudaBackward (24 ms)
   [----------] 4 tests from TestMSE (48 ms total)
   
   [----------] 2 tests from Nesterov
   [ RUN      ] Nesterov.ApplyCPU
   [       OK ] Nesterov.ApplyCPU (0 ms)
   [ RUN      ] Nesterov.ApplyCUDA
   [       OK ] Nesterov.ApplyCUDA (25 ms)
   [----------] 2 tests from Nesterov (25 ms total)
   
   [----------] 4 tests from OperationBatchNorm
   [ RUN      ] OperationBatchNorm.ForwardInference
   [       OK ] OperationBatchNorm.ForwardInference (2 ms)
   [ RUN      ] OperationBatchNorm.ForwardInference4D
   [       OK ] OperationBatchNorm.ForwardInference4D (1 ms)
   [ RUN      ] OperationBatchNorm.ForwardTraining
   [       OK ] OperationBatchNorm.ForwardTraining (0 ms)
   [ RUN      ] OperationBatchNorm.Backward
   [       OK ] OperationBatchNorm.Backward (0 ms)
   [----------] 4 tests from OperationBatchNorm (3 ms total)
   
   [----------] 2 tests from Operation_Convolution
   [ RUN      ] Operation_Convolution.Forward
   [       OK ] Operation_Convolution.Forward (24 ms)
   [ RUN      ] Operation_Convolution.Backward
   [       OK ] Operation_Convolution.Backward (1 ms)
   [----------] 2 tests from Operation_Convolution (25 ms total)
   
   [----------] 4 tests from OperationPooling
   [ RUN      ] OperationPooling.Forward
   [       OK ] OperationPooling.Forward (0 ms)
   [ RUN      ] OperationPooling.ForwardAverage
   [       OK ] OperationPooling.ForwardAverage (0 ms)
   [ RUN      ] OperationPooling.Backward
   [       OK ] OperationPooling.Backward (0 ms)
   [ RUN      ] OperationPooling.BackwardAvg
   [       OK ] OperationPooling.BackwardAvg (0 ms)
   [----------] 4 tests from OperationPooling (0 ms total)
   
   [----------] 5 tests from Platform
   [ RUN      ] Platform.CreateMultDevice
   [       OK ] Platform.CreateMultDevice (4 ms)
   [ RUN      ] Platform.NumGPUs
   [       OK ] Platform.NumGPUs (0 ms)
   [ RUN      ] Platform.QueryMem
   [       OK ] Platform.QueryMem (0 ms)
   [ RUN      ] Platform.CreateDevice
   [       OK ] Platform.CreateDevice (1 ms)
   [ RUN      ] Platform.CreatTensor
   [       OK ] Platform.CreatTensor (0 ms)
   [----------] 5 tests from Platform (5 ms total)
   
   [----------] 3 tests from Pooling
   [ RUN      ] Pooling.Setup
   [       OK ] Pooling.Setup (0 ms)
   [ RUN      ] Pooling.Forward
   [       OK ] Pooling.Forward (0 ms)
   [ RUN      ] Pooling.Backward
   [       OK ] Pooling.Backward (0 ms)
   [----------] 3 tests from Pooling (0 ms total)
   
   [----------] 5 tests from PReLU
   [ RUN      ] PReLU.Setup
   [       OK ] PReLU.Setup (0 ms)
   [ RUN      ] PReLU.ForwardCPU
   [       OK ] PReLU.ForwardCPU (0 ms)
   [ RUN      ] PReLU.BackwardCPU
   [       OK ] PReLU.BackwardCPU (0 ms)
   [ RUN      ] PReLU.ForwardGPU
   [       OK ] PReLU.ForwardGPU (2 ms)
   [ RUN      ] PReLU.BackwardGPU
   [       OK ] PReLU.BackwardGPU (33 ms)
   [----------] 5 tests from PReLU (35 ms total)
   
   [----------] 2 tests from RMSProp
   [ RUN      ] RMSProp.ApplyCPU
   [       OK ] RMSProp.ApplyCPU (0 ms)
   [ RUN      ] RMSProp.ApplyCUDA
   [       OK ] RMSProp.ApplyCUDA (23 ms)
   [----------] 2 tests from RMSProp (23 ms total)
   
   [----------] 4 tests from SGD
   [ RUN      ] SGD.ApplyWithoutMomentum
   [       OK ] SGD.ApplyWithoutMomentum (0 ms)
   [ RUN      ] SGD.ApplyWithMomentum
   [       OK ] SGD.ApplyWithMomentum (0 ms)
   [ RUN      ] SGD.ApplyWithoutMomentumCuda
   [       OK ] SGD.ApplyWithoutMomentumCuda (22 ms)
   [ RUN      ] SGD.ApplyWithMomentumCuda
   [       OK ] SGD.ApplyWithMomentumCuda (23 ms)
   [----------] 4 tests from SGD (45 ms total)
   
   [----------] 9 tests from Slice
   [ RUN      ] Slice.Setup
   [       OK ] Slice.Setup (0 ms)
   [ RUN      ] Slice.ForwardSliceRowCpp
   [       OK ] Slice.ForwardSliceRowCpp (0 ms)
   [ RUN      ] Slice.ForwardSliceColumn
   [       OK ] Slice.ForwardSliceColumn (0 ms)
   [ RUN      ] Slice.ForwardSliceRowCuda
   [       OK ] Slice.ForwardSliceRowCuda (33 ms)
   [ RUN      ] Slice.ForwardSliceColumnCuda
   [       OK ] Slice.ForwardSliceColumnCuda (33 ms)
   [ RUN      ] Slice.BackwardSliceRowCpp
   [       OK ] Slice.BackwardSliceRowCpp (0 ms)
   [ RUN      ] Slice.BackwardSliceColumn
   [       OK ] Slice.BackwardSliceColumn (0 ms)
   [ RUN      ] Slice.BackwardSliceRowCuda
   [       OK ] Slice.BackwardSliceRowCuda (22 ms)
   [ RUN      ] Slice.BackwardSliceColumnCuda
   [       OK ] Slice.BackwardSliceColumnCuda (23 ms)
   [----------] 9 tests from Slice (111 ms total)
   
   [----------] 3 tests from Snapshot
   [ RUN      ] Snapshot.WriteTest
   [       OK ] Snapshot.WriteTest (0 ms)
   [ RUN      ] Snapshot.ReadTest
   [       OK ] Snapshot.ReadTest (0 ms)
   [ RUN      ] Snapshot.ReadIntTest
   [       OK ] Snapshot.ReadIntTest (0 ms)
   [----------] 3 tests from Snapshot (0 ms total)
   
   [----------] 3 tests from Softmax
   [ RUN      ] Softmax.Setup
   [       OK ] Softmax.Setup (0 ms)
   [ RUN      ] Softmax.Forward
   [       OK ] Softmax.Forward (0 ms)
   [ RUN      ] Softmax.Backward
   [       OK ] Softmax.Backward (0 ms)
   [----------] 3 tests from Softmax (0 ms total)
   
   [----------] 11 tests from TensorClass
   [ RUN      ] TensorClass.Constructor
   [       OK ] TensorClass.Constructor (0 ms)
   [ RUN      ] TensorClass.Reshape
   [       OK ] TensorClass.Reshape (0 ms)
   [ RUN      ] TensorClass.AsType
   [       OK ] TensorClass.AsType (0 ms)
   [ RUN      ] TensorClass.ToDevice
   [       OK ] TensorClass.ToDevice (0 ms)
   [ RUN      ] TensorClass.CopyDataFromHostPtr
   [       OK ] TensorClass.CopyDataFromHostPtr (0 ms)
   [ RUN      ] TensorClass.CopyData
   [       OK ] TensorClass.CopyData (0 ms)
   [ RUN      ] TensorClass.Clone
   [       OK ] TensorClass.Clone (0 ms)
   [ RUN      ] TensorClass.T
   [       OK ] TensorClass.T (1 ms)
   [ RUN      ] TensorClass.Repeat
   [       OK ] TensorClass.Repeat (0 ms)
   [ RUN      ] TensorClass.RepeatData
   [       OK ] TensorClass.RepeatData (0 ms)
   [ RUN      ] TensorClass.Broadcast
   [       OK ] TensorClass.Broadcast (0 ms)
   [----------] 11 tests from TensorClass (1 ms total)
   
   [----------] 67 tests from TensorMath
   [ RUN      ] TensorMath.AbsCpp
   [       OK ] TensorMath.AbsCpp (0 ms)
   [ RUN      ] TensorMath.ExpCpp
   [       OK ] TensorMath.ExpCpp (0 ms)
   [ RUN      ] TensorMath.ExpStrideCpp
   WARNING: Logging before InitGoogleLogging() is written to STDERR
   I1106 12:50:01.186179  4664 tensor_math_cpp.h:146] not equal stride
   [       OK ] TensorMath.ExpStrideCpp (0 ms)
   [ RUN      ] TensorMath.LogCpp
   [       OK ] TensorMath.LogCpp (0 ms)
   [ RUN      ] TensorMath.ReLUCpp
   [       OK ] TensorMath.ReLUCpp (0 ms)
   [ RUN      ] TensorMath.SigmoidCpp
   [       OK ] TensorMath.SigmoidCpp (0 ms)
   [ RUN      ] TensorMath.SignCpp
   [       OK ] TensorMath.SignCpp (0 ms)
   [ RUN      ] TensorMath.SqrtCpp
   [       OK ] TensorMath.SqrtCpp (0 ms)
   [ RUN      ] TensorMath.SquareCpp
   [       OK ] TensorMath.SquareCpp (0 ms)
   [ RUN      ] TensorMath.TanhCpp
   [       OK ] TensorMath.TanhCpp (0 ms)
   [ RUN      ] TensorMath.SumCpp
   [       OK ] TensorMath.SumCpp (0 ms)
   [ RUN      ] TensorMath.SoftMaxCpp
   [       OK ] TensorMath.SoftMaxCpp (0 ms)
   [ RUN      ] TensorMath.SoftMaxOnAxis
   [       OK ] TensorMath.SoftMaxOnAxis (32 ms)
   [ RUN      ] TensorMath.LTCpp
   [       OK ] TensorMath.LTCpp (0 ms)
   [ RUN      ] TensorMath.LECpp
   [       OK ] TensorMath.LECpp (0 ms)
   [ RUN      ] TensorMath.GTCpp
   [       OK ] TensorMath.GTCpp (0 ms)
   [ RUN      ] TensorMath.GECpp
   [       OK ] TensorMath.GECpp (0 ms)
   [ RUN      ] TensorMath.PowCpp
   [       OK ] TensorMath.PowCpp (0 ms)
   [ RUN      ] TensorMath.SubCpp
   [       OK ] TensorMath.SubCpp (0 ms)
   [ RUN      ] TensorMath.EltwiseMultCpp
   [       OK ] TensorMath.EltwiseMultCpp (0 ms)
   [ RUN      ] TensorMath.DivCpp
   [       OK ] TensorMath.DivCpp (0 ms)
   [ RUN      ] TensorMath.BernoulliCpp
   [       OK ] TensorMath.BernoulliCpp (0 ms)
   [ RUN      ] TensorMath.UniformCpp
   [       OK ] TensorMath.UniformCpp (1 ms)
   [ RUN      ] TensorMath.GaussianCpp
   [       OK ] TensorMath.GaussianCpp (2 ms)
   [ RUN      ] TensorMath.AddTensorCpp
   [       OK ] TensorMath.AddTensorCpp (0 ms)
   [ RUN      ] TensorMath.AddTensorsCpp
   [       OK ] TensorMath.AddTensorsCpp (0 ms)
   [ RUN      ] TensorMath.SetValueCpp
   [       OK ] TensorMath.SetValueCpp (0 ms)
   [ RUN      ] TensorMath.ReshapeCpp
   [       OK ] TensorMath.ReshapeCpp (0 ms)
   [ RUN      ] TensorMath.TransposeReshapeCpp
   I1106 12:50:01.221532  4664 tensor_math_cpp.h:146] not equal stride
   I1106 12:50:01.221565  4664 tensor_math_cpp.h:146] not equal stride
   [       OK ] TensorMath.TransposeReshapeCpp (0 ms)
   [ RUN      ] TensorMath.TransposeFloatCpp
   I1106 12:50:01.221587  4664 tensor_math_cpp.h:146] not equal stride
   [       OK ] TensorMath.TransposeFloatCpp (0 ms)
   [ RUN      ] TensorMath.TransposeIntCpp
   I1106 12:50:01.221616  4664 tensor_math_cpp.h:146] not equal stride
   [       OK ] TensorMath.TransposeIntCpp (0 ms)
   [ RUN      ] TensorMath.BroadcastCpp
   [       OK ] TensorMath.BroadcastCpp (0 ms)
   [ RUN      ] TensorMath.L2Cpp
   [       OK ] TensorMath.L2Cpp (0 ms)
   [ RUN      ] TensorMath.MultCpp
   [       OK ] TensorMath.MultCpp (0 ms)
   [ RUN      ] TensorMath.AddColumnCpp
   [       OK ] TensorMath.AddColumnCpp (0 ms)
   [ RUN      ] TensorMath.SubColumnCpp
   [       OK ] TensorMath.SubColumnCpp (0 ms)
   [ RUN      ] TensorMath.DivColumnCpp
   [       OK ] TensorMath.DivColumnCpp (0 ms)
   [ RUN      ] TensorMath.AddRowCpp
   [       OK ] TensorMath.AddRowCpp (0 ms)
   [ RUN      ] TensorMath.SubRowCpp
   [       OK ] TensorMath.SubRowCpp (0 ms)
   [ RUN      ] TensorMath.MultRowCpp
   [       OK ] TensorMath.MultRowCpp (0 ms)
   [ RUN      ] TensorMath.MultColumnCpp
   [       OK ] TensorMath.MultColumnCpp (0 ms)
   [ RUN      ] TensorMath.DivRowCpp
   [       OK ] TensorMath.DivRowCpp (0 ms)
   [ RUN      ] TensorMath.SumRowsCpp
   [       OK ] TensorMath.SumRowsCpp (0 ms)
   [ RUN      ] TensorMath.SumColumnsCpp
   [       OK ] TensorMath.SumColumnsCpp (0 ms)
   [ RUN      ] TensorMath.ConcatenateRowsCpp
   [       OK ] TensorMath.ConcatenateRowsCpp (0 ms)
   [ RUN      ] TensorMath.ConcatenateColumnsCpp
   [       OK ] TensorMath.ConcatenateColumnsCpp (0 ms)
   [ RUN      ] TensorMath.CopyRowsCpp
   [       OK ] TensorMath.CopyRowsCpp (0 ms)
   [ RUN      ] TensorMath.CopyColumnsCpp
   [       OK ] TensorMath.CopyColumnsCpp (0 ms)
   [ RUN      ] TensorMath.L2Cuda
   [       OK ] TensorMath.L2Cuda (18 ms)
   [ RUN      ] TensorMath.MultCuda
   [       OK ] TensorMath.MultCuda (23 ms)
   [ RUN      ] TensorMath.AddColumnCuda
   [       OK ] TensorMath.AddColumnCuda (22 ms)
   [ RUN      ] TensorMath.SubColumnCuda
   [       OK ] TensorMath.SubColumnCuda (23 ms)
   [ RUN      ] TensorMath.MultColumnCuda
   [       OK ] TensorMath.MultColumnCuda (22 ms)
   [ RUN      ] TensorMath.DivColumnCuda
   [       OK ] TensorMath.DivColumnCuda (23 ms)
   [ RUN      ] TensorMath.AddRowCuda
   [       OK ] TensorMath.AddRowCuda (22 ms)
   [ RUN      ] TensorMath.SubRowCuda
   [       OK ] TensorMath.SubRowCuda (23 ms)
   [ RUN      ] TensorMath.MultRowCuda
   [       OK ] TensorMath.MultRowCuda (22 ms)
   [ RUN      ] TensorMath.DivRowCuda
   [       OK ] TensorMath.DivRowCuda (23 ms)
   [ RUN      ] TensorMath.SumRowsCuda
   [       OK ] TensorMath.SumRowsCuda (22 ms)
   [ RUN      ] TensorMath.SumColumnCuda
   [       OK ] TensorMath.SumColumnCuda (22 ms)
   [ RUN      ] TensorMath.ExpStrideCuda
   [       OK ] TensorMath.ExpStrideCuda (23 ms)
   [ RUN      ] TensorMath.ConcatenateRowsCuda
   [       OK ] TensorMath.ConcatenateRowsCuda (22 ms)
   [ RUN      ] TensorMath.ConcatenateColumnsCuda
   [       OK ] TensorMath.ConcatenateColumnsCuda (23 ms)
   [ RUN      ] TensorMath.CopyRowsCuda
   [       OK ] TensorMath.CopyRowsCuda (22 ms)
   [ RUN      ] TensorMath.CopyColumnsCuda
   [       OK ] TensorMath.CopyColumnsCuda (23 ms)
   [ RUN      ] TensorMath.RowMaxCuda
   [       OK ] TensorMath.RowMaxCuda (22 ms)
   [ RUN      ] TensorMath.BroadcastCuda
   [       OK ] TensorMath.BroadcastCuda (23 ms)
   [----------] 67 tests from TensorMath (459 ms total)
   
   [----------] 2 tests from TextFileWriter
   [ RUN      ] TextFileWriter.Create
   [       OK ] TextFileWriter.Create (0 ms)
   [ RUN      ] TextFileWriter.Append
   [       OK ] TextFileWriter.Append (0 ms)
   [----------] 2 tests from TextFileWriter (0 ms total)
   
   [----------] 2 tests from TextFileReader
   [ RUN      ] TextFileReader.Read
   [       OK ] TextFileReader.Read (0 ms)
   [ RUN      ] TextFileReader.SeekToFirst
   [       OK ] TextFileReader.SeekToFirst (1 ms)
   [----------] 2 tests from TextFileReader (1 ms total)
   
   [----------] 1 test from TimerTest
   [ RUN      ] TimerTest.TestTick
   [       OK ] TimerTest.TestTick (10 ms)
   [----------] 1 test from TimerTest (10 ms total)
   
   [----------] Global test environment tear-down
   [==========] 238 tests from 48 test cases ran. (3741 ms total)
   [  PASSED  ] 238 tests.
   ```
   
   2. Python Unit test:
   ```
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_operation.py
   ......................................................................................WARNING: Logging before InitGoogleLogging() is written to STDERR
   I1106 12:53:42.393013  4684 tensor_math_cpp.h:146] not equal stride
   I1106 12:53:42.393343  4684 tensor_math_cpp.h:146] not equal stride
   .I1106 12:53:42.393951  4684 tensor_math_cpp.h:146] not equal stride
   I1106 12:53:42.394306  4684 tensor_math_cpp.h:146] not equal stride
   ......
   ----------------------------------------------------------------------
   Ran 93 tests in 0.805s
   
   OK
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_layer.py
   ...................
   ----------------------------------------------------------------------
   Ran 19 tests in 0.008s
   
   OK
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_mkldnn.py
   .TEST CONV2D FORWARD
   TEST CONV2D DATA BACKWARD
   TEST CONV2D WEIGHT BACKWARD
   TEST CONV2D DATA BACKWARD
   [2.]
   ..
   ----------------------------------------------------------------------
   Ran 3 tests in 0.070s
   
   OK
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_optimizer.py
   ..............
   ----------------------------------------------------------------------
   Ran 14 tests in 0.021s
   
   OK
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_net.py
   ('relu1', (2,))
   ('relu2', (2,))
   ('merge', [(2,), (2,)])
   .('relu1', (2,))
   ('relu2', (2,))
   ('merge', [(2,), (2,)])
   ('split', [(2,), (2,)])
   ('split1', (2,))
   ('split2', (2,))
   .('conv', (4, 12, 12))
   ('flat', (576,))
   ('dense', (4,))
   NOTE: If your model was saved using pickle, then set use_pickle=True for loading it
   NOTE: If your model was saved using Snapshot, then set use_pickle=False for loading it
   .('relu1', (2,))
   ('relu2', (2,))
   .('conv', (4, 12, 12))
   ('flat', (576,))
   ('dense', (4,))
   .
   ----------------------------------------------------------------------
   Ran 5 tests in 0.010s
   
   OK
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_loss.py
   ...
   ----------------------------------------------------------------------
   Ran 3 tests in 0.005s
   
   OK
   ubuntu@ip-172-31-34-232:~/singa/test/python$ python3 test_api.py
   .
   ----------------------------------------------------------------------
   Ran 1 test in 0.013s
   
   OK
   ```
   
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services