This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sum of 10 trials used to generate results | |
---------------------------------------------------------------- | |
CatArray for 2 size 1 1D Tensors took 0.00030994415283203 seconds. | |
CatArray for 4 size 1 1D Tensors took 0.00022506713867188 seconds. | |
CatArray for 8 size 1 1D Tensors took 0.00023889541625977 seconds. | |
CatArray for 32 size 1 1D Tensors took 0.00027704238891602 seconds. | |
CatArray for 128 size 1 1D Tensors took 0.00044894218444824 seconds. | |
CatArray for 512 size 1 1D Tensors took 0.0012199878692627 seconds. | |
CatArray for 1024 size 1 1D Tensors took 0.0020511150360107 seconds. | |
CatArray for 2048 size 1 1D Tensors took 0.0042030811309814 seconds. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
---------------------------------------------------------------- | |
Soumith's Benchmarks (The times are the sum of 5 trials) | |
---------------------------------------------------------------- | |
ms: 0.280992 | |
ms: 0.218592 | |
ms: 0.218208 | |
ms: 0.253888 | |
ms: 0.243072 | |
CatArray for 32 1x600 Tensors along dim=1 took 0.00028080940246582 seconds | |
ms: 0.898176 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sum of 10 trials used to generate results | |
---------------------------------------------------------------- | |
CatArray for 2 size 1 1D Tensors took 0.0005650520324707 seconds. | |
CatArray for 4 size 1 1D Tensors took 0.00078177452087402 seconds. | |
CatArray for 8 size 1 1D Tensors took 0.0015158653259277 seconds. | |
CatArray for 32 size 1 1D Tensors took 0.0047669410705566 seconds. | |
CatArray for 128 size 1 1D Tensors took 0.021991968154907 seconds. | |
CatArray for 512 size 1 1D Tensors took 0.079319953918457 seconds. | |
CatArray for 1024 size 1 1D Tensors took 0.15872097015381 seconds. | |
CatArray for 2048 size 1 1D Tensors took 0.30642604827881 seconds. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
---------------------------------------------------------------- | |
Soumith's Benchmarks (The times are the sum of 5 trials) | |
---------------------------------------------------------------- | |
ms: 0.066048 | |
ms: 0.028448 | |
ms: 0.027008 | |
ms: 0.026304 | |
ms: 0.026624 | |
CatArray for 32 1x600 Tensors along dim=1 took 7.6389312744141e-05 seconds | |
ms: 0.041856 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sum of 10 trials used to generate results | |
---------------------------------------------------------------- | |
CatArray for 2 size 1 1D Tensors took 0.00029397010803223 seconds. | |
CatArray for 4 size 1 1D Tensors took 0.00024986267089844 seconds. | |
CatArray for 8 size 1 1D Tensors took 0.00025296211242676 seconds. | |
CatArray for 32 size 1 1D Tensors took 0.00030803680419922 seconds. | |
CatArray for 128 size 1 1D Tensors took 0.00056195259094238 seconds. | |
CatArray for 512 size 1 1D Tensors took 0.0013229846954346 seconds. | |
CatArray for 1024 size 1 1D Tensors took 0.0022039413452148 seconds. | |
CatArray for 2048 size 1 1D Tensors took 0.004547119140625 seconds. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#ifndef THC_TENSOR_MODE_CUH | |
#define THC_TENSOR_MODE_CUH | |
#include "THCNumerics.cuh" | |
#include "THCSortUtils.cuh" | |
struct ThrustHalfLess | |
{ | |
__host__ __device__ inline bool operator()(const half& lhs, const half& rhs) { | |
return THCNumerics<half>::lt(lhs, rhs); |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#ifndef THC_GENERIC_FILE | |
#define THC_GENERIC_FILE "generic/THCTensorMode.cu" | |
#else | |
THC_API void THCTensor_(calculateMode)(THCState *state, | |
THCTensor *values, | |
THCudaLongTensor *indices, | |
THCTensor *input, | |
THCudaLongStorage *sortBuffer, | |
int dimension, |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
local runtests = false | |
if not cutorch then | |
require 'cutorch' | |
runtests = true | |
end | |
local test = {} | |
local minsize = 5 | |
local maxsize = 10 | |
local minvalue = 2 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#include "TH.h" | |
#include "THC.h" | |
#include <cstdio> | |
#include "sys/time.h" | |
#include <vector> | |
#include "cuda_profiler_api.h" | |
const long loops = 10; | |
void compare(THIntTensor *input, THIntTensor *mode, THLongTensor *indices, |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Testing average duration for 10 loops | |
Testing 1D Tensor of size 8: 6 usec (TH), 49 usec (THC) | |
Testing 1D Tensor of size 16: 1 usec (TH), 39 usec (THC) | |
Testing 1D Tensor of size 32: 1 usec (TH), 41 usec (THC) | |
Testing 1D Tensor of size 64: 1 usec (TH), 51 usec (THC) | |
Testing 1D Tensor of size 128: 2 usec (TH), 53 usec (THC) | |
Testing 1D Tensor of size 256: 6 usec (TH), 83 usec (THC) | |
Testing 1D Tensor of size 512: 10 usec (TH), 108 usec (THC) | |
Testing 1D Tensor of size 1024: 23 usec (TH), 109 usec (THC) | |
Testing 1D Tensor of size 2048: 48 usec (TH), 1370 usec (THC) |