Shicong dcslin

## d9.py
input="""R 1
D 1
L 1
D 1
L 2
U 2
D 2
R 1
D 1
L 2

## gist:aa90e5258724cb9add9e9d67ae43e5fd
pass

## train.py
'''pass'''

## train.py
'''
pass
'''

# nvprof python3 examples/cnn/train_cnn.py cnn mnist -m1 -pfloat16

## train.py
'''
Diff https://github.com/kuangliu/pytorch-cifar/blob/master/main.py
'''

from apex import amp

net, optimizer = amp.initialize(net, optimizer, opt_level=args.opt_level)

#if device == 'cuda':
#    net = torch.nn.DataParallel(net)

## train.py
"""
This script is modified from https://github.com/pytorch/examples.git
"""
from __future__ import print_function
import argparse
import torch
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim
from torchvision import datasets, transforms

## 15jul.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                dcslin
                / 15jul.md
            
            
              Last active
              July 15, 2020 01:47
            
              
                15jul.md
              
          
    qabot:


fix cos sim bug, value same as pytorch, ok now
lstm+cos sim+margin loss, loss ok, top1 accuracy 4%~10%, http://ncrs/:8888/notebooks/singa-etc/notebook/singa-qabot-train.ipynb
lstm+cos sim+pooling+margin, loss not ok, top1 accuracy not ok
pytorch: lstm+cos sim+pooling+margin, loss ok

kint:


follow numpy convension, if int tensor + int tensor, return int tensor, else return float
check input types and cast to float in the GenTensorScalarFn when necessary
pr merged


## 15jul.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                dcslin
                / 15jul.md
            
            
              Created
              July 15, 2020 01:29
            
              
                15jul.md
              
          
    qabot:

fix cos sim bug, value ok
lstm+cos sim+margin loss, loss ok, top1 accuracy 4%~10%
lstm+cos sim+pooling+margin, loss not ok, top1 accuracy not ok
pytorch: lstm+cos sim+pooling+margin, loss ok

kint:

follow numpy convension, if int tensor + int tensor, return int tensor, else return float
check input types and cast to float in the GenTensorScalarFn when necessary
pr merged


## singa-print-trace-stack
root@39516c62233d:~/singa-hf2# cd build/
root@39516c62233d:~/singa-hf2/build# ./bin/test_singa --gtest_filter=*RNN*
Running main() from gtest_main.cc
Note: Google Test filter = *RNN*
[==========] Running 5 tests from 3 test cases.
[----------] Global test environment set-up.
[----------] 3 tests from TestCudnnRNN
[ RUN      ] TestCudnnRNN.Setup
[       OK ] TestCudnnRNN.Setup (0 ms)
[ RUN      ] TestCudnnRNN.Forward

## 8jul.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                dcslin
                / 8jul.md
            
            
              Last active
              July 8, 2020 02:12
            
              
                8jul.md
              
          
    kint:


removed static_assert(std::is_same<SType, DType>::value,"The Scalar type must match the Tensor data type");, compilation is ok
static_assert SType == DType fails at compile time, now after remove, fails at runtime (will see more XX op Not implemented)
to expect some hidden bug when SType != DType
ok:

int tensor and int tensor operation (test_onnx_backend.py)
int tensor and int scalar
nrm2() on int tensor (used tmp tensor with type cast)
float tensor and int scalar ops
cuda + int (after added cuda+int in TYPE_LANG_SWITCH
	'''
	pass
	'''

	# nvprof python3 examples/cnn/train_cnn.py cnn mnist -m1 -pfloat16
	'''
	Diff https://github.com/kuangliu/pytorch-cifar/blob/master/main.py
	'''

	from apex import amp

	net, optimizer = amp.initialize(net, optimizer, opt_level=args.opt_level)

	#if device == 'cuda':
	# net = torch.nn.DataParallel(net)
	"""
	This script is modified from https://github.com/pytorch/examples.git
	"""
	from __future__ import print_function
	import argparse
	import torch
	import torch.nn as nn
	import torch.nn.functional as F
	import torch.optim as optim
	from torchvision import datasets, transforms
	root@39516c62233d:~/singa-hf2# cd build/
	root@39516c62233d:~/singa-hf2/build# ./bin/test_singa --gtest_filter=RNN
	Running main() from gtest_main.cc
	Note: Google Test filter = RNN
	[==========] Running 5 tests from 3 test cases.
	[----------] Global test environment set-up.
	[----------] 3 tests from TestCudnnRNN
	[ RUN ] TestCudnnRNN.Setup
	[ OK ] TestCudnnRNN.Setup (0 ms)
	[ RUN ] TestCudnnRNN.Forward