Script:
from mxnet import nd
from benchmark.opperf.utils.benchmark_utils import run_performance_test
print(run_performance_test(nd.Embedding, run_backward=False, inputs=[{'data': (2**32,1), 'weight': (2**32,1), 'input_dim': 2**32, 'output_dim': 1}], warmup=5, runs=5))
Output:
INFO:root:Begin Benchmark - Embedding
INFO:root:Complete Benchmark - Embedding
[{'Embedding': [{'inputs': {'data': (4294967296, 1), 'weight': (4294967296, 1), 'input_dim': 4294967296, 'output_dim': 1}, 'max_storage_mem_alloc_cpu/0': 24696062.0, 'avg_time_forward_Embedding': 2051.3845}]}]