Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save Ahwar/80600ae719e317dc73110f2c9a119492 to your computer and use it in GitHub Desktop.
Save Ahwar/80600ae719e317dc73110f2c9a119492 to your computer and use it in GitHub Desktop.
run ONNX model inference using onnxruntime
import onnxruntime as nxrun
import numpy as np
import time
def mills():
"""function to get current
return int(round(time.time() * 1000))
## start inference session
sess = nxrun.InferenceSession("path/to/model_file.onnx")
## input, output shape
input_name = sess.get_inputs()[0].name
output_name = sess.get_outputs()[0].name
input_shape = sess.get_inputs()[0].shape
output_shape = sess.get_outputs()[0].shape
start_time = mills()
# dummy_input shape should be equal to input_shape
dummy_input = np.ones([1, 416, 416, 3], dtype=np.float32)
## run onnx model with onnx runtime python
result =, {input_name: dummy_input})
print("model single inference in milliSeconds on onnxruntime: ", mills() - start_time)
print("Output", result)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment