Created
July 25, 2023 18:25
-
-
Save erikr/a807dfff6ae410d1974e8407615ddce1 to your computer and use it in GitHub Desktop.
error
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
(base) root@93318d67c350:~/capsule# python src/example.py | |
2023-07-25 18:24:43.540497: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 AVX512F AVX512_VNNI FMA | |
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. | |
2023-07-25 18:24:43.652319: I tensorflow/core/util/port.cc:104] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`. | |
2023-07-25 18:24:44.328614: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer.so.7'; dlerror: libnvinfer.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64 | |
2023-07-25 18:24:44.328693: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer_plugin.so.7'; dlerror: libnvinfer_plugin.so.7: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64 | |
2023-07-25 18:24:44.328704: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly. | |
/opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/__init__.py:98: UserWarning: unable to load libtensorflow_io_plugins.so: unable to open file: libtensorflow_io_plugins.so, from paths: ['/opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/libtensorflow_io_plugins.so'] | |
caused by: ['/opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/libtensorflow_io_plugins.so: undefined symbol: _ZN3tsl6StatusC1EN10tensorflow5error4CodeESt17basic_string_viewIcSt11char_traitsIcEENS_18SourceLocationImplE'] | |
warnings.warn(f"unable to load libtensorflow_io_plugins.so: {e}") | |
/opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/__init__.py:104: UserWarning: file system plugins are not loaded: unable to open file: libtensorflow_io.so, from paths: ['/opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/libtensorflow_io.so'] | |
caused by: ['/opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/libtensorflow_io.so: undefined symbol: _ZTVN3tsl13GcsFileSystemE'] | |
warnings.warn(f"file system plugins are not loaded: {e}") | |
Downloading data from https://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz | |
170498071/170498071 [==============================] - 2s 0us/step | |
2023-07-25 18:24:51.238617: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:51.276782: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:51.279462: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:51.282344: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 AVX512F AVX512_VNNI FMA | |
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. | |
2023-07-25 18:24:51.284029: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:51.286678: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:51.289318: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:52.048505: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:52.050209: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:52.051706: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero | |
2023-07-25 18:24:52.053160: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1613] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 13582 MB memory: -> device: 0, name: Tesla T4, pci bus id: 0000:00:1e.0, compute capability: 7.5 | |
Epoch 1/10 | |
2023-07-25 18:24:54.275782: E tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:433] Could not create cudnn handle: CUDNN_STATUS_NOT_INITIALIZED | |
2023-07-25 18:24:54.275833: E tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:442] Possibly insufficient driver version: 510.47.3 | |
2023-07-25 18:24:54.275861: W tensorflow/core/framework/op_kernel.cc:1830] OP_REQUIRES failed at conv_ops_fused_impl.h:621 : UNIMPLEMENTED: DNN library is not found. | |
Traceback (most recent call last): | |
File "/root/capsule/src/example.py", line 26, in <module> | |
history = model.fit(train_images, train_labels, epochs=10, | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 70, in error_handler | |
raise e.with_traceback(filtered_tb) from None | |
File "/opt/conda/lib/python3.10/site-packages/tensorflow/python/eager/execute.py", line 52, in quick_execute | |
tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name, | |
tensorflow.python.framework.errors_impl.UnimplementedError: Graph execution error: | |
Detected at node 'sequential/conv2d/Relu' defined at (most recent call last): | |
File "/root/capsule/src/example.py", line 26, in <module> | |
history = model.fit(train_images, train_labels, epochs=10, | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 65, in error_handler | |
return fn(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/training.py", line 1650, in fit | |
tmp_logs = self.train_function(iterator) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/training.py", line 1249, in train_function | |
return step_function(self, iterator) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/training.py", line 1233, in step_function | |
outputs = model.distribute_strategy.run(run_step, args=(data,)) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/training.py", line 1222, in run_step | |
outputs = model.train_step(data) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/training.py", line 1023, in train_step | |
y_pred = self(x, training=True) | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 65, in error_handler | |
return fn(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/training.py", line 561, in __call__ | |
return super().__call__(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 65, in error_handler | |
return fn(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/base_layer.py", line 1132, in __call__ | |
outputs = call_fn(inputs, *args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 96, in error_handler | |
return fn(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/sequential.py", line 413, in call | |
return super().call(inputs, training=training, mask=mask) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/functional.py", line 511, in call | |
return self._run_internal_graph(inputs, training=training, mask=mask) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/functional.py", line 668, in _run_internal_graph | |
outputs = node.layer(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 65, in error_handler | |
return fn(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/engine/base_layer.py", line 1132, in __call__ | |
outputs = call_fn(inputs, *args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/utils/traceback_utils.py", line 96, in error_handler | |
return fn(*args, **kwargs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/layers/convolutional/base_conv.py", line 314, in call | |
return self.activation(outputs) | |
File "/opt/conda/lib/python3.10/site-packages/keras/activations.py", line 317, in relu | |
return backend.relu( | |
File "/opt/conda/lib/python3.10/site-packages/keras/backend.py", line 5369, in relu | |
x = tf.nn.relu(x) | |
Node: 'sequential/conv2d/Relu' | |
DNN library is not found. | |
[[{{node sequential/conv2d/Relu}}]] [Op:__inference_train_function_1310] |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment