fenar@macpro71 acm-observability % oc get pods -n nvidia-gpu-operator
NAME READY STATUS RESTARTS AGE
gpu-feature-discovery-s9jsp 1/1 Running 0 42m
gpu-operator-9f47fbdc-d2g9k 1/1 Running 0 44m
nvidia-container-toolkit-daemonset-7k46j 1/1 Running 0 42m
nvidia-cuda-validator-4br5g 0/1 Completed 0 40m
nvidia-dcgm-2z4px 1/1 Running 0 42m
nvidia-dcgm-exporter-7wgq2 1/1 Running 0 42m
nvidia-device-plugin-daemonset-mlvcv 1/1 Running 0 42m
nvidia-driver-daemonset-415.92.202405281402-0-ww2nb 2/2 Running 0 43m
(1) Download the latest NVIDIA DCGM Exporter Dashboard from the DCGM Exporter repository on GitHub:
curl -LfO https://github.com/NVIDIA/dcgm-exporter/raw/main/grafana/dcgm-exporter-dashboard.json
(2) Create a config map from the downloaded file in the openshift-config-managed namespace:
oc create configmap nvidia-dcgm-exporter-dashboard -n openshift-config-managed --from-file=dcgm-exporter-dashboard.json
(3) Label the config map to expose the dashboard in the Administrator perspective of the web console:
#!/bin/bash | |
# Ensure the script is run as root | |
if [ "$EUID" -ne 0 ]; then | |
echo "Please run as root" | |
exit | |
fi | |
# List all the hard drives | |
drives=$(lsblk -dpno NAME,TYPE | grep 'disk' | awk '{print $1}') |
#!/bin/bash | |
# Sample Script for VM migration betweek Openstack Deployments | |
# This is for inspiration purposes, use it wisely. | |
# Variables | |
SOURCE_OS_AUTH_URL="http://source-openstack:5000/v3" | |
SOURCE_OS_PROJECT_NAME="source_project" | |
SOURCE_OS_USERNAME="source_user" | |
SOURCE_OS_PASSWORD="source_password" | |
SOURCE_IMAGE_ID="source_image_id" |
--- | |
apiVersion: tuned.openshift.io/v1 | |
kind: Tuned | |
metadata: | |
name: sno-hugepages-patch | |
namespace: openshift-cluster-node-tuning-operator | |
spec: | |
profile: | |
- data: | | |
[main] |
### Main Guideline | |
https://docs.openshift.com/container-platform/4.14/scalability_and_performance/ztp_far_edge/ztp-reference-cluster-configuration-for-vdu.html#sno-configure-for-vdu | |
### Enable SCTP | |
https://docs.openshift.com/container-platform/4.14/networking/using-sctp.html | |
For SNO make sure -> machineconfiguration.openshift.io/role: master | |
### Install Service Mesh | |
https://docs.openshift.com/container-platform/4.14/service_mesh/v2x/installing-ossm.html |
--- | |
apiVersion: tuned.openshift.io/v1 | |
kind: Tuned | |
metadata: | |
name: sno-hugepages-patch | |
namespace: openshift-cluster-node-tuning-operator | |
spec: | |
profile: | |
- data: | | |
[main] |
{ | |
"annotations": { | |
"list": [ | |
{ | |
"builtIn": 1, | |
"datasource": { | |
"type": "grafana", | |
"uid": "-- Grafana --" | |
}, | |
"enable": true, |
{ | |
"annotations": { | |
"list": [ | |
{ | |
"builtIn": 1, | |
"datasource": { | |
"type": "grafana", | |
"uid": "-- Grafana --" | |
}, | |
"enable": true, |
---Incase You Have a Messed-Up Bios---- Start
(A) Clear all pending jobs via iDRAC WebUI: Maintenance -> Job Queue
(B) Revert back to initial BIOS
ssh to iDRAC terminal
% ssh root@
racadm>>racadm recover BIOS.Setup.1-1
---Incase You Have a Messed-Up Bios---- End
Enabling Dell-Intel-E810 on Server Side