Skip to content

Instantly share code, notes, and snippets.

View stgraber's full-sized avatar

Stéphane Graber stgraber

View GitHub Profile
@stgraber
stgraber / commands
Created March 22, 2022 22:23
LXD cluster with CEPH, OVN and Grafana on Google Compute
# Deploy the LXD cluster
juju bootstrap google/northamerica-northeast1 gce
juju deploy ./lxd-gce.yaml
juju add-storage ceph-osd/0 osd-devices=gce,100G,1
juju add-storage ceph-osd/1 osd-devices=gce,100G,1
juju add-storage ceph-osd/2 osd-devices=gce,100G,1
juju add-storage ceph-osd/3 osd-devices=gce,100G,1
juju add-storage ceph-osd/4 osd-devices=gce,100G,1
# Expose the services
# Show system
lscpu
lspci -nnn
nvidia-smi
# Enable MIG
nvidia-smi -mig 1
# List GPU instance profiles
nvidia-smi mig -lgip
recirc_id(0x3e76e),in_port(4),ct_state(+new-est-rel-rpl-inv+trk),ct_label(0/0x1),eth(src=00:16:3e:ad:66:b1,dst=00:16:3e:ad:66:b1),eth_type(0x0800),ipv4(src=64.0.0.0/224.0.0.0,dst=45.45.148.136,proto=6,ttl=66,frag=no), packets:0, bytes:0, used:never, actions:set(ipv4(ttl=65)),check_pkt_len(size=1518,gt(ct_clear,check_pkt_len(size=1518,gt(ct(zone=54,nat),recirc(0x3e76d)),le(ct(zone=54,nat),recirc(0x3e76e)))),le(ct_clear,check_pkt_len(size=1518,gt(ct(zone=54,nat),recirc(0x3e76d)),le(ct(zone=54,nat),recirc(0x3e76e)))))
recirc_id(0x3e76e),in_port(4),ct_state(+new-est-rel-rpl-inv+trk),ct_label(0/0x1),eth(src=00:16:3e:ad:66:b1,dst=00:16:3e:ad:66:b1),eth_type(0x0800),ipv4(src=2.0.0.0/254.0.0.0,dst=45.45.148.136,proto=6,ttl=31,frag=no), packets:0, bytes:0, used:never, actions:set(ipv4(ttl=30)),check_pkt_len(size=1518,gt(ct_clear,check_pkt_len(size=1518,gt(ct(zone=54,nat),recirc(0x3e76d)),le(ct(zone=54,nat),recirc(0x3e76e)))),le(ct_clear,check_pkt_len(size=1518,gt(ct(zone=54,nat),recirc(0x3e76d)),le(ct(zone=54,nat),rec
@stgraber
stgraber / metrics
Created October 1, 2021 18:39
/1.0/metrics output
# HELP lxd_cpu_seconds_total The total number of CPU seconds used in milliseconds.
# TYPE lxd_cpu_seconds_total counter
lxd_cpu_seconds_total{cpu="0",mode="system",name="lxd-build",project="default",type="container"} 3097
lxd_cpu_seconds_total{cpu="0",mode="user",name="lxd-build",project="default",type="container"} 50065
lxd_cpu_seconds_total{cpu="10",mode="system",name="lxd-build",project="default",type="container"} 8449
lxd_cpu_seconds_total{cpu="10",mode="user",name="lxd-build",project="default",type="container"} 126718
lxd_cpu_seconds_total{cpu="31",mode="system",name="lxd-build",project="default",type="container"} 0
lxd_cpu_seconds_total{cpu="31",mode="user",name="lxd-build",project="default",type="container"} 0
lxd_cpu_seconds_total{cpu="21",mode="system",name="lxd-build",project="default",type="container"} 0
lxd_cpu_seconds_total{cpu="21",mode="user",name="lxd-build",project="default",type="container"} 0
Nov 23 03:06:01 delmak kernel: Booting Linux on physical CPU 0x0000000001 [0x510f8000]
Nov 23 03:06:01 delmak kernel: Linux version 5.15.2+ (ubuntu@c2400) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #1 SMP Thu Nov 18 22:08:36 UTC 2021
Nov 23 03:06:01 delmak kernel: efi: EFI v2.60 by Qualcomm
Nov 23 03:06:01 delmak kernel: efi: ACPI 2.0=0x95e0000 TPMFinalLog=0x9330000 SMBIOS 3.0=0x5720000 ESRT=0xb945398 MEMATTR=0xb932018 TPMEventLog=0x92e0018 RNG=0xe7ce498 MEMRESERVE=0x97d0b98
Nov 23 03:06:01 delmak kernel: efi: seeding entropy pool
Nov 23 03:06:01 delmak kernel: random: fast init done
Nov 23 03:06:01 delmak kernel: [Firmware Bug]: Failed to parse event in TPM Final Events Log
Nov 23 03:06:01 delmak kernel: esrt: Reserving ESRT space from 0x000000000b945398 to 0x000000000b945470.
Nov 23 03:06:01 delmak kernel: ACPI: Early table checksum verification disabled
Nov 23 03:06:01 delmak kernel: ACPI: RSDP 0x00000000095E0000 000024 (v02 QCOM )
Nov 23 03:10:28 delmak kernel: ixgbe 0002:01:00.0: registered PHC device on enP2s2
Nov 23 03:10:29 delmak kernel: ixgbe 0002:01:00.0 enP2s2: detected SFP+: 3
Nov 23 03:10:29 delmak kernel: ixgbe 0002:01:00.0 enP2s2: NIC Link is Up 10 Gbps, Flow Control: RX/TX
Nov 23 03:10:29 delmak kernel: IPv6: ADDRCONF(NETDEV_CHANGE): enP2s2: link becomes ready
Nov 23 03:10:34 delmak kernel: ------------[ cut here ]------------
Nov 23 03:10:34 delmak kernel: NETDEV WATCHDOG: enP2s2 (ixgbe): transmit queue 14 timed out
Nov 23 03:10:34 delmak kernel: WARNING: CPU: 14 PID: 0 at net/sched/sch_generic.c:477 dev_watchdog+0x374/0x380
Nov 23 03:10:34 delmak kernel: Modules linked in: nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua joydev input_leds cdc_acm efi_pstore acpi_ipmi ipmi_ssif ipmi_devintf cppc_cpufreq ipmi_msghandler hdma_mgmt hdma sch_fq_codel drm ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor xor_neon raid6_pq
[ 0.000000] Booting Linux on physical CPU 0x0000000000 [0x501f0000]
[ 0.000000] Linux version 5.15.2+ (ubuntu@c2400) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #1 SMP Thu Nov 18 22:08:36 UTC 2021
[ 0.000000] Machine model: APM X-Gene Merlin board
[ 0.000000] efi: EFI v2.40 by X-Gene Merlin Board EFI Aug 19 2016 18:09:46
[ 0.000000] efi: ACPI=0x4ffa808000 ACPI 2.0=0x4ffa808014 SMBIOS 3.0=0x4ffa806000 ESRT=0x4fff039818 MEMRESERVE=0x4ffa3e1d18
[ 0.000000] esrt: Reserving ESRT space from 0x0000004fff039818 to 0x0000004fff039878.
[ 0.000000] NUMA: No NUMA configuration found
[ 0.000000] NUMA: Faking a node at [mem 0x0000004000000000-0x0000004fffffffff]
[ 0.000000] NUMA: NODE_DATA [mem 0x4ff7d27340-0x4ff7d2bfff]
[ 0.000000] Zone ranges:
[ 0.000000] Booting Linux on physical CPU 0x0000000000 [0x501f0000]
[ 0.000000] Linux version 5.15.2 (ubuntu@c2400) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #1 SMP Thu Nov 18 16:27:22 UTC 2021
[ 0.000000] Machine model: APM X-Gene Merlin board
[ 0.000000] efi: EFI v2.40 by X-Gene Merlin Board EFI Aug 19 2016 18:09:46
[ 0.000000] efi: ACPI=0x4ffa808000 ACPI 2.0=0x4ffa808014 SMBIOS 3.0=0x4ffa806000 ESRT=0x4fff039818 MEMRESERVE=0x4ffa3e1c98
[ 0.000000] esrt: Reserving ESRT space from 0x0000004fff039818 to 0x0000004fff039878.
[ 0.000000] NUMA: No NUMA configuration found
[ 0.000000] NUMA: Faking a node at [mem 0x0000004000000000-0x0000004fffffffff]
[ 0.000000] NUMA: NODE_DATA [mem 0x4ff7d27340-0x4ff7d2bfff]
[ 0.000000] Zone ranges:
[ 0.000000] Booting Linux on physical CPU 0x0000000000 [0x501f0000]
[ 0.000000] Linux version 5.15.2+ (ubuntu@c2400) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #1 SMP Wed Nov 17 21:09:01 UTC 2021
[ 0.000000] Machine model: APM X-Gene Merlin board
[ 0.000000] efi: EFI v2.40 by X-Gene Merlin Board EFI Aug 19 2016 18:09:46
[ 0.000000] efi: ACPI=0x4ffa808000 ACPI 2.0=0x4ffa808014 SMBIOS 3.0=0x4ffa806000 ESRT=0x4fff039818 MEMRESERVE=0x4ffa3e1c98
[ 0.000000] esrt: Reserving ESRT space from 0x0000004fff039818 to 0x0000004fff039878.
[ 0.000000] NUMA: No NUMA configuration found
[ 0.000000] NUMA: Faking a node at [mem 0x0000004000000000-0x0000004fffffffff]
[ 0.000000] NUMA: NODE_DATA [mem 0x4ff7d27340-0x4ff7d2bfff]
[ 0.000000] Zone ranges:
[ 0.000000] Booting Linux on physical CPU 0x0000000000 [0x501f0000]
[ 0.000000] Linux version 5.15.2+ (ubuntu@c2400) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #1 SMP Wed Nov 17 21:09:01 UTC 2021
[ 0.000000] Machine model: APM X-Gene Merlin board
[ 0.000000] efi: EFI v2.40 by X-Gene Merlin Board EFI Aug 19 2016 18:09:46
[ 0.000000] efi: ACPI=0x4ffa808000 ACPI 2.0=0x4ffa808014 SMBIOS 3.0=0x4ffa806000 ESRT=0x4fff039818 MEMRESERVE=0x4ffa3e1d18
[ 0.000000] esrt: Reserving ESRT space from 0x0000004fff039818 to 0x0000004fff039878.
[ 0.000000] NUMA: No NUMA configuration found
[ 0.000000] NUMA: Faking a node at [mem 0x0000004000000000-0x0000004fffffffff]
[ 0.000000] NUMA: NODE_DATA [mem 0x4ff7d27340-0x4ff7d2bfff]
[ 0.000000] Zone ranges: