cmake -DCUTLASS_LIBRARY_KERNELS=cutlass3x_sm90* -DCUTLASS_ENABLE_CUBLAS=ON -DCUTLASS_ENABLE_CUDNN=ON -DCUTLASS_NVCC_ARCHS="90a" ..
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
sudo add-apt-repository ppa:deadsnakes/ppa | |
sudo apt install python3.11 python3.11-distutils python3.11-venv libpython3.11-dev | |
curl -sS https://bootstrap.pypa.io/get-pip.py | python3.11 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Ignore `tvm.save_and_copy_tensor` packed functions inserted for debugging | |
@R.function | |
def decode(input_ids1: R.Tensor((1, 1), dtype="int32"), all_seq_len: R.Shape(["n"]), kv_cache: R.Tuple(R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Object, R.Obj |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import os | |
import torch | |
import torch.distributed as dist | |
def read_file_and_all_reduce(): | |
# Get the rank and world size from environment variables | |
rank = int(os.environ['LOCAL_RANK']) | |
world_size = int(os.environ['WORLD_SIZE']) | |
# initialize the process group |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
The fence we walked between the years | |
Did balance us serene; | |
It was a place half in the sky where | |
In the green of leaf and promising of peach | |
We’d reach our hand to touch, and almost touch the sky. | |
If we could reach and touch, we said, | |
‘Twould teach us not to, never to, be dead. | |
We ached and almost touched that stuff; | |
Our reach was never quite enough. |
- modern CMake is required for building a lot of new software
- CMake is dependency for many packages (e.g. ROS related)
- we don't want to remove CMake (which would remove packages that depend on it)
- we want safe procedure to update CMake that can be reversed easily
Check current version
Authored-by: Eric Lunderberg
Notes summarizing discussion between @Lunderberg and @csullivan on 2022_10_25
From previous conversation, possibility of representing pad/crop separately from the layout transform. This would allow algebraic
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Source: https://www.retrospect.com/au/support/kb/repair_efi_bootloader | |
This guide details how to rebuild the BCD boot store for windows. | |
Use diskpart to make System partition available. | |
(This will allow us to use bcdedit later in this guide to repair the bootloader.) | |
Boot from the Windows installation media |
On an Ubuntu 18.04 system with two NVIDIA GPUs only the change to Grub below was necessary, I did not need to mess with modprobe or initramfs. In addition, I needed to set the PCI bus for the correct nvidia xconfig to the non-blacklisted GPU following these steps
sudo nvidia-xconfig
This will create a new xorg config at /etc/X11/xorg.conf.
Then add the BusID
line in the following part of the file /etc/X11/xorg.conf:
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
function run() { | |
device_id=$1 | |
echo "Starting rpc server on adb device: $device_id" | |
preamble=' | |
spawn adb -s' | |
body='shell | |
expect "#" | |
set cmd "cd /data/local/tmp/; LD_LIBRARY_PATH=/data/local/tmp /data/local/tmp/tvm_rpc server --tracker=IP_ADDRESS:PORT --key=\"your_key_here\"" | |
send $cmd |
NewerOlder