First setup mountpoint following the instructions in the docs.
Next we'll tell S3 Mountpoint to cache on the 28TB of local NVME available on each P5 instance.
- Mount the nvme disks as a single mount - this needs to be done on each p5 instance:
First setup mountpoint following the instructions in the docs.
Next we'll tell S3 Mountpoint to cache on the 28TB of local NVME available on each P5 instance.
Make sure the instance has arn:aws:iam::aws:policy/AmazonEC2FullAccess
permissions.
Create a script called resize.sh
with the following contents:
#!/bin/bash
# Specify the desired volume size in GiB as a command line argument. If not specified, default to 20 GiB.
SIZE=${1:-20}
cd /fsx
install-nccl-aws-ofi.sh
to install AWS OFI NCCL:#!/usr/bin/env python | |
# Copyright (c) Facebook, Inc. and its affiliates. | |
# | |
# This source code is licensed under the MIT license found in the | |
# LICENSE file in the root directory of this source tree. | |
# | |
import os | |
import sys |
#!/bin/bash | |
# run as root, then validate with: | |
# chronyc sources -v | |
# chronyc tracking | |
# see https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/set-time.html#configure-time-sync | |
apt install -y chrony | |
sed -i '/\# See http:\/\/www.pool.ntp.org\/join.html for more information./a server 169.254.169.123 prefer iburst minpoll 4 maxpoll 4\npool time.aws.com iburst' /etc/chrony/chrony.conf | |
systemctl enable --now chrony | |
/etc/init.d/chrony restart |
sudo apt-get install virtualenvwrapper
4
is the number of compute nodes: