Skip to content

Instantly share code, notes, and snippets.

@chauhang
Created March 29, 2024 07:13
Show Gist options
  • Save chauhang/518cbdd437ea6f7c32ab469a75ae8777 to your computer and use it in GitHub Desktop.
Save chauhang/518cbdd437ea6f7c32ab469a75ae8777 to your computer and use it in GitHub Desktop.
rocm-bnb install compile error
## Installing ROCm version of BnB from https://github.com/TimDettmers/bitsandbytes/issues/107#issuecomment-2016586255
## insta.sh is failing with errors:
./install.sh -ic
Creating project build directory in: /home/gchauhan/meta/hipBLASLt/build
~/meta/hipBLASLt ~/meta/hipBLASLt
~/meta/hipBLASLt/build/release ~/meta/hipBLASLt/build/release ~/meta/hipBLASLt
2024-03-29 00:10:47 URL:https://objects.githubusercontent.com/github-production-release-asset-2e65be/80030341/2c121300-b9d7-11e9-9538-1012fba8c3b6?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAVCODYLSA53PQK4ZA%2F20240329%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20240329T071043Z&X-Amz-Expires=300&X-Amz-Signature=686266cabf80e9f66a0dccea94cbcdea219b70eb4bf02979c85848807e4b3eb1&X-Amz-SignedHeaders=host&actor_id=0&key_id=0&repo_id=80030341&response-content-disposition=attachment%3B%20filename%3Daocl-blis-mt-centos-2.0.tar.gz&response-content-type=application%2Foctet-stream [2736770/2736770] -> "blis.tar.gz" [1]
amd-blis-mt/
amd-blis-mt/examples/
amd-blis-mt/examples/input.general
amd-blis-mt/examples/input.operations
amd-blis-mt/examples/test_libblis.x
amd-blis-mt/LICENSE
amd-blis-mt/include/
amd-blis-mt/include/blis/
amd-blis-mt/include/blis/blis.h
amd-blis-mt/include/blis/cblas.h
amd-blis-mt/lib/
amd-blis-mt/lib/libblis.so.2.0.0
amd-blis-mt/lib/libblis.so.2
amd-blis-mt/lib/libblis-mt.a
amd-blis-mt/lib/libblis-mt.so
amd-blis-mt/ReleaseNotes.txt
~/meta/hipBLASLt/build/release ~/meta/hipBLASLt
-DAMDGPU_TARGETS=all -DCMAKE_BUILD_TYPE=Release
-- The CXX compiler identification is Clang 17.0.0
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /opt/rocm/bin/hipcc - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Using hip-clang to build for amdgpu backend
*******************************************************************************
*------------------------------- ROCMChecks WARNING --------------------------*
Options and properties should be set on a cmake target where possible. The
variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by
calling 'cmake -DCMAKE_CXX_FLAGS=" -D__HIP_HCC_COMPAT_MODE__=1"'
or set in a toolchain file and added with
'cmake -DCMAKE_TOOLCHAIN_FILE=<toolchain-file>'. ROCMChecks now calling:
CMake Warning at /opt/rocm/share/rocm/cmake/ROCMChecks.cmake:46 (message):
'CMAKE_CXX_FLAGS' is set at
/home/gchauhan/meta/hipBLASLt/CMakeLists.txt:<line#> shown below:
Call Stack (most recent call first):
CMakeLists.txt:9223372036854775807 (rocm_check_toolchain_var)
CMakeLists.txt:130 (set)
*-----------------------------------------------------------------------------*
*******************************************************************************
-- Found Git: /usr/local/bin/git (found version "2.43.0")
CMake Warning (dev) at cmake/findBLIS.cmake:41 (set):
Cannot set "BLIS_FOUND": current scope has no parent.
Call Stack (most recent call first):
CMakeLists.txt:141 (include)
This warning is for project developers. Use -Wno-dev to suppress it.
CMake Warning (dev) at cmake/findBLIS.cmake:42 (set):
Cannot set "BLIS_INCLUDE_DIR": current scope has no parent.
Call Stack (most recent call first):
CMakeLists.txt:141 (include)
This warning is for project developers. Use -Wno-dev to suppress it.
CMake Warning (dev) at cmake/findBLIS.cmake:43 (set):
Cannot set "BLIS_LIB": current scope has no parent.
Call Stack (most recent call first):
CMakeLists.txt:141 (include)
This warning is for project developers. Use -Wno-dev to suppress it.
BLIS heeader directory found: /home/gchauhan/meta/hipBLASLt/build/deps/blis/include/blis
BLIS lib found: /home/gchauhan/meta/hipBLASLt/build/deps/blis/lib/libblis.a
-- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on
-- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on - Success
-- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off
-- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off - Success
-- Performing Test COMPILER_HAS_TARGET_ID_gfx940
-- Performing Test COMPILER_HAS_TARGET_ID_gfx940 - Success
-- Performing Test COMPILER_HAS_TARGET_ID_gfx941
-- Performing Test COMPILER_HAS_TARGET_ID_gfx941 - Success
-- Performing Test COMPILER_HAS_TARGET_ID_gfx942
-- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success
-- AMDGPU_TARGETS: gfx90a:xnack+;gfx90a:xnack-;gfx940;gfx941;gfx942
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE
-- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS
-- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success
/home/gchauhan/my_envs/llm-amd/bin/python3 -m venv /home/gchauhan/meta/hipBLASLt/build/release/virtualenv --system-site-packages --clear
/home/gchauhan/meta/hipBLASLt/build/release/virtualenv/bin/python3 -m pip install /home/gchauhan/meta/hipBLASLt/tensilelite
Processing /home/gchauhan/meta/hipBLASLt/tensilelite
Preparing metadata (setup.py): started
Preparing metadata (setup.py): finished with status 'done'
Requirement already satisfied: pyyaml in /home/gchauhan/my_envs/llm-amd/lib/python3.11/site-packages (from Tensile==4.33.0) (6.0.1)
Collecting msgpack (from Tensile==4.33.0)
Obtaining dependency information for msgpack from https://files.pythonhosted.org/packages/f6/f0/a7bdb48223cd21b9abed814b08fca8fe6a40931e70ec97c24d2f15d68ef3/msgpack-1.0.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata
Downloading msgpack-1.0.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (9.1 kB)
Requirement already satisfied: joblib in /home/gchauhan/my_envs/llm-amd/lib/python3.11/site-packages (from Tensile==4.33.0) (1.3.2)
Downloading msgpack-1.0.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (409 kB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 409.3/409.3 kB 251.8 kB/s eta 0:00:00
Building wheels for collected packages: Tensile
Building wheel for Tensile (setup.py): started
Building wheel for Tensile (setup.py): finished with status 'done'
Created wheel for Tensile: filename=Tensile-4.33.0-py3-none-any.whl size=11068245 sha256=3d277271a83b00a869d49c8387ac3904d0a56bd9fd9f7b7c6c085ddf52860918
Stored in directory: /tmp/pip-ephem-wheel-cache-88jkrhv2/wheels/bd/ab/6e/a0075d1e03a10adb1547cd00de58d9feebe26458ce989db6a6
Successfully built Tensile
Installing collected packages: msgpack, Tensile
Successfully installed Tensile-4.33.0 msgpack-1.0.8
[notice] A new release of pip is available: 23.2.1 -> 24.0
[notice] To update, run: python3 -m pip install --upgrade pip
-- Adding /home/gchauhan/meta/hipBLASLt/build/release/virtualenv to CMAKE_PREFIX_PATH
-- The C compiler identification is GNU 11.4.1
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working C compiler: /usr/bin/cc - skipped
-- Detecting C compile features
-- Detecting C compile features - done
CMake Error at build/release/virtualenv/lib/python3.11/site-packages/Tensile/Source/lib/CMakeLists.txt:105 (find_package):
By not providing "Findmsgpack.cmake" in CMAKE_MODULE_PATH this project has
asked CMake to find a package configuration file provided by "msgpack", but
CMake did not find one.
Could not find a package configuration file provided by "msgpack" with any
of the following names:
msgpackConfig.cmake
msgpack-config.cmake
Add the installation prefix of "msgpack" to CMAKE_PREFIX_PATH or set
"msgpack_DIR" to a directory containing one of the above files. If
"msgpack" provides a separate development package or SDK, be sure it has
been installed.
-- Configuring incomplete, errors occurred!
@chauhang
Copy link
Author

chauhang commented Mar 29, 2024

Environment

python -m torch.utils.collect_env

Collecting environment information...
PyTorch version: 2.4.0.dev20240326+rocm6.0
Is debug build: False
CUDA used to build PyTorch: N/A
ROCM used to build PyTorch: 6.0.32830-d62f6a171

OS: CentOS Stream 9 (x86_64)
GCC version: (GCC) 11.4.1 20231218 (Red Hat 11.4.1-3)
Clang version: Could not collect
CMake version: version 3.26.5
Libc version: glibc-2.34

Python version: 3.11.5 (main, Sep 11 2023, 13:54:46) [GCC 11.2.0] (64-bit runtime)
Python platform: Linux-6.4.3-0_fbk1_zion_755_ga25447393a1d-x86_64-with-glibc2.34
Is CUDA available: True
CUDA runtime version: Could not collect
CUDA_MODULE_LOADING set to: LAZY
GPU models and configuration: AMD Instinct MI250X / MI250 (gfx90a:sramecc+:xnack-)
Nvidia driver version: Could not collect
cuDNN version: Could not collect
HIP runtime version: 6.0.32830
MIOpen runtime version: 3.0.0
Is XNNPACK available: True

CPU:
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 48 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 256
On-line CPU(s) list: 0-255
Vendor ID: AuthenticAMD
Model name: AMD EPYC 7713 64-Core Processor
CPU family: 25
Model: 1
Thread(s) per core: 2
Core(s) per socket: 64
Socket(s): 2
Stepping: 1
Frequency boost: enabled
CPU(s) scaling MHz: 100%
CPU max MHz: 2000.0000
CPU min MHz: 1500.0000
BogoMIPS: 3992.39
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 invpcid_single hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd amd_ppin brs arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif v_spec_ctrl umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca
Virtualization: AMD-V
L1d cache: 4 MiB (128 instances)
L1i cache: 4 MiB (128 instances)
L2 cache: 64 MiB (128 instances)
L3 cache: 512 MiB (16 instances)
NUMA node(s): 2
NUMA node0 CPU(s): 0-63,128-191
NUMA node1 CPU(s): 64-127,192-255
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Mmio stale data: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Vulnerable, IBPB: conditional, IBRS_FW, STIBP: always-on, RSB filling, PBRSB-eIBRS: Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected

Versions of relevant libraries:
[pip3] mypy-extensions==1.0.0
[pip3] numpy==1.26.0
[pip3] pytorch-triton-rocm==3.0.0+0a22a91d04
[pip3] torch==2.4.0.dev20240326+rocm6.0
[pip3] torchao-nightly==2024.3.25
[pip3] torchtune==0.0.1
[conda] numpy 1.26.0 pypi_0 pypi
[conda] pytorch-triton-rocm 3.0.0+0a22a91d04 pypi_0 pypi
[conda] torch 2.4.0.dev20240326+rocm6.0 pypi_0 pypi
[conda] torchao-nightly 2024.3.25 pypi_0 pypi
[conda] torchtune 0.0.1 pypi_0 pypi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment