Skip to content

Instantly share code, notes, and snippets.

View hppritcha's full-sized avatar

Howard Pritchard hppritcha

View GitHub Profile
@hppritcha
hppritcha / compilers.yaml
Created June 4, 2018 16:40
content of compilers.yaml
compilers:
- compiler:
environment: {}
extra_rpaths: []
flags: {}
modules:
- PrgEnv-cray
- cce/8.4.6
operating_system: cnl6
paths:
From 827eec3753a7b178c12f3e42c2e13dcf80aa5905 Mon Sep 17 00:00:00 2001
From: Howard Pritchard <howardp@lanl.gov>
Date: Wed, 12 Apr 2017 16:19:03 -0500
Subject: [PATCH] CH4/OFI: fix provider selection bug
In working with a libfabric that has multiple providers,
and trying to use the MPIR_CVAR_OFI_USE_PROVIDER env.
variable, I noticed it wasn't working and leading to
aborts with unable to pick the selected provider.
From 2a654e59ff06657d8991eb15d727b3488d3cb4c6 Mon Sep 17 00:00:00 2001
From: Howard Pritchard <howardp@lanl.gov>
Date: Tue, 11 Apr 2017 14:38:40 -0500
Subject: [PATCH] fix usage of PMI2_Info_GetNodeAttr/PutNodeAttr
When using the CH4/OFI device, there are two separate
shared memory regions set up, both relying on the
PMI2_Info_GetNodeAttr/PutNodeAttr of the same key
somehow working. At least on Cray it wasn't. The
non-local-rank zero processes were seeing the old
From 73474b98d33cb7d56daaa93d77bef54269db99e1 Mon Sep 17 00:00:00 2001
From: Ken Raffenetti <raffenet@mcs.anl.gov>
Date: Wed, 22 Feb 2017 15:13:34 -0600
Subject: [PATCH 1/5] pmi: Move headers out of top-level include dir
If we are building with an external pmi library, we should be using the
external version of the pmi.h/pmi2.h. Move the MPICH copies into subdirs
that will be conditionally added to CPPFLAGS based on configure options.
---
src/include/Makefile.mk | 2 -
[RUN ] rdm_sr::inject
registering a region and taking a ref on dom 0x930270 nic 0x952310 handle 0x5e8ffffffffde82b
registering a region and taking a ref on dom 0x98d680 nic 0x9af590 handle 0x5e9ffffffffde82c
registering a region and taking a ref on dom 0x930270 nic 0x952310 handle 0x5ea00007fff922fc
registering a region and taking a ref on dom 0x930270 nic 0x952310 handle 0x5eb00007fff91efa
registering a region and taking a ref on dom 0x930270 nic 0x952310 handle 0x5ec00007fff8eae6
registering a region and taking a ref on dom 0x930270 nic 0x952310 handle 0x5ed00007fff8e2e5
registering a region and taking a ref on dom 0x98d680 nic 0x9af590 handle 0x5ee00007fff903f6
registering a region and taking a ref on dom 0x98d680 nic 0x9af590 handle 0x5ef00007fff8fff4
registering a region and taking a ref on dom 0x98d680 nic 0x9af590 handle 0x5f000007fff8cbe0
hpp@tt-login2[3.45]:/usr/projects/hpctools/hpp/osu-micro-benchmarks-5.3.2/mpi/pt2pt>mpirun -np 2 -N 1 --mca pml ob1 ./osu_latency
# OSU MPI Latency Test v5.3.2
# Size Latency (us)
0 1.18
1 1.21
2 1.20
4 1.20
8 1.20
16 1.21
32 1.20
master
# OSU MPI Multiple Bandwidth / Message Rate Test v5.0
# [ pairs: 1 ] [ window size: 64 ]
# Size MB/s Messages/s
[nid00077:21441] mca: base: close: component cm closed
[nid00077:21441] mca: base: close: unloading component cm
[nid00076:22676] mca: base: close: component cm closed
[nid00076:22676] mca: base: close: unloading component cm
1 1.42 1424865.51
2 2.85 1425744.13
Using this configure option for OMPI:
./configure --prefix=/users/hpp/ompi_install_dir --with-pmi --with-slurm
I ran osu_mbw_mr multiple times on a LANL omnipath system (Intel(R) Xeon(R) CPU E5-2695 v4 @ 2.10GHz processors)
1.10.x:
srun -n 2 -N 2 --label ./osu_mbw_mr_1.10.x
0: # OSU MPI Multiple Bandwidth / Message Rate Test v5.3
0: # [ pairs: 1 ] [ window size: 64 ]
From f0b4bfdb0ebdd37fd48d0ec5ecadde9b77eb1741 Mon Sep 17 00:00:00 2001
From: Howard Pritchard <howardp@lanl.gov>
Date: Thu, 31 Mar 2016 12:48:16 -0600
Subject: [PATCH 2/2] prov/verbs: fix a shutdown bug
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
---
prov/verbs/src/ep_rdm/verbs_av_ep_rdm.c | 7 ++++---
1 files changed, 4 insertions(+), 3 deletions(-)
From a44735011ce8396e1f6bb9d3513c1bb618cf552c Mon Sep 17 00:00:00 2001
From: Howard Pritchard <howardp@lanl.gov>
Date: Thu, 31 Mar 2016 12:48:16 -0600
Subject: [PATCH 2/2] prov/verbs: fix a shutdown bug
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
---
prov/verbs/src/ep_rdm/verbs_av_ep_rdm.c | 7 ++++---
1 files changed, 4 insertions(+), 3 deletions(-)