Skip to content

Instantly share code, notes, and snippets.

Created May 8, 2014 19:54
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save anonymous/fbd83343f456f0688cea to your computer and use it in GitHub Desktop.
Save anonymous/fbd83343f456f0688cea to your computer and use it in GitHub Desktop.
spenser@scotch:src$ mpirun -np 2 valgrind ./transpose2 8
==31275== Memcheck, a memory error detector
==31275== Copyright (C) 2002-2012, and GNU GPL'd, by Julian Seward et al.
==31275== Using Valgrind-3.8.1 and LibVEX; rerun with -h for copyright info
==31275== Command: ./transpose2 8
==31275==
==31276== Memcheck, a memory error detector
==31276== Copyright (C) 2002-2012, and GNU GPL'd, by Julian Seward et al.
==31276== Using Valgrind-3.8.1 and LibVEX; rerun with -h for copyright info
==31276== Command: ./transpose2 8
==31276==
==31275== Syscall param writev(vector[...]) points to uninitialised byte(s)
==31275== at 0x34414E0C1B: writev (in /lib64/libc-2.12.so)
==31275== by 0x6066D66: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_oob_tcp.so)
==31275== by 0x606833C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_oob_tcp.so)
==31275== by 0x606A94C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_oob_tcp.so)
==31275== by 0x5E5E3D5: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_rml_oob.so)
==31275== by 0x5E5E61F: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_rml_oob.so)
==31275== by 0x6475E97: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_grpcomm_bad.so)
==31275== by 0x344244BB69: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x3442463C6F: PMPI_Init (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x401147: main (transpose2.c:82)
==31275== Address 0x4fd40e1 is 161 bytes inside a block of size 256 alloc'd
==31275== at 0x4A06BE0: realloc (vg_replace_malloc.c:662)
==31275== by 0x34424D3562: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x34424D3FD4: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x34424AF2D6: orte_grpcomm_base_pack_modex_entries (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x6475E4C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_grpcomm_bad.so)
==31275== by 0x344244BB69: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x3442463C6F: PMPI_Init (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31275== by 0x401147: main (transpose2.c:82)
==31275==
==31276== Syscall param writev(vector[...]) points to uninitialised byte(s)
==31276== at 0x34414E0C1B: writev (in /lib64/libc-2.12.so)
==31276== by 0x6066D66: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_oob_tcp.so)
==31276== by 0x606833C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_oob_tcp.so)
==31276== by 0x606A94C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_oob_tcp.so)
==31276== by 0x5E5E3D5: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_rml_oob.so)
==31276== by 0x5E5E61F: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_rml_oob.so)
==31276== by 0x6475E97: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_grpcomm_bad.so)
==31276== by 0x344244BB69: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x3442463C6F: PMPI_Init (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x401147: main (transpose2.c:82)
==31276== Address 0x4fd3cc1 is 161 bytes inside a block of size 256 alloc'd
==31276== at 0x4A06BE0: realloc (vg_replace_malloc.c:662)
==31276== by 0x34424D3562: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x34424D3FD4: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x34424AF2D6: orte_grpcomm_base_pack_modex_entries (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x6475E4C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_grpcomm_bad.so)
==31276== by 0x344244BB69: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x3442463C6F: PMPI_Init (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x401147: main (transpose2.c:82)
==31276==
N = 8
Matrix =
0: 0 1 2 3 4 5 6 7
0: 8 9 10 11 12 13 14 15
0: 16 17 18 19 20 21 22 23
0: 24 25 26 27 28 29 30 31
1: 32 33 34 35 36 37 38 39
1: 40 41 42 43 44 45 46 47
1: 48 49 50 51 52 53 54 55
1: 56 57 58 59 60 61 62 63
DOING TRANSPOSE
DOING TRANSPOSE
==31276== Invalid read of size 2
==31276== at 0x4A08D6C: memcpy (mc_replace_strmem.c:882)
==31276== by 0x34424DD616: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x8D4723C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_btl_sm.so)
==31276== by 0x7CDD813: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_pml_ob1.so)
==31276== by 0x7CD41BF: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_pml_ob1.so)
==31276== by 0x9BEDB22: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_coll_tuned.so)
==31276== by 0x9BF2C18: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_coll_tuned.so)
==31276== by 0x344245752C: PMPI_Alltoall (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x401100: mpi_transpose (transpose2.c:68)
==31276== by 0x40130D: main (transpose2.c:118)
==31276== Address 0x840201e is 14 bytes after a block of size 256 alloc'd
==31276== at 0x4A069EE: malloc (vg_replace_malloc.c:270)
==31276== by 0x400F0F: mpi_transpose (transpose2.c:40)
==31276== by 0x40130D: main (transpose2.c:118)
==31276==
==31276== Invalid write of size 2
==31276== at 0x4A08C4C: memcpy (mc_replace_strmem.c:882)
==31276== by 0x34424DE9EE: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x7CD5F1A: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_pml_ob1.so)
==31276== by 0x8D4842C: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_btl_sm.so)
==31276== by 0x34424D1AC9: opal_progress (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x344244A9F4: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x9BEDB3D: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_coll_tuned.so)
==31276== by 0x9BF2C18: ??? (in /usr/lib64/openmpi/lib/openmpi/mca_coll_tuned.so)
==31276== by 0x344245752C: PMPI_Alltoall (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x401100: mpi_transpose (transpose2.c:68)
==31276== by 0x40130D: main (transpose2.c:118)
==31276== Address 0x4fed5a0 is 0 bytes after a block of size 256 alloc'd
==31276== at 0x4A069EE: malloc (vg_replace_malloc.c:270)
==31276== by 0x401212: main (transpose2.c:97)
==31276==
==31276== Invalid read of size 8
==31276== at 0x4A08BE8: memcpy (mc_replace_strmem.c:882)
==31276== by 0x34424DB707: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x344245528E: ompi_datatype_sndrcv (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x344245752C: PMPI_Alltoall (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x401100: mpi_transpose (transpose2.c:68)
==31276== by 0x40130D: main (transpose2.c:118)
==31276== Address 0x8402020 is 16 bytes after a block of size 256 alloc'd
==31276== at 0x4A069EE: malloc (vg_replace_malloc.c:270)
==31276== by 0x400F0F: mpi_transpose (transpose2.c:40)
==31276== by 0x40130D: main (transpose2.c:118)
==31276==
==31276== Invalid write of size 8
==31276== at 0x4A08BF3: memcpy (mc_replace_strmem.c:882)
==31276== by 0x34424DB707: ??? (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x344245528E: ompi_datatype_sndrcv (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x344245752C: PMPI_Alltoall (in /usr/lib64/openmpi/lib/libmpi.so.1.0.2)
==31276== by 0x401100: mpi_transpose (transpose2.c:68)
==31276== by 0x40130D: main (transpose2.c:118)
==31276== Address 0x4fed5b0 is 16 bytes after a block of size 256 alloc'd
==31276== at 0x4A069EE: malloc (vg_replace_malloc.c:270)
==31276== by 0x401212: main (transpose2.c:97)
==31276==
Matrix =
0: 0 8 16 24 32 40 48 56
0: 1 9 17 25 33 41 49 57
0: 2 10 18 26 34 42 50 58
0: 3 11 19 27 35 43 51 59
1: 4 12 20 28 36 44 52 60
1: 5 13 21 29 37 45 53 61
1: 6 14 22 30 38 46 54 62
1: 7 15 23 31 39 47 55 63
==31276==
==31276== HEAP SUMMARY:
==31276== in use at exit: 159,364 bytes in 171 blocks
==31276== total heap usage: 6,757 allocs, 6,586 frees, 13,173,643 bytes allocated
==31276==
==31276== LEAK SUMMARY:
==31276== definitely lost: 51,469 bytes in 68 blocks
==31276== indirectly lost: 15,066 bytes in 40 blocks
==31276== possibly lost: 0 bytes in 0 blocks
==31276== still reachable: 92,829 bytes in 63 blocks
==31276== suppressed: 0 bytes in 0 blocks
==31276== Rerun with --leak-check=full to see details of leaked memory
==31276==
==31276== For counts of detected and suppressed errors, rerun with: -v
==31276== Use --track-origins=yes to see where uninitialised values come from
==31276== ERROR SUMMARY: 82 errors from 5 contexts (suppressed: 6 from 6)
--------------------------------------------------------------------------
mpirun noticed that process rank 1 with PID 31276 on node scotch.ecasp.org exited on signal 27 (Profiling timer expired).
--------------------------------------------------------------------------
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment