Skip to content

Instantly share code, notes, and snippets.

@yalue
Created August 14, 2020 21:22
Show Gist options
  • Save yalue/c69f0c7e46a6d87ebc6425c037f8edb8 to your computer and use it in GitHub Desktop.
Save yalue/c69f0c7e46a6d87ebc6425c037f8edb8 to your computer and use it in GitHub Desktop.
dmesg output when I tried running "import torch"
[ 347.582485] BUG: kernel NULL pointer dereference, address: 0000000000000028
[ 347.582489] #PF: supervisor write access in kernel mode
[ 347.582490] #PF: error_code(0x0002) - not-present page
[ 347.582491] PGD 0 P4D 0
[ 347.582494] Oops: 0002 [#1] PREEMPT SMP
[ 347.582496] CPU: 15 PID: 7937 Comm: python Not tainted 5.8.0+ #1
[ 347.582497] Hardware name: Dell Inc. Precision Tower 7910/0NK5PH, BIOS A13 05/20/2016
[ 347.582626] RIP: 0010:amdgpu_amdkfd_gpuvm_alloc_memory_of_gpu+0x48d/0x900 [amdgpu]
[ 347.582628] Code: 03 c1 e8 96 bf b2 dd e9 94 fd ff ff 83 bd 40 ff ff ff 02 48 8b 85 78 ff ff ff 75 12 48 8b 90 30 02 00 00 4c 89 b0 90 02 00 00 <4c> 89 72 28 48 8b 13 48 83 bd 48 ff ff ff 00 48 89 90 90 03 00 00
[ 347.582629] RSP: 0018:ffff9c00c2037cc0 EFLAGS: 00010246
[ 347.582631] RAX: ffff8f00455aec00 RBX: ffff9c00c2037dc0 RCX: 0000000000000000
[ 347.582632] RDX: 0000000000000000 RSI: ffff8f00423c4fe8 RDI: ffff8f00455aed50
[ 347.582633] RBP: ffff9c00c2037da0 R08: ffff8f00423c4f68 R09: 0000000000000000
[ 347.582634] R10: ffff8f003f6c6030 R11: ffff8f00455aec70 R12: 0000000000000000
[ 347.582635] R13: ffff8f00423c0000 R14: ffff8f004b9593b0 R15: 0000000000001000
[ 347.582636] FS: 00007fe2a8c84740(0000) GS:ffff8f009fdc0000(0000) knlGS:0000000000000000
[ 347.582637] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 347.582638] CR2: 0000000000000028 CR3: 00000004098e9003 CR4: 00000000001706e0
[ 347.582639] Call Trace:
[ 347.582713] kfd_ioctl_alloc_memory_of_gpu+0xbc/0x210 [amdgpu]
[ 347.582784] kfd_ioctl+0x274/0x500 [amdgpu]
[ 347.582854] ? kfd_dev_is_large_bar+0x90/0x90 [amdgpu]
[ 347.582860] ? __do_munmap+0x2db/0x510
[ 347.582864] ? tomoyo_file_ioctl+0x19/0x20
[ 347.582869] ksys_ioctl+0x98/0xb0
[ 347.582871] __x64_sys_ioctl+0x1a/0x20
[ 347.582876] do_syscall_64+0x37/0x80
[ 347.582878] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 347.582880] RIP: 0033:0x7fe2a859c6d7
[ 347.582882] Code: b3 66 90 48 8b 05 b1 47 2d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 81 47 2d 00 f7 d8 64 89 01 48
[ 347.582883] RSP: 002b:00007ffec5b155a8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 347.582884] RAX: ffffffffffffffda RBX: 00007fe22269f3a0 RCX: 00007fe2a859c6d7
[ 347.582885] RDX: 00007ffec5b155f0 RSI: 00000000c0284b16 RDI: 0000000000000004
[ 347.582886] RBP: 00007ffec5b155f0 R08: 00007ffec5b15730 R09: 0000000084000010
[ 347.582887] R10: 0000000000004022 R11: 0000000000000246 R12: 00000000c0284b16
[ 347.582888] R13: 0000000000000004 R14: 0000000000001000 R15: 00007fe22269f438
[ 347.582890] Modules linked in: nf_conntrack_netbios_ns nf_conntrack_broadcast ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter binfmt_misc amdgpu intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal intel_powerclamp iommu_v2 snd_hda_codec_realtek gpu_sched ttm snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio drm_kms_helper coretemp snd_hda_intel kvm_intel snd_intel_dspcfg snd_hda_codec snd_usb_audio drm snd_hda_core kvm snd_seq_midi snd_usbmidi_lib snd_hwdep snd_seq_midi_event mc snd_rawmidi snd_seq fb_sys_fops syscopyarea snd_pcm irqbypass snd_seq_device sysfillrect snd_timer sysimgblt crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd joydev snd cryptd hid_sony glue_helper input_leds ff_memless rapl intel_cstate soundcore dell_smm_hwmon mei_me dell_smbios mei dcdbas wmi_bmof dell_wmi_descriptor lpc_ich intel_wmi_thunderbolt mxm_wmi mac_hid sch_fq_codel parport_pc ppdev lp parport sunrpc
[ 347.582916] ip_tables x_tables autofs4 btrfs blake2b_generic xor zstd_compress raid6_pq libcrc32c hid_generic igb usbhid e1000e ahci i2c_algo_bit mpt3sas hid libahci dca raid_class scsi_transport_sas wmi
[ 347.582927] CR2: 0000000000000028
[ 347.582929] ---[ end trace eab3ba5088578679 ]---
[ 347.583002] RIP: 0010:amdgpu_amdkfd_gpuvm_alloc_memory_of_gpu+0x48d/0x900 [amdgpu]
[ 347.583004] Code: 03 c1 e8 96 bf b2 dd e9 94 fd ff ff 83 bd 40 ff ff ff 02 48 8b 85 78 ff ff ff 75 12 48 8b 90 30 02 00 00 4c 89 b0 90 02 00 00 <4c> 89 72 28 48 8b 13 48 83 bd 48 ff ff ff 00 48 89 90 90 03 00 00
[ 347.583005] RSP: 0018:ffff9c00c2037cc0 EFLAGS: 00010246
[ 347.583007] RAX: ffff8f00455aec00 RBX: ffff9c00c2037dc0 RCX: 0000000000000000
[ 347.583008] RDX: 0000000000000000 RSI: ffff8f00423c4fe8 RDI: ffff8f00455aed50
[ 347.583009] RBP: ffff9c00c2037da0 R08: ffff8f00423c4f68 R09: 0000000000000000
[ 347.583009] R10: ffff8f003f6c6030 R11: ffff8f00455aec70 R12: 0000000000000000
[ 347.583010] R13: ffff8f00423c0000 R14: ffff8f004b9593b0 R15: 0000000000001000
[ 347.583012] FS: 00007fe2a8c84740(0000) GS:ffff8f009fdc0000(0000) knlGS:0000000000000000
[ 347.583013] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 347.583014] CR2: 0000000000000028 CR3: 00000004098e9003 CR4: 00000000001706e0
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment