Skip to content

Instantly share code, notes, and snippets.

@ericfont
Created April 12, 2022 21:03
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ericfont/2a6b8703bf7aa1a423fcdcd6f7449f63 to your computer and use it in GitHub Desktop.
Save ericfont/2a6b8703bf7aa1a423fcdcd6f7449f63 to your computer and use it in GitHub Desktop.
mce errors
sudo journalctl -b -0 | grep mce
Apr 12 15:09:33 archdesktop kernel: mce: [Hardware Error]: Machine check events logged
Apr 12 15:09:33 archdesktop kernel: mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 5: bea0000000000108
Apr 12 15:09:33 archdesktop kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff85f3508a MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Apr 12 15:09:33 archdesktop kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1649790570 SOCKET 0 APIC 4 microcode 8001138
Apr 12 15:09:34 archdesktop rasdaemon[420]: mce:mce_record event enabled
Apr 12 15:09:34 archdesktop rasdaemon[420]: rasdaemon: mce:mce_record event enabled
Apr 12 15:09:34 archdesktop rasdaemon[419]: mce:mce_record event enabled
Apr 12 15:09:34 archdesktop rasdaemon[419]: rasdaemon: mce:mce_record event enabled
Apr 12 15:09:34 archdesktop rasdaemon[419]: rasdaemon: Enabled event mce:mce_record
Apr 12 15:09:34 archdesktop rasdaemon[419]: Enabled event mce:mce_record
Apr 12 15:09:34 archdesktop rasdaemon[419]: rasdaemon: Recording mce_record events
Apr 12 15:09:34 archdesktop rasdaemon[419]: overriding event (113) mce:mce_record with new print handler
Apr 12 17:01:30 archdesktop dbus-daemon[416]: [system] Activating via systemd: service name='org.freedesktop.home1' unit='dbus-org.freedesktop.home1.service' requested by ':1.68' (uid=0 pid=7985 comm="sudo mcelog")
Apr 12 17:01:30 archdesktop sudo[7985]: e : TTY=pts/2 ; PWD=/home/e ; USER=root ; COMMAND=/usr/bin/mcelog
@ericfont
Copy link
Author

didn't get any errors after adding kernel parameter.

Then upgraded firmware and removed the kernel parameter, but got a mce error reboot after ~2 days:

sudo journalctl -b -1 | grep mce
Apr 19 14:02:27 archdesktop kernel: mce: [Hardware Error]: Machine check events logged
Apr 19 14:02:27 archdesktop kernel: mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 5: bea0000000000108
Apr 19 14:02:27 archdesktop kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff9fc6f922 MISC d012000100000000 SYND 4d000000 IPID 500b000000000 
Apr 19 14:02:27 archdesktop kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1650391342 SOCKET 0 APIC 3 microcode 8001138
Apr 19 14:02:28 archdesktop rasdaemon[421]: rasdaemon: mce:mce_record event enabled
Apr 19 14:02:28 archdesktop rasdaemon[421]: mce:mce_record event enabled
Apr 19 14:02:28 archdesktop rasdaemon[420]: mce:mce_record event enabled
Apr 19 14:02:28 archdesktop rasdaemon[420]: rasdaemon: mce:mce_record event enabled
Apr 19 14:02:28 archdesktop rasdaemon[420]: rasdaemon: Enabled event mce:mce_record
Apr 19 14:02:28 archdesktop rasdaemon[420]: Enabled event mce:mce_record
Apr 19 14:02:28 archdesktop rasdaemon[420]: rasdaemon: Recording mce_record events
Apr 19 14:02:28 archdesktop rasdaemon[420]: overriding event (113) mce:mce_record with new print handler

Then I rebooted again, and got another spontaneous reboot after roughly 5 minutes.

@ericfont
Copy link
Author

now I've added sudo zenstates --c6-disable to my .xinitrc

@ericfont
Copy link
Author

somehow got a reboot:

sudo journalctl -b -0 | grep mce
May 15 21:43:23 archdesktop kernel: mce: [Hardware Error]: Machine check events logged
May 15 21:43:23 archdesktop kernel: mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 5: bea0000000000108
May 15 21:43:23 archdesktop kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffffb7b19940 MISC d012000100000000 SYND 4d000000 IPID 500b000000000 
May 15 21:43:23 archdesktop kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1652665398 SOCKET 0 APIC 4 microcode 8001138
May 15 21:43:24 archdesktop rasdaemon[423]: rasdaemon: mce:mce_record event enabled
May 15 21:43:24 archdesktop rasdaemon[422]: rasdaemon: mce:mce_record event enabled
May 15 21:43:24 archdesktop rasdaemon[422]: rasdaemon: Enabled event mce:mce_record
May 15 21:43:24 archdesktop rasdaemon[423]: mce:mce_record event enabled
May 15 21:43:24 archdesktop rasdaemon[422]: mce:mce_record event enabled
May 15 21:43:24 archdesktop rasdaemon[422]: Enabled event mce:mce_record
May 15 21:43:24 archdesktop rasdaemon[422]: rasdaemon: Recording mce_record events

@ericfont
Copy link
Author

ericfont commented Jun 9, 2022

Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: Machine check events logged
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: CPU 6: Machine Check: 0 Bank 5: bea0000000000108
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff9366e82e MISC d012000100000000 SYND 4d000000 IPID 500b000000000 
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1654646847 SOCKET 0 APIC 1 microcode 8001138
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: Machine check events logged
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: CPU 9: Machine Check: 0 Bank 5: bea0000000000108
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff9366e882 MISC d012000100000000 SYND 4d000000 IPID 500b000000000 
Jun 07 20:07:32 archdesktop kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1654646847 SOCKET 0 APIC 9 microcode 8001138

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment