- GPU: AMD Radeon RX 7700 XT / 7800 XT (Navi 32, gfx1101)
- OS: Ubuntu 25.10
- Kernel: 6.17.0-6-generic
- Desktop Environment: KDE Plasma (Wayland)
- Mesa: 25.2.3-1ubuntu1
- Driver: amdgpu (in-kernel)
Important: GPU freezes began after upgrading to Ubuntu 25.10
Analysis of system logs reveals multiple issues causing GPU instability:
amdgpu: smu driver if version = 0x0000003d, smu fw if version = 0x00000040
amdgpu: SMU driver if version not matched
Impact: The System Management Unit (SMU) version mismatch can cause:
- Power management instability
- GPU frequency scaling issues
- Thermal management problems
- Random freezes/hangs
[drm:drm_sched_entity_push_job [gpu_sched]] *ERROR* Trying to push to a killed entity
Impact: GPU jobs are being terminated prematurely, indicating:
- GPU hang/recovery cycles
- Driver failing to properly queue work
- Potential memory management issues
workqueue: dm_irq_work_func [amdgpu] hogged CPU for >10000us 5 times
Impact: Display manager interrupt handling is taking too long, which can cause:
- Screen freezes
- Desktop compositor stuttering
- System responsiveness issues
Multiple amdgpu_tlb_fence_work entries in workqueue indicate memory management issues with the GPU's Translation Lookaside Buffer.
Ubuntu 25.10 uses kernel 6.17.0-6, which is very new and still in development. RDNA3 (Navi 32) support in this kernel version has stability issues:
- GPU scheduler regressions
- SMU firmware compatibility issues
- Memory management bugs
- Wayland Compositor Issues: KDE Plasma on Wayland with bleeding-edge amdgpu stack can trigger compositor hangs
- Mesa 25.2.3: Very recent Mesa release may have regressions not yet fixed
- Firmware Mismatch: SMU firmware interface version incompatibility
The most effective solution is to use a more stable kernel:
# Install a stable kernel (6.11 or 6.8 LTS)
sudo apt install linux-image-6.11.0-generic linux-headers-6.11.0-generic
# Or use the LTS kernel
sudo apt install linux-image-6.8.0-generic linux-headers-6.8.0-generic
# Reboot and select the older kernel from GRUB menu
sudo rebootAfter rebooting, verify you're on the stable kernel:
uname -rIf stable, you can remove the problematic 6.17 kernel:
sudo apt remove linux-image-6.17.0-6-generic linux-headers-6.17.0-6-genericAdd stability-improving kernel parameters to /etc/default/grub:
sudo nano /etc/default/grubModify the GRUB_CMDLINE_LINUX_DEFAULT line to include:
GRUB_CMDLINE_LINUX_DEFAULT="quiet splash amdgpu.ppfeaturemask=0xffffffff amdgpu.gpu_recovery=1 amdgpu.runpm=0"
Parameter explanations:
amdgpu.ppfeaturemask=0xffffffff- Enables all power play features (may help with SMU issues)amdgpu.gpu_recovery=1- Enables GPU hang detection and recoveryamdgpu.runpm=0- Disables runtime power management (reduces SMU-related hangs)
Update GRUB and reboot:
sudo update-grub
sudo rebootEnsure you have the latest AMD GPU firmware:
sudo apt update
sudo apt install --reinstall linux-firmware
sudo update-initramfs -u
sudo rebootIf Wayland is contributing to freezes:
- Log out of KDE Plasma
- At the login screen (SDDM), click the session selector (usually bottom-left)
- Select "Plasma (X11)" instead of "Plasma (Wayland)"
- Log in and test stability
If runtime power management is causing issues, create a udev rule:
sudo nano /etc/udev/rules.d/99-amdgpu-power.rulesAdd:
KERNEL=="card0", SUBSYSTEM=="drm", DRIVERS=="amdgpu", ATTR{device/power_dpm_force_performance_level}="high"
Reload udev:
sudo udevadm control --reload-rules
sudo udevadm triggerUbuntu 25.10 is still receiving updates. Monitor for:
- Kernel updates that fix RDNA3 issues
- Mesa updates with amdgpu stability fixes
- Firmware updates
Check for updates regularly:
sudo apt update && sudo apt upgradeAfter applying fixes, verify stability:
-
Check kernel messages for errors:
sudo dmesg | grep -i amdgpu | grep -i error
-
Monitor GPU status:
rocm-smi
-
Check for scheduler errors:
sudo journalctl -b | grep "killed entity"
-
Stress test GPU (optional):
# Run a GPU-intensive task to test stability glxgears -fullscreen # Or use a benchmark tool
- Best case: Downgrading to kernel 6.11 or 6.8 should eliminate freezes
- Good case: Kernel parameters + firmware update improve stability significantly
- Acceptable: X11 session provides stable environment until Wayland issues are resolved
- Ubuntu Kernel Team Bug Reports
- AMD GPU Firmware Repository
- Mesa 3D Graphics Library
- AMDGPU Driver Documentation
- Stay on LTS kernels for production systems requiring stability
- Test bleeding-edge kernels in VMs or on non-critical systems first
- Monitor AMD GPU mailing lists for known RDNA3 issues
- Consider filing a bug report with Ubuntu/kernel.org if issues persist on stable kernels
Note: This diagnosis was generated using Claude Code based on actual system logs from an affected system. GPU freezes began immediately after upgrading to Ubuntu 25.10. While the information provided here is based on real diagnostic data and established troubleshooting practices, users should validate these remediation steps in their specific environment and maintain proper backups before making system changes.
If you experience similar issues, please test the solutions in order (kernel downgrade first) and report your results to help the community.