Lenovo Legion 7i Gen8 Kernel NULL pointer dereference

homunkulus

New Member
Joined
Mar 3, 2024
Messages
8
Reaction score
2
Credits
72
Hi,
I get strange crashes on my Lenovo Legion 7i Gen8 (Intel 13900HX, 32GB DDR5 5600MHz, NVidia RTX 4080).
The system is a NixOS running unstable using btrfs + LUKS on a Kingston FURY Renegade PCIe 4.0 NVMe M.2 SSD.

The crashes happen on random points in time, meaning sometimes it can go several days without any crashes, on other days it crashes couple of seconds after entering the LUKS password.

Kernel version is: Linux markusnix 6.7.6 #1-NixOS SMP PREEMPT_DYNAMIC Fri Feb 23 08:51:59 UTC 2024 x86_64 GNU/Linux


Journalctl gives me the following a bunch of times:

Code:
Mär 03 12:03:03 markusnix kernel: BUG: kernel NULL pointer dereference, address: 00000000000000eb
Mär 03 12:03:03 markusnix systemd[1]: Started Journal Service.
Mär 03 12:03:03 markusnix kernel: #PF: supervisor read access in kernel mode
Mär 03 12:03:03 markusnix kernel: #PF: error_code(0x0000) - not-present page
Mär 03 12:03:03 markusnix kernel: PGD 0 P4D 0
Mär 03 12:03:03 markusnix kernel: Oops: 0000 [#15] PREEMPT SMP NOPTI
Mär 03 12:03:03 markusnix kernel: CPU: 8 PID: 2793 Comm: systemd-journal Tainted: P      D    O       6.7.6 #1-NixOS
Mär 03 12:03:03 markusnix kernel: Hardware name: LENOVO 82WQ/LNVNB161216, BIOS KWCN42WW 09/15/2023
Mär 03 12:03:03 markusnix kernel: RIP: 0010:simple_xattr_get+0x31/0xa0
Mär 03 12:03:03 markusnix kernel: Code: 00 00 41 56 49 89 ce 41 55 4c 8d 6f 08 41 54 49 89 d4 55 48 89 f5 53 48 89 fb 4c 89 ef e8 e7 e9 72 00 48 8b 1b 48 85 db 74 1b <48> 8b 7b 18 48 89 ee e8 33 1c 70 00 85 c0 78 27 74 2b 48 8b 5b 08
Mär 03 12:03:03 markusnix kernel: RSP: 0018:ffff9ac283d33cf8 EFLAGS: 00010202
Mär 03 12:03:03 markusnix kernel: RAX: 0000000000001c00 RBX: 00000000000000d3 RCX: 0000000000000067
Mär 03 12:03:03 markusnix kernel: RDX: ffff915e099fca00 RSI: ffff9ac283d33dc8 RDI: ffff915e176ce5e0
Mär 03 12:03:03 markusnix kernel: RBP: ffff9ac283d33dc8 R08: ffff915e099fca00 R09: 0000000000000067
Mär 03 12:03:03 markusnix kernel: R10: ffff9ac283d33dc8 R11: 0000000000000000 R12: ffff915e099fca00
Mär 03 12:03:03 markusnix kernel: R13: ffff915e176ce5e0 R14: 0000000000000067 R15: ffff9ac283d33dc8
Mär 03 12:03:03 markusnix kernel: FS:  00007f45be8820c0(0000) GS:ffff91658d000000(0000) knlGS:0000000000000000
Mär 03 12:03:03 markusnix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mär 03 12:03:03 markusnix kernel: CR2: 00000000000000eb CR3: 000000015148c000 CR4: 0000000000f50ef0
Mär 03 12:03:03 markusnix kernel: PKRU: 55555554
Mär 03 12:03:03 markusnix kernel: Call Trace:
Mär 03 12:03:03 markusnix kernel:  <TASK>
Mär 03 12:03:03 markusnix kernel:  ? __die+0x23/0x70
Mär 03 12:03:03 markusnix kernel:  ? page_fault_oops+0x17d/0x4b0
Mär 03 12:03:03 markusnix kernel:  ? exc_page_fault+0x72/0x160
Mär 03 12:03:03 markusnix kernel:  ? asm_exc_page_fault+0x26/0x30
Mär 03 12:03:03 markusnix kernel:  ? simple_xattr_get+0x31/0xa0
Mär 03 12:03:03 markusnix kernel:  ? simple_xattr_get+0x29/0xa0
Mär 03 12:03:03 markusnix kernel:  __vfs_getxattr+0x7f/0xb0
Mär 03 12:03:03 markusnix kernel:  do_getxattr+0xf0/0x180
Mär 03 12:03:03 markusnix kernel:  getxattr+0x91/0xd0
Mär 03 12:03:03 markusnix kernel:  path_getxattr+0x7b/0xd0
Mär 03 12:03:03 markusnix kernel:  do_syscall_64+0x44/0x100
Mär 03 12:03:03 markusnix kernel:  entry_SYSCALL_64_after_hwframe+0x6f/0x77
Mär 03 12:03:03 markusnix kernel: RIP: 0033:0x7f45bed20c8e
Mär 03 12:03:03 markusnix kernel: Code: 01 0d 00 f7 d8 64 89 02 ba ff ff ff ff eb d5 e8 f8 37 01 00 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 49 89 ca b8 c0 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 72 01 0d 00 f7 d8 64 89 01 48
Mär 03 12:03:03 markusnix kernel: RSP: 002b:00007ffed7f80038 EFLAGS: 00000246 ORIG_RAX: 00000000000000c0
Mär 03 12:03:03 markusnix kernel: RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f45bed20c8e
Mär 03 12:03:03 markusnix kernel: RDX: 0000555c1412be80 RSI: 0000555c137c78f0 RDI: 0000555c1412bbe0
Mär 03 12:03:03 markusnix kernel: RBP: 0000555c1412be80 R08: 0000000000000070 R09: 0000000000000001
Mär 03 12:03:03 markusnix kernel: R10: 0000000000000067 R11: 0000000000000246 R12: 0000555c1412bbe0
Mär 03 12:03:03 markusnix kernel: R13: 0000555c137c78f0 R14: 0000000000000068 R15: 0000000000000007
Mär 03 12:03:03 markusnix kernel:  </TASK>
Mär 03 12:03:03 markusnix kernel: Modules linked in: nft_chain_nat xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype ccm af_packet cmac algif_hash algif_skcipher af_alg bnep msr xt_conntrack ip6t_rpfilter ipt_rpfilter xt_pkttype xt_LOG nf_log_syslog xt_tcpudp nft_compat nf_tables sch_fq_codel snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio uinput snd_sof_pci_intel_tgl snd_sof_intel_hda_common snd_soc_hdac_hda soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_hda_codec_hdmi snd_sof snd_sof_utils snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_generic_allocation soundwire_bus iwlmvm r8153_ecm cdc_ether hid_multitouch usbnet i915 mac80211 nls_iso8859_1 ptp nls_cp437 pps_core libarc4 vfat fat btusb uvcvideo btrtl btintel snd_hda_scodec_tas2781_i2c btbcm intel_uncore_frequency videobuf2_vmalloc snd_soc_tas2781_fmwlib btmtk intel_uncore_frequency_common uvc videobuf2_memops intel_tcc_cooling snd_hda_intel videobuf2_v4l2 snd_soc_tas2781_comlib
Mär 03 12:03:03 markusnix kernel:  bluetooth drm_buddy snd_intel_dspcfg x86_pkg_temp_thermal snd_intel_sdw_acpi videodev snd_soc_core ttm cmdlinepart iTCO_wdt intel_powerclamp snd_hda_codec intel_pmc_bxt spi_nor iwlwifi drm_display_helper watchdog intel_rapl_msr joydev videobuf2_common mousedev processor_thermal_device_pci snd_compress snd_hda_core processor_thermal_device r8152 coretemp processor_thermal_wt_hint ac97_bus mtd mc processor_thermal_rfim snd_pcm_dmaengine crc32_pclmul snd_hwdep polyval_clmulni mii mei_pxp mei_hdcp cec processor_thermal_rapl polyval_generic snd_pcm ecdh_generic r8169 gf128mul pmt_telemetry intel_rapl_common intel_gtt ecc pmt_class ghash_clmulni_intel ucsi_acpi crc16 snd_timer intel_lpss_pci ideapad_laptop realtek typec_ucsi agpgart processor_thermal_wt_req i2c_i801 evdev tpm_crb sparse_keymap spi_intel_pci processor_thermal_power_floor mdio_devres intel_lpss rapl wmi_bmof nvidia_wmi_ec_backlight snd idma64 intel_cstate cfg80211 i2c_hid_acpi intel_uncore libphy typec platform_profile tpm_tis spi_intel i2c_smbus
Mär 03 12:03:03 markusnix kernel:  i2c_algo_bit soundcore roles virt_dma processor_thermal_mbox mac_hid intel_vsec rfkill crc8 i2c_hid mei_me int3403_thermal battery int340x_thermal_zone int3400_thermal tpm_tis_core intel_pmc_core serio_raw pinctrl_alderlake acpi_thermal_rel acpi_tad acpi_pad mei tiny_power_button ac button nvidia_drm(PO) drm_kms_helper nvidia_modeset(PO) video wmi nvidia_uvm(PO) nvidia(PO) ctr loop cpufreq_powersave xt_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter veth tun tap macvlan bridge stp llc kvm_intel kvm drm irqbypass fuse efi_pstore backlight configfs nfnetlink efivarfs dmi_sysfs ip_tables x_tables autofs4 dm_crypt aes_generic cbc encrypted_keys trusted asn1_encoder tee tpm rng_core hid_generic usbhid hid xhci_pci xhci_pci_renesas input_leds firmware_class led_class xhci_hcd atkbd nvme libps2 thunderbolt vivaldi_fmap usbcore sha512_ssse3 nvme_core sha512_generic sha256_ssse3 sha1_ssse3 aesni_intel t10_pi libaes crypto_simd cryptd crc64_rocksoft rtc_cmos crc64 crc_t10dif i8042
Mär 03 12:03:03 markusnix kernel:  crct10dif_generic crct10dif_pclmul usb_common crct10dif_common serio dm_mod dax btrfs blake2b_generic libcrc32c crc32c_generic crc32c_intel xor raid6_pq
Mär 03 12:03:03 markusnix kernel: CR2: 00000000000000eb
Mär 03 12:03:03 markusnix kernel: ---[ end trace 0000000000000000 ]---
Mär 03 12:03:03 markusnix kernel: RIP: 0010:simple_xattr_get+0x31/0xa0
Mär 03 12:03:03 markusnix kernel: Code: 00 00 41 56 49 89 ce 41 55 4c 8d 6f 08 41 54 49 89 d4 55 48 89 f5 53 48 89 fb 4c 89 ef e8 e7 e9 72 00 48 8b 1b 48 85 db 74 1b <48> 8b 7b 18 48 89 ee e8 33 1c 70 00 85 c0 78 27 74 2b 48 8b 5b 08
Mär 03 12:03:03 markusnix kernel: RSP: 0018:ffff9ac280bcfcf8 EFLAGS: 00010202
Mär 03 12:03:03 markusnix kernel: RAX: 0000000000000000 RBX: 00000000000000d3 RCX: 0000000000000067
Mär 03 12:03:03 markusnix kernel: RDX: ffff915e04f4bd80 RSI: ffff9ac280bcfdc8 RDI: ffff915e176ce5e0
Mär 03 12:03:03 markusnix kernel: RBP: ffff9ac280bcfdc8 R08: ffff915e04f4bd80 R09: 0000000000000067
Mär 03 12:03:03 markusnix kernel: R10: ffff9ac280bcfdc8 R11: 0000000000000000 R12: ffff915e04f4bd80
Mär 03 12:03:03 markusnix kernel: R13: ffff915e176ce5e0 R14: 0000000000000067 R15: ffff9ac280bcfdc8
Mär 03 12:03:03 markusnix kernel: FS:  00007f45be8820c0(0000) GS:ffff91658d000000(0000) knlGS:0000000000000000
Mär 03 12:03:03 markusnix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mär 03 12:03:03 markusnix kernel: CR2: 00000000000000eb CR3: 000000015148c000 CR4: 0000000000f50ef0
Mär 03 12:03:03 markusnix kernel: PKRU: 55555554
Mär 03 12:03:03 markusnix kernel: note: systemd-journal[2793] exited with irqs disabled
Mär 03 12:03:03 markusnix kernel: note: systemd-journal[2793] exited with preempt_count 1
 


markusnix kernel: BUG: kernel NULL pointer dereference
The system is a NixOS running unstable using btrfs + LUKS
G'day homunkulus, Welcome to Linux.org

Any pc running that particular setup is well out of my experience.

However. there will be those here who will have experience in that area.

Good Luck.
 
I'm having a very similar issue, starting about a week ago. My journal only goes back to March 1st. I'm on Arch Linux, and I update almost daily.

I'm on a Legion Pro 7 Gen 8, with an AMD Ryzen 9 7945HX CPU and GeForce RTX 4080 GPU.

My journal looks like this when it happens:
Code:
Mar 08 08:02:33 zach-laptop kernel: BUG: kernel NULL pointer dereference, address: 00000000000000e8
Mar 08 08:02:33 zach-laptop kernel: #PF: supervisor read access in kernel mode
Mar 08 08:02:33 zach-laptop kernel: #PF: error_code(0x0000) - not-present page
Mar 08 08:02:33 zach-laptop kernel: PGD 45ca16067 P4D 45ca16067 PUD 0
Mar 08 08:02:33 zach-laptop kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Mar 08 08:02:33 zach-laptop kernel: CPU: 20 PID: 839 Comm: kworker/20:3 Tainted: P           OE      6.7.8-arch1-1 #1 dafab5364db1ad2baad864a7454abc1e303295d4
Mar 08 08:02:33 zach-laptop kernel: Hardware name: LENOVO 82WS/LNVNB161216, BIOS LPCN45WW 08/02/2023
Mar 08 08:02:33 zach-laptop kernel: Workqueue: cgroup_destroy css_free_rwork_fn
Mar 08 08:02:33 zach-laptop kernel: RIP: 0010:rb_first+0xf/0x30
Mar 08 08:02:33 zach-laptop kernel: Code: 10 e9 40 20 18 00 0f 1f 44 00 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 48 8b 07 48 85 c0 74 14 48 89 c2 <48> 8b 40 10 48 85 c0 75 f4 48 89 d0 e9 0b 20 18 00 31 d2 eb f4 66
Mar 08 08:02:33 zach-laptop kernel: RSP: 0018:ffffa4ac47adbdd8 EFLAGS: 00010206
Mar 08 08:02:33 zach-laptop kernel: RAX: 00000000000000d8 RBX: ffff8cfbd9b3e780 RCX: 0000000000400020
Mar 08 08:02:33 zach-laptop kernel: RDX: 00000000000000d8 RSI: 0000000000000000 RDI: ffff8cfac1ddfee8
Mar 08 08:02:33 zach-laptop kernel: RBP: ffff8cfaf10d2900 R08: 0000000000000000 R09: 0000000000400020
Mar 08 08:02:33 zach-laptop kernel: R10: ffff8cfb7570e340 R11: fefefefefefefeff R12: 0000000000000000
Mar 08 08:02:33 zach-laptop kernel: R13: ffff8cfac1ddfee8 R14: ffff8cfb49444000 R15: ffff8cfb49444090
Mar 08 08:02:33 zach-laptop kernel: FS:  0000000000000000(0000) GS:ffff8d01cdb00000(0000) knlGS:0000000000000000
Mar 08 08:02:33 zach-laptop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 08 08:02:33 zach-laptop kernel: CR2: 00000000000000e8 CR3: 0000000225e52000 CR4: 0000000000f50ef0
Mar 08 08:02:33 zach-laptop kernel: PKRU: 55555554
Mar 08 08:02:33 zach-laptop kernel: Call Trace:
Mar 08 08:02:33 zach-laptop kernel:  <TASK>
Mar 08 08:02:33 zach-laptop kernel:  ? __die+0x23/0x70
Mar 08 08:02:33 zach-laptop kernel:  ? page_fault_oops+0x171/0x4e0
Mar 08 08:02:33 zach-laptop kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Mar 08 08:02:33 zach-laptop kernel:  ? __update_load_avg_cfs_rq+0x26c/0x2e0
Mar 08 08:02:33 zach-laptop kernel:  ? exc_page_fault+0x7f/0x180
Mar 08 08:02:33 zach-laptop kernel:  ? asm_exc_page_fault+0x26/0x30
Mar 08 08:02:33 zach-laptop kernel:  ? rb_first+0xf/0x30
Mar 08 08:02:33 zach-laptop kernel:  simple_xattrs_free+0x29/0x90
Mar 08 08:02:33 zach-laptop kernel:  kernfs_put.part.0+0x60/0x150
Mar 08 08:02:33 zach-laptop kernel:  css_free_rwork_fn+0x131/0x430
Mar 08 08:02:33 zach-laptop kernel:  process_one_work+0x178/0x350
Mar 08 08:02:33 zach-laptop kernel:  worker_thread+0x30f/0x450
Mar 08 08:02:33 zach-laptop kernel:  ? __pfx_worker_thread+0x10/0x10
Mar 08 08:02:33 zach-laptop kernel:  kthread+0xe5/0x120
Mar 08 08:02:33 zach-laptop kernel:  ? __pfx_kthread+0x10/0x10
Mar 08 08:02:33 zach-laptop kernel:  ret_from_fork+0x31/0x50
Mar 08 08:02:33 zach-laptop kernel:  ? __pfx_kthread+0x10/0x10
Mar 08 08:02:33 zach-laptop kernel:  ret_from_fork_asm+0x1b/0x30
Mar 08 08:02:33 zach-laptop kernel:  </TASK>
Mar 08 08:02:33 zach-laptop kernel: Modules linked in: xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo iptable_nat xt_addrtype iptable_filter br_netfilter bridge stp llc nls_utf8 cifs cifs_arc4 nls_ucs2_utils rdma_cm iw_cm ib_cm ib_core cifs_md4 dns_resolver fscache netfs rfcomm snd_seq_dummy snd_hrtimer snd_seq cmac algif_hash algif_skcipher af_alg bnep overlay nft_redir nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables libcrc32c uvcvideo videobuf2_vmalloc uvc videobuf2_memops snd_usb_audio videobuf2_v4l2 snd_usbmidi_lib snd_ump videodev snd_rawmidi videobuf2_common snd_seq_device mc joydev intel_rapl_msr mousedev intel_rapl_common snd_sof_amd_acp63 snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci kvm_amd snd_sof_xtensa_dsp mt7921e snd_hda_codec_realtek mt7921_common snd_sof snd_hda_codec_generic kvm mt792x_lib snd_sof_utils ledtrig_audio snd_pci_ps mt76_connac_lib snd_hda_codec_hdmi snd_rpl_pci_acp6x irqbypass snd_acp_pci crct10dif_pclmul mt76 vfat
Mar 08 08:02:33 zach-laptop kernel:  snd_hda_scodec_tas2781_i2c snd_acp_legacy_common crc32_pclmul fat btusb snd_soc_tas2781_fmwlib snd_hda_intel snd_pci_acp6x polyval_clmulni snd_soc_tas2781_comlib btrtl snd_pci_acp5x snd_intel_dspcfg polyval_generic mac80211 btintel snd_rn_pci_acp3x gf128mul snd_intel_sdw_acpi snd_soc_core ghash_clmulni_intel btbcm snd_acp_config snd_hda_codec snd_soc_acpi sha512_ssse3 nvidia_drm(POE) btmtk libarc4 snd_compress usbhid ccp snd_pci_acp3x nvidia_modeset(POE) snd_hda_core sha256_ssse3 ac97_bus sha1_ssse3 snd_pcm_dmaengine snd_hwdep bluetooth cfg80211 r8169 aesni_intel snd_pcm ucsi_acpi ideapad_laptop realtek hid_multitouch sp5100_tco typec_ucsi mdio_devres snd_timer crypto_simd sparse_keymap hid_generic ecdh_generic platform_profile wmi_bmof cryptd rapl wdat_wdt typec pcspkr snd video k10temp i2c_piix4 libphy rfkill roles soundcore wmi i2c_hid_acpi crc8 i2c_hid psmouse mac_hid squashfs vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) sg nvidia_uvm(POE) nvidia(POE) i2c_dev crypto_user fuse loop dm_mod nfnetlink
Mar 08 08:02:33 zach-laptop kernel:  ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 serio_raw atkbd libps2 vivaldi_fmap nvme crc32c_intel nvme_core xhci_pci xhci_pci_renesas i8042 nvme_auth serio
Mar 08 08:02:33 zach-laptop kernel: CR2: 00000000000000e8
Mar 08 08:02:33 zach-laptop kernel: ---[ end trace 0000000000000000 ]---
Mar 08 08:02:33 zach-laptop kernel: RIP: 0010:rb_first+0xf/0x30
Mar 08 08:02:33 zach-laptop kernel: Code: 10 e9 40 20 18 00 0f 1f 44 00 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 48 8b 07 48 85 c0 74 14 48 89 c2 <48> 8b 40 10 48 85 c0 75 f4 48 89 d0 e9 0b 20 18 00 31 d2 eb f4 66
Mar 08 08:02:33 zach-laptop kernel: RSP: 0018:ffffa4ac47adbdd8 EFLAGS: 00010206
Mar 08 08:02:33 zach-laptop kernel: RAX: 00000000000000d8 RBX: ffff8cfbd9b3e780 RCX: 0000000000400020
Mar 08 08:02:33 zach-laptop kernel: RDX: 00000000000000d8 RSI: 0000000000000000 RDI: ffff8cfac1ddfee8
Mar 08 08:02:33 zach-laptop kernel: RBP: ffff8cfaf10d2900 R08: 0000000000000000 R09: 0000000000400020
Mar 08 08:02:33 zach-laptop kernel: R10: ffff8cfb7570e340 R11: fefefefefefefeff R12: 0000000000000000
Mar 08 08:02:33 zach-laptop kernel: R13: ffff8cfac1ddfee8 R14: ffff8cfb49444000 R15: ffff8cfb49444090
Mar 08 08:02:33 zach-laptop kernel: FS:  0000000000000000(0000) GS:ffff8d01cdb00000(0000) knlGS:0000000000000000
Mar 08 08:02:33 zach-laptop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 08 08:02:33 zach-laptop kernel: CR2: 00000000000000e8 CR3: 0000000225e52000 CR4: 0000000000f50ef0
Mar 08 08:02:33 zach-laptop kernel: PKRU: 55555554
Mar 08 08:02:33 zach-laptop kernel: note: kworker/20:3[839] exited with irqs disabled

I'm probably gong to explore bisecting my kernel version or whatever else might have updated recently. In other searching for this issue, it might be a red herring, but Nvidia or nouveau seem to crop up a bit. I might also try only using my on-board GPU, without Nvidia, and see if it persists. It happens to me about twice a day, maybe more, and primarily when I do CPU-intensive operations such as compiling programming projects with many things happening in parallel.
 
It's difficult, in that other thread, to even know if it's the same problem, as his photos of his monitor cut off after the start of the error each time. It doesn't seem to me to match, though.
 
It's actually part of the journal output above, but it's "6.7.8-arch1-1" It would have been "6.7.6-arch1-1" as of March 1st, the earliest date I can know for sure this problem occurred (although due to journal rotation, it might have been a few days earlier).

Since my earlier message, I've been using the onboard GPU exclusively and running a bunch of heavy CPU load stuff, so far without issues. We'll see how that continues through the day.
 
@zachcallear
I suspect either some incompatibility with Nvidia driver or otherwise kernel bug

I'm running 6.5.0 kernel with latest Nvidia driver and have no issues.
Other than this IDK.
 
As a research note, it has happened to me twice today while using the onboard, non-Nvidia GPU (the laptop UEFI setup set in switchable graphics mode, without switching to the Nvidia GPU).
 
I still think it has something to do with the NVidia GPU. Now using driver 545 instead of the current 550 line, and Firefox the system runs stable (Plasma 5).

The random crashes appeared with the 550 NVidia driver, which is currently the default driver line. There is a thread about laptop freezing when using the 550 driver, but so far no solution to it. Someone said with the NVidia 550 Vulkan Dev driver the problem does not happen, but I had no time yet to test that.

But I always had crashes already for a long time when I was using VSCode and had a background youtube video in a Chrome based browser. The crash sounds like some overload of the system, meaning that she sound starts to stutter, I can still move the mouse jerkily for maybe one second until everything freezes and I have to press the power button.
This doesn't happen when I use Firefox. Only in a Chrome based browser and I think also only on Youtube.
 
There is a thread about laptop freezing when using the 550 driver, but so far no solution to it.
There are so many unresolved threads online it's impossible to count them, but there are always other similar threads that either provide a solution or hints as to what the problem is.

All it takes is dedication to troubleshoot the problem and readiness to radically change things on your PC, I sometimes spend entire day or even multiple days to fix an issue, googling out like crazy, everything is fixable you just have to keep trying and not give up.

From my experience helping people in forums in 90% of the difficult cases people loose patience, give up, do not want to follow advices any more and no longer respond to their thread, mostly people care about solution only but do not care about what the problem is.

I'm not saying you're one of those ofc. I'm just saying how I see it.
 
I know what you mean, just the lack of time is also a problem, and I cannot really afford much downtime on a machine I use for my income.

Unfortunately also the Vulkan driver did not change the freezes according to this thread I meant.
 
if with driver 545 no crashes then OK, you've found out what's the problem is, but I find it hard to believe because I'm using 550 and no issues.
 
I still think it has something to do with the NVidia GPU. Now using driver 545 instead of the current 550 line, and Firefox the system runs stable (Plasma 5).

The random crashes appeared with the 550 NVidia driver, which is currently the default driver line. There is a thread about laptop freezing when using the 550 driver, but so far no solution to it. Someone said with the NVidia 550 Vulkan Dev driver the problem does not happen, but I had no time yet to test that.

But I always had crashes already for a long time when I was using VSCode and had a background youtube video in a Chrome based browser. The crash sounds like some overload of the system, meaning that she sound starts to stutter, I can still move the mouse jerkily for maybe one second until everything freezes and I have to press the power button.
This doesn't happen when I use Firefox. Only in a Chrome based browser and I think also only on Youtube.
I finally downgraded to Nvidia driver 545 this morning, and I didn't have any lockups the whole day. I would have normally had two or three. I'm also on the latest Arch version of Plasma, but I use Brave/Chromium and not Firefox.
 
I am now at the 535 "production" driver as NixOS does not offer the 545 anymore, and the lastest 545 version doesn't seem to work with the new Kernel 6.8 anymore.
535 runs stable (also in KDE6, except when I log out in Wayland, but then I still can use the TTY to reboot, so just SDDM does not come up again), so I am using that one currently.

My Chrome+Youtube issues with 545 seems to be unrelated to the crashes people observe with the 550 series. I need to try if it also happens with 535 driver.
 
I was hoping to find a link to an upstream bug report even though I have filed one on my distro's bugzilla.

I'm experiencing frequent crashes due to this bug and the crashes revolve around xattr functions.

First observed in kernel 6.7.6, still present in 6.8.1 .

What triggers it for me is the wonky dc jack on my portable - yes I should get a new one, but it shouldn't crash either just because of a few quick AC<->BAT events in succession.

Anyhow, thought it might help you narrow it down/be aware of what triggers it seeing how the reports come from folks using laptops.
 
Thanks for adding some specifics. I've still been having my machine work great on Nvidia 535. I had truly tested 545 previously, with success, but there's an Arch AUR package for 535, so it's easier to use. I haven't had the time to drill down into the exact cause or research kernel bug reports. I was hoping that by at least getting things out there in a forum post it might help searchers.
 
I just got a reply to my issue on my distro's bugtracker, word is this commit fixes it.

I've only been rocking 6.8.6 for a few hours, but it looks promising.

Cheers
 

Staff online

Members online


Top