[elrepo] Kernel Crash with wireguard module

Jens Kuehnel elrepo-general-ml at jens.kuehnel.org
Sat Jan 14 16:12:37 EST 2023


Hi all,

I have a strange problem. I run a Server with mutiples VMs with RHEL8 
(Developer subscription) and some elrepo modules.

I run wireguard with this rpm:
kmod-wireguard-1.0.20220627-3.el8_7.elrepo.x86_64

Every thing works fine with 4.18.0-425.3.1.el8.x86_64, but after update 
to kernel-4.18.0-425.10.1.el8_7.x86_64 I got after about 20 seconds:

* 100% CPU load with a kworker
* cpu soft lockup

and about 30-60 second after that the system hangs, no ssh, no console, 
only ping works.

The dmesg output at the end of this mail.

When I disable wireguard everything works fine. So it is the wireguard 
module. I run at the moment the 425.3.1 kernel again, because wireguard 
is important.

Can another recompile of the kernel help, has anyone the same problem, 
or is this a uniq problem with my hardware?

Thanks for the info.

Greeting from Frankfurt Germany.
CU
Jens Kühnel


------------------------------------------------------------
[   85.035861] wireguard: WireGuard 1.0.20220627 loaded. See 
www.wireguard.com for information.
[   85.035868] wireguard: Copyright (C) 2015-2019 Jason A. Donenfeld 
<Jason at zx2c4.com>. All Rights Reserved.
[  112.088711] watchdog: BUG: soft lockup - CPU#5 stuck for 22s! 
[kworker/5:1:81]
[  112.088718] Modules linked in: xt_CHECKSUM wireguard ip6_udp_tunnel 
udp_tunnel binfmt_misc br_netfilter bridge stp llc xt_physdev ipt_REJECT 
nf_reject_ipv4 nft_counter xt_LOG nf_log_syslog ip6t_REJECT 
nft_chain_nat nf_reject_ipv6 ipt_MASQUERADE nf_nat xt_conntrack 
nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables 
libcrc32c nfnetlink sunrpc vfat fat intel_rapl_msr intel_rapl_common 
x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm mei_wdt 
iTCO_wdt irqbypass iTCO_vendor_support rapl intel_cstate intel_uncore 
pcspkr wmi mei_me intel_pch_thermal mei acpi_pad i2c_i801 ie31200_edac 
intel_pmc_core ext4 mbcache jbd2 dm_crypt raid1 sd_mod t10_pi sg i915 
i2c_algo_bit cec intel_gtt drm_buddy drm_dp_helper drm_kms_helper 
syscopyarea sysfillrect sysimgblt crct10dif_pclmul fb_sys_fops ttm 
crc32_pclmul crc32c_intel ahci libahci e1000e drm libata 
ghash_clmulni_intel serio_raw video dm_mirror dm_region_hash dm_log 
dm_mod ftsteutates(O) fuse
[  112.088835] CPU: 5 PID: 81 Comm: kworker/5:1 Tainted: G          IO 
   --------- -  - 4.18.0-425.10.1.el8_7.x86_64 #1
[  112.088839] Hardware name: FUJITSU D3417-B1/D3417-B1, BIOS V5.0.0.11 
R1.28.0.SR.1 for D3417-B1x               07/25/2019
[  112.088842] Workqueue: events_power_efficient 
wg_ratelimiter_gc_entries [wireguard]
[  112.088851] RIP: 0010:native_queued_spin_lock_slowpath+0x5f/0x1c0
[  112.088856] Code: 71 f0 0f ba 2f 08 0f 92 c0 0f b6 c0 c1 e0 08 89 c2 
8b 07 30 e4 09 d0 a9 00 01 ff ff 75 4b 85 c0 74 0e 8b 07 84 c0 74 08 f3 
90 <8b> 07 84 c0 75 f8 b8 01 00 00 00 66 89 07 e9 1e b7 aa 00 8b 37 81
[  112.088861] RSP: 0018:ffffb780c657be58 EFLAGS: 00000202 ORIG_RAX: 
ffffffffffffff13
[  112.088864] RAX: 0000000000000101 RBX: ffffffffc0f05160 RCX: 
ffffffffb07b9a40
[  112.088866] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
ffffffffc0f05fb8
[  112.088869] RBP: 00000013e2e12d11 R08: ffffffffb07b9ae0 R09: 
0000746e65696369
[  112.088871] R10: 8080808080808080 R11: 0000000000000018 R12: 
dead000000000200
[  112.088873] R13: 0000000000000001 R14: ffff98bf06e6b780 R15: 
0000000000000001
[  112.088876] FS:  0000000000000000(0000) GS:ffff98cdee540000(0000) 
knlGS:0000000000000000
[  112.088878] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  112.088881] CR2: 00007f211802e240 CR3: 00000004eec10001 CR4: 
00000000003706e0
[  112.088883] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
[  112.088885] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 
0000000000000400
[  112.088888] Call Trace:
[  112.088890]  _raw_spin_lock+0x1e/0x30
[  112.088894]  wg_ratelimiter_gc_entries+0x49/0x170 [wireguard]
[  112.088901]  process_one_work+0x1a7/0x360
[  112.088904]  ? create_worker+0x1a0/0x1a0
[  112.088907]  worker_thread+0x30/0x390
[  112.088909]  ? create_worker+0x1a0/0x1a0
[  112.088911]  kthread+0x10b/0x130
[  112.088915]  ? set_kthread_struct+0x50/0x50
[  112.088918]  ret_from_fork+0x1f/0x40

Message from syslogd at vmhost at Jan 14 21:24:04 ...
  kernel:watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [kworker/5:1:81]
[  140.088627] watchdog: BUG: soft lockup - CPU#5 stuck for 22s! 
[kworker/5:1:81]


More information about the elrepo mailing list