VM di Proxmox VE: KERNEL PANIC

Ciao,

da qualche mese e in maniera del tutto imprevedibile una VM ospitata su PROXMOX VE si blocca con un Kernel Panic. L’unico modo per ripristinare il corretto funzionamento è quello di eseguire uno stop brutale e successivamente riavviare la VM.

Inizialmente ho pensato si trattasse di una caso isolato ma da distanza di 1 mese si è verificato nuovamente il problema: così ho indagato il problema.

La VM in questione è un server Linux Ubuntu quindi sono andato a leggere il log in /var/log/kernel.log . Questo è l’estratto:

Sep 23 16:40:13 agenda kernel: [1883749.471708] BUG: unable to handle kernel NULL pointer dereference at 000000a4
Sep 23 16:40:13 agenda kernel: [1883749.475356] IP: [<f824b89c>] e1000_clean_tx_irq+0xcc/0x320 [e1000]
Sep 23 16:40:13 agenda kernel: [1883749.475356] *pdpt = 000000003680c001 *pde = 0000000000000000
Sep 23 16:40:13 agenda kernel: [1883749.475356] Thread overran stack, or stack corrupted
Sep 23 16:40:13 agenda kernel: [1883749.475356] Oops: 0000 [#1] SMP
Sep 23 16:40:13 agenda kernel: [1883750.312055] last sysfs file: /sys/devices/pci0000:00/0000:00:1f.0/local_cpus
Sep 23 16:40:13 agenda kernel: [1883750.312055] Modules linked in: lp fbcon tileblit font bitblit softcursor i2c_piix4 vga16fb vgastate psmouse virtio_balloon serio_raw joydev shpchp parport usbhid hid floppy e1000
Sep 23 16:40:13 agenda kernel: [1883750.312055]
Sep 23 16:40:13 agenda kernel: [1883750.312055] Pid: 0, comm: swapper Not tainted (2.6.32-74-generic-pae #142-Ubuntu) Standard PC (i440FX + PIIX, 1996)
Sep 23 16:40:13 agenda kernel: [1883751.313929] EIP: 0060:[<f824b89c>] EFLAGS: 00010246 CPU: 0
Sep 23 16:40:13 agenda kernel: [1883751.313929] EIP is at e1000_clean_tx_irq+0xcc/0x320 [e1000]
Sep 23 16:40:13 agenda kernel: [1883751.313929] EAX: 00000000 EBX: 00000046 ECX: f6ccd460 EDX: f84db690
Sep 23 16:40:13 agenda kernel: [1883751.313929] ESI: f6efca40 EDI: f6ccd000 EBP: c07a3e60 ESP: c07a3df8
Sep 23 16:40:13 agenda kernel: [1883751.313929] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Sep 23 16:40:13 agenda kernel: [1883751.313929] Process swapper (pid: 0, ti=c07a2000 task=c07b2c80 task.ti=c07a2000)
Sep 23 16:40:13 agenda kernel: [1883752.315550] Stack:
Sep 23 16:40:13 agenda kernel: [1883752.315550] 00174e3f f683b6e0 f683b5e0 f683b450 f683b000 ceea7f15 00174e3f c07a3e2c
Sep 23 16:40:13 agenda kernel: [1883752.315550] <0> c036af23 ceea7f15 00174e3f 00174e3f 00000001 c07a3e70 f683b000 ceea7f15
Sep 23 16:40:13 agenda kernel: [1883752.315550] <0> 00000076 00000001 f683b3a0 00000046 00000076 ffffff01 00000002 f683b3a0
Sep 23 16:40:13 agenda kernel: [1883753.317097] Call Trace:
Sep 23 16:40:13 agenda kernel: [1883753.317097] [<c036af23>] ? atomic64_cmpxchg+0x23/0x30
Sep 23 16:40:13 agenda kernel: [1883753.317097] [<f824bb1c>] ? e1000_clean+0x2c/0x210 [e1000]
Sep 23 16:40:13 agenda kernel: [1883753.317097] [<c04f665d>] ? net_rx_action+0xed/0x1d0
Sep 23 16:40:13 agenda kernel: [1883754.318764] [<c015bebc>] ? __do_softirq+0xac/0x1e0
Sep 23 16:40:13 agenda kernel: [1883754.318764] [<c012929b>] ? ack_apic_level+0x6b/0x170
Sep 23 16:40:13 agenda kernel: [1883754.318764] [<c01772be>] ? sched_clock_tick+0x5e/0xa0
Sep 23 16:40:13 agenda kernel: [1883754.318764] [<c015c035>] ? do_softirq+0x45/0x50
Sep 23 16:40:13 agenda kernel: [1883754.989141] [<c015c185>] ? irq_exit+0x65/0x70
Sep 23 16:40:13 agenda kernel: [1883754.989141] [<c05c6c75>] ? do_IRQ+0x55/0xc0
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c0109e30>] ? common_interrupt+0x30/0x40
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c013066a>] ? native_safe_halt+0xa/0x10
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c0110d00>] ? default_idle+0x40/0x90
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c0108444>] ? cpu_idle+0x94/0xd0
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c05adb72>] ? rest_init+0x62/0x70
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c07f690c>] ? start_kernel+0x356/0x35c
Sep 23 16:40:13 agenda kernel: [1883755.320249] [<c07f63e2>] ? unknown_bootoption+0x0/0x19e
Sep 23 16:40:13 agenda kernel: [1883756.321468] [<c07f60bb>] ? i386_start_kernel+0xaa/0xb1
Sep 23 16:40:13 agenda kernel: [1883756.321468] Code: 84 3d 01 00 00 8d 0c 5b c1 e1 03 83 45 f0 01 80 7d ec 00 75 42 8b 3e 8b 46 1c 3b 5d e4 8d 14 5b 8d 14 d0 0f 94 45 ec 75 b9 8b 02 <8b> 88 a4 00 00 00 0f b7 49 10 85 c9 74 8e 89 4d c8 8b 48 50 89
Sep 23 16:40:13 agenda kernel: [1883756.321468] EIP: [<f824b89c>] e1000_clean_tx_irq+0xcc/0x320 [e1000] SS:ESP 0068:c07a3df8
Sep 23 16:40:13 agenda kernel: [1883757.322865] CR2: 00000000000000a4
Sep 23 16:40:13 agenda kernel: [1883757.322865] BUG: scheduling while atomic: swapper/0/0x10000100
Sep 23 16:40:13 agenda kernel: [1883757.322865] Modules linked in: lp fbcon tileblit font bitblit softcursor i2c_piix4 vga16fb vgastate psmouse virtio_balloon serio_raw joydev shpchp parport usbhid hid floppy e1000
Sep 23 16:40:13 agenda kernel: [1883758.324260]
Sep 23 16:40:13 agenda kernel: [1883758.324260] Pid: 0, comm: swapper Not tainted (2.6.32-74-generic-pae #142-Ubuntu) Standard PC (i440FX + PIIX, 1996)
Sep 23 16:40:13 agenda kernel: [1883758.324260] EIP: 0060:[<c013066a>] EFLAGS: 00000246 CPU: 0
Sep 23 16:40:13 agenda kernel: [1883759.325426] EIP is at native_safe_halt+0xa/0x10
Sep 23 16:40:13 agenda kernel: [1883759.325426] EAX: c07a2000 EBX: c07f2da8 ECX: c2204a60 EDX: 00000000
Sep 23 16:40:13 agenda kernel: [1883759.325426] ESI: 00000000 EDI: c07af000 EBP: c07a3f7c ESP: c07a3f7c
Sep 23 16:40:13 agenda kernel: [1883759.325426] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Sep 23 16:40:13 agenda kernel: [1883759.325426] CR0: 8005003b CR2: 000000a4 CR3: 36c7c000 CR4: 000006f0
Sep 23 16:40:13 agenda kernel: [1883759.325426] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Sep 23 16:40:13 agenda kernel: [1883759.325426] DR6: ffff0ff0 DR7: 00000400
Sep 23 16:40:13 agenda kernel: [1883760.326988] Call Trace:
Sep 23 16:40:13 agenda kernel: [1883760.326988] [<c0110d00>] default_idle+0x40/0x90
Sep 23 16:40:13 agenda kernel: [1883760.326988] [<c0108444>] cpu_idle+0x94/0xd0
Sep 23 16:40:13 agenda kernel: [1883760.326988] [<c05adb72>] rest_init+0x62/0x70
Sep 23 16:40:13 agenda kernel: [1883760.326988] [<c07f690c>] start_kernel+0x356/0x35c
Sep 23 16:40:13 agenda kernel: [1883760.326988] [<c07f63e2>] ? unknown_bootoption+0x0/0x19e
Sep 23 16:40:13 agenda kernel: [1883760.326988] [<c07f60bb>] i386_start_kernel+0xaa/0xb1
Sep 23 16:40:14 agenda kernel: [1883761.869221] ---[ end trace 16cd3d378cf96811 ]---
Sep 23 16:40:14 agenda kernel: [1883762.019210] Kernel panic - not syncing: Fatal exception in interrupt

Da una ricerca ho scoperto che a causare il problema è il Kernel di PROXMOX VE in particolare la versione 4.15.

La soluzione più ovvia è quella di eseguire un aggiornamento per avanzare di versione. A tale scopo ti rimando alla guida disponibile sul mio sito: Aggiornare Proxmox VE da 5.3 a 5.4 e Aggiornare Proxmox VE da 5.4 a 6.0