[LTP] ltp/read_all_sys (read_all -d /sys -q -r 10) cause system panic with kernel-4.18.0-rc1
    Rafael David Tinoco 
    rafael.tinoco@linaro.org
       
    Thu Jul  5 12:54:38 CEST 2018
    
    
  
We've faced similar issue in Bug:
https://bugs.linaro.org/show_bug.cgi?id=3903
And it turned out to be a regmap area declared wrongly (likely) for a
PMIC device, caught when trying to read "registers" file from debugfs
in that particular device. Best way to identify the issue is to check
which file in /sys is causing the NULL dereference:
for file in $(find /sys -type f -print); do echo $file; sudo cat $file
> /dev/null; done
when being read (and check how it is implementing the VFS proxy fops
read function).
On Thu, Jul 5, 2018 at 2:50 AM Li Wang <liwang@redhat.com> wrote:
>
> Hi there,
>
> The problem is still in kernel-v4.18-rc3.
>
> Panic was caused by command:
>   # ./read_all  -d /sys -q -r 10
>
>
> Or, you can try the full steps to reproduce:
>
> # git clone https://github.com/linux-test-project/ltp/
> # cd ltp
> # ./build.sh
> # cd ../ltp-install
> # ./runltp -s read_all_sys
>
>
> dmesg log:
> -----------------------
> # dmesg |grep qla2
> [    4.764969] qla2xxx [0000:00:00.0]-0005: : QLogic Fibre Channel HBA
> Driver: 10.00.00.07-k.
> [    4.775414] qla2xxx [0000:04:00.0]-001a: : MSI-X vector count: 32.
> [    4.783705] qla2xxx [0000:04:00.0]-001d: : Found an ISP2532 irq 27
> iobase 0x(____ptrval____).
> [    4.793972] qla2xxx [0000:04:00.0]-00c6:4: MSI-X: Failed to enable
> support with 32 vectors, using 10 vectors.
> [    5.214159] scsi host4: qla2xxx
> [    5.214514] qla2xxx [0000:04:00.0]-00fb:4: QLogic QLE2562 -
> PCI-Express Dual Channel 8Gb Fibre Channel HBA.
> [    5.214520] qla2xxx [0000:04:00.0]-00fc:4: ISP2532: PCIe (5.0GT/s
> x8) @ 0000:04:00.0 hdma+ host#=4 fw=8.07.00 (90d5).
> [    5.214622] qla2xxx [0000:04:00.1]-001a: : MSI-X vector count: 32.
> [    5.214624] qla2xxx [0000:04:00.1]-001d: : Found an ISP2532 irq 57
> iobase 0x(____ptrval____).
> [    5.215149] qla2xxx [0000:04:00.1]-00c6:5: MSI-X: Failed to enable
> support with 32 vectors, using 10 vectors.
> [    5.631289] scsi host5: qla2xxx
> [    5.635270] qla2xxx [0000:04:00.1]-00fb:5: QLogic QLE2562 -
> PCI-Express Dual Channel 8Gb Fibre Channel HBA.
> [    5.646156] qla2xxx [0000:04:00.1]-00fc:5: ISP2532: PCIe (5.0GT/s
> x8) @ 0000:04:00.1 hdma+ host#=5 fw=8.07.00 (90d5).
> [   26.393233] qla2xxx [0000:04:00.0]-8038:4: Cable is unplugged...
> [   26.841214] qla2xxx [0000:04:00.1]-8038:5: Cable is unplugged...
>
>
> ---------------------
> [ 8162.297286] BUG: unable to handle kernel NULL pointer dereference
> at 00000000000000b8
> [ 8162.306027] PGD 800000042a986067 P4D 800000042a986067 PUD 42a990067 PMD 0
> [ 8162.313695] Oops: 0000 [#1] SMP PTI
> [ 8162.317588] CPU: 7 PID: 1677 Comm: read_all Tainted: G          I
>     4.18.0-rc3+ #1
> [ 8162.326422] Hardware name: FUJITSU
> PRIMERGY RX200 S6             /D3031, BIOS 6.00 Rev. 1.10.3031
>      01/20/2012
> [ 8162.341096] RIP: 0010:qla_dfs_tgt_counters_show+0x92/0x2a0 [qla2xxx]
> [ 8162.348176] Code: b6 86 22 01 00 00 66 85 c0 74 63 83 e8 01 4c 8b
> 9e b8 00 00 00 31 f6 0f b7 c0 48 8d 3c c5 08 00 00 00 49 8b 04 33 48
> 83 c6 08 <48> 03 90 b8 00 00 00 48 03 88 c0 00 00 00 4c 03 80 c8 00 00
> 00 4c
> [ 8162.369260] RSP: 0018:ffffadee829d7d60 EFLAGS: 00010202
> [ 8162.375088] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
> [ 8162.383049] RDX: 0000000000000000 RSI: 0000000000000008 RDI: 0000000000000040
> [ 8162.391010] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
> [ 8162.398973] R10: ffff893be3dce798 R11: ffff893be3a004c0 R12: 0000000000000000
> [ 8162.406935] R13: 0000000000000000 R14: ffff893be5006f00 R15: 0000000000000000
> [ 8162.414898] FS:  00007f7611b68740(0000) GS:ffff893bffdc0000(0000)
> knlGS:0000000000000000
> [ 8162.423926] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 8162.430337] CR2: 00000000000000b8 CR3: 000000042a98e006 CR4: 00000000000206e0
> [ 8162.438298] Call Trace:
> [ 8162.441031]  ? _cond_resched+0x15/0x30
> [ 8162.445215]  ? __kmalloc_node+0x1cd/0x280
> [ 8162.449690]  ? seq_read+0x33e/0x3f0
> [ 8162.453581]  seq_read+0x120/0x3f0
> [ 8162.457280]  full_proxy_read+0x50/0x70
> [ 8162.461462]  __vfs_read+0x36/0x190
> [ 8162.465257]  vfs_read+0x87/0x130
> [ 8162.468855]  ksys_read+0x52/0xc0
> [ 8162.472456]  do_syscall_64+0x5b/0x180
> [ 8162.476541]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
> [ 8162.482177] RIP: 0033:0x7f76117476d0
> [ 8162.486163] Code: 73 01 c3 48 8b 0d d8 88 20 00 f7 d8 64 89 01 48
> 83 c8 ff c3 66 0f 1f 44 00 00 83 3d 19 cd 20 00 00 75 10 b8 00 00 00
> 00 0f 05 <48> 3d 01 f0 ff ff 73 31 c3 48 83 ec 08 e8 ee fc ff ff 48 89
> 04 24
> [ 8162.507237] RSP: 002b:00007fff35e015f8 EFLAGS: 00000246 ORIG_RAX:
> 0000000000000000
> [ 8162.515685] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 00007f76117476d0
> [ 8162.523647] RDX: 00000000000003ff RSI: 00007fff35e01ab0 RDI: 0000000000000003
> [ 8162.531607] RBP: 0000000000001c0c R08: 0000000000000000 R09: 00007fff35e01550
> [ 8162.539569] R10: 0000000000000000 R11: 0000000000000246 R12: 00007f7611b59000
> [ 8162.547530] R13: 00007fff35e01ab0 R14: 0000000000000028 R15: 0000000000000030
> [ 8162.555493] Modules linked in: sunrpc intel_powerclamp coretemp
> kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul
> ghash_clmulni_intel ipmi_ssif pcbc aesni_intel crypto_simd cryptd
> ipmi_si iTCO_wdt glue_helper iTCO_vendor_support gpio_ich ipmi_devintf
> sg pcspkr i2c_i801 ipmi_msghandler acpi_power_meter lpc_ich
> i7core_edac acpi_cpufreq ip_tables xfs libcrc32c sd_mod sr_mod cdrom
> mgag200 drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops
> ttm ata_generic qla2xxx pata_acpi drm igb ata_piix mptsas libata
> nvme_fc scsi_transport_sas nvme_fabrics mptscsih nvme_core
> crc32c_intel mptbase scsi_transport_fc dca i2c_algo_bit dm_mirror
> dm_region_hash dm_log dm_mod
> [ 8162.621090] CR2: 00000000000000b8
>
>
> On Tue, Jun 19, 2018 at 6:41 PM, Li Wang <liwang@redhat.com> wrote:
> > Hi,
> >
> > I'm hitting this panic when running ltp/read_all_sys on kernel-v4.18-rc1.
> >
> > Test env:
> > FUJITSU PRIMERGY RX200 S6 GS01
> > Intel(R) Xeon(R) CPU E5620 @ 2.40GHz
> > 16384 MB memory, 598 GB disk space
> >
> >
> > [ 5915.705844] BUG: unable to handle kernel NULL pointer dereference
> > at 00000000000000b8
> > [ 5915.714587] PGD 800000042bcf7067 P4D 800000042bcf7067 PUD 423f4e067 PMD 0
> > [ 5915.722254] Oops: 0000 [#1] SMP PTI
> > [ 5915.726147] CPU: 6 PID: 18535 Comm: read_all Tainted: P
> > IOE     4.18.0-rc1 #1
> > [ 5915.734980] Hardware name: FUJITSU
> > PRIMERGY RX200 S6             /D3031, BIOS 6.00 Rev. 1.10.3031
> >      01/20/2012
> > [ 5915.749654] RIP: 0010:qla_dfs_tgt_counters_show+0x92/0x2a0 [qla2xxx]
> > [ 5915.756733] Code: b6 86 22 01 00 00 66 85 c0 74 63 83 e8 01 4c 8b
> > 9e b8 00 00 00 31 f6 0f b7 c0 48 8d 3c c5 08 00 00 00 49 8b 04 33 48
> > 83 c6 08 <48> 03 90 b8 00 00 00 48 03 88 c0 00 00 00 4c 03 80 c8 00 00
> > 00 4c
> > [ 5915.777816] RSP: 0018:ffffaf04109e3d60 EFLAGS: 00010202
> > [ 5915.783645] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
> > [ 5915.791606] RDX: 0000000000000000 RSI: 0000000000000008 RDI: 0000000000000040
> > [ 5915.799568] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
> > [ 5915.807529] R10: ffff956823a74798 R11: ffff956824a29000 R12: 0000000000000000
> > [ 5915.815489] R13: 0000000000000000 R14: ffff9567badfc280 R15: 0000000000000000
> > [ 5915.823451] FS:  00007f27336a1740(0000) GS:ffff95683fd80000(0000)
> > knlGS:0000000000000000
> > [ 5915.832479] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > [ 5915.838890] CR2: 00000000000000b8 CR3: 000000042960a003 CR4: 00000000000206e0
> > [ 5915.846850] Call Trace:
> > [ 5915.849583]  ? __kmalloc_node+0x195/0x280
> > [ 5915.854056]  ? seq_read+0x33e/0x3f0
> > [ 5915.857946]  seq_read+0x120/0x3f0
> > [ 5915.861643]  full_proxy_read+0x50/0x70
> > [ 5915.865827]  __vfs_read+0x36/0x190
> > [ 5915.869622]  vfs_read+0x87/0x130
> > [ 5915.873223]  ksys_read+0x52/0xc0
> > [ 5915.876823]  do_syscall_64+0x5b/0x180
> > [ 5915.880910]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
> > [ 5915.886547] RIP: 0033:0x7f2733280790
> > [ 5915.890532] Code: 73 01 c3 48 8b 0d 18 88 20 00 f7 d8 64 89 01 48
> > 83 c8 ff c3 66 0f 1f 44 00 00 83 3d 59 cc 20 00 00 75 10 b8 00 00 00
> > 00 0f 05 <48> 3d 01 f0 ff ff 73 31 c3 48 83 ec 08 e8 1e fc ff ff 48 89
> > 04 24
> > [ 5915.911617] RSP: 002b:00007ffef181c738 EFLAGS: 00000246 ORIG_RAX:
> > 0000000000000000
> > [ 5915.920064] RAX: ffffffffffffffda RBX: 0000000000000006 RCX: 00007f2733280790
> > [ 5915.928025] RDX: 00000000000003ff RSI: 00007ffef181cbf0 RDI: 0000000000000006
> > [ 5915.935986] RBP: 0000000000000b7b R08: 0000000000000000 R09: 00007ffef181c690
> > [ 5915.943949] R10: 0000000000000000 R11: 0000000000000246 R12: 00007f2733688000
> > [ 5915.951909] R13: 00007ffef181cbf0 R14: 0000000000000028 R15: 0000000000000030
> > [ 5915.959871] Modules linked in: dummy veth binfmt_misc sctp overlay
> > tun fuse vfat fat btrfs xor zstd_decompress zstd_compress xxhash
> > raid6_pq ext4 mbcache jbd2 loop sunrpc intel_powerclamp coretemp
> > kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul
> > ghash_clmulni_intel pcbc ipmi_ssif aesni_intel crypto_simd iTCO_wdt
> > ipmi_si cryptd iTCO_vendor_support glue_helper gpio_ich ipmi_devintf
> > sg acpi_power_meter ipmi_msghandler i2c_i801 pcspkr lpc_ich
> > i7core_edac acpi_cpufreq ip_tables xfs libcrc32c sd_mod sr_mod cdrom
> > mgag200 drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops
> > ttm ata_generic pata_acpi qla2xxx drm igb ata_piix nvme_fc mptsas
> > libata nvme_fabrics scsi_transport_sas dca crc32c_intel mptscsih
> > i2c_algo_bit nvme_core i2c_core mptbase scsi_transport_fc dm_mirror
> > dm_region_hash dm_log
> > [ 5916.038566]  dm_mod [last unloaded: ltp_insmod01]
> > [ 5916.043814] CR2: 00000000000000b8
> > [ 5916.047513] BUG: unable to handle kernel NULL pointer dereference
> > at 00000000000000b8
> > [ 5916.047537] ---[ end trace 1dddacfb06305174 ]---
> >
> >
> > --
> > Regards,
> > Li Wang
>
>
>
> --
> Regards,
> Li Wang
>
> --
> Mailing list info: https://lists.linux.it/listinfo/ltp
    
    
More information about the ltp
mailing list