Kernel bug (cosa fare?)

Massimo Rossi massi.rossi@alice.it
Sab 18 Nov 2006 11:44:59 CET


Buongiorno a tutti,
abbiamo in housing un server Dell 2850 ed è la seconda volta che freeza 
in breve tempo.
Ha su RHEL4 con kernel originale non ricompilato, ecco alcune info:
> # cat /proc/version
> Linux version 2.6.9-22.0.2.ELsmp 
> (bhcompile@hs20-bc1-1.build.redhat.com) (gcc version 3.4.5 20051201 
> (Red Hat 3.4.5-2)) #1 SMP Thu Jan 5 17:13:01 EST 2006
da /var/log/messages.2:
> Nov 18 10:10:01 technicare kernel: ------------[ cut here ]------------
> Nov 18 10:10:01 technicare kernel: kernel BUG at 
> include/asm/atomic_kmap.h:60!
> Nov 18 10:10:01 technicare kernel: invalid operand: 0000 [#1]
> Nov 18 10:10:01 technicare kernel: SMP
> Nov 18 10:10:01 technicare kernel: Modules linked in: nls_utf8 loop 
> dcdipm(U) dcdbas(U) nfsd exportfs lockd md5 ipv6 autofs4 sunrpc button 
> battery ac uhci_hcd ehci_hcd hw
> _random e1000 floppy sg dm_snapshot dm_zero dm_mirror ext3 jbd dm_mod 
> megaraid_mbox megaraid_mm mptscsih mptbase sd_mod scsi_mod
> Nov 18 10:10:01 technicare kernel: CPU:    2
> Nov 18 10:10:01 technicare kernel: EIP:    0060:[<c010adfc>]    
> Tainted: P      VLI
> Nov 18 10:10:01 technicare kernel: EFLAGS: 00010206   (2.6.9-22.0.2.ELsmp)
> Nov 18 10:10:01 technicare kernel: EIP is at load_LDT_nolock+0xf8/0x1c1
> Nov 18 10:10:01 technicare kernel: eax: c000aca0   ebx: 80000000   
> ecx: 0ffc1163   edx: 00000055
> Nov 18 10:10:01 technicare kernel: esi: c000af48   edi: cffc2000   
> ebp: e23eaad4   esp: dc498f2c
> Nov 18 10:10:01 technicare kernel: ds: 007b   es: 007b   ss: 0068
> Nov 18 10:10:01 technicare kernel: Process java.bin (pid: 8795, 
> threadinfo=dc498000 task=f5060230)
> Nov 18 10:10:01 technicare kernel: Stack: fff94000 c11ff820 dc498000 
> 00000011 00000000 00000001 00000200 e23eaaf0
> Nov 18 10:10:01 technicare kernel:        00000002 dc498000 00000200 
> cffc2000 e23eaad4 c010a689 cffc1000 00000000
> Nov 18 10:10:01 technicare kernel:        00001000 00001000 00000001 
> e23eaad8 dc498000 bffff9e8 ffffffea c010ab9a
> Nov 18 10:10:01 technicare kernel: Call Trace:
> Nov 18 10:10:01 technicare kernel:  [<c010a689>] alloc_ldt+0xd3/0x108
> Nov 18 10:10:01 technicare kernel:  [<c010ab9a>] write_ldt+0x112/0x22f
> Nov 18 10:10:01 technicare kernel:  [<c010acfe>] sys_modify_ldt+0x47/0x4d
> Nov 18 10:10:01 technicare kernel:  [<c02d137f>] syscall_call+0x7/0xb
> Nov 18 10:10:01 technicare kernel: Code: 8d 42 16 c1 e0 0c 29 c6 8d 04 
> d5 00 00 00 00 89 34 24 8b 35 dc c8 40 c0 89 f1 29 c1 89 c8 8b 09 8b 
> 58 04 85 c9 75 04 85 db 74 08
> <0f> 0b 3c 00 f9 b8 2d c0 8d 04 d5 00 00 00 00 8b 2d 7c 29 32 c0
> Nov 18 10:10:01 technicare kernel:  <0>Fatal exception: panic in 5 seconds
Pare faccia riferimento a java.bin, ma non ho trovato niente negli altri 
log file.
Vorrei cercare di capire da cosa deriva e pensevo di inviare una mail a 
chi si occupa del kernel.
Secondo voi è la strada giusta ed esiste una procedura da seguire?

Grazie

Massimo Rossi



Maggiori informazioni sulla lista glug