[LTP] Holidays and LTP release

Cyril Hrubis chrubis@suse.cz
Fri Jan 15 15:00:49 CET 2021


Hi!
> > > semctl09.c:132: TINFO: Test SEM_STAT_ANY with nobody user
> > > semctl09.c:155: TPASS: SEM_INFO returned valid index 10 to semid 10
> > > semctl09.c:164: TPASS: Counted used = 1
> > > semctl09.c:112: TPASS: semset_cnt = 1
> > > semctl09.c:119: TPASS: sen_cnt = 2
> > > tst_test.c:1313: TBROK: Test killed by SIGSEGV!
> >
> > There seems to be a part of the log missing here, if it's segfaulting in
> > the libc semctl() there used to be glibc bug for SEM_STAT_ANY which may
> > cause SIGSEGV:
> >
> > https://sourceware.org/bugzilla/show_bug.cgi?id=26637

And it looks like there is something strange going on here, Martin
Doucha is looking into it at the moment.

> > > accept02.c:55: TBROK: setsockopt(3, 0, 42, 0xb6f4cf7c, 132) failed: ENODEV (19)
> >
> > Looks like muticast groups is not compiled in kernel here.
> 
> True.
> We have not enabled multicast groups.
> # CONFIG_IP_MULTICAST is not set
> 
> >
> > We should handle this and report TCONF instead, however this is not a
> > regression, the test has been like this for three years.
> 
> Reporting TCONF would be right here.

Sure, however since this is not a regression let's do that after the
release. Feel free to open an issue so that it's not forgotten.

> > > clock_gettime04.c:143: TFAIL: CLOCK_REALTIME: Time travelled backwards
> > > (5): -1610153174499293029 ns
> > > clock_gettime04.c:151: TFAIL: CLOCK_REALTIME_COARSE: Difference
> > > between successive readings greater than 5 ms (4): 9
> > > clock_gettime04.c:158: TPASS: CLOCK_MONOTONIC: Difference between
> > > successive readings is reasonable
> > > clock_gettime04.c:151: TFAIL: CLOCK_MONOTONIC_COARSE: Difference
> > > between successive readings greater than 5 ms (2): 9
> > > clock_gettime04.c:158: TPASS: CLOCK_MONOTONIC_RAW: Difference between
> > > successive readings is reasonable
> > > clock_gettime04.c:158: TPASS: CLOCK_BOOTTIME: Difference between
> > > successive readings is reasonable
> >
> > This shouldn't really happen, what this says is that time jumped 51
> > years back. That is really absurd.
> 
> This is always reproduced on arm beagleboard-x15, i386 32-bit qemu_i386
> and qemu_arm.
> 
> clock_gettime04.c:143: TFAIL: CLOCK_REALTIME: Time travelled backwards
> (5): -1610599851098100352 ns
> 
> https://lkft.validation.linaro.org/scheduler/job/2142122#L3885
> https://lkft.validation.linaro.org/scheduler/job/2144548#L2534

I've reproduced it here on RPI board, I will have a look.

> > > getrlimit03.c:168: TPASS: __NR_prlimit64(0) and __NR_ugetrlimit(0)
> > > gave consistent results
> > > tst_test.c:1313: TBROK: Test killed by SIGILL!
> >
> > Can we have a backtrace here? It's impossible to say what happened here
> > without further debugging.
> 
> I do not have backtrace log for this now.
> Let me share strace output log in this link,
> # strace -f ./getrlimit03
> https://lkft.validation.linaro.org/scheduler/job/2145166#L2569

Thanks.

I've managed to reproduce as well but I have no idea what happens there,
it looks like something gets corrupted and the processor attempts to
execute invalid code.

Maybe we pass wrong size of the rlim structure somewhere and kernel
writes over a stack.

> > > on x86_64:
> > > The ioctl_sg01 test killed and reported failed.
> > >
> > > tst_test.c:1263: TINFO: Timeout per run is 0h 15m 00s
> > > ioctl_sg01.c:81: TINFO: Found SCSI device /dev/sg1
> > > [  332.383394] ioctl_sg01 invoked oom-killer:
> >
> > That's likely due to tst_pollutte_memory().
> >
> > What are the overcommit settings on that machine?
> 
> overcommit_memory is 0.
> After changing overcommit to 2 the test got PASS.
> 
> > This is probably worth fixing before the release if we manage to figure
> > out the cause.
> 
> cat /proc/sys/vm/overcommit_memory
> 0
> cat /proc/sys/vm/overcommit_ratio
> 50
> ./runltp -s ioctl_sg01 --> FAILED
> 
> echo 2 > /proc/sys/vm/overcommit_memory
> cat /proc/sys/vm/overcommit_memory
> 2
> ./runltp -s ioctl_sg01 --> PASS

So it looks like we need better safety margin.

Have you tried to increase the safety in the lib/tst_memutils.c as
pointed out by Martin?

-- 
Cyril Hrubis
chrubis@suse.cz


More information about the ltp mailing list