[LTP] [PATCH] mem/lib: keep allocating memory until get an error in single process

Jan Stancek jstancek@redhat.com
Mon Jun 20 15:03:51 CEST 2016



----- Original Message -----
> From: "Li Wang" <liwang@redhat.com>
> To: "Jan Stancek" <jstancek@redhat.com>
> Cc: ltp@lists.linux.it
> Sent: Monday, 20 June, 2016 1:34:54 PM
> Subject: Re: [PATCH] mem/lib: keep allocating memory until get an error in single process
> 
> On Mon, Jun 20, 2016 at 06:43:18AM -0400, Jan Stancek wrote:
> > 
> > 
> > ----- Original Message -----
> > > From: "Li Wang" <liwang@redhat.com>
> > > To: jstancek@redhat.com
> > > Cc: ltp@lists.linux.it
> > > Sent: Monday, 20 June, 2016 11:53:05 AM
> > > Subject: [PATCH] mem/lib: keep allocating memory until get an error in
> > > single process
> > > 
> > > We occasionally catch errors like:
> > > oom03       0  TINFO  :  start OOM testing for KSM pages.
> > > oom03       0  TINFO  :  expected victim is 3490.
> > > oom03       6  TFAIL  :  mem.c:163: victim unexpectedly ended with
> > > retcode:
> > > 0, expected: 12
> > > oom03       0  TINFO  :  set overcommit_memory to 0
> > > 
> > > It cames from the caller testoom(0, 1, ENOMEM, 1). The full reason is
> > > that
> > > function child_alloc() go into single process mode, then successfully
> > > finish
> > > the memory allocation and return 0.
> > 
> > Description above doesn't explain why you get 0, when oom03 is set to run
> > in cgroup with memory.memsw.limit_in_bytes == TESTMEM, and then allocates
> > TESTMEM + MB.
> > 
> > My guess is a KSM scan merged some pages before you have hit the limit.
> That might be.
> 
> > Do yo get these failures always during KSM test?
> No, I just get the failures only once during my test.
> 
> > > 
> > > In this patch, let's make it (in single mode) keep allocating memory with
> > > an incresed length in order to avoid 0 returned.
> 
> Hmm, now I think these codes have two problems.
> 
> 1. As you said the KSM merge same pages and let the oom03 failed as above.
> 
> 
> 2. The child_alloc() probably should also do memory allocation with an
> infinite loop

At the time 'lite' was introduced, everything was single threaded,
the only difference was that lite == 0 would keep allocating in
infinite loop.

Your patch is effectively eliminating lite == 1, which means that
oom03 will pass even if cgroup limits are broken.

> in single process mode. Because if someone has a caller testoom(0, 1, ENOMEM,
> 1) at other
> place in future, that'll be easily get fauilures.

Likely because author is expecting behaviour other than what
we have now:

 * @lite: if non-zero, child makes only single TESTMEM+MB allocation
 *        if zero, child keeps allocating memory until it gets killed
 *        or some operation fails

> Or, can we solve them in one method?

We could skip oom(KSM) if lite == 1 in testoom(), since limit_in_bytes
may vary from run to run, which isn't reliable for oom03 cgroup test.

Or we could change alloc_mem for lite == 1 && testcase == KSM,
such that each page would have unique content (to prevent merging).

Regards,
Jan


More information about the ltp mailing list