[LTP] [PATCH/RFC] syscalls/readahead02: don't use cache size
Tue Mar 5 16:33:17 CET 2019
On Tue, Mar 5, 2019 at 5:17 PM Jan Stancek <email@example.com> wrote:
> ----- Original Message -----
> > On Tue, Mar 5, 2019 at 2:34 PM Jan Stancek <firstname.lastname@example.org> wrote:
> > >
> > > Using system-wide "Cached" size is not accurate. The test is sporadically
> > > failing with warning on ppc64le 4.18 and 5.0 kernels.
> > >
> > > Problem is that test over-estimates max readahead size, which then
> > > leads to fewer readhead calls and kernel can silently trims length
> > > in each of them:
> > > ...
> > > readahead02.c:244: INFO: Test #2: POSIX_FADV_WILLNEED on file
> > > readahead02.c:134: INFO: creating test file of size: 67108864
> > > readahead02.c:263: INFO: read_testfile(0)
> > > readahead02.c:274: INFO: read_testfile(1)
> > > readahead02.c:189: INFO: max ra estimate: 12320768
> > > readahead02.c:198: INFO: readahead calls made: 6
> > > readahead02.c:204: PASS: offset is still at 0 as expected
> > > readahead02.c:308: INFO: read_testfile(0) took: 492486 usec
> > > readahead02.c:309: INFO: read_testfile(1) took: 430627 usec
> > > readahead02.c:311: INFO: read_testfile(0) read: 67108864 bytes
> > > readahead02.c:313: INFO: read_testfile(1) read: 59244544 bytes
> > > readahead02.c:316: PASS: readahead saved some I/O
> > > readahead02.c:324: INFO: cache can hold at least: 264192 kB
> > > readahead02.c:325: INFO: read_testfile(0) used cache: 124992 kB
> > > readahead02.c:326: INFO: read_testfile(1) used cache: 12032 kB
> > > readahead02.c:338: WARN: using less cache than expected
> > >
> > > Stop relying on used cache size, and use minimal sane readahead length,
> > > that should work across all systems.
> > But now instead of over estimating readahead length you definetly
> > underestimate it resulting in way too many readahead calls
> > (reahead every 4K block), which does not validate long readahead
> > code is working at all.
> We could try to cap on backing device read_ahead_kb.
Makes sense. It should would make the test more deterministic.
> > How about this patch instead.
> > Can you say with sufficient confidence if it solves the sporadic errors?
> It likely won't. We started at 2M couple years back, but there
> were several commit that keep changing the limit: 
>  600e19afc5f8a6c18ea49cee9511c5797db02391
>  https://lkml.org/lkml/2016/7/25/308
Well, perhaps the name I chose for the constant is wrong.
I did not mean that configuration > 1MB readahead is not sane.
I meant we could call readahead syscall for at most every 1MB,
so mitigate over estimation of the loop step.
So maybe setra to 1MB on test device and call readahead
in 1MB steps without estimation?
More information about the ltp