[LTP] [PATCH v3 2/2] read_all: wait children I/O reading in limited time

Richard Palethorpe rpalethorpe@suse.de
Mon Apr 23 09:44:52 CEST 2018


Hello,

Li Wang writes:

> read_all reports some stalled messges in test:
>  # ./read_all -d /sys -q -r 10
>    tst_test.c:987: INFO: Timeout per run is 0h 05m 00s
>    read_all.c:280: BROK: Worker 26075 is stalled
>    read_all.c:280: WARN: Worker 26075 is stalled
>    read_all.c:280: WARN: Worker 26079 is stalled
>    read_all.c:280: WARN: Worker 26087 is stalled
>
> The reason is that some children are still working on the read
> I/O but parent trys to stopping them after visit_dir immediately.
> Although the stop_attemps is 65535, it still sometimes fails.
>
> Instead, we take use of TST_RETRY_FUNC maroc to loop the stop
> operation in limited seconds to wait children I/O.
>
> And, the sched_work push action in an infinite loop, here merge it
> into rep_sched_work with using TST_RETRY_FUNC macro.
>
> Signed-off-by: Li Wang <liwang@redhat.com>
> Cc: Richard Palethorpe <rpalethorpe@suse.de>
> Cc: Xiao Yang <yangx.jy@cn.fujitsu.com>
> Cc: Cyril Hrubis <chrubis@suse.cz>
> ---
>
> Notes:
>     Hi Cyril and Richard,
>     
>     The purpose of this patch is to replace the old one[1] to solve
>     the children I/O issue. Please could you consider to merge or
>     comment on the change of using new marco.
>     
>     [1] http://lists.linux.it/pipermail/ltp/2018-April/007704.html
>
>  testcases/kernel/fs/read_all/read_all.c | 45 +++++++--------------------------
>  1 file changed, 9 insertions(+), 36 deletions(-)
>
> diff --git a/testcases/kernel/fs/read_all/read_all.c b/testcases/kernel/fs/read_all/read_all.c
> index b7ed540..b420e37 100644
> --- a/testcases/kernel/fs/read_all/read_all.c
> +++ b/testcases/kernel/fs/read_all/read_all.c
> @@ -265,23 +265,14 @@ static void spawn_workers(void)
>  static void stop_workers(void)
>  {
>  	const char stop_code[1] = { '\0' };
> -	int i, stop_attempts;
> +	int i;
>  
>  	if (!workers)
>  		return;
>  
>  	for (i = 0; i < worker_count; i++) {
> -		stop_attempts = 0xffff;
> -		if (workers[i].q) {
> -			while (!queue_push(workers[i].q, stop_code)) {
> -				if (--stop_attempts < 0) {
> -					tst_brk(TBROK,
> -						"Worker %d is stalled",
> -						workers[i].pid);
> -					break;
> -				}
> -			}
> -		}
> +		if (workers[i].q)
> +			TST_RETRY_FUNC(queue_push(workers[i].q, stop_code), 1);
>  	}
>  
>  	for (i = 0; i < worker_count; i++) {
> @@ -292,33 +283,15 @@ static void stop_workers(void)
>  	}
>  }
>  
> -static void sched_work(const char *path)
> -{
> -	static int cur;
> -	int push_attempts = 0, pushed;
> -
> -	while (1) {
> -		pushed = queue_push(workers[cur].q, path);
> -
> -		if (++cur >= worker_count)
> -			cur = 0;
> -
> -		if (pushed)
> -			break;
> -
> -		if (++push_attempts > worker_count) {
> -			usleep(100);
> -			push_attempts = 0;
> -		}
> -	}
> -}
> -
>  static void rep_sched_work(const char *path, int rep)
>  {
> -	int i;
> +	int i, j;
>  
> -	for (i = 0; i < rep; i++)
> -		sched_work(path);
> +	for (i = j = 0; i < rep; i++, j++) {
> +		if (j >= worker_count)
> +			j = 0;
> +		TST_RETRY_FUNC(queue_push(workers[j].q, path), 1);
> +	}
>  }
>  
>  static void setup(void)

Patchset looks good to me!

-- 
Thank you,
Richard.


More information about the ltp mailing list