From: Pavel Begunkov <[email protected]>
To: Hao Xu <[email protected]>, Jens Axboe <[email protected]>
Cc: [email protected], Joseph Qi <[email protected]>
Subject: Re: [PATCH] io_uring: fix possible deadlock in io_uring_poll
Date: Wed, 3 Feb 2021 00:04:02 +0000 [thread overview]
Message-ID: <[email protected]> (raw)
In-Reply-To: <[email protected]>
On 02/02/2021 19:52, Hao Xu wrote:
> This might happen if we do epoll_wait on a uring fd while reading/writing
> the former epoll fd in a sqe in the former uring instance.
> So let's don't flush cqring overflow list when we fail to get the uring
> lock. This leads to less accuracy, but is still ok.
if (io_cqring_events(ctx) || test_bit(0, &ctx->cq_check_overflow))
mask |= EPOLLIN | EPOLLRDNORM;
Instead of flushing. It'd make sense if we define poll as "there might
be something, go do your peek/wait with overflow checks". Jens, is that
documented anywhere?
>
> Reported-by: Abaci <[email protected]>
> Fixes: 6c503150ae33 ("io_uring: patch up IOPOLL overflow_flush sync")
> Signed-off-by: Hao Xu <[email protected]>
> ---
>
> Here I use mutex_trylock() to fix this issue, but this causes loss of
> accuracy. I think doing cqring overflow flush in a task work maybe a
> better solution. I'm think of this. Any thoughts?
>
> fs/io_uring.c | 31 ++++++++++++++++++++++++++++++-
> 1 file changed, 30 insertions(+), 1 deletion(-)
>
> diff --git a/fs/io_uring.c b/fs/io_uring.c
> index 38c6cbe1ab38..866e45d42ac7 100644
> --- a/fs/io_uring.c
> +++ b/fs/io_uring.c
> @@ -8718,7 +8718,36 @@ static __poll_t io_uring_poll(struct file *file, poll_table *wait)
> smp_rmb();
> if (!io_sqring_full(ctx))
> mask |= EPOLLOUT | EPOLLWRNORM;
> - io_cqring_overflow_flush(ctx, false, NULL, NULL);
> +
> + if (test_bit(0, &ctx->cq_check_overflow)) {
> + bool should_flush = true;
> + /* iopoll syncs against uring_lock, not completion_lock */
> + if (ctx->flags & IORING_SETUP_IOPOLL) {
> + /*
> + * avoid ABBA deadlock.
> + * there could be contention like below:
> + * CPU0 CPU1
> + * ---- ----
> + * lock(&ctx->uring_lock);
> + * lock(&ep->mtx);
> + * lock(&ctx->uring_lock);
> + * lock(&ep->mtx);
> + *
> + * this might happen if we do epoll_wait on a uring fd while
> + * read/write the former epoll fd in a sqe in the former uring
> + * instance.
> + * We don't flush cqring overflow list when we fail to get the
> + * uring lock. This leads to less accuracy, but is still ok.
> + */
> + should_flush = mutex_trylock(&ctx->uring_lock);
> + }
> + if (should_flush) {
> + __io_cqring_overflow_flush(ctx, false, NULL, NULL);
> + if (ctx->flags & IORING_SETUP_IOPOLL)
> + mutex_unlock(&ctx->uring_lock);
> + }
> + }
> +
> if (io_cqring_events(ctx))
> mask |= EPOLLIN | EPOLLRDNORM;
>
>
--
Pavel Begunkov
next prev parent reply other threads:[~2021-02-03 0:08 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-02-02 19:52 [PATCH] io_uring: fix possible deadlock in io_uring_poll Hao Xu
2021-02-03 0:04 ` Pavel Begunkov [this message]
2021-02-03 1:48 ` Jens Axboe
2021-02-03 16:48 ` Pavel Begunkov
2021-02-04 16:48 ` Pavel Begunkov
2021-02-04 16:54 ` Hao Xu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox