public inbox for [email protected]
 help / color / mirror / Atom feed
* [RFC] do_iopoll() and *grab_env()
@ 2020-06-11 15:54 Pavel Begunkov
  2020-06-12 17:02 ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Pavel Begunkov @ 2020-06-11 15:54 UTC (permalink / raw)
  To: io-uring, Jens Axboe

io_do_iopoll() can async punt a request with io_queue_async_work(),
so doing io_req_work_grab_env(). The problem is that iopoll() can
be called from who knows what context, e.g. from a completely
different process with its own memory space, creds, etc.

io_do_iopoll() {
	ret = req->poll();
	if (ret == -EAGAIN)
		io_queue_async_work()
	...
}


I can't find it handled in io_uring. Can this even happen?
Wouldn't it be better to complete them with -EAGAIN?

-- 
Pavel Begunkov

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-11 15:54 [RFC] do_iopoll() and *grab_env() Pavel Begunkov
@ 2020-06-12 17:02 ` Jens Axboe
  2020-06-12 17:30   ` Pavel Begunkov
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2020-06-12 17:02 UTC (permalink / raw)
  To: Pavel Begunkov, io-uring

On 6/11/20 9:54 AM, Pavel Begunkov wrote:
> io_do_iopoll() can async punt a request with io_queue_async_work(),
> so doing io_req_work_grab_env(). The problem is that iopoll() can
> be called from who knows what context, e.g. from a completely
> different process with its own memory space, creds, etc.
> 
> io_do_iopoll() {
> 	ret = req->poll();
> 	if (ret == -EAGAIN)
> 		io_queue_async_work()
> 	...
> }
> 
> 
> I can't find it handled in io_uring. Can this even happen?
> Wouldn't it be better to complete them with -EAGAIN?

I don't think a plain -EAGAIN complete would be very useful, it's kind
of a shitty thing to pass back to userspace when it can be avoided. For
polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
latter, there's no problem in retrying, regardless of context. For the
former, I think we'd get -EFAULT mapping the IO at that point, which is
probably reasonable. I'd need to double check, though.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 17:02 ` Jens Axboe
@ 2020-06-12 17:30   ` Pavel Begunkov
  2020-06-12 17:55     ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Pavel Begunkov @ 2020-06-12 17:30 UTC (permalink / raw)
  To: Jens Axboe, io-uring

On 12/06/2020 20:02, Jens Axboe wrote:
> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>> be called from who knows what context, e.g. from a completely
>> different process with its own memory space, creds, etc.
>>
>> io_do_iopoll() {
>> 	ret = req->poll();
>> 	if (ret == -EAGAIN)
>> 		io_queue_async_work()
>> 	...
>> }
>>
>>
>> I can't find it handled in io_uring. Can this even happen?
>> Wouldn't it be better to complete them with -EAGAIN?
> 
> I don't think a plain -EAGAIN complete would be very useful, it's kind
> of a shitty thing to pass back to userspace when it can be avoided. For
> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
> latter, there's no problem in retrying, regardless of context. For the
> former, I think we'd get -EFAULT mapping the IO at that point, which is
> probably reasonable. I'd need to double check, though.

It's shitty, but -EFAULT is the best outcome. I care more about not
corrupting another process' memory if addresses coincide. AFAIK it can
happen because io_{read,write} will use iovecs for punted re-submission.

Unconditional in advance async_prep() is too heavy to be good. I'd love to
see something more clever, but with -EAGAIN users at least can handle it.

-- 
Pavel Begunkov

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 17:30   ` Pavel Begunkov
@ 2020-06-12 17:55     ` Jens Axboe
  2020-06-12 18:02       ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2020-06-12 17:55 UTC (permalink / raw)
  To: Pavel Begunkov, io-uring

On 6/12/20 11:30 AM, Pavel Begunkov wrote:
> On 12/06/2020 20:02, Jens Axboe wrote:
>> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>>> be called from who knows what context, e.g. from a completely
>>> different process with its own memory space, creds, etc.
>>>
>>> io_do_iopoll() {
>>> 	ret = req->poll();
>>> 	if (ret == -EAGAIN)
>>> 		io_queue_async_work()
>>> 	...
>>> }
>>>
>>>
>>> I can't find it handled in io_uring. Can this even happen?
>>> Wouldn't it be better to complete them with -EAGAIN?
>>
>> I don't think a plain -EAGAIN complete would be very useful, it's kind
>> of a shitty thing to pass back to userspace when it can be avoided. For
>> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
>> latter, there's no problem in retrying, regardless of context. For the
>> former, I think we'd get -EFAULT mapping the IO at that point, which is
>> probably reasonable. I'd need to double check, though.
> 
> It's shitty, but -EFAULT is the best outcome. I care more about not
> corrupting another process' memory if addresses coincide. AFAIK it can
> happen because io_{read,write} will use iovecs for punted re-submission.
>
> 
> Unconditional in advance async_prep() is too heavy to be good. I'd love to
> see something more clever, but with -EAGAIN users at least can handle it.

So how about we just grab ->task for the initial issue, and retry if we
find it through -EAGAIN and ->task == current. That'll be the most
common case, by far, and it'll prevent passes back -EAGAIN when we
really don't have to. If the task is different, then -EAGAIN makes more
sense, because at that point we're passing back -EAGAIN because we
really cannot feasibly handle it rather than just as a convenience.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 17:55     ` Jens Axboe
@ 2020-06-12 18:02       ` Jens Axboe
  2020-06-12 18:33         ` Pavel Begunkov
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2020-06-12 18:02 UTC (permalink / raw)
  To: Pavel Begunkov, io-uring

On 6/12/20 11:55 AM, Jens Axboe wrote:
> On 6/12/20 11:30 AM, Pavel Begunkov wrote:
>> On 12/06/2020 20:02, Jens Axboe wrote:
>>> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>>>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>>>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>>>> be called from who knows what context, e.g. from a completely
>>>> different process with its own memory space, creds, etc.
>>>>
>>>> io_do_iopoll() {
>>>> 	ret = req->poll();
>>>> 	if (ret == -EAGAIN)
>>>> 		io_queue_async_work()
>>>> 	...
>>>> }
>>>>
>>>>
>>>> I can't find it handled in io_uring. Can this even happen?
>>>> Wouldn't it be better to complete them with -EAGAIN?
>>>
>>> I don't think a plain -EAGAIN complete would be very useful, it's kind
>>> of a shitty thing to pass back to userspace when it can be avoided. For
>>> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
>>> latter, there's no problem in retrying, regardless of context. For the
>>> former, I think we'd get -EFAULT mapping the IO at that point, which is
>>> probably reasonable. I'd need to double check, though.
>>
>> It's shitty, but -EFAULT is the best outcome. I care more about not
>> corrupting another process' memory if addresses coincide. AFAIK it can
>> happen because io_{read,write} will use iovecs for punted re-submission.
>>
>>
>> Unconditional in advance async_prep() is too heavy to be good. I'd love to
>> see something more clever, but with -EAGAIN users at least can handle it.
> 
> So how about we just grab ->task for the initial issue, and retry if we
> find it through -EAGAIN and ->task == current. That'll be the most
> common case, by far, and it'll prevent passes back -EAGAIN when we
> really don't have to. If the task is different, then -EAGAIN makes more
> sense, because at that point we're passing back -EAGAIN because we
> really cannot feasibly handle it rather than just as a convenience.

Something like this, totally untested. And wants a comment too.

diff --git a/fs/io_uring.c b/fs/io_uring.c
index 155f3d830ddb..15806f71b33e 100644
--- a/fs/io_uring.c
+++ b/fs/io_uring.c
@@ -1727,6 +1728,12 @@ static int io_put_kbuf(struct io_kiocb *req)
 	return cflags;
 }
 
+static inline void req_set_fail_links(struct io_kiocb *req)
+{
+	if ((req->flags & (REQ_F_LINK | REQ_F_HARDLINK)) == REQ_F_LINK)
+		req->flags |= REQ_F_FAIL_LINK;
+}
+
 /*
  * Find and free completed poll iocbs
  */
@@ -1767,8 +1774,14 @@ static void io_iopoll_queue(struct list_head *again)
 	do {
 		req = list_first_entry(again, struct io_kiocb, list);
 		list_del(&req->list);
-		refcount_inc(&req->refs);
-		io_queue_async_work(req);
+		if (req->task == current) {
+			refcount_inc(&req->refs);
+			io_queue_async_work(req);
+		} else {
+			io_cqring_add_event(req, -EAGAIN);
+			req_set_fail_links(req);
+			io_put_req(req);
+		}
 	} while (!list_empty(again));
 }
 
@@ -1937,12 +1950,6 @@ static void kiocb_end_write(struct io_kiocb *req)
 	file_end_write(req->file);
 }
 
-static inline void req_set_fail_links(struct io_kiocb *req)
-{
-	if ((req->flags & (REQ_F_LINK | REQ_F_HARDLINK)) == REQ_F_LINK)
-		req->flags |= REQ_F_FAIL_LINK;
-}
-
 static void io_complete_rw_common(struct kiocb *kiocb, long res)
 {
 	struct io_kiocb *req = container_of(kiocb, struct io_kiocb, rw.kiocb);
@@ -2137,6 +2144,8 @@ static int io_prep_rw(struct io_kiocb *req, const struct io_uring_sqe *sqe,
 
 		kiocb->ki_flags |= IOCB_HIPRI;
 		kiocb->ki_complete = io_complete_rw_iopoll;
+		req->task = current;
+		get_task_struct(current);
 		req->result = 0;
 		req->iopoll_completed = 0;
 	} else {

-- 
Jens Axboe


^ permalink raw reply related	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 18:02       ` Jens Axboe
@ 2020-06-12 18:33         ` Pavel Begunkov
  2020-06-12 18:46           ` Pavel Begunkov
  2020-06-12 19:42           ` Jens Axboe
  0 siblings, 2 replies; 9+ messages in thread
From: Pavel Begunkov @ 2020-06-12 18:33 UTC (permalink / raw)
  To: Jens Axboe, io-uring

On 12/06/2020 21:02, Jens Axboe wrote:
> On 6/12/20 11:55 AM, Jens Axboe wrote:
>> On 6/12/20 11:30 AM, Pavel Begunkov wrote:
>>> On 12/06/2020 20:02, Jens Axboe wrote:
>>>> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>>>>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>>>>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>>>>> be called from who knows what context, e.g. from a completely
>>>>> different process with its own memory space, creds, etc.
>>>>>
>>>>> io_do_iopoll() {
>>>>> 	ret = req->poll();
>>>>> 	if (ret == -EAGAIN)
>>>>> 		io_queue_async_work()
>>>>> 	...
>>>>> }
>>>>>
>>>>>
>>>>> I can't find it handled in io_uring. Can this even happen?
>>>>> Wouldn't it be better to complete them with -EAGAIN?
>>>>
>>>> I don't think a plain -EAGAIN complete would be very useful, it's kind
>>>> of a shitty thing to pass back to userspace when it can be avoided. For
>>>> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
>>>> latter, there's no problem in retrying, regardless of context. For the
>>>> former, I think we'd get -EFAULT mapping the IO at that point, which is
>>>> probably reasonable. I'd need to double check, though.
>>>
>>> It's shitty, but -EFAULT is the best outcome. I care more about not
>>> corrupting another process' memory if addresses coincide. AFAIK it can
>>> happen because io_{read,write} will use iovecs for punted re-submission.
>>>
>>>
>>> Unconditional in advance async_prep() is too heavy to be good. I'd love to
>>> see something more clever, but with -EAGAIN users at least can handle it.
>>
>> So how about we just grab ->task for the initial issue, and retry if we
>> find it through -EAGAIN and ->task == current. That'll be the most
>> common case, by far, and it'll prevent passes back -EAGAIN when we
>> really don't have to. If the task is different, then -EAGAIN makes more
>> sense, because at that point we're passing back -EAGAIN because we
>> really cannot feasibly handle it rather than just as a convenience.

Yeah, I was even thinking to drag it through task_work just to call
*grab_env() there. Looks reasonable to me.

> Something like this, totally untested. And wants a comment too.

Looks like it. Would you leave this to me? There is another issue with
cancellation requiring ->task, It'd be easier to keep them together.

> 
> diff --git a/fs/io_uring.c b/fs/io_uring.c
> index 155f3d830ddb..15806f71b33e 100644
> --- a/fs/io_uring.c
> +++ b/fs/io_uring.c
> @@ -1727,6 +1728,12 @@ static int io_put_kbuf(struct io_kiocb *req)
>  	return cflags;
>  }
>  
> +static inline void req_set_fail_links(struct io_kiocb *req)
> +{
> +	if ((req->flags & (REQ_F_LINK | REQ_F_HARDLINK)) == REQ_F_LINK)
> +		req->flags |= REQ_F_FAIL_LINK;
> +}
> +
>  /*
>   * Find and free completed poll iocbs
>   */
> @@ -1767,8 +1774,14 @@ static void io_iopoll_queue(struct list_head *again)
>  	do {
>  		req = list_first_entry(again, struct io_kiocb, list);
>  		list_del(&req->list);
> -		refcount_inc(&req->refs);
> -		io_queue_async_work(req);
> +		if (req->task == current) {
> +			refcount_inc(&req->refs);
> +			io_queue_async_work(req);
> +		} else {
> +			io_cqring_add_event(req, -EAGAIN);
> +			req_set_fail_links(req);
> +			io_put_req(req);
> +		}
>  	} while (!list_empty(again));
>  }
>  
> @@ -1937,12 +1950,6 @@ static void kiocb_end_write(struct io_kiocb *req)
>  	file_end_write(req->file);
>  }
>  
> -static inline void req_set_fail_links(struct io_kiocb *req)
> -{
> -	if ((req->flags & (REQ_F_LINK | REQ_F_HARDLINK)) == REQ_F_LINK)
> -		req->flags |= REQ_F_FAIL_LINK;
> -}
> -
>  static void io_complete_rw_common(struct kiocb *kiocb, long res)
>  {
>  	struct io_kiocb *req = container_of(kiocb, struct io_kiocb, rw.kiocb);
> @@ -2137,6 +2144,8 @@ static int io_prep_rw(struct io_kiocb *req, const struct io_uring_sqe *sqe,
>  
>  		kiocb->ki_flags |= IOCB_HIPRI;
>  		kiocb->ki_complete = io_complete_rw_iopoll;
> +		req->task = current;
> +		get_task_struct(current);
>  		req->result = 0;
>  		req->iopoll_completed = 0;
>  	} else {
> 

-- 
Pavel Begunkov

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 18:33         ` Pavel Begunkov
@ 2020-06-12 18:46           ` Pavel Begunkov
  2020-06-12 19:42           ` Jens Axboe
  1 sibling, 0 replies; 9+ messages in thread
From: Pavel Begunkov @ 2020-06-12 18:46 UTC (permalink / raw)
  To: Jens Axboe, io-uring

On 12/06/2020 21:33, Pavel Begunkov wrote:
> On 12/06/2020 21:02, Jens Axboe wrote:
>> On 6/12/20 11:55 AM, Jens Axboe wrote:
>>> On 6/12/20 11:30 AM, Pavel Begunkov wrote:
>>>> On 12/06/2020 20:02, Jens Axboe wrote:
>>>>> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>>>>>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>>>>>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>>>>>> be called from who knows what context, e.g. from a completely
>>>>>> different process with its own memory space, creds, etc.
>>>>>>
>>>>>> io_do_iopoll() {
>>>>>> 	ret = req->poll();
>>>>>> 	if (ret == -EAGAIN)
>>>>>> 		io_queue_async_work()
>>>>>> 	...
>>>>>> }
>>>>>>
>>>>>>
>>>>>> I can't find it handled in io_uring. Can this even happen?
>>>>>> Wouldn't it be better to complete them with -EAGAIN?
>>>>>
>>>>> I don't think a plain -EAGAIN complete would be very useful, it's kind
>>>>> of a shitty thing to pass back to userspace when it can be avoided. For
>>>>> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
>>>>> latter, there's no problem in retrying, regardless of context. For the
>>>>> former, I think we'd get -EFAULT mapping the IO at that point, which is
>>>>> probably reasonable. I'd need to double check, though.
>>>>
>>>> It's shitty, but -EFAULT is the best outcome. I care more about not
>>>> corrupting another process' memory if addresses coincide. AFAIK it can
>>>> happen because io_{read,write} will use iovecs for punted re-submission.
>>>>
>>>>
>>>> Unconditional in advance async_prep() is too heavy to be good. I'd love to
>>>> see something more clever, but with -EAGAIN users at least can handle it.
>>>
>>> So how about we just grab ->task for the initial issue, and retry if we
>>> find it through -EAGAIN and ->task == current. That'll be the most
>>> common case, by far, and it'll prevent passes back -EAGAIN when we
>>> really don't have to. If the task is different, then -EAGAIN makes more
>>> sense, because at that point we're passing back -EAGAIN because we
>>> really cannot feasibly handle it rather than just as a convenience.
> 
> Yeah, I was even thinking to drag it through task_work just to call
> *grab_env() there. Looks reasonable to me.

edit: *Yours looks reasonable*.
task_work is too cumbersome for such a small nuisance.

> 
>> Something like this, totally untested. And wants a comment too.
> 
> Looks like it. Would you leave this to me? There is another issue with
> cancellation requiring ->task, It'd be easier to keep them together.
> 

-- 
Pavel Begunkov

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 18:33         ` Pavel Begunkov
  2020-06-12 18:46           ` Pavel Begunkov
@ 2020-06-12 19:42           ` Jens Axboe
  2020-06-13 19:12             ` Pavel Begunkov
  1 sibling, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2020-06-12 19:42 UTC (permalink / raw)
  To: Pavel Begunkov, io-uring

On 6/12/20 12:33 PM, Pavel Begunkov wrote:
> On 12/06/2020 21:02, Jens Axboe wrote:
>> On 6/12/20 11:55 AM, Jens Axboe wrote:
>>> On 6/12/20 11:30 AM, Pavel Begunkov wrote:
>>>> On 12/06/2020 20:02, Jens Axboe wrote:
>>>>> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>>>>>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>>>>>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>>>>>> be called from who knows what context, e.g. from a completely
>>>>>> different process with its own memory space, creds, etc.
>>>>>>
>>>>>> io_do_iopoll() {
>>>>>> 	ret = req->poll();
>>>>>> 	if (ret == -EAGAIN)
>>>>>> 		io_queue_async_work()
>>>>>> 	...
>>>>>> }
>>>>>>
>>>>>>
>>>>>> I can't find it handled in io_uring. Can this even happen?
>>>>>> Wouldn't it be better to complete them with -EAGAIN?
>>>>>
>>>>> I don't think a plain -EAGAIN complete would be very useful, it's kind
>>>>> of a shitty thing to pass back to userspace when it can be avoided. For
>>>>> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
>>>>> latter, there's no problem in retrying, regardless of context. For the
>>>>> former, I think we'd get -EFAULT mapping the IO at that point, which is
>>>>> probably reasonable. I'd need to double check, though.
>>>>
>>>> It's shitty, but -EFAULT is the best outcome. I care more about not
>>>> corrupting another process' memory if addresses coincide. AFAIK it can
>>>> happen because io_{read,write} will use iovecs for punted re-submission.
>>>>
>>>>
>>>> Unconditional in advance async_prep() is too heavy to be good. I'd love to
>>>> see something more clever, but with -EAGAIN users at least can handle it.
>>>
>>> So how about we just grab ->task for the initial issue, and retry if we
>>> find it through -EAGAIN and ->task == current. That'll be the most
>>> common case, by far, and it'll prevent passes back -EAGAIN when we
>>> really don't have to. If the task is different, then -EAGAIN makes more
>>> sense, because at that point we're passing back -EAGAIN because we
>>> really cannot feasibly handle it rather than just as a convenience.
> 
> Yeah, I was even thinking to drag it through task_work just to call
> *grab_env() there. Looks reasonable to me.
> 
>> Something like this, totally untested. And wants a comment too.
> 
> Looks like it. Would you leave this to me? There is another issue with
> cancellation requiring ->task, It'd be easier to keep them together.

Guess this ties into the next email, on using task_work? I actually
don't think that's a bad idea. If you have a low(er) queue depth device,
the -EAGAIN path is not necessarily that common. And task_work is a lot
more efficient for re-submittal than async work, plus needs to grab less
resources.

So I think you should still run with it...

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [RFC] do_iopoll() and *grab_env()
  2020-06-12 19:42           ` Jens Axboe
@ 2020-06-13 19:12             ` Pavel Begunkov
  0 siblings, 0 replies; 9+ messages in thread
From: Pavel Begunkov @ 2020-06-13 19:12 UTC (permalink / raw)
  To: Jens Axboe, io-uring

On 12/06/2020 22:42, Jens Axboe wrote:
> On 6/12/20 12:33 PM, Pavel Begunkov wrote:
>> On 12/06/2020 21:02, Jens Axboe wrote:
>>> On 6/12/20 11:55 AM, Jens Axboe wrote:
>>>> On 6/12/20 11:30 AM, Pavel Begunkov wrote:
>>>>> On 12/06/2020 20:02, Jens Axboe wrote:
>>>>>> On 6/11/20 9:54 AM, Pavel Begunkov wrote:
>>>>>>> io_do_iopoll() can async punt a request with io_queue_async_work(),
>>>>>>> so doing io_req_work_grab_env(). The problem is that iopoll() can
>>>>>>> be called from who knows what context, e.g. from a completely
>>>>>>> different process with its own memory space, creds, etc.
>>>>>>>
>>>>>>> io_do_iopoll() {
>>>>>>> 	ret = req->poll();
>>>>>>> 	if (ret == -EAGAIN)
>>>>>>> 		io_queue_async_work()
>>>>>>> 	...
>>>>>>> }
>>>>>>>
>>>>>>>
>>>>>>> I can't find it handled in io_uring. Can this even happen?
>>>>>>> Wouldn't it be better to complete them with -EAGAIN?
>>>>>>
>>>>>> I don't think a plain -EAGAIN complete would be very useful, it's kind
>>>>>> of a shitty thing to pass back to userspace when it can be avoided. For
>>>>>> polled IO, we know we're doing O_DIRECT, or using fixed buffers. For the
>>>>>> latter, there's no problem in retrying, regardless of context. For the
>>>>>> former, I think we'd get -EFAULT mapping the IO at that point, which is
>>>>>> probably reasonable. I'd need to double check, though.
>>>>>
>>>>> It's shitty, but -EFAULT is the best outcome. I care more about not
>>>>> corrupting another process' memory if addresses coincide. AFAIK it can
>>>>> happen because io_{read,write} will use iovecs for punted re-submission.
>>>>>
>>>>>
>>>>> Unconditional in advance async_prep() is too heavy to be good. I'd love to
>>>>> see something more clever, but with -EAGAIN users at least can handle it.
>>>>
>>>> So how about we just grab ->task for the initial issue, and retry if we
>>>> find it through -EAGAIN and ->task == current. That'll be the most
>>>> common case, by far, and it'll prevent passes back -EAGAIN when we
>>>> really don't have to. If the task is different, then -EAGAIN makes more
>>>> sense, because at that point we're passing back -EAGAIN because we
>>>> really cannot feasibly handle it rather than just as a convenience.
>>
>> Yeah, I was even thinking to drag it through task_work just to call
>> *grab_env() there. Looks reasonable to me.
>>
>>> Something like this, totally untested. And wants a comment too.
>>
>> Looks like it. Would you leave this to me? There is another issue with
>> cancellation requiring ->task, It'd be easier to keep them together.
> 
> Guess this ties into the next email, on using task_work? I actually
> don't think that's a bad idea. If you have a low(er) queue depth device,
> the -EAGAIN path is not necessarily that common. And task_work is a lot
> more efficient for re-submittal than async work, plus needs to grab less
> resources.
> 
> So I think you should still run with it...

Ok, I'll look into this then


-- 
Pavel Begunkov

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2020-06-13 19:13 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2020-06-11 15:54 [RFC] do_iopoll() and *grab_env() Pavel Begunkov
2020-06-12 17:02 ` Jens Axboe
2020-06-12 17:30   ` Pavel Begunkov
2020-06-12 17:55     ` Jens Axboe
2020-06-12 18:02       ` Jens Axboe
2020-06-12 18:33         ` Pavel Begunkov
2020-06-12 18:46           ` Pavel Begunkov
2020-06-12 19:42           ` Jens Axboe
2020-06-13 19:12             ` Pavel Begunkov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox