public inbox for io-uring@vger.kernel.org
 help / color / mirror / Atom feed
From: Bernd Schubert <bschubert@ddn.com>
To: Joanne Koong <joannelkoong@gmail.com>
Cc: Bernd Schubert <bernd@bsbernd.com>,
	miklos@szeredi.hu, axboe@kernel.dk, asml.silence@gmail.com,
	io-uring@vger.kernel.org, csander@purestorage.com,
	xiaobing.li@samsung.com, linux-fsdevel@vger.kernel.org
Subject: Re: [PATCH v3 19/25] fuse: add io-uring kernel-managed buffer ring
Date: Thu, 5 Feb 2026 22:48:30 +0100	[thread overview]
Message-ID: <4e9d0896-e887-47bc-bc82-cb7fe17ec64e@ddn.com> (raw)
In-Reply-To: <CAJnrk1bt8X2E6estPz-xUmUBeQ93rKOpY078kVQCMmjtiVA5eA@mail.gmail.com>



On 2/5/26 22:29, Joanne Koong wrote:
> On Thu, Feb 5, 2026 at 12:49 PM Bernd Schubert <bschubert@ddn.com> wrote:
>>
>>
>>
>> On 2/5/26 21:24, Joanne Koong wrote:
>>> On Tue, Feb 3, 2026 at 3:58 PM Bernd Schubert <bernd@bsbernd.com> wrote:
>>>>
>>>>
>>>>
>>>> On 12/23/25 01:35, Joanne Koong wrote:
>>>>> Add io-uring kernel-managed buffer ring capability for fuse daemons
>>>>> communicating through the io-uring interface.
>>>>>
>>>>> This has two benefits:
>>>>> a) eliminates the overhead of pinning/unpinning user pages and
>>>>> translating virtual addresses for every server-kernel interaction
>>>>>
>>>>> b) reduces the amount of memory needed for the buffers per queue and
>>>>> allows buffers to be reused across entries. Incremental buffer
>>>>> consumption, when added, will allow a buffer to be used across multiple
>>>>> requests.
>>>>>
>>>>> Buffer ring usage is set on a per-queue basis. In order to use this, the
>>>>> daemon needs to have preregistered a kernel-managed buffer ring and a
>>>>> fixed buffer at index 0 that will hold all the headers, and set the
>>>>> "use_bufring" field during registration. The kernel-managed buffer ring
>>>>> will be pinned for the lifetime of the connection.
>>>>>
>>>>> Signed-off-by: Joanne Koong <joannelkoong@gmail.com>
>>>>> ---
>>>>>  fs/fuse/dev_uring.c       | 423 ++++++++++++++++++++++++++++++++------
>>>>>  fs/fuse/dev_uring_i.h     |  30 ++-
>>>>>  include/uapi/linux/fuse.h |  15 +-
>>>>>  3 files changed, 399 insertions(+), 69 deletions(-)
>>>>>
>>>>> diff --git a/fs/fuse/dev_uring.c b/fs/fuse/dev_uring.c
>>>>> @@ -824,21 +1040,29 @@ static void fuse_uring_add_req_to_ring_ent(struct fuse_ring_ent *ent,
>>>>>  }
>>>>>
>>>>>  /* Fetch the next fuse request if available */
>>>>> -static struct fuse_req *fuse_uring_ent_assign_req(struct fuse_ring_ent *ent)
>>>>> +static struct fuse_req *fuse_uring_ent_assign_req(struct fuse_ring_ent *ent,
>>>>> +                                               unsigned int issue_flags)
>>>>>       __must_hold(&queue->lock)
>>>>>  {
>>>>>       struct fuse_req *req;
>>>>>       struct fuse_ring_queue *queue = ent->queue;
>>>>>       struct list_head *req_queue = &queue->fuse_req_queue;
>>>>> +     int err;
>>>>>
>>>>>       lockdep_assert_held(&queue->lock);
>>>>>
>>>>>       /* get and assign the next entry while it is still holding the lock */
>>>>>       req = list_first_entry_or_null(req_queue, struct fuse_req, list);
>>>>> -     if (req)
>>>>> -             fuse_uring_add_req_to_ring_ent(ent, req);
>>>>> +     if (req) {
>>>>> +             err = fuse_uring_next_req_update_buffer(ent, req, issue_flags);
>>>>> +             if (!err) {
>>>>> +                     fuse_uring_add_req_to_ring_ent(ent, req);
>>>>> +                     return req;
>>>>> +             }
>>>>
>>>> Hmm, who/what is going to handle the request if this fails? Let's say we
>>>> have just one ring entry per queue and now it fails here - this ring
>>>> entry will go into FRRS_AVAILABLE and nothing will pull from the queue
>>>> anymore. I guess it _should_ not happen, some protection would be good.
>>>> In order to handle it, at least one other ent needs to be in flight.
>>>
>>> If the queue only has one ring ent and this fails, the request gets
>>> reassigned to the ent whenever ->send_req() is next triggered. I don't
>>> think this is a new edge case introduced by kmbufs; in the existing
>>> code, fuse_uring_commit_fetch() -> fuse_uring_get_next_fuse_req() ->
>>> fuse_uring_send_next_to_ring() -> fuse_uring_prepare_send() could fail
>>> if any of the copying fails, in which case we end up in the same
>>> position of the ent getting assigned the next request whenever
>>> ->send_req() is next triggered.
>>
>> I don't manage to check right now (need to solve another imbalance with
>> reduced rings right now), but every failed copy is *supposed* to end up
> 
> Thanks for your work on the reduced rings, I'm looking forward to
> seeing your patchset.
> 
>> in a request failure. Why should it block, if the copy failed?
>> It would be a bug if it does not right now and should be solved.
>>
>> Regarding your copy, I don't think waiting for for the next ->send_req()
>> is an option, it might be never.
>> One solution might be a single entry in any of the queues or in a
>> separate queue that doesn't have buf-rings - i.e. it can go slowly, but
>> it must not block. Some wake-up task retry might also work, but would be
>> timeout based.
> 
> Ah, so your concern is about the request taking too long to complete,
> not about the ent not being available again to send requests. In the
> existing code, yes if the next request can't be sent after a commit
> then that next request is immediately terminated. For the kmbuf case,
> the fuse_uring_next_req_update_buffer() call only fails if all the
> buffers are currently being used. The request will be picked up when
> the next buffer becomes available / is recycled back, which happens
> when the request(s) sent out to the server completes and is committed,
> if a ->send_req() hasn't already been triggered by then.


In the simple one ring entry example there wouldn't be another in-fly
request - the request would basically hang forever, if some reasons the
ring buffer is not available. It *shouldn't* happen, but what if for
example the same buffer would be used for zery-copy another subsystem
now consumes them? No idea if that could happen - with these buffers
there an additional complexity, which I don't understand from fuse point
of view. Let's just assume there would be some kind of ring buffer bug
and requests would now hang - how would we debug this if it just hangs
without any log or failure message?


Thanks,
Bernd

  reply	other threads:[~2026-02-05 21:48 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-12-23  0:34 [PATCH v3 00/25] fuse/io-uring: add kernel-managed buffer rings and zero-copy Joanne Koong
2025-12-23  0:34 ` [PATCH v3 01/25] io_uring/kbuf: refactor io_buf_pbuf_register() logic into generic helpers Joanne Koong
2025-12-23  0:34 ` [PATCH v3 02/25] io_uring/kbuf: rename io_unregister_pbuf_ring() to io_unregister_buf_ring() Joanne Koong
2025-12-23  0:35 ` [PATCH v3 03/25] io_uring/kbuf: add support for kernel-managed buffer rings Joanne Koong
2025-12-23  0:35 ` [PATCH v3 04/25] io_uring/kbuf: add mmap " Joanne Koong
2025-12-23  0:35 ` [PATCH v3 05/25] io_uring/kbuf: support kernel-managed buffer rings in buffer selection Joanne Koong
2026-01-03 22:45   ` Caleb Sander Mateos
2026-01-09  0:56     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 06/25] io_uring/kbuf: add buffer ring pinning/unpinning Joanne Koong
2025-12-29 21:07   ` Gabriel Krisman Bertazi
2025-12-30  1:27     ` Joanne Koong
2025-12-30 17:54       ` Gabriel Krisman Bertazi
2026-01-02 17:57         ` Joanne Koong
2026-01-08 18:40         ` Caleb Sander Mateos
2026-01-08 19:18   ` Caleb Sander Mateos
2026-01-09  1:04     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 07/25] io_uring/kbuf: add recycling for kernel managed buffer rings Joanne Koong
2025-12-29 22:00   ` Gabriel Krisman Bertazi
2025-12-29 22:20     ` Gabriel Krisman Bertazi
2025-12-30  1:15       ` Joanne Koong
2026-01-05 18:49         ` Gabriel Krisman Bertazi
2026-01-08 20:37   ` Caleb Sander Mateos
2026-01-09  1:07     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 08/25] io_uring: add io_uring_cmd_fixed_index_get() and io_uring_cmd_fixed_index_put() Joanne Koong
2026-01-08 19:02   ` Caleb Sander Mateos
2026-01-08 20:44     ` Caleb Sander Mateos
2026-01-09  0:55       ` Joanne Koong
2026-01-09  1:08         ` Caleb Sander Mateos
2025-12-23  0:35 ` [PATCH v3 09/25] io_uring/kbuf: add io_uring_cmd_is_kmbuf_ring() Joanne Koong
2025-12-23  0:35 ` [PATCH v3 10/25] io_uring/kbuf: export io_ring_buffer_select() Joanne Koong
2026-01-08 20:34   ` Caleb Sander Mateos
2026-01-09  0:38     ` Joanne Koong
2026-01-09  2:43       ` Caleb Sander Mateos
2025-12-23  0:35 ` [PATCH v3 11/25] io_uring/kbuf: return buffer id in buffer selection Joanne Koong
2025-12-23  0:35 ` [PATCH v3 12/25] io_uring/cmd: set selected buffer index in __io_uring_cmd_done() Joanne Koong
2025-12-23  0:35 ` [PATCH v3 13/25] fuse: refactor io-uring logic for getting next fuse request Joanne Koong
2025-12-23  0:35 ` [PATCH v3 14/25] fuse: refactor io-uring header copying to ring Joanne Koong
2026-01-11 16:03   ` Bernd Schubert
2026-01-16 22:33     ` Joanne Koong
2026-01-27 23:06       ` Bernd Schubert
2025-12-23  0:35 ` [PATCH v3 15/25] fuse: refactor io-uring header copying from ring Joanne Koong
2025-12-23  0:35 ` [PATCH v3 16/25] fuse: use enum types for header copying Joanne Koong
2025-12-23  0:35 ` [PATCH v3 17/25] fuse: refactor setting up copy state for payload copying Joanne Koong
2025-12-23  0:35 ` [PATCH v3 18/25] fuse: support buffer copying for kernel addresses Joanne Koong
2025-12-23  0:35 ` [PATCH v3 19/25] fuse: add io-uring kernel-managed buffer ring Joanne Koong
2026-02-03 23:58   ` Bernd Schubert
2026-02-05 20:24     ` Joanne Koong
2026-02-05 20:49       ` Bernd Schubert
2026-02-05 21:29         ` Joanne Koong
2026-02-05 21:48           ` Bernd Schubert [this message]
2026-02-05 22:19             ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 20/25] io_uring/rsrc: rename io_buffer_register_bvec()/io_buffer_unregister_bvec() Joanne Koong
2026-01-08 20:52   ` Caleb Sander Mateos
2025-12-23  0:35 ` [PATCH v3 21/25] io_uring/rsrc: split io_buffer_register_request() logic Joanne Koong
2026-01-08 21:04   ` Caleb Sander Mateos
2026-01-09  0:18     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 22/25] io_uring/rsrc: Allow buffer release callback to be optional Joanne Koong
2025-12-23  0:35 ` [PATCH v3 23/25] io_uring/rsrc: add io_buffer_register_bvec() Joanne Koong
2026-01-08 21:09   ` Caleb Sander Mateos
2026-01-09  0:10     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 24/25] fuse: add zero-copy over io-uring Joanne Koong
2026-01-08 21:15   ` Caleb Sander Mateos
2026-01-09  0:07     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 25/25] docs: fuse: add io-uring bufring and zero-copy documentation Joanne Koong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4e9d0896-e887-47bc-bc82-cb7fe17ec64e@ddn.com \
    --to=bschubert@ddn.com \
    --cc=asml.silence@gmail.com \
    --cc=axboe@kernel.dk \
    --cc=bernd@bsbernd.com \
    --cc=csander@purestorage.com \
    --cc=io-uring@vger.kernel.org \
    --cc=joannelkoong@gmail.com \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=miklos@szeredi.hu \
    --cc=xiaobing.li@samsung.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox