From: Joanne Koong <joannelkoong@gmail.com>
To: Bernd Schubert <bschubert@ddn.com>
Cc: Bernd Schubert <bernd@bsbernd.com>,
miklos@szeredi.hu, axboe@kernel.dk, asml.silence@gmail.com,
io-uring@vger.kernel.org, csander@purestorage.com,
xiaobing.li@samsung.com, linux-fsdevel@vger.kernel.org
Subject: Re: [PATCH v3 19/25] fuse: add io-uring kernel-managed buffer ring
Date: Thu, 5 Feb 2026 14:19:05 -0800 [thread overview]
Message-ID: <CAJnrk1Z7gNunoYtQoJMrm+xFAwGPTZg0brYhdeLvZ0oBMxiwoA@mail.gmail.com> (raw)
In-Reply-To: <4e9d0896-e887-47bc-bc82-cb7fe17ec64e@ddn.com>
On Thu, Feb 5, 2026 at 1:48 PM Bernd Schubert <bschubert@ddn.com> wrote:
>
>
>
> On 2/5/26 22:29, Joanne Koong wrote:
> > On Thu, Feb 5, 2026 at 12:49 PM Bernd Schubert <bschubert@ddn.com> wrote:
> >>
> >>
> >>
> >> On 2/5/26 21:24, Joanne Koong wrote:
> >>> On Tue, Feb 3, 2026 at 3:58 PM Bernd Schubert <bernd@bsbernd.com> wrote:
> >>>>
> >>>>
> >>>>
> >>>> On 12/23/25 01:35, Joanne Koong wrote:
> >>>>> Add io-uring kernel-managed buffer ring capability for fuse daemons
> >>>>> communicating through the io-uring interface.
> >>>>>
> >>>>> This has two benefits:
> >>>>> a) eliminates the overhead of pinning/unpinning user pages and
> >>>>> translating virtual addresses for every server-kernel interaction
> >>>>>
> >>>>> b) reduces the amount of memory needed for the buffers per queue and
> >>>>> allows buffers to be reused across entries. Incremental buffer
> >>>>> consumption, when added, will allow a buffer to be used across multiple
> >>>>> requests.
> >>>>>
> >>>>> Buffer ring usage is set on a per-queue basis. In order to use this, the
> >>>>> daemon needs to have preregistered a kernel-managed buffer ring and a
> >>>>> fixed buffer at index 0 that will hold all the headers, and set the
> >>>>> "use_bufring" field during registration. The kernel-managed buffer ring
> >>>>> will be pinned for the lifetime of the connection.
> >>>>>
> >>>>> Signed-off-by: Joanne Koong <joannelkoong@gmail.com>
> >>>>> ---
> >>>>> fs/fuse/dev_uring.c | 423 ++++++++++++++++++++++++++++++++------
> >>>>> fs/fuse/dev_uring_i.h | 30 ++-
> >>>>> include/uapi/linux/fuse.h | 15 +-
> >>>>> 3 files changed, 399 insertions(+), 69 deletions(-)
> >>>>>
> >>>>> diff --git a/fs/fuse/dev_uring.c b/fs/fuse/dev_uring.c
> >>>>> @@ -824,21 +1040,29 @@ static void fuse_uring_add_req_to_ring_ent(struct fuse_ring_ent *ent,
> >>>>> }
> >>>>>
> >>>>> /* Fetch the next fuse request if available */
> >>>>> -static struct fuse_req *fuse_uring_ent_assign_req(struct fuse_ring_ent *ent)
> >>>>> +static struct fuse_req *fuse_uring_ent_assign_req(struct fuse_ring_ent *ent,
> >>>>> + unsigned int issue_flags)
> >>>>> __must_hold(&queue->lock)
> >>>>> {
> >>>>> struct fuse_req *req;
> >>>>> struct fuse_ring_queue *queue = ent->queue;
> >>>>> struct list_head *req_queue = &queue->fuse_req_queue;
> >>>>> + int err;
> >>>>>
> >>>>> lockdep_assert_held(&queue->lock);
> >>>>>
> >>>>> /* get and assign the next entry while it is still holding the lock */
> >>>>> req = list_first_entry_or_null(req_queue, struct fuse_req, list);
> >>>>> - if (req)
> >>>>> - fuse_uring_add_req_to_ring_ent(ent, req);
> >>>>> + if (req) {
> >>>>> + err = fuse_uring_next_req_update_buffer(ent, req, issue_flags);
> >>>>> + if (!err) {
> >>>>> + fuse_uring_add_req_to_ring_ent(ent, req);
> >>>>> + return req;
> >>>>> + }
> >>>>
> >>>> Hmm, who/what is going to handle the request if this fails? Let's say we
> >>>> have just one ring entry per queue and now it fails here - this ring
> >>>> entry will go into FRRS_AVAILABLE and nothing will pull from the queue
> >>>> anymore. I guess it _should_ not happen, some protection would be good.
> >>>> In order to handle it, at least one other ent needs to be in flight.
> >>>
> >>> If the queue only has one ring ent and this fails, the request gets
> >>> reassigned to the ent whenever ->send_req() is next triggered. I don't
> >>> think this is a new edge case introduced by kmbufs; in the existing
> >>> code, fuse_uring_commit_fetch() -> fuse_uring_get_next_fuse_req() ->
> >>> fuse_uring_send_next_to_ring() -> fuse_uring_prepare_send() could fail
> >>> if any of the copying fails, in which case we end up in the same
> >>> position of the ent getting assigned the next request whenever
> >>> ->send_req() is next triggered.
> >>
> >> I don't manage to check right now (need to solve another imbalance with
> >> reduced rings right now), but every failed copy is *supposed* to end up
> >
> > Thanks for your work on the reduced rings, I'm looking forward to
> > seeing your patchset.
> >
> >> in a request failure. Why should it block, if the copy failed?
> >> It would be a bug if it does not right now and should be solved.
> >>
> >> Regarding your copy, I don't think waiting for for the next ->send_req()
> >> is an option, it might be never.
> >> One solution might be a single entry in any of the queues or in a
> >> separate queue that doesn't have buf-rings - i.e. it can go slowly, but
> >> it must not block. Some wake-up task retry might also work, but would be
> >> timeout based.
> >
> > Ah, so your concern is about the request taking too long to complete,
> > not about the ent not being available again to send requests. In the
> > existing code, yes if the next request can't be sent after a commit
> > then that next request is immediately terminated. For the kmbuf case,
> > the fuse_uring_next_req_update_buffer() call only fails if all the
> > buffers are currently being used. The request will be picked up when
> > the next buffer becomes available / is recycled back, which happens
> > when the request(s) sent out to the server completes and is committed,
> > if a ->send_req() hasn't already been triggered by then.
>
>
> In the simple one ring entry example there wouldn't be another in-fly
> request - the request would basically hang forever, if some reasons the
If there's no in-flight request, then the buffer will always be
available since there's no other request using it. For the one
ring-entry example with 1 registered buffer in the bufring,
request A -> reserves buffer 1 -> request A gets sent to the server
request B gets enqueued -> no ent is available -> request is added to
the list for later servicing
server sends a reply for request A -> kernel processes the reply in
fuse_uring_commit_fetch() -> buffer 1 is recycled -> server gets the
next request (request B) -> server uses buffer 1 to service request B
> ring buffer is not available. It *shouldn't* happen, but what if for
> example the same buffer would be used for zery-copy another subsystem
> now consumes them? No idea if that could happen - with these buffers
The buffer is registered by libfuse for the fuse connection and can
*only* be used by that fuse connection. It cannot be used by another
subsystem.
> there an additional complexity, which I don't understand from fuse point
> of view. Let's just assume there would be some kind of ring buffer bug
> and requests would now hang - how would we debug this if it just hangs
> without any log or failure message?
The ring buffer can only run out of entries if there are in-flight
requests that are currently using all the buffers. Those requests will
complete and recycle back the buffer they used and fetch the next
request, which will use the buffer that was just recycled back.
Thanks,
Joanne
>
>
> Thanks,
> Bernd
next prev parent reply other threads:[~2026-02-05 22:19 UTC|newest]
Thread overview: 64+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-12-23 0:34 [PATCH v3 00/25] fuse/io-uring: add kernel-managed buffer rings and zero-copy Joanne Koong
2025-12-23 0:34 ` [PATCH v3 01/25] io_uring/kbuf: refactor io_buf_pbuf_register() logic into generic helpers Joanne Koong
2025-12-23 0:34 ` [PATCH v3 02/25] io_uring/kbuf: rename io_unregister_pbuf_ring() to io_unregister_buf_ring() Joanne Koong
2025-12-23 0:35 ` [PATCH v3 03/25] io_uring/kbuf: add support for kernel-managed buffer rings Joanne Koong
2025-12-23 0:35 ` [PATCH v3 04/25] io_uring/kbuf: add mmap " Joanne Koong
2025-12-23 0:35 ` [PATCH v3 05/25] io_uring/kbuf: support kernel-managed buffer rings in buffer selection Joanne Koong
2026-01-03 22:45 ` Caleb Sander Mateos
2026-01-09 0:56 ` Joanne Koong
2025-12-23 0:35 ` [PATCH v3 06/25] io_uring/kbuf: add buffer ring pinning/unpinning Joanne Koong
2025-12-29 21:07 ` Gabriel Krisman Bertazi
2025-12-30 1:27 ` Joanne Koong
2025-12-30 17:54 ` Gabriel Krisman Bertazi
2026-01-02 17:57 ` Joanne Koong
2026-01-08 18:40 ` Caleb Sander Mateos
2026-01-08 19:18 ` Caleb Sander Mateos
2026-01-09 1:04 ` Joanne Koong
2025-12-23 0:35 ` [PATCH v3 07/25] io_uring/kbuf: add recycling for kernel managed buffer rings Joanne Koong
2025-12-29 22:00 ` Gabriel Krisman Bertazi
2025-12-29 22:20 ` Gabriel Krisman Bertazi
2025-12-30 1:15 ` Joanne Koong
2026-01-05 18:49 ` Gabriel Krisman Bertazi
2026-01-08 20:37 ` Caleb Sander Mateos
2026-01-09 1:07 ` Joanne Koong
2025-12-23 0:35 ` [PATCH v3 08/25] io_uring: add io_uring_cmd_fixed_index_get() and io_uring_cmd_fixed_index_put() Joanne Koong
2026-01-08 19:02 ` Caleb Sander Mateos
2026-01-08 20:44 ` Caleb Sander Mateos
2026-01-09 0:55 ` Joanne Koong
2026-01-09 1:08 ` Caleb Sander Mateos
2025-12-23 0:35 ` [PATCH v3 09/25] io_uring/kbuf: add io_uring_cmd_is_kmbuf_ring() Joanne Koong
2025-12-23 0:35 ` [PATCH v3 10/25] io_uring/kbuf: export io_ring_buffer_select() Joanne Koong
2026-01-08 20:34 ` Caleb Sander Mateos
2026-01-09 0:38 ` Joanne Koong
2026-01-09 2:43 ` Caleb Sander Mateos
2025-12-23 0:35 ` [PATCH v3 11/25] io_uring/kbuf: return buffer id in buffer selection Joanne Koong
2025-12-23 0:35 ` [PATCH v3 12/25] io_uring/cmd: set selected buffer index in __io_uring_cmd_done() Joanne Koong
2025-12-23 0:35 ` [PATCH v3 13/25] fuse: refactor io-uring logic for getting next fuse request Joanne Koong
2025-12-23 0:35 ` [PATCH v3 14/25] fuse: refactor io-uring header copying to ring Joanne Koong
2026-01-11 16:03 ` Bernd Schubert
2026-01-16 22:33 ` Joanne Koong
2026-01-27 23:06 ` Bernd Schubert
2025-12-23 0:35 ` [PATCH v3 15/25] fuse: refactor io-uring header copying from ring Joanne Koong
2025-12-23 0:35 ` [PATCH v3 16/25] fuse: use enum types for header copying Joanne Koong
2025-12-23 0:35 ` [PATCH v3 17/25] fuse: refactor setting up copy state for payload copying Joanne Koong
2025-12-23 0:35 ` [PATCH v3 18/25] fuse: support buffer copying for kernel addresses Joanne Koong
2025-12-23 0:35 ` [PATCH v3 19/25] fuse: add io-uring kernel-managed buffer ring Joanne Koong
2026-02-03 23:58 ` Bernd Schubert
2026-02-05 20:24 ` Joanne Koong
2026-02-05 20:49 ` Bernd Schubert
2026-02-05 21:29 ` Joanne Koong
2026-02-05 21:48 ` Bernd Schubert
2026-02-05 22:19 ` Joanne Koong [this message]
2025-12-23 0:35 ` [PATCH v3 20/25] io_uring/rsrc: rename io_buffer_register_bvec()/io_buffer_unregister_bvec() Joanne Koong
2026-01-08 20:52 ` Caleb Sander Mateos
2025-12-23 0:35 ` [PATCH v3 21/25] io_uring/rsrc: split io_buffer_register_request() logic Joanne Koong
2026-01-08 21:04 ` Caleb Sander Mateos
2026-01-09 0:18 ` Joanne Koong
2025-12-23 0:35 ` [PATCH v3 22/25] io_uring/rsrc: Allow buffer release callback to be optional Joanne Koong
2025-12-23 0:35 ` [PATCH v3 23/25] io_uring/rsrc: add io_buffer_register_bvec() Joanne Koong
2026-01-08 21:09 ` Caleb Sander Mateos
2026-01-09 0:10 ` Joanne Koong
2025-12-23 0:35 ` [PATCH v3 24/25] fuse: add zero-copy over io-uring Joanne Koong
2026-01-08 21:15 ` Caleb Sander Mateos
2026-01-09 0:07 ` Joanne Koong
2025-12-23 0:35 ` [PATCH v3 25/25] docs: fuse: add io-uring bufring and zero-copy documentation Joanne Koong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAJnrk1Z7gNunoYtQoJMrm+xFAwGPTZg0brYhdeLvZ0oBMxiwoA@mail.gmail.com \
--to=joannelkoong@gmail.com \
--cc=asml.silence@gmail.com \
--cc=axboe@kernel.dk \
--cc=bernd@bsbernd.com \
--cc=bschubert@ddn.com \
--cc=csander@purestorage.com \
--cc=io-uring@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=miklos@szeredi.hu \
--cc=xiaobing.li@samsung.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox