public inbox for io-uring@vger.kernel.org
 help / color / mirror / Atom feed
From: Joanne Koong <joannelkoong@gmail.com>
To: Bernd Schubert <bschubert@ddn.com>
Cc: Bernd Schubert <bernd@bsbernd.com>,
	miklos@szeredi.hu, axboe@kernel.dk,  asml.silence@gmail.com,
	io-uring@vger.kernel.org, csander@purestorage.com,
	 xiaobing.li@samsung.com, linux-fsdevel@vger.kernel.org
Subject: Re: [PATCH v3 19/25] fuse: add io-uring kernel-managed buffer ring
Date: Thu, 5 Feb 2026 13:29:56 -0800	[thread overview]
Message-ID: <CAJnrk1bt8X2E6estPz-xUmUBeQ93rKOpY078kVQCMmjtiVA5eA@mail.gmail.com> (raw)
In-Reply-To: <ffe1a340-4759-441b-b04d-2ae7732bbd35@ddn.com>

On Thu, Feb 5, 2026 at 12:49 PM Bernd Schubert <bschubert@ddn.com> wrote:
>
>
>
> On 2/5/26 21:24, Joanne Koong wrote:
> > On Tue, Feb 3, 2026 at 3:58 PM Bernd Schubert <bernd@bsbernd.com> wrote:
> >>
> >>
> >>
> >> On 12/23/25 01:35, Joanne Koong wrote:
> >>> Add io-uring kernel-managed buffer ring capability for fuse daemons
> >>> communicating through the io-uring interface.
> >>>
> >>> This has two benefits:
> >>> a) eliminates the overhead of pinning/unpinning user pages and
> >>> translating virtual addresses for every server-kernel interaction
> >>>
> >>> b) reduces the amount of memory needed for the buffers per queue and
> >>> allows buffers to be reused across entries. Incremental buffer
> >>> consumption, when added, will allow a buffer to be used across multiple
> >>> requests.
> >>>
> >>> Buffer ring usage is set on a per-queue basis. In order to use this, the
> >>> daemon needs to have preregistered a kernel-managed buffer ring and a
> >>> fixed buffer at index 0 that will hold all the headers, and set the
> >>> "use_bufring" field during registration. The kernel-managed buffer ring
> >>> will be pinned for the lifetime of the connection.
> >>>
> >>> Signed-off-by: Joanne Koong <joannelkoong@gmail.com>
> >>> ---
> >>>  fs/fuse/dev_uring.c       | 423 ++++++++++++++++++++++++++++++++------
> >>>  fs/fuse/dev_uring_i.h     |  30 ++-
> >>>  include/uapi/linux/fuse.h |  15 +-
> >>>  3 files changed, 399 insertions(+), 69 deletions(-)
> >>>
> >>> diff --git a/fs/fuse/dev_uring.c b/fs/fuse/dev_uring.c
> >>> @@ -824,21 +1040,29 @@ static void fuse_uring_add_req_to_ring_ent(struct fuse_ring_ent *ent,
> >>>  }
> >>>
> >>>  /* Fetch the next fuse request if available */
> >>> -static struct fuse_req *fuse_uring_ent_assign_req(struct fuse_ring_ent *ent)
> >>> +static struct fuse_req *fuse_uring_ent_assign_req(struct fuse_ring_ent *ent,
> >>> +                                               unsigned int issue_flags)
> >>>       __must_hold(&queue->lock)
> >>>  {
> >>>       struct fuse_req *req;
> >>>       struct fuse_ring_queue *queue = ent->queue;
> >>>       struct list_head *req_queue = &queue->fuse_req_queue;
> >>> +     int err;
> >>>
> >>>       lockdep_assert_held(&queue->lock);
> >>>
> >>>       /* get and assign the next entry while it is still holding the lock */
> >>>       req = list_first_entry_or_null(req_queue, struct fuse_req, list);
> >>> -     if (req)
> >>> -             fuse_uring_add_req_to_ring_ent(ent, req);
> >>> +     if (req) {
> >>> +             err = fuse_uring_next_req_update_buffer(ent, req, issue_flags);
> >>> +             if (!err) {
> >>> +                     fuse_uring_add_req_to_ring_ent(ent, req);
> >>> +                     return req;
> >>> +             }
> >>
> >> Hmm, who/what is going to handle the request if this fails? Let's say we
> >> have just one ring entry per queue and now it fails here - this ring
> >> entry will go into FRRS_AVAILABLE and nothing will pull from the queue
> >> anymore. I guess it _should_ not happen, some protection would be good.
> >> In order to handle it, at least one other ent needs to be in flight.
> >
> > If the queue only has one ring ent and this fails, the request gets
> > reassigned to the ent whenever ->send_req() is next triggered. I don't
> > think this is a new edge case introduced by kmbufs; in the existing
> > code, fuse_uring_commit_fetch() -> fuse_uring_get_next_fuse_req() ->
> > fuse_uring_send_next_to_ring() -> fuse_uring_prepare_send() could fail
> > if any of the copying fails, in which case we end up in the same
> > position of the ent getting assigned the next request whenever
> > ->send_req() is next triggered.
>
> I don't manage to check right now (need to solve another imbalance with
> reduced rings right now), but every failed copy is *supposed* to end up

Thanks for your work on the reduced rings, I'm looking forward to
seeing your patchset.

> in a request failure. Why should it block, if the copy failed?
> It would be a bug if it does not right now and should be solved.
>
> Regarding your copy, I don't think waiting for for the next ->send_req()
> is an option, it might be never.
> One solution might be a single entry in any of the queues or in a
> separate queue that doesn't have buf-rings - i.e. it can go slowly, but
> it must not block. Some wake-up task retry might also work, but would be
> timeout based.

Ah, so your concern is about the request taking too long to complete,
not about the ent not being available again to send requests. In the
existing code, yes if the next request can't be sent after a commit
then that next request is immediately terminated. For the kmbuf case,
the fuse_uring_next_req_update_buffer() call only fails if all the
buffers are currently being used. The request will be picked up when
the next buffer becomes available / is recycled back, which happens
when the request(s) sent out to the server completes and is committed,
if a ->send_req() hasn't already been triggered by then.

Thanks,
Joanne


>
>
> Thanks,
> Bernd
>

  reply	other threads:[~2026-02-05 21:30 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-12-23  0:34 [PATCH v3 00/25] fuse/io-uring: add kernel-managed buffer rings and zero-copy Joanne Koong
2025-12-23  0:34 ` [PATCH v3 01/25] io_uring/kbuf: refactor io_buf_pbuf_register() logic into generic helpers Joanne Koong
2025-12-23  0:34 ` [PATCH v3 02/25] io_uring/kbuf: rename io_unregister_pbuf_ring() to io_unregister_buf_ring() Joanne Koong
2025-12-23  0:35 ` [PATCH v3 03/25] io_uring/kbuf: add support for kernel-managed buffer rings Joanne Koong
2025-12-23  0:35 ` [PATCH v3 04/25] io_uring/kbuf: add mmap " Joanne Koong
2025-12-23  0:35 ` [PATCH v3 05/25] io_uring/kbuf: support kernel-managed buffer rings in buffer selection Joanne Koong
2026-01-03 22:45   ` Caleb Sander Mateos
2026-01-09  0:56     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 06/25] io_uring/kbuf: add buffer ring pinning/unpinning Joanne Koong
2025-12-29 21:07   ` Gabriel Krisman Bertazi
2025-12-30  1:27     ` Joanne Koong
2025-12-30 17:54       ` Gabriel Krisman Bertazi
2026-01-02 17:57         ` Joanne Koong
2026-01-08 18:40         ` Caleb Sander Mateos
2026-01-08 19:18   ` Caleb Sander Mateos
2026-01-09  1:04     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 07/25] io_uring/kbuf: add recycling for kernel managed buffer rings Joanne Koong
2025-12-29 22:00   ` Gabriel Krisman Bertazi
2025-12-29 22:20     ` Gabriel Krisman Bertazi
2025-12-30  1:15       ` Joanne Koong
2026-01-05 18:49         ` Gabriel Krisman Bertazi
2026-01-08 20:37   ` Caleb Sander Mateos
2026-01-09  1:07     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 08/25] io_uring: add io_uring_cmd_fixed_index_get() and io_uring_cmd_fixed_index_put() Joanne Koong
2026-01-08 19:02   ` Caleb Sander Mateos
2026-01-08 20:44     ` Caleb Sander Mateos
2026-01-09  0:55       ` Joanne Koong
2026-01-09  1:08         ` Caleb Sander Mateos
2025-12-23  0:35 ` [PATCH v3 09/25] io_uring/kbuf: add io_uring_cmd_is_kmbuf_ring() Joanne Koong
2025-12-23  0:35 ` [PATCH v3 10/25] io_uring/kbuf: export io_ring_buffer_select() Joanne Koong
2026-01-08 20:34   ` Caleb Sander Mateos
2026-01-09  0:38     ` Joanne Koong
2026-01-09  2:43       ` Caleb Sander Mateos
2025-12-23  0:35 ` [PATCH v3 11/25] io_uring/kbuf: return buffer id in buffer selection Joanne Koong
2025-12-23  0:35 ` [PATCH v3 12/25] io_uring/cmd: set selected buffer index in __io_uring_cmd_done() Joanne Koong
2025-12-23  0:35 ` [PATCH v3 13/25] fuse: refactor io-uring logic for getting next fuse request Joanne Koong
2025-12-23  0:35 ` [PATCH v3 14/25] fuse: refactor io-uring header copying to ring Joanne Koong
2026-01-11 16:03   ` Bernd Schubert
2026-01-16 22:33     ` Joanne Koong
2026-01-27 23:06       ` Bernd Schubert
2025-12-23  0:35 ` [PATCH v3 15/25] fuse: refactor io-uring header copying from ring Joanne Koong
2025-12-23  0:35 ` [PATCH v3 16/25] fuse: use enum types for header copying Joanne Koong
2025-12-23  0:35 ` [PATCH v3 17/25] fuse: refactor setting up copy state for payload copying Joanne Koong
2025-12-23  0:35 ` [PATCH v3 18/25] fuse: support buffer copying for kernel addresses Joanne Koong
2025-12-23  0:35 ` [PATCH v3 19/25] fuse: add io-uring kernel-managed buffer ring Joanne Koong
2026-02-03 23:58   ` Bernd Schubert
2026-02-05 20:24     ` Joanne Koong
2026-02-05 20:49       ` Bernd Schubert
2026-02-05 21:29         ` Joanne Koong [this message]
2026-02-05 21:48           ` Bernd Schubert
2026-02-05 22:19             ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 20/25] io_uring/rsrc: rename io_buffer_register_bvec()/io_buffer_unregister_bvec() Joanne Koong
2026-01-08 20:52   ` Caleb Sander Mateos
2025-12-23  0:35 ` [PATCH v3 21/25] io_uring/rsrc: split io_buffer_register_request() logic Joanne Koong
2026-01-08 21:04   ` Caleb Sander Mateos
2026-01-09  0:18     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 22/25] io_uring/rsrc: Allow buffer release callback to be optional Joanne Koong
2025-12-23  0:35 ` [PATCH v3 23/25] io_uring/rsrc: add io_buffer_register_bvec() Joanne Koong
2026-01-08 21:09   ` Caleb Sander Mateos
2026-01-09  0:10     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 24/25] fuse: add zero-copy over io-uring Joanne Koong
2026-01-08 21:15   ` Caleb Sander Mateos
2026-01-09  0:07     ` Joanne Koong
2025-12-23  0:35 ` [PATCH v3 25/25] docs: fuse: add io-uring bufring and zero-copy documentation Joanne Koong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAJnrk1bt8X2E6estPz-xUmUBeQ93rKOpY078kVQCMmjtiVA5eA@mail.gmail.com \
    --to=joannelkoong@gmail.com \
    --cc=asml.silence@gmail.com \
    --cc=axboe@kernel.dk \
    --cc=bernd@bsbernd.com \
    --cc=bschubert@ddn.com \
    --cc=csander@purestorage.com \
    --cc=io-uring@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=miklos@szeredi.hu \
    --cc=xiaobing.li@samsung.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox