* [PATCH v2 0/3] account zcrx area pinned memory
@ 2025-07-18 18:59 Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 1/3] io_uring: export io_[un]account_mem Pavel Begunkov
` (3 more replies)
0 siblings, 4 replies; 6+ messages in thread
From: Pavel Begunkov @ 2025-07-18 18:59 UTC (permalink / raw)
To: io-uring; +Cc: asml.silence, dw
Honour RLIMIT_MEMLOCK while pinning zcrx areas.
v2: Fix null ifq on partial destruction
Fix page leaks on io_zcrx_free_area() failure.
Pavel Begunkov (3):
io_uring: export io_[un]account_mem
io_uring/zcrx: account area memory
io_uring/zcrx: fix leaking pages on sg init fail
io_uring/rsrc.c | 4 ++--
io_uring/rsrc.h | 2 ++
io_uring/zcrx.c | 36 +++++++++++++++++++++++++++++++-----
io_uring/zcrx.h | 1 +
4 files changed, 36 insertions(+), 7 deletions(-)
--
2.49.0
^ permalink raw reply [flat|nested] 6+ messages in thread
* [PATCH v2 1/3] io_uring: export io_[un]account_mem
2025-07-18 18:59 [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
@ 2025-07-18 18:59 ` Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 2/3] io_uring/zcrx: account area memory Pavel Begunkov
` (2 subsequent siblings)
3 siblings, 0 replies; 6+ messages in thread
From: Pavel Begunkov @ 2025-07-18 18:59 UTC (permalink / raw)
To: io-uring; +Cc: asml.silence, dw
Export pinned memory accounting helpers, they'll be used by zcrx
shortly.
Cc: stable@vger.kernel.org
Fixes: cf96310c5f9a0 ("io_uring/zcrx: add io_zcrx_area")
Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
---
io_uring/rsrc.c | 4 ++--
io_uring/rsrc.h | 2 ++
2 files changed, 4 insertions(+), 2 deletions(-)
diff --git a/io_uring/rsrc.c b/io_uring/rsrc.c
index 043018bc9b9c..f75f5e43fa4a 100644
--- a/io_uring/rsrc.c
+++ b/io_uring/rsrc.c
@@ -55,7 +55,7 @@ int __io_account_mem(struct user_struct *user, unsigned long nr_pages)
return 0;
}
-static void io_unaccount_mem(struct io_ring_ctx *ctx, unsigned long nr_pages)
+void io_unaccount_mem(struct io_ring_ctx *ctx, unsigned long nr_pages)
{
if (ctx->user)
__io_unaccount_mem(ctx->user, nr_pages);
@@ -64,7 +64,7 @@ static void io_unaccount_mem(struct io_ring_ctx *ctx, unsigned long nr_pages)
atomic64_sub(nr_pages, &ctx->mm_account->pinned_vm);
}
-static int io_account_mem(struct io_ring_ctx *ctx, unsigned long nr_pages)
+int io_account_mem(struct io_ring_ctx *ctx, unsigned long nr_pages)
{
int ret;
diff --git a/io_uring/rsrc.h b/io_uring/rsrc.h
index 25e7e998dcfd..a3ca6ba66596 100644
--- a/io_uring/rsrc.h
+++ b/io_uring/rsrc.h
@@ -120,6 +120,8 @@ int io_files_update(struct io_kiocb *req, unsigned int issue_flags);
int io_files_update_prep(struct io_kiocb *req, const struct io_uring_sqe *sqe);
int __io_account_mem(struct user_struct *user, unsigned long nr_pages);
+int io_account_mem(struct io_ring_ctx *ctx, unsigned long nr_pages);
+void io_unaccount_mem(struct io_ring_ctx *ctx, unsigned long nr_pages);
static inline void __io_unaccount_mem(struct user_struct *user,
unsigned long nr_pages)
--
2.49.0
^ permalink raw reply related [flat|nested] 6+ messages in thread
* [PATCH v2 2/3] io_uring/zcrx: account area memory
2025-07-18 18:59 [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 1/3] io_uring: export io_[un]account_mem Pavel Begunkov
@ 2025-07-18 18:59 ` Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 3/3] io_uring/zcrx: fix leaking pages on sg init fail Pavel Begunkov
2025-07-18 19:01 ` [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
3 siblings, 0 replies; 6+ messages in thread
From: Pavel Begunkov @ 2025-07-18 18:59 UTC (permalink / raw)
To: io-uring; +Cc: asml.silence, dw
zcrx areas can be quite large and need to be accounted and checked
against RLIMIT_MEMLOCK. In practise it shouldn't be a big issue as
the inteface already requires cap_net_admin.
Cc: stable@vger.kernel.org
Fixes: cf96310c5f9a0 ("io_uring/zcrx: add io_zcrx_area")
Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
---
io_uring/zcrx.c | 32 ++++++++++++++++++++++++++++----
io_uring/zcrx.h | 1 +
2 files changed, 29 insertions(+), 4 deletions(-)
diff --git a/io_uring/zcrx.c b/io_uring/zcrx.c
index 7d7396ce876c..4f9191f922a1 100644
--- a/io_uring/zcrx.c
+++ b/io_uring/zcrx.c
@@ -158,6 +158,23 @@ static int io_zcrx_map_area_dmabuf(struct io_zcrx_ifq *ifq, struct io_zcrx_area
area->mem.dmabuf_offset);
}
+static unsigned long io_count_account_pages(struct page **pages, unsigned nr_pages)
+{
+ struct folio *last_folio = NULL;
+ unsigned long res = 0;
+ int i;
+
+ for (i = 0; i < nr_pages; i++) {
+ struct folio *folio = page_folio(pages[i]);
+
+ if (folio == last_folio)
+ continue;
+ last_folio = folio;
+ res += 1UL << folio_order(folio);
+ }
+ return res;
+}
+
static int io_import_umem(struct io_zcrx_ifq *ifq,
struct io_zcrx_mem *mem,
struct io_uring_zcrx_area_reg *area_reg)
@@ -180,10 +197,15 @@ static int io_import_umem(struct io_zcrx_ifq *ifq,
if (ret)
return ret;
+ mem->account_pages = io_count_account_pages(pages, nr_pages);
+ ret = io_account_mem(ifq->ctx, mem->account_pages);
+ if (ret)
+ mem->account_pages = 0;
+
mem->pages = pages;
mem->nr_folios = nr_pages;
mem->size = area_reg->len;
- return 0;
+ return ret;
}
static void io_release_area_mem(struct io_zcrx_mem *mem)
@@ -353,10 +375,12 @@ static void io_free_rbuf_ring(struct io_zcrx_ifq *ifq)
static void io_zcrx_free_area(struct io_zcrx_area *area)
{
- if (area->ifq)
- io_zcrx_unmap_area(area->ifq, area);
+ io_zcrx_unmap_area(area->ifq, area);
io_release_area_mem(&area->mem);
+ if (area->mem.account_pages)
+ io_unaccount_mem(area->ifq->ctx, area->mem.account_pages);
+
kvfree(area->freelist);
kvfree(area->nia.niovs);
kvfree(area->user_refs);
@@ -384,6 +408,7 @@ static int io_zcrx_create_area(struct io_zcrx_ifq *ifq,
area = kzalloc(sizeof(*area), GFP_KERNEL);
if (!area)
goto err;
+ area->ifq = ifq;
ret = io_import_area(ifq, &area->mem, area_reg);
if (ret)
@@ -418,7 +443,6 @@ static int io_zcrx_create_area(struct io_zcrx_ifq *ifq,
}
area->free_count = nr_iovs;
- area->ifq = ifq;
/* we're only supporting one area per ifq for now */
area->area_id = 0;
area_reg->rq_area_token = (u64)area->area_id << IORING_ZCRX_AREA_SHIFT;
diff --git a/io_uring/zcrx.h b/io_uring/zcrx.h
index 89015b923911..109c4ca36434 100644
--- a/io_uring/zcrx.h
+++ b/io_uring/zcrx.h
@@ -15,6 +15,7 @@ struct io_zcrx_mem {
struct page **pages;
unsigned long nr_folios;
struct sg_table page_sg_table;
+ unsigned long account_pages;
struct dma_buf_attachment *attach;
struct dma_buf *dmabuf;
--
2.49.0
^ permalink raw reply related [flat|nested] 6+ messages in thread
* [PATCH v2 3/3] io_uring/zcrx: fix leaking pages on sg init fail
2025-07-18 18:59 [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 1/3] io_uring: export io_[un]account_mem Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 2/3] io_uring/zcrx: account area memory Pavel Begunkov
@ 2025-07-18 18:59 ` Pavel Begunkov
2025-07-18 19:01 ` [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
3 siblings, 0 replies; 6+ messages in thread
From: Pavel Begunkov @ 2025-07-18 18:59 UTC (permalink / raw)
To: io-uring; +Cc: asml.silence, dw
If sg_alloc_table_from_pages() fails, io_import_umem() returns without
cleaning up pinned pages first. Fix it.
Fixes: b84621d96ee02 ("io_uring/zcrx: allocate sgtable for umem areas")
Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
---
io_uring/zcrx.c | 4 +++-
1 file changed, 3 insertions(+), 1 deletion(-)
diff --git a/io_uring/zcrx.c b/io_uring/zcrx.c
index 4f9191f922a1..2c5d4e7c3b47 100644
--- a/io_uring/zcrx.c
+++ b/io_uring/zcrx.c
@@ -194,8 +194,10 @@ static int io_import_umem(struct io_zcrx_ifq *ifq,
ret = sg_alloc_table_from_pages(&mem->page_sg_table, pages, nr_pages,
0, nr_pages << PAGE_SHIFT,
GFP_KERNEL_ACCOUNT);
- if (ret)
+ if (ret) {
+ unpin_user_pages(pages, nr_pages);
return ret;
+ }
mem->account_pages = io_count_account_pages(pages, nr_pages);
ret = io_account_mem(ifq->ctx, mem->account_pages);
--
2.49.0
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [PATCH v2 0/3] account zcrx area pinned memory
2025-07-18 18:59 [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
` (2 preceding siblings ...)
2025-07-18 18:59 ` [PATCH v2 3/3] io_uring/zcrx: fix leaking pages on sg init fail Pavel Begunkov
@ 2025-07-18 19:01 ` Pavel Begunkov
2025-07-18 21:18 ` Jens Axboe
3 siblings, 1 reply; 6+ messages in thread
From: Pavel Begunkov @ 2025-07-18 19:01 UTC (permalink / raw)
To: io-uring; +Cc: dw
On 7/18/25 19:59, Pavel Begunkov wrote:
> Honour RLIMIT_MEMLOCK while pinning zcrx areas.
It appeared cleaner resending the whole thing. Let me know
if a fixup patch is preferable.
--
Pavel Begunkov
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH v2 0/3] account zcrx area pinned memory
2025-07-18 19:01 ` [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
@ 2025-07-18 21:18 ` Jens Axboe
0 siblings, 0 replies; 6+ messages in thread
From: Jens Axboe @ 2025-07-18 21:18 UTC (permalink / raw)
To: Pavel Begunkov, io-uring; +Cc: dw
On 7/18/25 1:01 PM, Pavel Begunkov wrote:
> On 7/18/25 19:59, Pavel Begunkov wrote:
>> Honour RLIMIT_MEMLOCK while pinning zcrx areas.
>
> It appeared cleaner resending the whole thing. Let me know
> if a fixup patch is preferable.
Fixup would be better in this case, as there's already other
patches on top at this point.
--
Jens Axboe
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2025-07-18 21:18 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-07-18 18:59 [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 1/3] io_uring: export io_[un]account_mem Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 2/3] io_uring/zcrx: account area memory Pavel Begunkov
2025-07-18 18:59 ` [PATCH v2 3/3] io_uring/zcrx: fix leaking pages on sg init fail Pavel Begunkov
2025-07-18 19:01 ` [PATCH v2 0/3] account zcrx area pinned memory Pavel Begunkov
2025-07-18 21:18 ` Jens Axboe
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox