From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-ed1-f41.google.com (mail-ed1-f41.google.com [209.85.208.41]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 911BB3370FE for ; Fri, 9 Jan 2026 13:33:18 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.208.41 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1767965601; cv=none; b=X2yOA54Kt6BCNtENjo0XVyb8rCHb69/bnMqoFxxZQJXM2RSMSg/22JtYURT5goEz9/NVgXny4s9eLfUk4KKTfyUYYeMAcSI5lI2FNOyDXFr9pYMXh+D2xKPrg1Ct1IAh90cy9zGzYJgmTpy0z6sqxjV2H/dnLXxd0fXybCJaN2A= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1767965601; c=relaxed/simple; bh=ME5RwYYeXnXNLbQ/Bkl/rSm+h34n/XlbEQhFl4FNSPc=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version; b=FBYr3weLmQyJPQWtzE2F3Ohh6AtJ0opG/1Qyyo/Y6WKsOwXTKW8+CypCtNQwE/DuA85jK9/3ORyYcfI+wKC4mA/ybPRFSFUruZulcsN35XKalx4bmFsJKEVdELgpHaVz7v1uNecuaJgsfJV5wCO33ROypt44G3nihjusGpToUTA= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=dIEfbt8C; arc=none smtp.client-ip=209.85.208.41 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="dIEfbt8C" Received: by mail-ed1-f41.google.com with SMTP id 4fb4d7f45d1cf-64d0d41404cso6486089a12.0 for ; Fri, 09 Jan 2026 05:33:18 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1767965596; x=1768570396; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=fQi3F9taD578uKhb5WeGr8+LqFJi+cgMU9mJCgFH+JI=; b=dIEfbt8CVBvqEUKnUoyCiHwB2hv3YHrq/CQa5lnGnOJDt0JGx8V31XjSd/PoigmZqB khEFWijQoQrFVFZaoC+nEzQf0l8oEGypsNo9ZyTpNHn0lvJrZI655xQb1Zq0es2Zyx3d 7RYYODu7yglubEWk2LPG+9dn8RBIqk/Az4FwC5GFnj21Phg8F1NUvcltZagWg3I1KO6B GWnuwXIqE21s3Kp7jKATgQ9HyCpPuBqg0Mse65+9JjErmmx7s1lAEi2sN91prg8S9FWj nPAe7wAr/0uKHsXFKbafqJlay5A89fuFK0yv1MWDwAFcEco43Qn/i9Bj81b2FKTD8vxM J/4Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1767965596; x=1768570396; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-gg:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=fQi3F9taD578uKhb5WeGr8+LqFJi+cgMU9mJCgFH+JI=; b=A6IVF7LBpunsYCIAQzLy+h0+kjgtvWnNneCFPJRaAKDt1SfjRe5hzylfGKAonhEICp nsSkEq/gwMpZosCdbGozknF2GL4ZzOuhdvMn8nokKe9X3IYQrDXNEFEV+GvDGz+ySIPJ x/flo5iuWXSQCLUvpcVOrGnux2F5HLud8Is7zbFm8hglZl+Ike4rT5zkVfHjZ3kofd9V aLPyKN7Vajb5ozwyZ9FW4GVqky2oY+ada9vrmlr/Xy7dV1HziRmziYyzXubi4+yNmjPP Mnygl5RbmNayhiUOTPrwiFGMMzChSvPTnTnkJie4Etz+L9mjty7vFQiVcQHwwGj+5r4t u63A== X-Forwarded-Encrypted: i=1; AJvYcCWTexUy5IMZ7jMNms14egzbp/U3Yl/EnYHene1filfQ30uDLXKPgSxk3mGgROPkA3uYt0an+g4B4A==@vger.kernel.org X-Gm-Message-State: AOJu0YxTvGIOux6X30K/YSTG3oxLk+HKiLwGUDpvZmU3Tepv+IIgp8FR QuigMiZVji/S/a2Z0oZ58ejYXtQ7b1p81n6IsYOLIhQ4VrNHdH0pog8PgWre2Q== X-Gm-Gg: AY/fxX4J/DHTnIQ74zxCagi+wj3KdK5HviAJCQ/GdGS1Ck6I9uzaPXuGu79zUmeSSTd ooMFi9b1f5jup4djdy/9eItBTwycQFgnApXe8BA3k3VsvHTIgzOje45vOuOp8SvUhWsxddxk+b7 szWZI0FAQWwJAsT8Ygskh3qhZtnseUmCAWnhR0hj6V/GMdA8N5AOxCOk4zWH4l9qttuge/a2FJy Yw93qBV6a/DreTt/pXVtBZ+a3yIwjMjzknRQVMHa1sZOz0d5LeGt33i+Uk1+2O5zit7Qyv9zQEg hvPeyA7+wReveXr5gXs9vzBJpQN5wMm4EFSq8GfBILf2OcCaGaLMaqIE1Bfg5uaqaK/tKD6tyU8 IATYolDlaAg+ZsVrY27qaR3QOsCMnjK02tNTlzmBqegbU/BPzGi23R2LrD2/Bz34Jq5eTYHRRsg xhjBIEo5zY4JA48U+5s5S5o3nQ8lInee6YV5dHiAUrwb/Mv5SNaxaVUYLnSC7KEWxAFoojpl9bd ew2uRBx X-Google-Smtp-Source: AGHT+IHEu00dTqymHYnWX3YB9+V+M0g0pVj0f65C6gXQXrOLC70cW7LkbUbrjcu3dFBefYWTWksRNQ== X-Received: by 2002:a05:600c:4fc6:b0:477:a21c:2066 with SMTP id 5b1f17b1804b1-47d84b0a902mr94022565e9.5.1767958133975; Fri, 09 Jan 2026 03:28:53 -0800 (PST) Received: from 127.com ([2620:10d:c092:600::1:69b5]) by smtp.gmail.com with ESMTPSA id 5b1f17b1804b1-47d8636c610sm60056985e9.0.2026.01.09.03.28.51 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 09 Jan 2026 03:28:52 -0800 (PST) From: Pavel Begunkov To: netdev@vger.kernel.org Cc: "David S . Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni , Jonathan Corbet , Michael Chan , Pavan Chebbi , Andrew Lunn , Alexei Starovoitov , Daniel Borkmann , Jesper Dangaard Brouer , John Fastabend , Joshua Washington , Harshitha Ramamurthy , Saeed Mahameed , Tariq Toukan , Mark Bloch , Leon Romanovsky , Alexander Duyck , Ilias Apalodimas , Shuah Khan , Willem de Bruijn , Ankit Garg , Tim Hostetler , Alok Tiwari , Ziwei Xiao , John Fraker , Praveen Kaligineedi , Mohsin Bashir , Joe Damato , Mina Almasry , Dimitri Daskalakis , Stanislav Fomichev , Kuniyuki Iwashima , Samiullah Khawaja , Ahmed Zaki , Alexander Lobakin , Pavel Begunkov , David Wei , Yue Haibing , Haiyue Wang , Jens Axboe , Simon Horman , Vishwanath Seshagiri , linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, bpf@vger.kernel.org, linux-rdma@vger.kernel.org, linux-kselftest@vger.kernel.org, dtatulea@nvidia.com, io-uring@vger.kernel.org Subject: [PATCH net-next v8 0/9] Add support for providers with large rx buffer Date: Fri, 9 Jan 2026 11:28:39 +0000 Message-ID: X-Mailer: git-send-email 2.52.0 Precedence: bulk X-Mailing-List: io-uring@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Note: it's net/ only bits and doesn't include changes, which shoulf be merged separately and are posted separately. The full branch for convenience is at [1], and the patch is here: https://lore.kernel.org/io-uring/7486ab32e99be1f614b3ef8d0e9bc77015b173f7.1764265323.git.asml.silence@gmail.com Many modern NICs support configurable receive buffer lengths, and zcrx and memory providers can use buffers larger than 4K to improve performance. When paired with hw-gro larger rx buffer sizes can drastically reduce the number of buffers traversing the stack and save a lot of processing time. It also allows to give to users larger contiguous chunks of data. The idea was first floated around by Saeed during netdev conf 2024 and was asked about by a few folks. Single stream benchmarks showed up to ~30% CPU util improvement. E.g. comparison for 4K vs 32K buffers using a 200Gbit NIC: packets=23987040 (MB=2745098), rps=199559 (MB/s=22837) CPU %usr %nice %sys %iowait %irq %soft %idle 0 1.53 0.00 27.78 2.72 1.31 66.45 0.22 packets=24078368 (MB=2755550), rps=200319 (MB/s=22924) CPU %usr %nice %sys %iowait %irq %soft %idle 0 0.69 0.00 8.26 31.65 1.83 57.00 0.57 This series adds net infrastructure for memory providers configuring the size and implements it for bnxt. It's an opt-in feature for drivers, they should advertise support for the parameter in the qops and must check if the hardware supports the given size. It's limited to memory providers as it drastically simplifies implementation. It doesn't affect the fast path zcrx uAPI, and the user exposed parameter is defined in zcrx terms, which allows it to be flexible and adjusted in the future. A liburing example can be found at [2] full branch: [1] https://github.com/isilence/linux.git zcrx/large-buffers-v8 Liburing example: [2] https://github.com/isilence/liburing.git zcrx/rx-buf-len --- The following changes since commit 9ace4753a5202b02191d54e9fdf7f9e3d02b85eb: Linux 6.19-rc4 (2026-01-04 14:41:55 -0800) are available in the Git repository at: https://github.com/isilence/linux.git tags/net-queue-rx-buf-len-v8 for you to fetch changes up to 37f5abe6929963fc6086777056b59ecb034d0e19: io_uring/zcrx: document area chunking parameter (2026-01-08 11:35:20 +0000) v8: - Add stripped down qcfg - Retain the page size across resets for bnxt v7: - Add xa_destroy - Rebase v6: - Update docs and add a selftest v5: https://lore.kernel.org/netdev/cover.1760440268.git.asml.silence@gmail.com/ - Remove all unnecessary bits like configuration via netlink, and multi-stage queue configuration. v4: https://lore.kernel.org/all/cover.1760364551.git.asml.silence@gmail.com/ - Update fbnic qops - Propagate max buf len for hns3 - Use configured buf size in __bnxt_alloc_rx_netmem - Minor stylistic changes v3: https://lore.kernel.org/all/cover.1755499375.git.asml.silence@gmail.com/ - Rebased, excluded zcrx specific patches - Set agg_size_fac to 1 on warning v2: https://lore.kernel.org/all/cover.1754657711.git.asml.silence@gmail.com/ - Add MAX_PAGE_ORDER check on pp init - Applied comments rewording - Adjust pp.max_len based on order - Patch up mlx5 queue callbacks after rebase - Minor ->queue_mgmt_ops refactoring - Rebased to account for both fill level and agg_size_fac - Pass providers buf length in struct pp_memory_provider_params and apply it in __netdev_queue_confi(). - Use ->supported_ring_params to validate drivers support of set qcfg parameters. Jakub Kicinski (2): net: reduce indent of struct netdev_queue_mgmt_ops members eth: bnxt: adjust the fill level of agg queues with larger buffers Pavel Begunkov (7): net: memzero mp params when closing a queue net: add bare bone queue configs net: pass queue rx page size from memory provider eth: bnxt: store rx buffer size per queue eth: bnxt: support qcfg provided rx page size selftests: iou-zcrx: test large chunk sizes io_uring/zcrx: document area chunking parameter Documentation/networking/iou-zcrx.rst | 20 +++ drivers/net/ethernet/broadcom/bnxt/bnxt.c | 126 ++++++++++++++---- drivers/net/ethernet/broadcom/bnxt/bnxt.h | 2 + drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.c | 6 +- drivers/net/ethernet/broadcom/bnxt/bnxt_xdp.h | 2 +- drivers/net/ethernet/google/gve/gve_main.c | 9 +- .../net/ethernet/mellanox/mlx5/core/en_main.c | 10 +- drivers/net/ethernet/meta/fbnic/fbnic_txrx.c | 8 +- drivers/net/netdevsim/netdev.c | 7 +- include/net/netdev_queues.h | 47 +++++-- include/net/netdev_rx_queue.h | 2 + include/net/page_pool/types.h | 1 + net/core/dev.c | 17 +++ net/core/netdev_rx_queue.c | 31 +++-- .../selftests/drivers/net/hw/iou-zcrx.c | 72 ++++++++-- .../selftests/drivers/net/hw/iou-zcrx.py | 37 +++++ 16 files changed, 318 insertions(+), 79 deletions(-) -- 2.52.0