From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from yourcmc.ru (yourcmc.ru [195.209.40.11])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7A353269823
	for <io-uring@vger.kernel.org>; Tue,  8 Apr 2025 12:43:25 +0000 (UTC)
Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=195.209.40.11
ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116;
	t=1744116210; cv=none; b=ZLlGwKU+HQe0bOM7KdhgLjAV4P+PkL4jrsTiTa33Z5LtaaLnYKyXCfgn+kO8NZGhiHmV47XgZKUHW8voPYeSRI3HCA7Oa+ZgkiWUj3cYrXLrT65rYDAXxRUMyHWBhdCF0cT31XxZyWVvHZeuf3IZQHBV3sHUdH8FIKvH9oHtMhY=
ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org;
	s=arc-20240116; t=1744116210; c=relaxed/simple;
	bh=AYeTwKiofssUbed/G9VMxPpLD5FWMzOZlazbpXvmpUQ=;
	h=MIME-Version:Date:Content-Type:From:Message-ID:Subject:To:
	 In-Reply-To:References; b=BSaad42w6G8jxtdlD8sArhNAtn7AUmY2vMREkQ/rWMf08RO9PbeheQOXILMnBvnigflcyh11lHZyhOTaDLut8QAuSbE1l+SUV72PC85zZn6vVB5aCm0bqRAv9qrVInfQX6L04dc5FjpnNxz5IiCf3Q/ZFv7EMdHDqjdoAPpk5e8=
ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=yourcmc.ru; spf=pass smtp.mailfrom=yourcmc.ru; dkim=pass (2048-bit key) header.d=yourcmc.ru header.i=@yourcmc.ru header.b=fbFPQvGo; arc=none smtp.client-ip=195.209.40.11
Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=yourcmc.ru
Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=yourcmc.ru
Authentication-Results: smtp.subspace.kernel.org;
	dkim=pass (2048-bit key) header.d=yourcmc.ru header.i=@yourcmc.ru header.b="fbFPQvGo"
Received: from yourcmc.ru (localhost [127.0.0.1])
	by yourcmc.ru (Postfix) with ESMTP id 0B9D9FE0667;
	Tue,  8 Apr 2025 15:43:17 +0300 (MSK)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yourcmc.ru; s=mail;
	t=1744116197; bh=W/JlKwAahGfNP22k5CLM+oWZkeBwc+dqrVtjKgD0dRU=;
	h=Date:From:Subject:To:In-Reply-To:References;
	b=fbFPQvGoOqf1OBbYsohEl0zjhLleUOi/kcPgdxWkU9bguUqkv87m9VEIK/w4/GUEn
	 4qRJIZSARRH2bAD67DR1+ny3dBEzUTRyXH+8k3H7fpm9vxd8ftEijsq0pDXYrNhtUK
	 WaR4jGUJ9pWM94xxxj+6WVwB18LW0ix+9TDcSi/dy2YBHq+t1XROVjasdZtd4uCDvB
	 +fuqc7pmVprGTxpa0wHBi8NqK2A5IdrKO2YtwhThN+K61xwy4lFa65jUYVKB161M0T
	 P8nnadD+hAkMzKWnkyjrrjXGwnThW1eumI457ELpnAwfyrbuCtv3iSllGs91PdRoWy
	 VMUclQFguweBw==
Received: from rainloop.yourcmc.ru (yourcmc.ru [195.209.40.11])
	by yourcmc.ru (Postfix) with ESMTPSA id DABDFFE065F;
	Tue,  8 Apr 2025 15:43:16 +0300 (MSK)
Precedence: bulk
X-Mailing-List: io-uring@vger.kernel.org
List-Id: <io-uring.vger.kernel.org>
List-Subscribe: <mailto:io-uring+subscribe@vger.kernel.org>
List-Unsubscribe: <mailto:io-uring+unsubscribe@vger.kernel.org>
MIME-Version: 1.0
Date: Tue, 08 Apr 2025 12:43:16 +0000
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: RainLoop/1.14.0
From: vitalif@yourcmc.ru
Message-ID: <61b6b1d6cffae4344254ddaef9be6621@yourcmc.ru>
Subject: Re: io_uring zero-copy send test results
To: "Pavel Begunkov" <asml.silence@gmail.com>, io-uring@vger.kernel.org
In-Reply-To: <d7a31a1e-87bd-4a3b-abbb-f1e26b2a03f8@gmail.com>
References: <d7a31a1e-87bd-4a3b-abbb-f1e26b2a03f8@gmail.com>
 <5ce812ab-29a6-4132-a067-27ea27895940@gmail.com>
 <f1600745ba7b328019558611c1ad7684@yourcmc.ru>
 <f7e03e2c113fbbf45a4910538a9528ef@yourcmc.ru>
X-Virus-Scanned: ClamAV using ClamSMTP

> What kernel version you use? I'm specifically interested whether it has=
:=0A> =0A> 6fe4220912d19 ("io_uring/notif: implement notification stackin=
g")=0A> =0A> That would explain why it's slow even with huge pages.=0A=0A=
It was Linux 6.8.12-4-pve (proxmox), so yeah, it didn't include that comm=
it.=0A=0AWe repeated tests with Linux 6.11 also from proxmox:=0A=0AAMD EP=
YC GENOA 9554 MELLANOX CX-5, iommu=3Dpt, Linux 6.11=0A=0A4096 8192 10000 =
12000 16384 65435=0Azc MB/s 2288 2422 2149 2396 2506 2476=0Azc CPU 90% 67=
% 56% 56% 57% 44%=0Asend MB/s 1685 2033 2389 2343 2281 2415=0Asend CPU 95=
% 87% 49% 48% 62% 38%=0A=0AAMD EPYC GENOA 9554 MELLANOX CX-5, iommu=3Dpt,=
 -l1, Linux 6.11=0A=0A4096 8192 10000 12000 16384 65435=0Azc MB/s 2359 25=
09 2351 2508 2384 2424=0Azc CPU 85% 58% 52% 45% 37% 18%=0Asend MB/s 1503 =
1892 2325 2447 2434 2440=0Asend CPU 99% 96% 50% 49% 57% 37%=0A=0ANow it's=
 nice and quick even without huge pages and even with 4k buffers!=0A=0A> =
That doesn't make sense. Do you see anything odd in the profile?=0A=0ADid=
n't have time to repeat tests with perf on those servers yet, but I can c=
heck dmesg logs. In the default iommu mode, /sys/class/iommu is empty and=
 dmesg includes the following lines:=0A=0ADMAR-IR: IOAPIC id 8 under DRHD=
 base  0x9b7fc000 IOMMU 9=0Aiommu: Default domain type: Translated =0Aiom=
mu: DMA domain TLB invalidation policy: lazy mode =0A=0AWith iommu=3Dpt, =
dmesg has:=0A=0ADMAR-IR: IOAPIC id 8 under DRHD base  0x9b7fc000 IOMMU 9=
=0Aiommu: Default domain type: Passthrough (set via kernel command line)