From: Willem de Bruijn <[email protected]>
To: Breno Leitao <[email protected]>,
Willem de Bruijn <[email protected]>
Cc: Jens Axboe <[email protected]>, David Ahern <[email protected]>,
Willem de Bruijn <[email protected]>,
[email protected], [email protected],
[email protected], [email protected], [email protected],
[email protected], [email protected], [email protected],
[email protected], [email protected],
[email protected], [email protected],
[email protected]
Subject: Re: [PATCH 0/5] add initial io_uring_cmd support for sockets
Date: Thu, 13 Apr 2023 10:24:31 -0400 [thread overview]
Message-ID: <[email protected]> (raw)
In-Reply-To: <ZDdGl/[email protected]>
Breno Leitao wrote:
> On Wed, Apr 12, 2023 at 10:28:41AM -0400, Willem de Bruijn wrote:
> > Breno Leitao wrote:
> > > On Tue, Apr 11, 2023 at 09:28:29AM -0600, Jens Axboe wrote:
> > > > On 4/11/23 9:24?AM, Willem de Bruijn wrote:
> > > > > Jens Axboe wrote:
> > > > >> On 4/11/23 9:00?AM, Willem de Bruijn wrote:
> > > > >> But that doesn't work, because sock->ops->ioctl() assumes the arg is
> > > > >> memory in userspace. Or do you mean change all of the sock->ops->ioctl()
> > > > >> to pass in on-stack memory (or similar) and have it work with a kernel
> > > > >> address?
> > > > >
> > > > > That was what I suggested indeed.
> > > > >
> > > > > It's about as much code change as this patch series. But it avoids
> > > > > the code duplication.
> > > >
> > > > Breno, want to tackle that as a prep patch first? Should make the
> > > > functional changes afterwards much more straightforward, and will allow
> > > > support for anything really.
> > >
> > > Absolutely. I just want to make sure that I got the proper approach that
> > > we agreed here.
> > >
> > > Let me explain what I understood taking TCP as an example:
> > >
> > > 1) Rename tcp_ioctl() to something as _tcp_ioctl() where the 'arg'
> > > argument is now just a kernel memory (located in the stack frame from the
> > > callee).
> > >
> > > 2) Recreate "tcp_ioctl()" that will basically allocate a 'arg' in the
> > > stack and call _tcp_ioctl() passing that 'arg' argument. At the bottom of
> > > this (tcp_ioctl() function) function, call `put_user(in_kernel_arg, userspace_arg)
> > >
> > > 3) Repeat it for the 20 protocols that implement ioctl:
> > >
> > > ag "struct proto .* = {" -A 20 net/ | grep \.ioctl
> > > net/dccp/ipv6.c .ioctl = dccp_ioctl,
> > > net/dccp/ipv4.c .ioctl = dccp_ioctl,
> > > net/ieee802154/socket.c .ioctl = dgram_ioctl,
> > > net/ipv4/udplite.c .ioctl = udp_ioctl,
> > > net/ipv4/raw.c .ioctl = raw_ioctl,
> > > net/ipv4/udp.c .ioctl = udp_ioctl,
> > > net/ipv4/tcp_ipv4.c .ioctl = tcp_ioctl,
> > > net/ipv6/raw.c .ioctl = rawv6_ioctl,
> > > net/ipv6/tcp_ipv6.c .ioctl = tcp_ioctl,
> > > net/ipv6/udp.c .ioctl = udp_ioctl,
> > > net/ipv6/udplite.c .ioctl = udp_ioctl,
> > > net/l2tp/l2tp_ip6.c .ioctl = l2tp_ioctl,
> > > net/l2tp/l2tp_ip.c .ioctl = l2tp_ioctl,
> > > net/phonet/datagram.: .ioctl = pn_ioctl,
> > > net/phonet/pep.c .ioctl = pep_ioctl,
> > > net/rds/af_rds.c .ioctl = rds_ioctl,
> > > net/sctp/socket.c .ioctl = sctp_ioctl,
> > > net/sctp/socket.c .ioctl = sctp_ioctl,
> > > net/xdp/xsk.c .ioctl = sock_no_ioctl,
> > > net/mptcp/protocol.c .ioctl = mptcp_ioctl,
> > >
> > > Am I missing something?
> >
> > The suggestion is to convert all to take kernel memory and do the
> > put_cmsg in the caller of .ioctl. Rather than create a wrapper for
> > each individual instance and add a separate .iouring_cmd for each.
> >
> > "change all of the sock->ops->ioctl() to pass in on-stack memory
> > (or similar) and have it work with a kernel address"
>
> is it possible to do it for cases where we don't know what is the size
> of the buffer?
>
> For instance the raw_ioctl()/rawv6_ioctl() case. The "arg" argument is
> used in different ways (one for input and one for output):
>
> 1) If cmd == SIOCOUTQ or SIOCINQ, then the return value will be
> returned to userspace:
> put_user(amount, (int __user *)arg)
>
> 2) For default cmd, ipmr_ioctl() is called, which reads from the `arg`
> parameter:
> copy_from_user(&vr, arg, sizeof(vr)
>
> How to handle these contradictory behaviour ahead of time (at callee
> time, where the buffers will be prepared)?
>
> Thank you!
Ah you found a counter-example to the simple pattern of put_user.
The answer perhaps depends on how many such counter-examples you
encounter in the list you gave. If this is the only one, exceptions
in the wrapper are reasonable. Not if there are many.
Is the intent for io_uring to support all cases eventually? The
current patch series only targeted more common fast path operations.
Probably also relevant is whether/how the approach can be extended
to [gs]etsockopt, as that was another example given, with the same
challenge.
next prev parent reply other threads:[~2023-04-13 14:24 UTC|newest]
Thread overview: 52+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-04-06 14:43 [PATCH 0/5] add initial io_uring_cmd support for sockets Breno Leitao
2023-04-06 14:43 ` [RFC PATCH 1/4] net: wire up support for file_operations->uring_cmd() Breno Leitao
2023-04-06 14:43 ` [RFC PATCH 2/4] net: add uring_cmd callback to UDP Breno Leitao
2023-04-11 12:54 ` Pavel Begunkov
2023-04-06 14:43 ` [RFC PATCH 3/4] net: add uring_cmd callback to TCP Breno Leitao
2023-04-06 14:43 ` [RFC PATCH 4/4] net: add uring_cmd callback to raw "protocol" Breno Leitao
2023-04-06 15:34 ` [PATCH 0/5] add initial io_uring_cmd support for sockets Willem de Bruijn
2023-04-06 15:59 ` Breno Leitao
2023-04-06 18:16 ` Willem de Bruijn
2023-04-07 2:46 ` David Ahern
2023-04-11 12:00 ` Breno Leitao
2023-04-11 14:36 ` David Ahern
2023-04-11 14:41 ` Jens Axboe
2023-04-11 14:51 ` Willem de Bruijn
2023-04-11 14:54 ` Jens Axboe
2023-04-11 15:00 ` Willem de Bruijn
2023-04-11 15:06 ` Jens Axboe
2023-04-11 15:24 ` Willem de Bruijn
2023-04-11 15:28 ` Jens Axboe
2023-04-12 13:53 ` Breno Leitao
2023-04-12 14:28 ` Willem de Bruijn
2023-04-13 0:02 ` Breno Leitao
2023-04-13 14:24 ` Willem de Bruijn [this message]
2023-04-13 14:45 ` Jakub Kicinski
2023-04-13 14:57 ` David Laight
2023-04-18 13:23 ` Breno Leitao
2023-04-18 19:41 ` Willem de Bruijn
2023-04-20 14:43 ` Breno Leitao
2023-04-20 16:48 ` Willem de Bruijn
2023-04-11 15:10 ` David Ahern
2023-04-11 15:17 ` Jens Axboe
2023-04-11 15:27 ` David Ahern
2023-04-11 15:29 ` Jens Axboe
2023-04-12 7:39 ` David Laight
2023-04-06 16:41 ` Keith Busch
2023-04-06 16:49 ` Jens Axboe
2023-04-06 16:58 ` Breno Leitao
2023-04-06 16:57 ` [PATCH RFC] io_uring: Pass whole sqe to commands Breno Leitao
2023-04-07 18:51 ` Keith Busch
2023-04-11 12:22 ` Breno Leitao
2023-04-11 12:39 ` Pavel Begunkov
2023-04-13 2:56 ` Ming Lei
2023-04-13 16:47 ` Breno Leitao
2023-04-14 2:12 ` Ming Lei
2023-04-14 13:12 ` Pavel Begunkov
2023-04-14 13:59 ` Ming Lei
2023-04-14 14:56 ` Pavel Begunkov
2023-04-16 9:51 ` Ming Lei
-- strict thread matches above, loose matches on Subject: below --
2023-05-02 9:21 [PATCH 0/5] add initial io_uring_cmd support for sockets Adrien Delorme
2023-05-02 13:03 ` Pavel Begunkov
2023-05-03 13:11 ` Adrien Delorme
2023-05-03 13:27 ` David Laight
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6438109fe8733_13361929472@willemb.c.googlers.com.notmuch \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
[email protected] \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox