public inbox for [email protected]
 help / color / mirror / Atom feed
From: Jens Axboe <[email protected]>
To: Jackie Liu <[email protected]>
Cc: Bob Liu <[email protected]>, [email protected]
Subject: Re: [PATCH] io_uring: fix error clear of ->file_table in io_sqe_files_register()
Date: Sun, 10 Nov 2019 20:09:42 -0800	[thread overview]
Message-ID: <[email protected]> (raw)
In-Reply-To: <[email protected]>

On 11/10/19 9:02 PM, Jackie Liu wrote:
> 
> 
>> 2019年11月11日 11:54,Jens Axboe <[email protected]> 写道:
>>
>> On 11/10/19 4:44 PM, Bob Liu wrote:
>>> On 11/10/19 11:46 PM, Jens Axboe wrote:
>>>> syzbot reports that when using failslab and friends, we can get a double
>>>> free in io_sqe_files_unregister():
>>>>
>>>> BUG: KASAN: double-free or invalid-free in
>>>> io_sqe_files_unregister+0x20b/0x300 fs/io_uring.c:3185
>>>>
>>>> CPU: 1 PID: 8819 Comm: syz-executor452 Not tainted 5.4.0-rc6-next-20191108
>>>> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
>>>> Google 01/01/2011
>>>> Call Trace:
>>>>    __dump_stack lib/dump_stack.c:77 [inline]
>>>>    dump_stack+0x197/0x210 lib/dump_stack.c:118
>>>>    print_address_description.constprop.0.cold+0xd4/0x30b mm/kasan/report.c:374
>>>>    kasan_report_invalid_free+0x65/0xa0 mm/kasan/report.c:468
>>>>    __kasan_slab_free+0x13a/0x150 mm/kasan/common.c:450
>>>>    kasan_slab_free+0xe/0x10 mm/kasan/common.c:480
>>>>    __cache_free mm/slab.c:3426 [inline]
>>>>    kfree+0x10a/0x2c0 mm/slab.c:3757
>>>>    io_sqe_files_unregister+0x20b/0x300 fs/io_uring.c:3185
>>>>    io_ring_ctx_free fs/io_uring.c:3998 [inline]
>>>>    io_ring_ctx_wait_and_kill+0x348/0x700 fs/io_uring.c:4060
>>>>    io_uring_release+0x42/0x50 fs/io_uring.c:4068
>>>>    __fput+0x2ff/0x890 fs/file_table.c:280
>>>>    ____fput+0x16/0x20 fs/file_table.c:313
>>>>    task_work_run+0x145/0x1c0 kernel/task_work.c:113
>>>>    exit_task_work include/linux/task_work.h:22 [inline]
>>>>    do_exit+0x904/0x2e60 kernel/exit.c:817
>>>>    do_group_exit+0x135/0x360 kernel/exit.c:921
>>>>    __do_sys_exit_group kernel/exit.c:932 [inline]
>>>>    __se_sys_exit_group kernel/exit.c:930 [inline]
>>>>    __x64_sys_exit_group+0x44/0x50 kernel/exit.c:930
>>>>    do_syscall_64+0xfa/0x760 arch/x86/entry/common.c:290
>>>>    entry_SYSCALL_64_after_hwframe+0x49/0xbe
>>>> RIP: 0033:0x43f2c8
>>>> Code: 31 b8 c5 f7 ff ff 48 8b 5c 24 28 48 8b 6c 24 30 4c 8b 64 24 38 4c 8b
>>>> 6c 24 40 4c 8b 74 24 48 4c 8b 7c 24 50 48 83 c4 58 c3 66 <0f> 1f 84 00 00
>>>> 00 00 00 48 8d 35 59 ca 00 00 0f b6 d2 48 89 fb 48
>>>> RSP: 002b:00007ffd5b976008 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
>>>> RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 000000000043f2c8
>>>> RDX: 0000000000000000 RSI: 000000000000003c RDI: 0000000000000000
>>>> RBP: 00000000004bf0a8 R08: 00000000000000e7 R09: ffffffffffffffd0
>>>> R10: 0000000000000001 R11: 0000000000000246 R12: 0000000000000001
>>>> R13: 00000000006d1180 R14: 0000000000000000 R15: 0000000000000000
>>>>
>>>> This happens if we fail allocating the file tables. For that case we do
>>>> free the file table correctly, but we forget to set it to NULL. This
>>>> means that ring teardown will see it as being non-NULL, and attempt to
>>>> free it again.
>>>>
>>>> Fix this by clearing the file_table pointer if we free the table.
>>>>
>>>> Reported-by: [email protected]
>>>> Fixes: 65e19f54d29c ("io_uring: support for larger fixed file sets")
>>>> Signed-off-by: Jens Axboe <[email protected]>
>>>>
>>>
>>> Reviewed-by: Bob Liu <[email protected]>
>>
>> Thanks, added.
>>
>>> By the way, there are many place(besides io_uring.c) which need to set
>>> pointer to NULL after free.  I saw similar fix from time to time.
>>>
>>> Do you think a safe_free() is worth? e.g
>>> #define SAFE_FREE(p) { if (p) { free(p); (p)=NULL; } }
>>
>> Hmm not sure, and would probably be better as:
>>
>> kfree_safe(&ptr);
>>
>> or something instead. I seem to recall discussions about that ages ago,
>> probably worth while to try and search and see if you can find those. I
>> suspect Linus hates it, reasons not remembered ;-)
>>
> 
> I think this may be a worthwhile solution, but kfree can handle NULL, we can
> set it to NULL directly after free is finished.
> 
> void kfree_safe(const void *ptr)
> {
> 	kfree(ptr);
> 	ptr = NULL;
> }

Sure, but that doesn't change the ptr in the caller, which was my point.
You need to pass in a pointer to the pointer for that, otherwise
clearing it in kfree_safe() is pointless:

void kfree_safe(const void **ptr)
{
	kfree(*ptr);
	*ptr = NULL;
}

and then you run into all sorts of fun, since void ** isn't the same as
'struct foo **'.

-- 
Jens Axboe


  reply	other threads:[~2019-11-11  4:09 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-11-10 15:46 [PATCH] io_uring: fix error clear of ->file_table in io_sqe_files_register() Jens Axboe
2019-11-10 23:44 ` Bob Liu
2019-11-11  3:54   ` Jens Axboe
2019-11-11  4:02     ` Jackie Liu
2019-11-11  4:09       ` Jens Axboe [this message]
2019-11-11  4:19         ` Jackie Liu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    [email protected] \
    [email protected] \
    [email protected] \
    [email protected] \
    [email protected] \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox