From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server-vie001.gnuweeb.org X-Spam-Level: X-Spam-Status: No, score=-1.2 required=5.0 tests=ALL_TRUSTED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_ZEN_BLOCKED_OPENDNS autolearn=ham autolearn_force=no version=3.4.6 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gnuweeb.org; s=default; t=1725550978; bh=f/L1vJBQcrayJOQ2JHYkGIgWrGSNwvUW1OTxLAbYxxQ=; h=From:To:Cc:Subject:Message-ID:MIME-Version:Content-Type: Content-Transfer-Encoding:From; b=QVmIezZxp0brODB4Wk0QOGqKQGqpzt6R7vodKy0vVUSLc0/0ghI+XRtWBA0HMW/DG KNS2vvptNC3mEjMxQ1no4Qgq1qzwW7u2JzTxAmuOsg3AjSu3ELUFwvW8WLfkWdtEne O9eo6vd0sO+tg3RPAv1Q4458FRBtXmFDryis4wap9zwU9/MxM69zQK/cxML8ddLFVJ 2aLFXdrS7+SAIPXHNbpOOHOo1zdQZz6NvpdYwXSOw7Lmwv4oCZUXOo0K+06qwz4W+j XviQYwBCuLG/6X55FhQnfhuorWRYuXb/RXKiCQ/SjbldXP5I6YmU9/b2jUHJesXenR V/HtC3fCCi/qg== Received: from biznet-home.integral.gnuweeb.org (unknown [182.253.126.159]) by server-vie001.gnuweeb.org (Postfix) with ESMTPSA id 16AFD3103E6C; Thu, 5 Sep 2024 15:42:56 +0000 (UTC) Date: Thu, 5 Sep 2024 22:42:54 +0700 From: Ammar Faizi To: Konstantin Ryabitsev , Greg Kroah-Hartman Cc: Linux Kernel Mailing List , GNU/Weeb Mailing List , Alviro Iskandar Setiawan , Ravel Kevin Ethan , Michael William Jonathan Subject: Public Inbox Output Broken Due to Double HTML Encoding Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit X-Bpl: hUx9VaHkTWcLO7S8CQCslj6OzqBx2hfLChRz45nPESx5VSB/xuJQVOKOB1zSXE3yc9ntP27bV1M1 List-Id: Hi, We have noticed a recent issue with the lore public inbox output where it appears to be affected by double HTML encoding. This problem surfaced after 1 September 2024, as that was the last time we observed the correct output. The issue is particularly visible in email headers, where characters such as <, >, and quotes are incorrectly encoded twice. This results in unreadable and improperly rendered email headers. It also breaks several links in the HTML output. For instance, in this thread: https://lore.kernel.org/qemu-devel/20240905141211.1253307-1-kraxel@redhat.com/ Here's the current, broken output: ``` From: Gerd Hoffmann <kraxel@redhat.com> To: qemu-devel@nongnu.org Cc: "Paolo Bonzini" <pbonzini@redhat.com>, "Yanan Wang" <wangyanan55@huawei.com>, "Zhao Liu" <zhao1.liu@intel.com>, "Eduardo Habkost" <eduardo@habkost.net>, "Richard Henderson" <richard.henderson@linaro.org>, "Marcel Apfelbaum" <marcel.apfelbaum@gmail.com>, "Philippe Mathieu-Daudé" <philmd@linaro.org>, "Michael S. Tsirkin" <mst@redhat.com>, ``` And here's the expected output: ``` From: Gerd Hoffmann To: qemu-devel@nongnu.org Cc: "Paolo Bonzini" , "Yanan Wang" , "Zhao Liu" , "Eduardo Habkost" , "Richard Henderson" , "Marcel Apfelbaum" , "Philippe Mathieu-Daudé" , "Michael S. Tsirkin" , "Gerd Hoffmann" ``` It seems like the encoding process is being applied redundantly, causing special characters like < (for <) and > (for >) to render improperly. This double encoding affects both usability and readability, especially when dealing with email threads that rely heavily on clear headers for communication between contributors. For reference, I have also attached a screenshot of the output, which can be found here: https://gist.github.com/ammarfaizi2/bbb1480822ece70cf2d261e876bd7ba2 It would be great if this could be resolved as soon as possible to restore the proper display of inbox outputs. Thank you for your attention to this matter! -- Ammar Faizi