From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <ammarfaizi2@gnuweeb.org>
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on
	server-vie001.gnuweeb.org
X-Spam-Level: 
X-Spam-Status: No, score=-1.2 required=5.0 tests=ALL_TRUSTED,DKIM_SIGNED,
	DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_ZEN_BLOCKED_OPENDNS
	autolearn=ham autolearn_force=no version=3.4.6
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gnuweeb.org;
	s=default; t=1725550978;
	bh=f/L1vJBQcrayJOQ2JHYkGIgWrGSNwvUW1OTxLAbYxxQ=;
	h=From:To:Cc:Subject:Message-ID:MIME-Version:Content-Type:
	 Content-Transfer-Encoding:From;
	b=QVmIezZxp0brODB4Wk0QOGqKQGqpzt6R7vodKy0vVUSLc0/0ghI+XRtWBA0HMW/DG
	 KNS2vvptNC3mEjMxQ1no4Qgq1qzwW7u2JzTxAmuOsg3AjSu3ELUFwvW8WLfkWdtEne
	 O9eo6vd0sO+tg3RPAv1Q4458FRBtXmFDryis4wap9zwU9/MxM69zQK/cxML8ddLFVJ
	 2aLFXdrS7+SAIPXHNbpOOHOo1zdQZz6NvpdYwXSOw7Lmwv4oCZUXOo0K+06qwz4W+j
	 XviQYwBCuLG/6X55FhQnfhuorWRYuXb/RXKiCQ/SjbldXP5I6YmU9/b2jUHJesXenR
	 V/HtC3fCCi/qg==
Received: from biznet-home.integral.gnuweeb.org (unknown [182.253.126.159])
	by server-vie001.gnuweeb.org (Postfix) with ESMTPSA id 16AFD3103E6C;
	Thu,  5 Sep 2024 15:42:56 +0000 (UTC)
Date: Thu, 5 Sep 2024 22:42:54 +0700
From: Ammar Faizi <ammarfaizi2@gnuweeb.org>
To: Konstantin Ryabitsev <konstantin@linuxfoundation.org>,
	Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	GNU/Weeb Mailing List <gwml@gnuweeb.org>,
	Alviro Iskandar Setiawan <alviro.iskandar@gnuweeb.org>,
	Ravel Kevin Ethan <silverwolf@gnuweeb.org>,
	Michael William Jonathan <moe@gnuweeb.org>
Subject: Public Inbox Output Broken Due to Double HTML Encoding
Message-ID: <ZtnRfnHrxVeEwTwx@biznet-home.integral.gnuweeb.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
X-Bpl: hUx9VaHkTWcLO7S8CQCslj6OzqBx2hfLChRz45nPESx5VSB/xuJQVOKOB1zSXE3yc9ntP27bV1M1
List-Id: <gwml.vger.gnuweeb.org>


Hi,

We have noticed a recent issue with the lore public inbox output where
it appears to be affected by double HTML encoding. This problem
surfaced after 1 September 2024, as that was the last time we observed
the correct output.

The issue is particularly visible in email headers, where characters
such as <, >, and quotes are incorrectly encoded twice. This results in
unreadable and improperly rendered email headers. It also breaks
several links in the HTML output.

For instance, in this thread:

    https://lore.kernel.org/qemu-devel/20240905141211.1253307-1-kraxel@redhat.com/

Here's the current, broken output:
```
From: Gerd Hoffmann &lt;kraxel@redhat.com&gt;
To: qemu-devel@nongnu.org
Cc: &#34;Paolo Bonzini&#34; &lt;pbonzini@redhat.com&gt;,
	&#34;Yanan Wang&#34; &lt;wangyanan55@huawei.com&gt;,
	&#34;Zhao Liu&#34; &lt;zhao1.liu@intel.com&gt;,
	&#34;Eduardo Habkost&#34; &lt;eduardo@habkost.net&gt;,
	&#34;Richard Henderson&#34; &lt;richard.henderson@linaro.org&gt;,
	&#34;Marcel Apfelbaum&#34; &lt;marcel.apfelbaum@gmail.com&gt;,
	&#34;Philippe Mathieu-Daud&#233;&#34; &lt;philmd@linaro.org&gt;,
	&#34;Michael S. Tsirkin&#34; &lt;mst@redhat.com&gt;,
```

And here's the expected output:
```
From: Gerd Hoffmann <kraxel@redhat.com>
To: qemu-devel@nongnu.org
Cc: "Paolo Bonzini" <pbonzini@redhat.com>,
	"Yanan Wang" <wangyanan55@huawei.com>,
	"Zhao Liu" <zhao1.liu@intel.com>,
	"Eduardo Habkost" <eduardo@habkost.net>,
	"Richard Henderson" <richard.henderson@linaro.org>,
	"Marcel Apfelbaum" <marcel.apfelbaum@gmail.com>,
	"Philippe Mathieu-Daudé" <philmd@linaro.org>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	"Gerd Hoffmann" <kraxel@redhat.com>
```

It seems like the encoding process is being applied redundantly, causing
special characters like &lt; (for <) and &gt; (for >) to render
improperly. This double encoding affects both usability and
readability, especially when dealing with email threads that rely
heavily on clear headers for communication between contributors.

For reference, I have also attached a screenshot of the output, which
can be found here:

    https://gist.github.com/ammarfaizi2/bbb1480822ece70cf2d261e876bd7ba2

It would be great if this could be resolved as soon as possible to
restore the proper display of inbox outputs.

Thank you for your attention to this matter!

-- 
Ammar Faizi