public inbox for gwml@vger.gnuweeb.org
 help / color / mirror / Atom feed
* Public Inbox Output Broken Due to Double HTML Encoding
@ 2024-09-05 15:42 Ammar Faizi
  2024-09-05 15:50 ` Konstantin Ryabitsev
  0 siblings, 1 reply; 2+ messages in thread
From: Ammar Faizi @ 2024-09-05 15:42 UTC (permalink / raw)
  To: Konstantin Ryabitsev, Greg Kroah-Hartman
  Cc: Linux Kernel Mailing List, GNU/Weeb Mailing List,
	Alviro Iskandar Setiawan, Ravel Kevin Ethan,
	Michael William Jonathan


Hi,

We have noticed a recent issue with the lore public inbox output where
it appears to be affected by double HTML encoding. This problem
surfaced after 1 September 2024, as that was the last time we observed
the correct output.

The issue is particularly visible in email headers, where characters
such as <, >, and quotes are incorrectly encoded twice. This results in
unreadable and improperly rendered email headers. It also breaks
several links in the HTML output.

For instance, in this thread:

    https://lore.kernel.org/qemu-devel/20240905141211.1253307-1-kraxel@redhat.com/

Here's the current, broken output:
```
From: Gerd Hoffmann &lt;kraxel@redhat.com&gt;
To: qemu-devel@nongnu.org
Cc: &#34;Paolo Bonzini&#34; &lt;pbonzini@redhat.com&gt;,
	&#34;Yanan Wang&#34; &lt;wangyanan55@huawei.com&gt;,
	&#34;Zhao Liu&#34; &lt;zhao1.liu@intel.com&gt;,
	&#34;Eduardo Habkost&#34; &lt;eduardo@habkost.net&gt;,
	&#34;Richard Henderson&#34; &lt;richard.henderson@linaro.org&gt;,
	&#34;Marcel Apfelbaum&#34; &lt;marcel.apfelbaum@gmail.com&gt;,
	&#34;Philippe Mathieu-Daud&#233;&#34; &lt;philmd@linaro.org&gt;,
	&#34;Michael S. Tsirkin&#34; &lt;mst@redhat.com&gt;,
```

And here's the expected output:
```
From: Gerd Hoffmann <kraxel@redhat.com>
To: qemu-devel@nongnu.org
Cc: "Paolo Bonzini" <pbonzini@redhat.com>,
	"Yanan Wang" <wangyanan55@huawei.com>,
	"Zhao Liu" <zhao1.liu@intel.com>,
	"Eduardo Habkost" <eduardo@habkost.net>,
	"Richard Henderson" <richard.henderson@linaro.org>,
	"Marcel Apfelbaum" <marcel.apfelbaum@gmail.com>,
	"Philippe Mathieu-Daudé" <philmd@linaro.org>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	"Gerd Hoffmann" <kraxel@redhat.com>
```

It seems like the encoding process is being applied redundantly, causing
special characters like &lt; (for <) and &gt; (for >) to render
improperly. This double encoding affects both usability and
readability, especially when dealing with email threads that rely
heavily on clear headers for communication between contributors.

For reference, I have also attached a screenshot of the output, which
can be found here:

    https://gist.github.com/ammarfaizi2/bbb1480822ece70cf2d261e876bd7ba2

It would be great if this could be resolved as soon as possible to
restore the proper display of inbox outputs.

Thank you for your attention to this matter!

-- 
Ammar Faizi


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2024-09-05 15:50 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-09-05 15:42 Public Inbox Output Broken Due to Double HTML Encoding Ammar Faizi
2024-09-05 15:50 ` Konstantin Ryabitsev

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox