* [PATCH v1 0/2] Improve some utility codes
@ 2022-08-03 10:27 Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 1/2] Improve code for bottom border Muhammad Rizki
` (3 more replies)
0 siblings, 4 replies; 8+ messages in thread
From: Muhammad Rizki @ 2022-08-03 10:27 UTC (permalink / raw)
To: Ammar Faizi
Cc: Muhammad Rizki, GNU/Weeb Mailing List, Alviro Iskandar Setiawan
Afternoon guys,
In this series, I'm improving some utility codes in scraper/utils.py,
like multiple dash `-` just like "-"*72, I think it's more practice than
before. I fix a bug too while receiving email payload like `<text>`,
`®`, and more using double html.escape().
There are 2 patches in this series:
- Patch 1 is to improve bottom border string template.
- Patch 2 is to use double html.escape() for fix_utf8_chars()
This should be works fine and tested too, thanks.
Muhammad Rizki (2):
Improve code for bottom border
Use html.escape() for fix_utf8_chars()
daemon/scraper/utils.py | 11 ++++-------
1 file changed, 4 insertions(+), 7 deletions(-)
base-commit: f42828bf0aa6fe23f1f5257c4a2483cb8798e89a
--
Muhammad Rizki
^ permalink raw reply [flat|nested] 8+ messages in thread
* [PATCH v1 1/2] Improve code for bottom border
2022-08-03 10:27 [PATCH v1 0/2] Improve some utility codes Muhammad Rizki
@ 2022-08-03 10:27 ` Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 2/2] Use html.escape() for fix_utf8_chars() Muhammad Rizki
` (2 subsequent siblings)
3 siblings, 0 replies; 8+ messages in thread
From: Muhammad Rizki @ 2022-08-03 10:27 UTC (permalink / raw)
To: Ammar Faizi
Cc: Muhammad Rizki, GNU/Weeb Mailing List, Alviro Iskandar Setiawan
Just improve code for creating bottom border for create_template() in
scraper/utils.py
Signed-off-by: Muhammad Rizki <[email protected]>
---
daemon/scraper/utils.py | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/daemon/scraper/utils.py b/daemon/scraper/utils.py
index e89255b..765468c 100644
--- a/daemon/scraper/utils.py
+++ b/daemon/scraper/utils.py
@@ -185,7 +185,7 @@ def create_template(thread: Message, to=None, cc=None):
ret = ret[:4000] + "..."
ret = fix_utf8_char(ret)
- ret += "\n<code>------------------------------------------------------------------------</code>"
+ ret += f"\n<code>{'-'*72}</code>"
return ret, files, is_patch
--
Muhammad Rizki
^ permalink raw reply related [flat|nested] 8+ messages in thread
* [PATCH v1 2/2] Use html.escape() for fix_utf8_chars()
2022-08-03 10:27 [PATCH v1 0/2] Improve some utility codes Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 1/2] Improve code for bottom border Muhammad Rizki
@ 2022-08-03 10:27 ` Muhammad Rizki
2022-08-03 12:49 ` Ammar Faizi
2022-08-03 12:51 ` (subset) [PATCH v1 0/2] Improve some utility codes Ammar Faizi
2022-08-03 13:07 ` Ammar Faizi
3 siblings, 1 reply; 8+ messages in thread
From: Muhammad Rizki @ 2022-08-03 10:27 UTC (permalink / raw)
To: Ammar Faizi
Cc: Muhammad Rizki, GNU/Weeb Mailing List, Alviro Iskandar Setiawan
We found a bug when receiving email payload like `<text>`, `®`, and
more. Using double html.escape() will fix the bug for now.
Signed-off-by: Muhammad Rizki <[email protected]>
Reported-by: Alviro Iskandar Setiawan <[email protected]>
---
daemon/scraper/utils.py | 9 +++------
1 file changed, 3 insertions(+), 6 deletions(-)
diff --git a/daemon/scraper/utils.py b/daemon/scraper/utils.py
index 765468c..c428a33 100644
--- a/daemon/scraper/utils.py
+++ b/daemon/scraper/utils.py
@@ -14,6 +14,7 @@ import os
import re
import shutil
import httpx
+import html
def get_email_msg_id(mail):
@@ -218,12 +219,8 @@ def clean_up_after_send_patch(tmp):
def fix_utf8_char(text: str):
- return (
- text.rstrip()
- .replace("<", "<")
- .replace(">",">")
- .replace("�"," ")
- )
+ text = text.rstrip().replace("�"," ")
+ return html.escape(html.escape(text))
EMAIL_MSG_ID_PATTERN = r"<([^\<\>]+)>"
--
Muhammad Rizki
^ permalink raw reply related [flat|nested] 8+ messages in thread
* Re: [PATCH v1 2/2] Use html.escape() for fix_utf8_chars()
2022-08-03 10:27 ` [PATCH v1 2/2] Use html.escape() for fix_utf8_chars() Muhammad Rizki
@ 2022-08-03 12:49 ` Ammar Faizi
2022-08-03 12:53 ` Ammar Faizi
2022-08-03 13:01 ` Muhammad Rizki
0 siblings, 2 replies; 8+ messages in thread
From: Ammar Faizi @ 2022-08-03 12:49 UTC (permalink / raw)
To: Muhammad Rizki; +Cc: GNU/Weeb Mailing List, Alviro Iskandar Setiawan
On 8/3/22 5:27 PM, Muhammad Rizki wrote:
> return html.escape(html.escape(text))
I doubt about this double escape. Just to clear my doubt. Did you test
this?
I didn't, but you said previously that "<" is shown as ">" which is
totally garbage if that's the actual output. Rizki?
--
Ammar Faizi
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: (subset) [PATCH v1 0/2] Improve some utility codes
2022-08-03 10:27 [PATCH v1 0/2] Improve some utility codes Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 1/2] Improve code for bottom border Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 2/2] Use html.escape() for fix_utf8_chars() Muhammad Rizki
@ 2022-08-03 12:51 ` Ammar Faizi
2022-08-03 13:07 ` Ammar Faizi
3 siblings, 0 replies; 8+ messages in thread
From: Ammar Faizi @ 2022-08-03 12:51 UTC (permalink / raw)
To: Muhammad Rizki
Cc: Ammar Faizi, Alviro Iskandar Setiawan, GNU/Weeb Mailing List
On Wed, 3 Aug 2022 17:27:16 +0700, Muhammad Rizki wrote:
> Afternoon guys,
>
> In this series, I'm improving some utility codes in scraper/utils.py,
> like multiple dash `-` just like "-"*72, I think it's more practice than
> before. I fix a bug too while receiving email payload like `<text>`,
> `®`, and more using double html.escape().
>
> [...]
Applied, thanks!
[1/2] Improve code for bottom border
commit: 09960d39aefedd07678bdadb6df26f79c9854c69
Best regards,
--
Ammar Faizi
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH v1 2/2] Use html.escape() for fix_utf8_chars()
2022-08-03 12:49 ` Ammar Faizi
@ 2022-08-03 12:53 ` Ammar Faizi
2022-08-03 13:01 ` Muhammad Rizki
1 sibling, 0 replies; 8+ messages in thread
From: Ammar Faizi @ 2022-08-03 12:53 UTC (permalink / raw)
To: Muhammad Rizki; +Cc: GNU/Weeb Mailing List, Alviro Iskandar Setiawan
On 8/3/22 7:49 PM, Ammar Faizi wrote:
> On 8/3/22 5:27 PM, Muhammad Rizki wrote:
>> return html.escape(html.escape(text))
>
> I doubt about this double escape. Just to clear my doubt. Did you test
> this?
>
> I didn't, but you said previously that "<" is shown as ">" which is
> totally garbage if that's the actual output. Rizki?
I mean ">" is shown as ">".
--
Ammar Faizi
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH v1 2/2] Use html.escape() for fix_utf8_chars()
2022-08-03 12:49 ` Ammar Faizi
2022-08-03 12:53 ` Ammar Faizi
@ 2022-08-03 13:01 ` Muhammad Rizki
1 sibling, 0 replies; 8+ messages in thread
From: Muhammad Rizki @ 2022-08-03 13:01 UTC (permalink / raw)
To: Ammar Faizi; +Cc: GNU/Weeb Mailing List, Alviro Iskandar Setiawan
On 03/08/2022 19.49, Ammar Faizi wrote:
> On 8/3/22 5:27 PM, Muhammad Rizki wrote:
>> return html.escape(html.escape(text))
>
> I doubt about this double escape. Just to clear my doubt. Did you test
> this?
Tested and work as my expected.
>
> I didn't, but you said previously that "<" is shown as ">" which is
> totally garbage if that's the actual output. Rizki?
>
Yes, with double html.escape() it can replace the previous code (with
replace())
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH v1 0/2] Improve some utility codes
2022-08-03 10:27 [PATCH v1 0/2] Improve some utility codes Muhammad Rizki
` (2 preceding siblings ...)
2022-08-03 12:51 ` (subset) [PATCH v1 0/2] Improve some utility codes Ammar Faizi
@ 2022-08-03 13:07 ` Ammar Faizi
3 siblings, 0 replies; 8+ messages in thread
From: Ammar Faizi @ 2022-08-03 13:07 UTC (permalink / raw)
To: Muhammad Rizki
Cc: Ammar Faizi, GNU/Weeb Mailing List, Alviro Iskandar Setiawan
On Wed, 3 Aug 2022 17:27:16 +0700, Muhammad Rizki wrote:
> Afternoon guys,
>
> In this series, I'm improving some utility codes in scraper/utils.py,
> like multiple dash `-` just like "-"*72, I think it's more practice than
> before. I fix a bug too while receiving email payload like `<text>`,
> `®`, and more using double html.escape().
>
> [...]
Applied, thanks!
[1/2] Improve code for bottom border
commit: 09960d39aefedd07678bdadb6df26f79c9854c69
[2/2] Use html.escape() for fix_utf8_chars()
commit: a6280937ce9623e961bb66a76fad2c2f5cd10244
Best regards,
--
Ammar Faizi
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2022-08-03 13:07 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-08-03 10:27 [PATCH v1 0/2] Improve some utility codes Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 1/2] Improve code for bottom border Muhammad Rizki
2022-08-03 10:27 ` [PATCH v1 2/2] Use html.escape() for fix_utf8_chars() Muhammad Rizki
2022-08-03 12:49 ` Ammar Faizi
2022-08-03 12:53 ` Ammar Faizi
2022-08-03 13:01 ` Muhammad Rizki
2022-08-03 12:51 ` (subset) [PATCH v1 0/2] Improve some utility codes Ammar Faizi
2022-08-03 13:07 ` Ammar Faizi
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox