From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on gnuweeb.org X-Spam-Level: X-Spam-Status: No, score=-1.8 required=5.0 tests=ALL_TRUSTED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,NO_DNS_FOR_FROM, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 Received: from [192.168.1.2] (unknown [101.128.125.123]) by gnuweeb.org (Postfix) with ESMTPSA id 844DA804D1; Wed, 19 Oct 2022 17:51:23 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gnuweeb.org; s=default; t=1666201884; bh=kvVRMnjUOptLM9sXv0BhMduiJjZDluePFetJIdAaI/o=; h=Date:Subject:To:Cc:References:From:In-Reply-To:From; b=TYL4jCRHs0KN5XDQwV1E/AM1TvdPXBTMpo2rvYEU7h9oP72A1wqy3IA6wH/nQkQl3 r+/TGECqfUZhNPYGpS5ZompwLluvh00hPNo8xJ/V3x4ux1eUQpTyBjlB9QgEpyAv37 7WlTIjGolTXkwmwmpp2FE/uaq6lyZQ9+j/GxaGIo2q5v7Kg/jQzDGIYA7Ns3LrRBJg 7hbmh7rH3SkIEMOY5dEEkIKo0w7UOsczKWr8uMpG02tom3crPLeKpkwAWFVMOAOVAn W709JXizvZLyEHkmvfFFpeE1+PPKQxkRu8A2eBlBrBsikFoDCQtGRXoTZ6htSMfZ0V 1s5iYqLEZ9jBA== Message-ID: <31408c83-e9d3-3fcb-c109-b4b93df104d8@gnuweeb.org> Date: Thu, 20 Oct 2022 00:51:20 +0700 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.3.3 Subject: Re: [PATCH v1 4/7] atom: Improve fix_utf8_char() To: Ammar Faizi Cc: Alviro Iskandar Setiawan , GNU/Weeb Mailing List References: <20221018081635.1617-1-kiizuha@gnuweeb.org> <20221018081635.1617-5-kiizuha@gnuweeb.org> <14303851-8483-0737-8edc-649ee121f0ee@gnuweeb.org> <785390ce-e3ed-f1eb-dec6-a383563d139b@gnuweeb.org> <76df4a7f-dafc-c167-3b86-ae9dd6d0a233@gnuweeb.org> Content-Language: en-US From: Muhammad Rizki In-Reply-To: <76df4a7f-dafc-c167-3b86-ae9dd6d0a233@gnuweeb.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit List-Id: On 20/10/2022 00.42, Ammar Faizi wrote: > On 10/20/22 12:35 AM, Muhammad Rizki wrote: >> Ugh, hate when I should digging up the chat to give a prove. So, you >> want the re.sub() to be remove or no? > > What I want is: decode the email *properly*, then send it > to Telegram intact. > > That being said, if you have a string ">" in the email > decoded email, it should be still ">" when it is sent > to Telegram. If you have a string ">" in the decode email, > it should be still ">" when it is sent to Telegram. And > so on so forth... > > But what you do here is removing all HTML special chars > after unescape() it twice. I also don't understand why > unescape() should be called twice and nested like that. > > Make me understand why it is necessary doing that is > your job as a submitter. > I know. I forgot where the conversation started, but you said this https://discord.com/channels/845302963739033611/845302963739033613/1028563014418444348 "Just send an empty email that would be fine." Sorry, I don't understand your statement about it. I thought you were asked me that the HTML contain tags should be empty or what. Please explain.