From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.help Subject: Re: Incorrect rendering of accented characters in HTML e-mail (Gnus) Date: Sat, 10 Oct 2020 17:44:26 +0300 Message-ID: <83tuv23zud.fsf@gnu.org> References: <87362mp5md.fsf@pc-117-162.ovh.com> <83v9fi41ux.fsf@gnu.org> <87tuv287za.fsf@pc-117-162.ovh.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="32914"; mail-complaints-to="usenet@ciao.gmane.io" To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Sat Oct 10 16:45:25 2020 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kRG7f-0008Ov-I1 for geh-help-gnu-emacs@m.gmane-mx.org; Sat, 10 Oct 2020 16:45:23 +0200 Original-Received: from localhost ([::1]:50716 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kRG7Z-0003dw-G4 for geh-help-gnu-emacs@m.gmane-mx.org; Sat, 10 Oct 2020 10:45:20 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:51648) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kRG6m-0003dT-Te for help-gnu-emacs@gnu.org; Sat, 10 Oct 2020 10:44:30 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:37727) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kRG6k-0001mT-TC for help-gnu-emacs@gnu.org; Sat, 10 Oct 2020 10:44:27 -0400 Original-Received: from [176.228.60.248] (port=3135 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kRG6j-0003Tj-QW for help-gnu-emacs@gnu.org; Sat, 10 Oct 2020 10:44:26 -0400 In-Reply-To: <87tuv287za.fsf@pc-117-162.ovh.com> (message from Garjola Dindi on Sat, 10 Oct 2020 16:35:05 +0200) X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:124413 Archived-At: > From: Garjola Dindi > Date: Sat, 10 Oct 2020 16:35:05 +0200 > > >> The html part of the e-mails contains > >> > >> ,---- > >> | < #part type=text/plain format="flowed" charset="utf-8" > >> | disposition=inline nofile=yes> > >> `---- > >> > >> so I guess that the html renderer should pick it up. I have tested shr, > >> gnus-w3m and w3m and I always get the same result. > >> > >> I would be grateful if somebody could help me understand what happens. > > > > How does the character appear in the original HTML? > > Thanks for your quick response. > > I don't know if I am inspecting the message correctly, because when I > enter the edit mode, all characters appear OK. Therefore, I am not sure > if I an seeing the original html. Can you use some other tool, like wget or curl, to download the text as it comes from the server? > I have also noticed that the I also have the same issue with non html > e-mails. "Same issue" in what sense? Is just é replaced by i, or does something like that happen with every non-ASCII letter? > For instance, here is what I see in the article buffer: > > ,---- > | \311lodie, qui a rejoint l'\351quipe podcast, me dit que sa soeur, qui a une > | formation th\351\342trale, serait disponible ponctuellement pour faire des > | voix pour des lectures. Pour le moment on a jamais eu ce besoin mais \347a > | peut ouvrir des perspectives. > `---- > > (I have replaced the non printable chars with \xxx) What do you mean by "non printable" here? Do they look like octal escapes or do they look like something else? Btw, the above is not UTF-8 encoding, it's Latin-1 encoding. Does the problem go away if you start Emacs as "emacs -Q"?