From: Garjola Dindi <garjola@garjola.net>
To: help-gnu-emacs@gnu.org
Subject: Re: Incorrect rendering of accented characters in HTML e-mail (Gnus)
Date: Sat, 10 Oct 2020 16:35:05 +0200 [thread overview]
Message-ID: <87tuv287za.fsf@pc-117-162.ovh.com> (raw)
In-Reply-To: 83v9fi41ux.fsf@gnu.org
[-- Attachment #1: Type: text/plain, Size: 6032 bytes --]
On Sat 10-Oct-2020 at 16:00:54 +02, Eli Zaretskii <eliz@gnu.org> wrote:
>> From: Garjola Dindi <garjola@garjola.net>
>> Date: Sat, 10 Oct 2020 15:34:02 +0200
>>
>> If I use describe-char to inspect the characters, I get this before
>> «washing»:
>>
>> ,----
>> | position: 470 of 867 (54%), column: 30
>> | character: i (displayed as i) (codepoint 105, #o151, #x69)
>> | charset: ascii (ASCII (ISO646 IRV))
>> | code point in charset: 0x69
>> | script: latin
>> | syntax: w which means: word |
>> | category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman
>> | to input: type "C-x 8 RET 69" or "C-x 8 RET LATIN SMALL LETTER I"
>> | buffer code: #x69
>> | file code: #x69 (encoded by coding system utf-8-unix)
>> | display: by this font (glyph code)
>> | ftcrhb:-GOOG-Noto Sans-normal-normal-normal-*-19-*-*-*-*-0-iso10646-1 (#x4C
>> |
>> | Character code properties: customize what to show
>> | name: LATIN SMALL LETTER I
>> | general-category: Ll (Letter, Lowercase)
>> | decomposition: (105) ('i')
>> |
>> | There is an overlay here:
>> | From 440 to 520
>> | face hl-line
>> | priority -50
>> | window #<window 141 on *Article nnmaildir+RSSFeeds:ABlog*>
>> |
>> |
>> | There are text properties here:
>> | face variable-pitch
>> `----
>>
>> And this after «washing»
>>
>> ,----
>> | position: 472 of 871 (54%), column: 30
>> | character: é (displayed as é) (codepoint 233, #o351, #xe9)
>> | charset: unicode (Unicode (ISO10646))
>> | code point in charset: 0xE9
>> | script: latin
>> | syntax: w which means: word
>> | category: .:Base, L:Left-to-right (strong), c:Chinese, j:Japanese, l:Latin,
>> | v:Viet
>> | to input: type "C-x 8 RET e9" or "C-x 8 RET LATIN SMALL LETTER E WITH ACUTE"
>> | buffer code: #xC3 #xA9
>> | file code: #xC3 #xA9 (encoded by coding system utf-8-unix)
>> | display: by this font (glyph code)
>> | ftcrhb:-GOOG-Noto Sans-normal-normal-normal-*-19-*-*-*-*-0-iso10646-1 (#xAB)
>> |
>> | Character code properties: customize what to show
>> | name: LATIN SMALL LETTER E WITH ACUTE
>> | old-name: LATIN SMALL LETTER E ACUTE
>> | general-category: Ll (Letter, Lowercase)
>> | decomposition: (101 769) ('e' '́')
>> |
>> | There is an overlay here:
>> | From 442 to 523
>> | face hl-line
>> | priority -50
>> | window #<window 155 on *Article nnmaildir+RSSFeeds:ABlog*>
>> |
>> |
>> | There are text properties here:
>> | face variable-pitch
>> `----
>>
>> The html part of the e-mails contains
>>
>> ,----
>> | < #part type=text/plain format="flowed" charset="utf-8"
>> | disposition=inline nofile=yes>
>> `----
>>
>> so I guess that the html renderer should pick it up. I have tested shr,
>> gnus-w3m and w3m and I always get the same result.
>>
>> I would be grateful if somebody could help me understand what happens.
>
> How does the character appear in the original HTML?
Thanks for your quick response.
I don't know if I am inspecting the message correctly, because when I
enter the edit mode, all characters appear OK. Therefore, I am not sure
if I an seeing the original html.
I have also noticed that the I also have the same issue with non html
e-mails. I thought they were html, but they are just multipart.
For instance, here is what I see in the article buffer:
,----
| \311lodie, qui a rejoint l'\351quipe podcast, me dit que sa soeur, qui a une
| formation th\351\342trale, serait disponible ponctuellement pour faire des
| voix pour des lectures. Pour le moment on a jamais eu ce besoin mais \347a
| peut ouvrir des perspectives.
`----
(I have replaced the non printable chars with \xxx) and here is what I
see in edit mode:
,----
|
[-- Attachment #2.1: Type: text/plain, Size: 2 bytes --]
|
[-- Attachment #2.2: Type: text/plain, Size: 261 bytes --]
|
| Élodie, qui a rejoint l'équipe podcast, me dit que sa soeur, qui a une
| formation théâtrale, serait disponible ponctuellement pour faire des
| voix pour des lectures. Pour le moment on a jamais eu ce besoin mais ça
| peut ouvrir des perspectives.
|
[-- Attachment #2.3: Type: text/plain, Size: 171 bytes --]
| Pour connaître la configuration de la liste, gérer votre abonnement à la
| liste et vos informations personnelles :
| https://listes.april.org/wws/info/libreavous
|
[-- Attachment #3: Type: text/plain, Size: 299 bytes --]
`----
Again, when I quit the edit mode, the article buffer displays things correctly.
In the case of html I have for instace this in the article buffer:
,----
| Les groupes suprimacistes blancs ont profiti du mandat de Donald Trump et des ...
`----
and this in the edit mode buffer
,----
|
[-- Attachment #4: Type: text/plain, Size: 2 bytes --]
|
[-- Attachment #5: Type: text/plain, Size: 99 bytes --]
`----
So now I think this is not due to html, but to multipart MIME.
Thanks again for your help.
next prev parent reply other threads:[~2020-10-10 14:35 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-10 13:34 Incorrect rendering of accented characters in HTML e-mail (Gnus) Garjola Dindi
2020-10-10 14:00 ` Eli Zaretskii
2020-10-10 14:35 ` Garjola Dindi [this message]
2020-10-10 14:44 ` Eli Zaretskii
2020-10-10 15:53 ` Garjola Dindi
2020-10-10 16:12 ` Eli Zaretskii
2020-10-10 20:10 ` Garjola Dindi
2020-10-11 7:15 ` Damien Collard
2020-10-11 10:27 ` Garjola Dindi
2020-10-11 11:27 ` Garjola Dindi
2020-10-11 15:26 ` Damien Collard
2020-10-12 12:01 ` Garjola Dindi
2021-01-21 7:52 ` Garjola Dindi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87tuv287za.fsf@pc-117-162.ovh.com \
--to=garjola@garjola.net \
--cc=help-gnu-emacs@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.