all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Eli Zaretskii <eliz@gnu.org>
To: help-gnu-emacs@gnu.org
Subject: Re: Incorrect rendering of accented characters in HTML e-mail (Gnus)
Date: Sat, 10 Oct 2020 17:00:54 +0300	[thread overview]
Message-ID: <83v9fi41ux.fsf@gnu.org> (raw)
In-Reply-To: <87362mp5md.fsf@pc-117-162.ovh.com> (message from Garjola Dindi on Sat, 10 Oct 2020 15:34:02 +0200)

> From: Garjola Dindi <garjola@garjola.net>
> Date: Sat, 10 Oct 2020 15:34:02 +0200
> 
> If I use describe-char to inspect the characters, I get this before
> «washing»:
> 
> ,----
> | position: 470 of 867 (54%), column: 30                                     
> | character: i (displayed as i) (codepoint 105, #o151, #x69)                 
> | charset: ascii (ASCII (ISO646 IRV))                                        
> | code point in charset: 0x69                                                
> | script: latin                                                              
> | syntax: w 	which means: word                                          |
> | category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman      
> | to input: type "C-x 8 RET 69" or "C-x 8 RET LATIN SMALL LETTER I"          
> | buffer code: #x69                                                          
> | file code: #x69 (encoded by coding system utf-8-unix)                      
> | display: by this font (glyph code)                                         
> | ftcrhb:-GOOG-Noto Sans-normal-normal-normal-*-19-*-*-*-*-0-iso10646-1 (#x4C
> |                                                                            
> | Character code properties: customize what to show                          
> | name: LATIN SMALL LETTER I                                                 
> | general-category: Ll (Letter, Lowercase)                                   
> | decomposition: (105) ('i')                                                 
> |                                                                            
> | There is an overlay here:                                                  
> | From 440 to 520                                                            
> | face                 hl-line                                               
> | priority             -50                                                   
> | window               #<window 141 on *Article nnmaildir+RSSFeeds:ABlog*>   
> |                                                                            
> |                                                                            
> | There are text properties here:                                            
> | face                 variable-pitch                                        
> `----
> 
> And this after «washing»
> 
> ,----
> | position: 472 of 871 (54%), column: 30                                      
> | character: é (displayed as é) (codepoint 233, #o351, #xe9)                  
> | charset: unicode (Unicode (ISO10646))                                       
> | code point in charset: 0xE9                                                 
> | script: latin                                                               
> | syntax: w 	which means: word                                             
> | category: .:Base, L:Left-to-right (strong), c:Chinese, j:Japanese, l:Latin, 
> | v:Viet
> | to input: type "C-x 8 RET e9" or "C-x 8 RET LATIN SMALL LETTER E WITH ACUTE"
> | buffer code: #xC3 #xA9                                                      
> | file code: #xC3 #xA9 (encoded by coding system utf-8-unix)                  
> | display: by this font (glyph code)                                          
> | ftcrhb:-GOOG-Noto Sans-normal-normal-normal-*-19-*-*-*-*-0-iso10646-1 (#xAB)
> |                                                                             
> | Character code properties: customize what to show                           
> | name: LATIN SMALL LETTER E WITH ACUTE                                       
> | old-name: LATIN SMALL LETTER E ACUTE                                        
> | general-category: Ll (Letter, Lowercase)                                    
> | decomposition: (101 769) ('e' '́')                                           
> |                                                                             
> | There is an overlay here:                                                   
> | From 442 to 523                                                             
> | face                 hl-line                                                
> | priority             -50                                                    
> | window               #<window 155 on *Article nnmaildir+RSSFeeds:ABlog*>    
> |                                                                             
> |                                                                             
> | There are text properties here:                                             
> | face                 variable-pitch                                         
> `----
> 
> The html part of the e-mails contains
> 
> ,----
> | < #part type=text/plain format="flowed" charset="utf-8"
> | disposition=inline nofile=yes>
> `----
> 
> so I guess that the html renderer should pick it up. I have tested shr,
> gnus-w3m and w3m and I always get the same result.
> 
> I would be grateful if somebody could help me understand what happens.

How does the character appear in the original HTML?



  reply	other threads:[~2020-10-10 14:00 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-10-10 13:34 Incorrect rendering of accented characters in HTML e-mail (Gnus) Garjola Dindi
2020-10-10 14:00 ` Eli Zaretskii [this message]
2020-10-10 14:35   ` Garjola Dindi
2020-10-10 14:44     ` Eli Zaretskii
2020-10-10 15:53       ` Garjola Dindi
2020-10-10 16:12         ` Eli Zaretskii
2020-10-10 20:10           ` Garjola Dindi
2020-10-11  7:15     ` Damien Collard
2020-10-11 10:27       ` Garjola Dindi
2020-10-11 11:27       ` Garjola Dindi
2020-10-11 15:26         ` Damien Collard
2020-10-12 12:01           ` Garjola Dindi
2021-01-21  7:52           ` Garjola Dindi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=83v9fi41ux.fsf@gnu.org \
    --to=eliz@gnu.org \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.