unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
From: ken <gebser@mousecar.com>
To: GNU Emacs List <help-gnu-emacs@gnu.org>
Subject: Re: garbage chars when pasting French chars into emacs
Date: Wed, 01 Feb 2012 21:39:22 -0500	[thread overview]
Message-ID: <4F29F75A.9070301@mousecar.com> (raw)
In-Reply-To: <83ty3ap7t1.fsf@gnu.org>


On 02/01/2012 04:23 PM Eli Zaretskii wrote:
>> Date: Wed, 01 Feb 2012 15:41:42 -0500
>> From: ken <gebser@mousecar.com>
>>
>> Just to be comprehensive I'll state at the outset that I'm using Linux 
>> (CentOS 5.7), so this is the environment emacs is working in.  From a 
>> shell I get this:
>>
>> $ set|grep -i lang
>> LANG=en_US.UTF-8
>>
>> Now I pull up a webpage with some French on it: 
>> <http://www.wikilivres.info/wiki/Maurice_Merleau-Ponty>.  Examining the 
>> source code of this page, I see at the top:
>>
>> <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
>>
>> So this page is presented in UTF-8.
>>
>> Firefox is also set to present pages in UTF-8: View -> Character 
>> Encoding -> UTF-8
>>
>> But when I copy and paste the text from "Francais" to "invisible, 1964)" 
>> inclusive, many of the characters aren't rendered correctly; I get 
>> "garbage" characters in their stead, e.g., the second-to-last line 
>> appears something like this:
>>
>>      * L^[$(B!G^[$(C)+^[(Bil et l^[$(B!G^[(Besprit, Gallimard, 1960
>>
>> Other lines are improperly rendered also.
>>
>> I'd like to fix this.  And if possible understand why this doesn't work, 
>> so I might be able to diagnose these problems for myself.
> 
> What is your value of selection-coding-system?  Try setting it to
> something like ctext-with-extensions.

Thanks, Eli,

Immediately prior to doing the copy-and-paste I ran all of these:

(set-language-environment               'UTF-8)
(set-default-coding-systems             'utf-8)
(setq file-name-coding-system           'utf-8)
(setq default-buffer-file-coding-system 'utf-8)
(setq coding-system-for-write           'utf-8)
(set-keyboard-coding-system             'utf-8)
(set-terminal-coding-system             'utf-8)
(set-clipboard-coding-system            'utf-8)
(set-selection-coding-system            'utf-8)
(prefer-coding-system                   'utf-8)
(modify-coding-system-alist 'process "\\*shell\\*\\'" 'utf-8-unix)

Following your advice, I ran

(set-selection-coding-system 'ctext-with-extensions)

and then did the same copy-and-paste again.  This got more of the 
characters correct, but not all of them.  So we're a lot closer....  Got 
another suggestion?





  reply	other threads:[~2012-02-02  2:39 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-02-01 20:41 garbage chars when pasting French chars into emacs ken
2012-02-01 21:23 ` Eli Zaretskii
2012-02-02  2:39   ` ken [this message]
2012-02-02  3:55     ` Eli Zaretskii
2012-02-02 20:00       ` ken
2012-02-03  7:31         ` Eli Zaretskii
2012-02-06 18:01           ` different distro [was: Re: garbage chars when pasting French chars into emacs] ken
2012-02-06 20:15             ` Peter Dyballa
2012-02-01 21:29 ` garbage chars when pasting French chars into emacs Philipp Haselwarter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4F29F75A.9070301@mousecar.com \
    --to=gebser@mousecar.com \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).