From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Ralf Angeli Newsgroups: gmane.emacs.help Subject: Re: Convert unibyte to multibyte on input Date: Wed, 06 Jul 2005 09:35:07 +0200 Organization: nil Message-ID: <42cb89ab$0$18648$14726298@news.sunsite.dk> References: <42ca65d2$0$18647$14726298@news.sunsite.dk> Reply-To: angeli@iwi.uni-sb.de NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1120635558 17875 80.91.229.2 (6 Jul 2005 07:39:18 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 6 Jul 2005 07:39:18 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Jul 06 09:39:10 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1Dq4Uk-0002x5-Uz for geh-help-gnu-emacs@m.gmane.org; Wed, 06 Jul 2005 09:38:59 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Dq4W3-000280-H1 for geh-help-gnu-emacs@m.gmane.org; Wed, 06 Jul 2005 03:40:19 -0400 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!postnews.google.com!news3.google.com!news.glorb.com!dotsrc.org!news.dotsrc.org!not-for-mail Original-Newsgroups: gnu.emacs.help Mail-Copies-To: nobody User-Agent: Gnus/5.110004 (No Gnus v0.4) Emacs/22.0.50 (gnu/linux) Cancel-Lock: sha1:HxRrM29MuqfboR3bo9Axey/uZws= Original-Lines: 27 Original-NNTP-Posting-Host: 134.96.72.190 Original-X-Trace: news.sunsite.dk DXC=gRJTm8UJAIm5kL0JZ2 List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:27811 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:27811 * Ralf Angeli (2005-07-05) writes: > If I understand (info "(elisp)Converting Representations") correctly, > Emacs will convert unibyte text to multibyte if it is inserted into a > multibyte buffer. However, on Windows I could observe that text, > guillemets in particular, copied from the character table and pasted > into Emacs will remain in its unibyte representation. When typing > `C-u C-x =' on a « character one gets the following result with a CVS > Emacs checked out and compiled a few days ago: > > ,---- > | character: « (0253, 171, 0xab) > | charset: eight-bit-graphic (8-bit graphic char (0xA0..0xFF)) > | code point: 171 > | syntax: which means: whitespace > | buffer code: 0xAB > | file code: 0xAB (encoded by coding system raw-text-dos) I think I identified the cause for this. The problem shows up particularly in LaTeX files. Those are opened with a raw-text-dos file coding system which prevents character code conversion. The raw-text-dos file coding system being picked is likely the result of missing autoloads for latexenc.el in the Windows build. I sent a bug report to emacs-pretest-bugs. -- Ralf