From mboxrd@z Thu Jan  1 00:00:00 1970
Path: news.gmane.org!not-for-mail
From: Peter Dyballa <Peter_Dyballa@Web.DE>
Newsgroups: gmane.emacs.help
Subject: Re: searching for non ascii characters
Date: Wed, 3 Aug 2005 16:09:13 +0200
Message-ID: <2aa9db9858e6c4eb17440a5941cff4f0@Web.DE>
References: <u3bpsulsx.fsf@cwazy.co.uk>
	<mailman.2370.1123016502.20277.help-gnu-emacs@gnu.org>
	<uacjz40b4.fsf@cwazy.co.uk>
NNTP-Posting-Host: main.gmane.org
Mime-Version: 1.0 (Apple Message framework v622)
Content-Type: text/plain; charset=WINDOWS-1252; format=flowed
Content-Transfer-Encoding: quoted-printable
X-Trace: sea.gmane.org 1123078843 22219 80.91.229.2 (3 Aug 2005 14:20:43 GMT)
X-Complaints-To: usenet@sea.gmane.org
NNTP-Posting-Date: Wed, 3 Aug 2005 14:20:43 +0000 (UTC)
Cc: help-gnu-emacs@gnu.org
Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Aug 03 16:20:40 2005
Return-path: <help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org>
Original-Received: from lists.gnu.org ([199.232.76.165])
	by ciao.gmane.org with esmtp (Exim 4.43)
	id 1E0K4H-0002p9-Sk
	for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Aug 2005 16:18:02 +0200
Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.43)
	id 1E0K72-0007lY-KD
	for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Aug 2005 10:20:52 -0400
Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43)
	id 1E0K5W-0006tR-VZ
	for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:19:19 -0400
Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43)
	id 1E0K5M-0006nR-N5
	for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:19:12 -0400
Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.43) id 1E0K5L-0006jG-LL
	for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:19:07 -0400
Original-Received: from [217.72.192.226] (helo=smtp08.web.de)
	by monty-python.gnu.org with esmtp
	(TLS-1.0:DHE_RSA_3DES_EDE_CBC_SHA:24) (Exim 4.34) id 1E0K8O-0007i9-PW
	for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:22:17 -0400
Original-Received: from [84.245.189.163] (helo=[192.168.1.2])
	by smtp08.web.de with asmtp (TLSv1:RC4-SHA:128) (WEB.DE 4.105 #314)
	id 1E0Jvn-00065n-00; Wed, 03 Aug 2005 16:09:15 +0200
In-Reply-To: <uacjz40b4.fsf@cwazy.co.uk>
X-Image-Url: http://homepage.mac.com/sparifankal/.cv/thumbs/me.thumbnail
Original-To: rahed@cwazy.co.uk
X-Mailer: Apple Mail (2.622)
X-Sender: Peter_Dyballa@web.de
X-BeenThere: help-gnu-emacs@gnu.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Users list for the GNU Emacs text editor <help-gnu-emacs.gnu.org>
List-Unsubscribe: <http://lists.gnu.org/mailman/listinfo/help-gnu-emacs>,
	<mailto:help-gnu-emacs-request@gnu.org?subject=unsubscribe>
List-Archive: <http://lists.gnu.org/pipermail/help-gnu-emacs>
List-Post: <mailto:help-gnu-emacs@gnu.org>
List-Help: <mailto:help-gnu-emacs-request@gnu.org?subject=help>
List-Subscribe: <http://lists.gnu.org/mailman/listinfo/help-gnu-emacs>,
	<mailto:help-gnu-emacs-request@gnu.org?subject=subscribe>
Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org
Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org
Xref: news.gmane.org gmane.emacs.help:28391
X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:28391


Am 03.08.2005 um 15:28 schrieb rahed@cwazy.co.uk:

>   character: =9A (01210241, 331937, 0x510a1)
>     charset: mule-unicode-0100-24ff (Unicode characters of the range=20=

> U+0100..U+24FF.)
>  code point: 33 33
>      syntax: word
>    category: l:Latin
> buffer code: 0x9C 0xF4 0xA1 0xA1
>   file code: B9 (encoded by coding system iso-latin-2-unix)
>        font: -outline-Courier=20
> New-normal-r-normal-normal-13-97-96-96-c-80-iso10646-1
>

My own test file with ISO 8859-2 encoding has this in GNU Emacs 23:

         character: =9A (0541, 353, 0x161)
preferred charset: iso-8859-2 (ISO/IEC 8859/2)
        code point: 0xB9
            syntax: w 	which means: word
          category: j:Japanese   l:Latin
       buffer code: 0xC5 0xA1
         file code: 0xB9 (encoded by coding system iso-latin-2-unix)
           display: by this font (glyph code)
     =20
-B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO10646-1=20=

(0x161)

and this in GNU Emacs 22 and 21.3:

   character: =9A (04471, 2361, 0x939, U+0161)
     charset: [latin-iso8859-2]
	     (Right-Hand Part of Latin Alphabet 2 (ISO/IEC 8859-2):=20
ISO-IR-101.)
  code point: [57]
      syntax: w 	which means: word
    category: l:Latin
buffer code: 0x82 0xB9
   file code: 0xB9 (encoded by coding system iso-latin-2-unix)
     display: by this font (glyph code)
     =20
-B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO8859-2=20=

(0xB9)

Both use the right charset and encoding. If you close and open again=20
that file and it has that '-*- coding: iso-8859-2; -*-' in its header,=20=

among the first six or nine lines, Emacs should switch to that coding=20
-- except you have at the file's end a block of local or file variables=20=

that say something different. Or it has a fixation to a specific=20
coding-system. Did you launch your Emacs after changing .emacs? Can you=20=

check the variable's state (C-h v on this variable in .emacs in newly=20
launched Emacs)? If it's something different than set then you either=20
have this statement not executed or it exists more than once and gets=20
reset some time after this line ... What does your file's tail look=20
like?

The last thing I think of is the use of fontsets instead of fonts. What=20=

is your status?

Your file has at LATIN SMALL LETTER S WITH CARON's position the correct=20=

byte, 0xB9. So it is presumingly still correctly encoded. To see it in=20=

ISO/IEC 8859-2 you can revert-buffer-with-coding-system, C-x RET r=20
CODING-SYSTEM. Use M-x list-coding-systems to see what your system has.

--
Greetings

   Pete

Windows, c'est un peu comme le beaujolais nouveau: =E0 chaque nouvelle=20=

cuv=E9e on sait que ce sera d=E9gueulasse, mais on en prend quand m=EAme, =
par=20
masochisme.