From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Peter Dyballa Newsgroups: gmane.emacs.help Subject: Re: searching for non ascii characters Date: Wed, 3 Aug 2005 16:09:13 +0200 Message-ID: <2aa9db9858e6c4eb17440a5941cff4f0@Web.DE> References: NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (Apple Message framework v622) Content-Type: text/plain; charset=WINDOWS-1252; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1123078843 22219 80.91.229.2 (3 Aug 2005 14:20:43 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 3 Aug 2005 14:20:43 +0000 (UTC) Cc: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Aug 03 16:20:40 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1E0K4H-0002p9-Sk for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Aug 2005 16:18:02 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1E0K72-0007lY-KD for geh-help-gnu-emacs@m.gmane.org; Wed, 03 Aug 2005 10:20:52 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1E0K5W-0006tR-VZ for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:19:19 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1E0K5M-0006nR-N5 for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:19:12 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1E0K5L-0006jG-LL for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:19:07 -0400 Original-Received: from [217.72.192.226] (helo=smtp08.web.de) by monty-python.gnu.org with esmtp (TLS-1.0:DHE_RSA_3DES_EDE_CBC_SHA:24) (Exim 4.34) id 1E0K8O-0007i9-PW for help-gnu-emacs@gnu.org; Wed, 03 Aug 2005 10:22:17 -0400 Original-Received: from [84.245.189.163] (helo=[192.168.1.2]) by smtp08.web.de with asmtp (TLSv1:RC4-SHA:128) (WEB.DE 4.105 #314) id 1E0Jvn-00065n-00; Wed, 03 Aug 2005 16:09:15 +0200 In-Reply-To: X-Image-Url: http://homepage.mac.com/sparifankal/.cv/thumbs/me.thumbnail Original-To: rahed@cwazy.co.uk X-Mailer: Apple Mail (2.622) X-Sender: Peter_Dyballa@web.de X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:28391 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:28391 Am 03.08.2005 um 15:28 schrieb rahed@cwazy.co.uk: > character: =9A (01210241, 331937, 0x510a1) > charset: mule-unicode-0100-24ff (Unicode characters of the range=20= > U+0100..U+24FF.) > code point: 33 33 > syntax: word > category: l:Latin > buffer code: 0x9C 0xF4 0xA1 0xA1 > file code: B9 (encoded by coding system iso-latin-2-unix) > font: -outline-Courier=20 > New-normal-r-normal-normal-13-97-96-96-c-80-iso10646-1 > My own test file with ISO 8859-2 encoding has this in GNU Emacs 23: character: =9A (0541, 353, 0x161) preferred charset: iso-8859-2 (ISO/IEC 8859/2) code point: 0xB9 syntax: w which means: word category: j:Japanese l:Latin buffer code: 0xC5 0xA1 file code: 0xB9 (encoded by coding system iso-latin-2-unix) display: by this font (glyph code) =20 -B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO10646-1=20= (0x161) and this in GNU Emacs 22 and 21.3: character: =9A (04471, 2361, 0x939, U+0161) charset: [latin-iso8859-2] (Right-Hand Part of Latin Alphabet 2 (ISO/IEC 8859-2):=20 ISO-IR-101.) code point: [57] syntax: w which means: word category: l:Latin buffer code: 0x82 0xB9 file code: 0xB9 (encoded by coding system iso-latin-2-unix) display: by this font (glyph code) =20 -B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO8859-2=20= (0xB9) Both use the right charset and encoding. If you close and open again=20 that file and it has that '-*- coding: iso-8859-2; -*-' in its header,=20= among the first six or nine lines, Emacs should switch to that coding=20 -- except you have at the file's end a block of local or file variables=20= that say something different. Or it has a fixation to a specific=20 coding-system. Did you launch your Emacs after changing .emacs? Can you=20= check the variable's state (C-h v on this variable in .emacs in newly=20 launched Emacs)? If it's something different than set then you either=20 have this statement not executed or it exists more than once and gets=20 reset some time after this line ... What does your file's tail look=20 like? The last thing I think of is the use of fontsets instead of fonts. What=20= is your status? Your file has at LATIN SMALL LETTER S WITH CARON's position the correct=20= byte, 0xB9. So it is presumingly still correctly encoded. To see it in=20= ISO/IEC 8859-2 you can revert-buffer-with-coding-system, C-x RET r=20 CODING-SYSTEM. Use M-x list-coding-systems to see what your system has. -- Greetings Pete Windows, c'est un peu comme le beaujolais nouveau: =E0 chaque nouvelle=20= cuv=E9e on sait que ce sera d=E9gueulasse, mais on en prend quand m=EAme, = par=20 masochisme.