From: Peter Dyballa <Peter_Dyballa@Web.DE>
Cc: help-gnu-emacs@gnu.org
Subject: Re: searching for non ascii characters
Date: Wed, 3 Aug 2005 16:09:13 +0200 [thread overview]
Message-ID: <2aa9db9858e6c4eb17440a5941cff4f0@Web.DE> (raw)
In-Reply-To: <uacjz40b4.fsf@cwazy.co.uk>
Am 03.08.2005 um 15:28 schrieb rahed@cwazy.co.uk:
> character: š (01210241, 331937, 0x510a1)
> charset: mule-unicode-0100-24ff (Unicode characters of the range
> U+0100..U+24FF.)
> code point: 33 33
> syntax: word
> category: l:Latin
> buffer code: 0x9C 0xF4 0xA1 0xA1
> file code: B9 (encoded by coding system iso-latin-2-unix)
> font: -outline-Courier
> New-normal-r-normal-normal-13-97-96-96-c-80-iso10646-1
>
My own test file with ISO 8859-2 encoding has this in GNU Emacs 23:
character: š (0541, 353, 0x161)
preferred charset: iso-8859-2 (ISO/IEC 8859/2)
code point: 0xB9
syntax: w which means: word
category: j:Japanese l:Latin
buffer code: 0xC5 0xA1
file code: 0xB9 (encoded by coding system iso-latin-2-unix)
display: by this font (glyph code)
-B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO10646-1
(0x161)
and this in GNU Emacs 22 and 21.3:
character: š (04471, 2361, 0x939, U+0161)
charset: [latin-iso8859-2]
(Right-Hand Part of Latin Alphabet 2 (ISO/IEC 8859-2):
ISO-IR-101.)
code point: [57]
syntax: w which means: word
category: l:Latin
buffer code: 0x82 0xB9
file code: 0xB9 (encoded by coding system iso-latin-2-unix)
display: by this font (glyph code)
-B&H-LucidaTypewriter-Medium-R-Normal-Sans-10-100-75-75-M-60-ISO8859-2
(0xB9)
Both use the right charset and encoding. If you close and open again
that file and it has that '-*- coding: iso-8859-2; -*-' in its header,
among the first six or nine lines, Emacs should switch to that coding
-- except you have at the file's end a block of local or file variables
that say something different. Or it has a fixation to a specific
coding-system. Did you launch your Emacs after changing .emacs? Can you
check the variable's state (C-h v on this variable in .emacs in newly
launched Emacs)? If it's something different than set then you either
have this statement not executed or it exists more than once and gets
reset some time after this line ... What does your file's tail look
like?
The last thing I think of is the use of fontsets instead of fonts. What
is your status?
Your file has at LATIN SMALL LETTER S WITH CARON's position the correct
byte, 0xB9. So it is presumingly still correctly encoded. To see it in
ISO/IEC 8859-2 you can revert-buffer-with-coding-system, C-x RET r
CODING-SYSTEM. Use M-x list-coding-systems to see what your system has.
--
Greetings
Pete
Windows, c'est un peu comme le beaujolais nouveau: à chaque nouvelle
cuvée on sait que ce sera dégueulasse, mais on en prend quand même, par
masochisme.
next prev parent reply other threads:[~2005-08-03 14:09 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-08-02 20:27 searching for non ascii characters Radomir Hejl
2005-08-02 20:55 ` Peter Dyballa
[not found] ` <mailman.2370.1123016502.20277.help-gnu-emacs@gnu.org>
2005-08-03 13:28 ` rahed
2005-08-03 14:09 ` Peter Dyballa [this message]
[not found] ` <mailman.2456.1123078766.20277.help-gnu-emacs@gnu.org>
2005-08-03 14:52 ` rahed
2005-08-03 15:11 ` Peter Dyballa
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2aa9db9858e6c4eb17440a5941cff4f0@Web.DE \
--to=peter_dyballa@web.de \
--cc=help-gnu-emacs@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).