all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Sam Steingold <sds@gnu.org>
Subject: Re: Cyrillic, utf-8 and windows
Date: Tue, 09 Dec 2003 13:25:00 -0500	[thread overview]
Message-ID: <ufzftq26b.fsf@gnu.org> (raw)
In-Reply-To: ubrqjqhln.fsf@gnu.org

> * Sam Steingold <fqf@tah.bet> [2003-12-08 13:39:32 -0500]:
>
> GNU Emacs 21.3.50.1 (i386-msvc-nt5.0.2195)
>  of 2003-11-20 on WINSTEINGOLDLAP
> --with-msvc (12.00)
>
> I can open in Emacs a utf-8 file with Cyrillic characters in it and it
> is displayed just fine - with correct glyphs &c.
> I set `default-input-method' to "cyrillic-yawerty" in .emacs,
> so when I try C-\ `toggle-input-method', I get 2 "character outline
> boxes" in the modeline and when I type, I see these "character outline
> boxes" in the buffer instead of the characters I just typed.
> When I save the buffer, kill it, and re-visit the file,
> I see what I just typed displayed correctly as Cyrillic!
> So, why does Emacs display the characters that I type as boxes
> (rectangles) but shows them correctly when loaded from a file on disk?
>
> I use:
>
>   (setq default-input-method "cyrillic-yawerty")
>   (prefer-coding-system 'utf-8)
>   (when (fboundp 'utf-translate-cjk-mode) (utf-translate-cjk-mode 1))

when I type using cyrillic-yawerty, I get this:

  character: а (07120, 3664, 0xe50, U+0430)
    charset: cyrillic-iso8859-5
             (Right-Hand Part of Latin/Cyrillic Alphabet (ISO/IEC 8859-5): ISO-IR-144.)
 code point: 80
     syntax: w 	which means: word
   category: y:Cyrillic  
buffer code: 0x8C 0xD0
  file code: 0xD0 0xB0 (encoded by coding system mule-utf-8-unix)
    display: no font available

when I save the file, kill the buffer and visit the file again, that
character becomes

  character: а (01212120, 332880, 0x51450, U+0430)
    charset: mule-unicode-0100-24ff
             (Unicode characters of the range U+0100..U+24FF.)
 code point: 40 80
     syntax: w 	which means: word
   category: y:Cyrillic  
buffer code: 0x9C 0xF4 0xA8 0xD0
  file code: 0xD0 0xB0 (encoded by coding system mule-utf-8-unix)
    display: by this font (glyph code)
     -outline-Courier New-normal-r-normal-normal-13-97-96-96-c-80-iso10646-1 (0x430)

So, how do I tell cyrillic-yawerty to insert UTF-8?!


-- 
Sam Steingold (http://www.podval.org/~sds) running w2k
<http://www.camera.org> <http://www.iris.org.il> <http://www.memri.org/>
<http://www.mideasttruth.com/> <http://www.honestreporting.com>
When you talk to God, it's prayer; when He talks to you, it's schizophrenia.

  reply	other threads:[~2003-12-09 18:25 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-12-08 18:39 Cyrillic, utf-8 and windows Sam Steingold
2003-12-09 18:25 ` Sam Steingold [this message]
2003-12-09 23:58   ` Kenichi Handa
2003-12-11 19:38     ` Sam Steingold
2003-12-11 23:20       ` Kenichi Handa
2003-12-10  1:27   ` Jason Rumney
2003-12-10  7:20     ` Roman Belenov
2003-12-11 19:39       ` Sam Steingold

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ufzftq26b.fsf@gnu.org \
    --to=sds@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.