From: Simon Josefsson <jas@extundo.com>
Subject: Cyrillic vs UTF-8
Date: Fri, 25 Apr 2003 18:12:17 +0200 [thread overview]
Message-ID: <iluadee4kv2.fsf@latte.josefsson.org> (raw)
$ emacs -q --no-site-file
C-h H (view HELLO file)
Mark the line with Russian text with mouse
q (quit HELLO file)
C-x C-f ff RET (open a new file)
C-y (yank the text, looks fine in the new buffer)
C-x C-s (save file, it complains that iso-latin-1 cannot
encode the data, and suggests utf-8)
RET (go with the default utf-8)
C-x C-k (kill buffer)
C-x C-f ff RET (open file again)
(emacs fail to recognize it as utf-8 and displays gibberish)
C-x C-k (kill buffer)
C-x RET c utf-8 C-x C-f ff RET (open fail as utf-8)
(emacs recognize the file as utf-8 but display empty boxes)
Pressing C-u C-x = on the first empty box (first non-ascii character)
shows:
character: Р (01212100, 332864, 0x51440)
charset: mule-unicode-0100-24ff
(Unicode characters of the range U+0100..U+24FF.)
code point: 40 64
syntax: w which means: word
category: y:Cyrillic
buffer code: 0x9C 0xF4 0xA8 0xC0
file code: 0xD0 0xA0 (encoded by coding system mule-utf-8-unix)
Unicode: 0420
font: -Adobe-Courier-Medium-R-Normal--17-120-100-100-M-100-ISO10646-1
I think there are two problems. Opening the file the first time
should guess it is a utf-8 file. Secondly, emacs should be able to
find a font that contains the characters -- I have all font packages
from Debian installed. The following works fine:
-Misc-Fixed-Medium-R-Normal--18-120-100-100-C-90-ISO10646-1
In GNU Emacs 21.3.50.12 (i686-pc-linux-gnu)
of 2003-04-25 on latte.josefsson.org
configured using `configure '--with-gtk''
Important settings:
value of $LC_ALL: nil
value of $LC_COLLATE: nil
value of $LC_CTYPE: nil
value of $LC_MESSAGES: en_US.UTF-8
value of $LC_MONETARY: nil
value of $LC_NUMERIC: nil
value of $LC_TIME: en_US.UTF-8
value of $LANG: nil
locale-coding-system: nil
default-enable-multibyte-characters: t
Recent input:
M-x r e p o r <tab> <return>
Recent messages:
(emacs -q)
Loading tool-bar...done
Loading image...done
Loading tooltip...done
For information about the GNU Project and its goals, type C-h C-p.
Loading emacsbug...done
next reply other threads:[~2003-04-25 16:12 UTC|newest]
Thread overview: 55+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-04-25 16:12 Simon Josefsson [this message]
2003-04-25 16:40 ` Cyrillic vs UTF-8 Eli Zaretskii
2003-04-25 17:09 ` Simon Josefsson
2003-04-25 22:39 ` Eli Zaretskii
2003-04-26 8:11 ` Kenichi Handa
2003-04-26 12:25 ` Simon Josefsson
2003-04-28 9:18 ` Kenichi Handa
2003-04-28 11:11 ` Simon Josefsson
2003-04-26 16:21 ` Benjamin Riefenstahl
2003-04-26 16:27 ` Benjamin Riefenstahl
2003-04-28 4:38 ` Richard Stallman
2003-05-01 8:27 ` Kenichi Handa
2003-05-02 7:06 ` Richard Stallman
2003-05-02 21:51 ` Eli Zaretskii
2003-05-03 13:37 ` Juanma Barranquero
2003-05-03 19:04 ` Eli Zaretskii
2003-05-04 13:03 ` Richard Stallman
2003-05-04 11:04 ` Dave Love
2003-05-04 12:01 ` Simon Josefsson
2003-05-04 17:13 ` Dave Love
2003-05-04 18:03 ` Simon Josefsson
2003-05-05 8:47 ` Kenichi Handa
2003-04-26 13:44 ` Richard Stallman
2003-04-26 14:10 ` Simon Josefsson
2003-04-28 21:49 ` Stefan Monnier
2003-04-28 22:29 ` Simon Josefsson
2003-04-29 13:49 ` Stefan Monnier
2003-04-29 14:27 ` Simon Josefsson
2003-04-30 4:42 ` Stephen J. Turnbull
2003-04-30 5:43 ` Richard Stallman
2003-05-19 0:40 ` Kenichi Handa
2003-05-19 0:52 ` Stefan Monnier
2003-05-19 2:31 ` Kenichi Handa
2003-05-19 13:28 ` Stefan Monnier
2003-05-19 13:49 ` Stefan Monnier
2003-04-25 16:54 ` Simon Josefsson
2003-04-26 3:55 ` Implementing charset-aware X font names [was: Cyrillic vs UTF-8] Stephen J. Turnbull
2003-04-28 11:09 ` Kenichi Handa
2003-04-28 12:27 ` Implementing charset-aware X font names Stephen J. Turnbull
2003-05-01 11:13 ` Kenichi Handa
2003-05-01 14:14 ` Alex Schroeder
2003-05-01 23:16 ` Kenichi Handa
2003-04-26 7:59 ` Cyrillic vs UTF-8 Kenichi Handa
2003-04-26 12:14 ` Simon Josefsson
2003-05-01 7:20 ` Kenichi Handa
2003-05-01 14:06 ` Alex Schroeder
2003-05-01 18:03 ` Customizing fontsets (was: Cyrillic vs UTF-8) Oliver Scholz
2003-05-02 5:17 ` Customizing fontsets Alex Schroeder
2003-05-02 6:32 ` Kenichi Handa
2003-05-02 13:25 ` Stefan Monnier
2003-05-03 0:40 ` Oliver Scholz
2003-05-03 1:50 ` Kenichi Handa
2003-05-03 12:08 ` Oliver Scholz
2003-05-07 1:22 ` Kenichi Handa
2003-05-03 0:33 ` Oliver Scholz
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=iluadee4kv2.fsf@latte.josefsson.org \
--to=jas@extundo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.