emacs thinks UTF-8 can't encode Japanese text?

unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed

From: James Ralston <ralston@pobox.com>
Subject: emacs thinks UTF-8 can't encode Japanese text?
Date: Wed, 12 Jan 2005 01:32:10 -0500	[thread overview]
Message-ID: <200501120632.j0C6WAcl002699@shieldbreaker.l33tskillz.org> (raw)

I'm trying to use Emacs 21.3 on Fedora Core 3 to edit files containing
Japanese text encoded with UTF-8.

I've used the same version of Emacs on Fedora Core 2 with no problems.
Everything just works.  My locale is the same on both systems:
en_US.UTF-8.

But on my FC3 system, if I visit a UTF-8 encoded file, the Japanese
characters display as empty boxes.  Also, if I paste Japanese text
into an Emacs window, and try to save the buffer, I receive this
message:

> These default coding systems were tried:
>   utf-8-unix
> However, none of them safely encodes the target text.

This message makes no sense, because UTF-8 encodes everything.

On my FC2 system, here's what "C-u C-x =" says:

>   character: い (0151044, 53796, 0xd224)
>     charset: japanese-jisx0208 (JISX0208.1983/1990 Japanese Kanji: ISO-IR-87)
>  code point: 36 36
>      syntax: word
>    category: H:Japanese Hiragana characters of 2-byte character sets  
>              j:Japanese  
>              |:While filling, we can break a line at this character.  
> buffer code: 0x92 0xA4 0xA4
>   file code: 0xE3 0x81 0x84 (encoded by coding system utf-8-unix)
>        font: -mplus-gothic-medium-R-normal--12-120-75-75-C-120-jisx0208.1990-0

On my FC3 system, here's what "C-u C-x =" on the same character says:

>   character: い (0151044, 53796, 0xd224)
>     charset: japanese-jisx0208 (JISX0208.1983/1990 Japanese Kanji: ISO-IR-87)
>  code point: 36 36
>      syntax: word
>    category: H:Japanese Hiragana characters of 2-byte character sets  
>              j:Japanese  
>              |:While filling, we can break a line at this character.  
> buffer code: 0x92 0xA4 0xA4
>   file code: not encodable by coding system utf-8-unix
>        font: -mplus-gothic-medium-R-normal--12-120-75-75-C-120-jisx0208.1990-0

The only difference is the "file code:" line.  But I don't understand
why Emacs 21.3 on FC3 doesn't think that UTF-8 encodes that character,
because it absolutely does.

The FC3 packager claims that he has no problems:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=144707

Does anyone have any ideas?

next             reply	other threads:[~2005-01-12  6:32 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-01-12  6:32 James Ralston [this message]
     [not found] <mailman.12573.1105513276.27204.help-gnu-emacs@gnu.org>
2005-01-13  4:09 ` emacs thinks UTF-8 can't encode Japanese text? Edward Casey

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200501120632.j0C6WAcl002699@shieldbreaker.l33tskillz.org \
    --to=ralston@pobox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).