all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* Problem with non-bmp unicode
@ 2006-11-08  8:26 Jérôme Marant
  2006-11-10 16:46 ` Chong Yidong
  0 siblings, 1 reply; 2+ messages in thread
From: Jérôme Marant @ 2006-11-08  8:26 UTC (permalink / raw)


[-- Attachment #1: Type: text/plain, Size: 842 bytes --]

Hi,

We received a bug report from Taneli Vahakangas <taneli.vahakangas@welho.com>
against emacs21 related to unicode and non-bmp characters.
(http://bugs.debian.org/397159)

--------------

An UTF-8 file (attached) with these three characters:
U+0022 U+00010380 U+0022
shows with "emacs -nw":
"\360\220\216\200"
which is not usable at all. The file displays correctly if I cat it.

I tried a bunch of other characters outside the BMP, all of which
fail in the same way. Characters in the BMP work nicely.

--------------

Apparently, emacs 22 shows a question mark instead of "\360\220\216\200"
but trying to delete the question mark character with backspace turn it into
"\360\220\216".

Since I'm no unicode expert, I don't know what is supposed to happen in that
case.

Thanks in advance.

-- 
Jérôme Marant

[-- Attachment #2: ftest --]
[-- Type: text/plain, Size: 6 bytes --]

"𐎀"

[-- Attachment #3: Type: text/plain, Size: 142 bytes --]

_______________________________________________
Emacs-devel mailing list
Emacs-devel@gnu.org
http://lists.gnu.org/mailman/listinfo/emacs-devel

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Problem with non-bmp unicode
  2006-11-08  8:26 Problem with non-bmp unicode Jérôme Marant
@ 2006-11-10 16:46 ` Chong Yidong
  0 siblings, 0 replies; 2+ messages in thread
From: Chong Yidong @ 2006-11-10 16:46 UTC (permalink / raw)


> An UTF-8 file (attached) with these three characters:
> U+0022 U+00010380 U+0022
> shows with "emacs -nw":
> "\360\220\216\200"
> which is not usable at all. The file displays correctly if I cat it.
>
> I tried a bunch of other characters outside the BMP, all of which
> fail in the same way. Characters in the BMP work nicely.
>
> Apparently, emacs 22 shows a question mark instead of "\360\220\216\200"
> but trying to delete the question mark character with backspace turn it into
> "\360\220\216".

Do we support display/editing of characters outside the BMP?

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2006-11-10 16:46 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-11-08  8:26 Problem with non-bmp unicode Jérôme Marant
2006-11-10 16:46 ` Chong Yidong

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.