unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* Problems with Cyrillic encoding
@ 2003-02-17 12:59 Claus Tondering
  2003-02-18 12:27 ` Kenichi Handa
  0 siblings, 1 reply; 2+ messages in thread
From: Claus Tondering @ 2003-02-17 12:59 UTC (permalink / raw)


This bug report will be sent to the Free Software Foundation,
not to your local site managers!
Please write in English, because the Emacs maintainers do not have
translators to read other languages for them.

Your bug report will be posted to the bug-gnu-emacs@gnu.org mailing list,
and to the gnu.emacs.bug news group.

In GNU Emacs 21.2.1 (i686-pc-linux-gnu, X toolkit, Xaw3d scroll bars)
 of 2002-08-29 on astest
configured using `configure  --host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --target=i386-redhat-linux-gnu --program-prefix= --prefix=/usr --exec-prefix=/usr --bindir=/usr/bin --sbindir=/usr/sbin --sysconfdir=/etc --datadir=/usr/share --includedir=/usr/include --libdir=/usr/lib --libexecdir=/usr/libexec --localstatedir=/var --sharedstatedir=/usr/com --mandir=/usr/share/man --infodir=/usr/share/info --with-gcc --with-pop --with-sound'
Important settings:
  value of $LC_ALL: nil
  value of $LC_COLLATE: nil
  value of $LC_CTYPE: nil
  value of $LC_MESSAGES: nil
  value of $LC_MONETARY: nil
  value of $LC_NUMERIC: nil
  value of $LC_TIME: nil
  value of $LANG: en_US.UTF-8
  locale-coding-system: utf-8
  default-enable-multibyte-characters: t

Please describe exactly what actions triggered the bug
and the precise symptoms of the bug:

I can type Russian (Cyrillic) letters in two ways:

* If I select a Russian keyboard and type Russian letters, they get
  into the Emacs buffer with character values in the range
  0x51450-0x5146f (for the lower case letters).

  Calling the the built-in Lisp function
  find-coding-systems-region-internal on these characters returns
  mule-utf-8 as a possible encoding.

* If I use a Latin keyboard and set the input method to
  cyrillic-jcuken, the Russian letters are inserted into the Emacs
  buffer as characts with values in the range 0xe50-0xe6f (for the
  lower case letters).

  Calling find-coding-systems-region-internal on these characters does
  *not* return mule-utf-8 as a possible encoding.

This has bad consequences when I try to use Russian characters in
email and Gnus, because UTF-8 is not chosen as a valid encoding in the
last case.

Best wishes,
---------------------------------------------------------------
Claus Tondering               E-mail: ct@filanet.dk
Senior Software Engineer
Filanet Europe ApS            Web: http://www.filanet.com

Address: Herlev Hovedgade 82c 1, DK-2730 Herlev, Denmark

Phone (direct): +45 44 50 37 81      Fax: +45 44 50 37 99
---------------------------------------------------------------



Recent input:
C-s C-s C-s C-s C-s C-s C-s C-s C-s C-s C-s C-s C-s 
C-s C-s C-s C-s M-x a p r o s p o s <return> <backspace> 
<backspace> <backspace> <backspace> <tab> <return> 
m u l e <return> <f6> <next> <next> <next> <down> <down> 
<down> <down> <down> <down> <next> <f6> <prior> <f10> 
<f6> <switch-frame> C-h b <f6> C-s g n u <C-home> C-s 
e m a c s - b u f <up> M-x a p r o <tab> <return> e 
m a c s . * b u g <return> M-x r e p o r t <tab> <
return>

Recent messages:
eval: 
Entering debugger...
 [2 times]
Back to top level.
Mark saved where search started
Mark set
Mark saved where search started [2 times]
Mark set
Mark saved where search started
Loading emacsbug...done

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Problems with Cyrillic encoding
  2003-02-17 12:59 Problems with Cyrillic encoding Claus Tondering
@ 2003-02-18 12:27 ` Kenichi Handa
  0 siblings, 0 replies; 2+ messages in thread
From: Kenichi Handa @ 2003-02-18 12:27 UTC (permalink / raw)
  Cc: handa

In article <20030217125917.1F5337C018@mail.filanet.dk>, Claus Tondering <ct@filanet.dk> writes:
> In GNU Emacs 21.2.1 (i686-pc-linux-gnu, X toolkit, Xaw3d scroll bars)
[...]
> I can type Russian (Cyrillic) letters in two ways:

> * If I select a Russian keyboard and type Russian letters, they get
>   into the Emacs buffer with character values in the range
>   0x51450-0x5146f (for the lower case letters).

>   Calling the the built-in Lisp function
>   find-coding-systems-region-internal on these characters returns
>   mule-utf-8 as a possible encoding.

> * If I use a Latin keyboard and set the input method to
>   cyrillic-jcuken, the Russian letters are inserted into the Emacs
>   buffer as characts with values in the range 0xe50-0xe6f (for the
>   lower case letters).

>   Calling find-coding-systems-region-internal on these characters does
>   *not* return mule-utf-8 as a possible encoding.

> This has bad consequences when I try to use Russian characters in
> email and Gnus, because UTF-8 is not chosen as a valid encoding in the
> last case.

Please try the latest pretest version 21.2.95 available at
<ftp://alpha.gnu.org/gnu/emacs/pretest/>.

With this version, both kind of Russian characters
(0x51450-0x5146f and 0xe50-0xe6f) can be encoded by both
utf-8 and iso-8859-5.

But, I don't know why characters of the range
0x51450-0x5146f is input by a Russian keyboard.

As something about X's keysym handling is changed, the
behaviour of the pretest version may be different from 21.2.

---
Ken'ichi HANDA
handa@m17n.org

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2003-02-18 12:27 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-02-17 12:59 Problems with Cyrillic encoding Claus Tondering
2003-02-18 12:27 ` Kenichi Handa

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).