unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
* Emacs in xterm and Cyrillic?
@ 2005-04-03 23:34 vedm
  2005-04-04 17:43 ` Peter Dyballa
                   ` (2 more replies)
  0 siblings, 3 replies; 15+ messages in thread
From: vedm @ 2005-04-03 23:34 UTC (permalink / raw)




When I start emacs in xterm (emacs -nw) I can not read Cyrillic text: I
see only question marks.

I can see the same text without a problem in VIM and even with the
"more" command. Any idea? (My version of Emacs is 21.3.1)


-- 
vedm

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-03 23:34 Emacs in xterm and Cyrillic? vedm
@ 2005-04-04 17:43 ` Peter Dyballa
  2005-04-05 20:49 ` Stefan Monnier
       [not found] ` <mailman.78.1112634873.2895.help-gnu-emacs@gnu.org>
  2 siblings, 0 replies; 15+ messages in thread
From: Peter Dyballa @ 2005-04-04 17:43 UTC (permalink / raw)
  Cc: help-gnu-emacs


Am 04.04.2005 um 01:34 schrieb vedm:

> When I start emacs in xterm (emacs -nw) I can not read Cyrillic text: I
> see only question marks.

First you should set in GNU Emacs a language environment: Cyrillic-ALT, 
Cyrillic-ISO, Cyrillic-KOI8, Ukrainian ... This should already prepare 
most things.

If this doesn't help yet, then there are environment variables in 
xterm's shell, LANG and LC_CTYPE. GNU Emacs 22 will reverse this 
sequence.

If this still is not sufficient you have in GNU Emacs a few variables 
and functions with 'coding-system' you can set, for example:

	(set-terminal-coding-system		'XXX)
	(prefer-coding-system			'XXX)
	(setq default-buffer-file-coding-system 'XXX)

--
Greetings

   Pete

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-03 23:34 Emacs in xterm and Cyrillic? vedm
  2005-04-04 17:43 ` Peter Dyballa
@ 2005-04-05 20:49 ` Stefan Monnier
       [not found] ` <mailman.78.1112634873.2895.help-gnu-emacs@gnu.org>
  2 siblings, 0 replies; 15+ messages in thread
From: Stefan Monnier @ 2005-04-05 20:49 UTC (permalink / raw)


> When I start emacs in xterm (emacs -nw) I can not read Cyrillic text: I
> see only question marks.

> I can see the same text without a problem in VIM and even with the
> "more" command. Any idea? (My version of Emacs is 21.3.1)

What's your locale setting?


        Stefan

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
       [not found] ` <mailman.78.1112634873.2895.help-gnu-emacs@gnu.org>
@ 2005-04-06  1:29   ` vedm
  2005-04-06  8:18     ` Peter Dyballa
                       ` (2 more replies)
  0 siblings, 3 replies; 15+ messages in thread
From: vedm @ 2005-04-06  1:29 UTC (permalink / raw)


Peter Dyballa <Peter_Dyballa@Web.DE> writes:

> Am 04.04.2005 um 01:34 schrieb vedm:
> 
> > When I start emacs in xterm (emacs -nw) I can not read Cyrillic text: I
> > see only question marks.
> 
> First you should set in GNU Emacs a language environment: Cyrillic-ALT,
> Cyrillic-ISO, Cyrillic-KOI8, Ukrainian ... This should already prepare
> most things.
> 
> If this doesn't help yet, then there are environment variables in
> xterm's shell, LANG and LC_CTYPE. GNU Emacs 22 will reverse this
> sequence.
> 
> If this still is not sufficient you have in GNU Emacs a few variables
> and functions with 'coding-system' you can set, for example:
> 
> 	(set-terminal-coding-system		'XXX)
> 	(prefer-coding-system			'XXX)
> 	(setq default-buffer-file-coding-system 'XXX)
> 

Thanks for the help, it turned out that the following solves the
problem:

(set-terminal-coding-system 'iso-8859-5)

That's because in my .Xdefaults I have this:

==
Rxvt*font: -misc-fixed-medium-r-normal-*-20-200-*-75-c-100-iso8859-5
xterm*font: -misc-fixed-medium-r-normal-*-18-*-*-*-*-*-iso8859-5
==

(iso-8859-5 is the standard cyrillic encoding, although it is much less
used than the Microsoft non-standard cp1251 aka windows-1251)

-- 
vedm

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-06  1:29   ` vedm
@ 2005-04-06  8:18     ` Peter Dyballa
  2005-04-06 15:40     ` Stefan Monnier
       [not found]     ` <mailman.354.1112774023.2895.help-gnu-emacs@gnu.org>
  2 siblings, 0 replies; 15+ messages in thread
From: Peter Dyballa @ 2005-04-06  8:18 UTC (permalink / raw)
  Cc: help-gnu-emacs


Am 06.04.2005 um 03:29 schrieb vedm:

> That's because in my .Xdefaults I have this:
>
> ==
> Rxvt*font: -misc-fixed-medium-r-normal-*-20-200-*-75-c-100-iso8859-5
> xterm*font: -misc-fixed-medium-r-normal-*-18-*-*-*-*-*-iso8859-5

How was then vim able to display Cyrillic in that same xterm?

(Unicode encoded fonts, *-iso10646-1, could do a better job since they 
would allow Latin diacritics too. Here are some: 
http://openlab.jp/efont, also, once for GNU Emacs 20 developed, 
http://www.gnu.org/directory/intlfonts.html)

--
Greetings
                                  <]
    Pete      o        __o         |__    o           recumbo
     ___o    /I       -\<,         |o \  -\),-%       ergo sum!
___/\ /\___./ \___...O/ O____.....`-O-'-()--o_________________

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-06  1:29   ` vedm
  2005-04-06  8:18     ` Peter Dyballa
@ 2005-04-06 15:40     ` Stefan Monnier
  2005-04-10  0:26       ` vedm
       [not found]     ` <mailman.354.1112774023.2895.help-gnu-emacs@gnu.org>
  2 siblings, 1 reply; 15+ messages in thread
From: Stefan Monnier @ 2005-04-06 15:40 UTC (permalink / raw)


> (set-terminal-coding-system 'iso-8859-5)

If you properly set your locale this won't be necessary.


        Stefan

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-06 15:40     ` Stefan Monnier
@ 2005-04-10  0:26       ` vedm
  0 siblings, 0 replies; 15+ messages in thread
From: vedm @ 2005-04-10  0:26 UTC (permalink / raw)


Stefan Monnier <monnier@iro.umontreal.ca> writes:

> > (set-terminal-coding-system 'iso-8859-5)
> 
> If you properly set your locale this won't be necessary.
> 
> 
>         Stefan

You are right. 

Running the command "locale -a" I see that the locale for Bulgarian is
bg_BG. I set the following variables:

LC_ALL=bg_BG
LANG=bg_BG
LANGUAGE=bg_BG


This didn't help. The other locale for Bulgarian is bg_BG.cp1251. However
my files are encoded in iso8859-5. I noticed that there is a Russian
locale ru_RU.iso88595: I tried it and it worked: I didn't need the
"set-terminal-coding-system" call with this locale.

So in the end, the problem can be fixed either with
set-terminal-coding-system or with setting the appropriate locale.

But I wonder why there is no Bulgarian  locale with iso-8859-5
encoding? After all, that is supposed to be the standard.


-- 
vedm

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
       [not found]     ` <mailman.354.1112774023.2895.help-gnu-emacs@gnu.org>
@ 2005-04-10  0:58       ` vedm
  2005-04-10 12:54         ` Recommendations: emacs files containing multiple languages ken
                           ` (3 more replies)
  0 siblings, 4 replies; 15+ messages in thread
From: vedm @ 2005-04-10  0:58 UTC (permalink / raw)


Peter Dyballa <Peter_Dyballa@Web.DE> writes:

> Am 06.04.2005 um 03:29 schrieb vedm:
> 
> > That's because in my .Xdefaults I have this:
> >
> > ==
> > Rxvt*font: -misc-fixed-medium-r-normal-*-20-200-*-75-c-100-iso8859-5
> > xterm*font: -misc-fixed-medium-r-normal-*-18-*-*-*-*-*-iso8859-5
> 
> How was then vim able to display Cyrillic in that same xterm?
> 

Probably VIM just lets the terminal display the font. And once you set
xterm/rxvt (I use rxvt) with a cyrillic font like iso8859-5 it can
display it: I can see the files with the "more" command.

> (Unicode encoded fonts, *-iso10646-1, could do a better job since they
> would allow Latin diacritics too. Here are some:
> http://openlab.jp/efont, also, once for GNU Emacs 20 developed,
> http://www.gnu.org/directory/intlfonts.html)
>  

My cyrillic files are encoded in iso8859-5, just because that encoding
is within the ASCII set and is enough for the cyrilic script. Yes, I
agree that UTF is better for handling all sorts of languages, but I
still haven't tried to use it in emacs and Xterm. (One disadvantage of
UTF is that the UTF files (at least cyrillic files) are almost two times
bigger compared to ASCII encoded files).


-- 
vedm

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Recommendations: emacs files containing multiple languages
  2005-04-10  0:58       ` vedm
@ 2005-04-10 12:54         ` ken
  2005-04-10 13:33           ` Peter Dyballa
  2005-04-10 19:06           ` Eli Zaretskii
  2005-04-10 13:20         ` Emacs in xterm and Cyrillic? Peter Dyballa
                           ` (2 subsequent siblings)
  3 siblings, 2 replies; 15+ messages in thread
From: ken @ 2005-04-10 12:54 UTC (permalink / raw)


vedm wrote:

> [....]
>
>
>My cyrillic files are encoded in iso8859-5, just because that encoding
>is within the ASCII set and is enough for the cyrilic script. Yes, I
>agree that UTF is better for handling all sorts of languages, but I
>still haven't tried to use it in emacs and Xterm. (One disadvantage of
>UTF is that the UTF files (at least cyrillic files) are almost two times
>bigger compared to ASCII encoded files).
>  
>

In these days of extreme code bloat, large document files don't bother
me too much.  Getting the functionality I need, though, is.

This week I've been wondering how to compose and edit text files
containing multiple languages... like English, German, Russian, French,
and ancient Greek... all in the same file.  Is this even possible in
emacs?  If so, any tips?

I've heard of an editor called "sam", developed by Rob Pike, that can do
this.  But in my many years of using emacs and reading lists like this,
I've never heard of emacs being capable of such a wild thing.


tia,
ken

-- 
A lot of us are working harder than we want, at things we don't like to
do.  Why? ...In order to afford the sort of existence we don't care to live.
	-- Bradford Angier

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-10  0:58       ` vedm
  2005-04-10 12:54         ` Recommendations: emacs files containing multiple languages ken
@ 2005-04-10 13:20         ` Peter Dyballa
  2005-04-11 17:48         ` Kevin Rodgers
       [not found]         ` <mailman.1082.1113240243.2895.help-gnu-emacs@gnu.org>
  3 siblings, 0 replies; 15+ messages in thread
From: Peter Dyballa @ 2005-04-10 13:20 UTC (permalink / raw)
  Cc: help-gnu-emacs


Am 10.04.2005 um 02:58 schrieb vedm:

> (One disadvantage of UTF is that the UTF files (at least cyrillic 
> files) are almost two times bigger compared to ASCII encoded files).
>

That's true -- I actually tried to recommend using ISO 10646-1 fonts 
for general display use. Using them or having them installed you could 
easily switch to another locale ... which are an easy start into 
fontsets too with GNU Emacs running freely in X11.

--
Greetings

   Pete

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Recommendations: emacs files containing multiple languages
  2005-04-10 12:54         ` Recommendations: emacs files containing multiple languages ken
@ 2005-04-10 13:33           ` Peter Dyballa
  2005-04-10 19:06           ` Eli Zaretskii
  1 sibling, 0 replies; 15+ messages in thread
From: Peter Dyballa @ 2005-04-10 13:33 UTC (permalink / raw)
  Cc: GNU Emacs List


Am 10.04.2005 um 14:54 schrieb ken:

> This week I've been wondering how to compose and edit text files
> containing multiple languages... like English, German, Russian, French,
> and ancient Greek... all in the same file.  Is this even possible in
> emacs?  If so, any tips?

Start the file with this header:

	;;; -*- mode: Text; coding: utf-8; -*-

and create fontsets like these:

(message "Neue fontsets für X11")
(if (fboundp 'new-fontset)
   (progn
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;;;;;;;;; Monotype Courier New - Unicode encoded TrueType font, version  
2.82, 1318 glyphs
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec "-monotype-courier  
new-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_monotype_courier" t 'noerror)
	(set-fontset-font "fontset-09pt_monotype_courier"	'latin-iso8859-1   
'("courier new" . "iso8859-1"))
	(set-fontset-font "fontset-09pt_monotype_courier"	'latin-iso8859-2   
'("courier new" . "iso8859-2"))
	(set-fontset-font "fontset-09pt_monotype_courier"	'latin-iso8859-3   
'("courier new" . "iso8859-3"))
	(set-fontset-font "fontset-09pt_monotype_courier"	'latin-iso8859-4   
'("courier new" . "iso8859-4"))
	(set-fontset-font "fontset-09pt_monotype_courier" 'cyrillic-iso8859-5   
'("courier new" . "iso8859-5"))
	(set-fontset-font "fontset-09pt_monotype_courier" 'arabic-iso8859-6   
'("courier new" . "iso8859-6"))
	(set-fontset-font "fontset-09pt_monotype_courier"  'greek-iso8859-7   
'("courier new" . "iso8859-7"))
	(set-fontset-font "fontset-09pt_monotype_courier" 'hebrew-iso8859-8   
'("courier new" . "iso8859-8"))
	(set-fontset-font "fontset-09pt_monotype_courier"	'latin-iso8859-9   
'("courier new" . "iso8859-9"))
	(set-fontset-font "fontset-09pt_monotype_courier"  'latin-iso8859-15  
'("courier new" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_monotype_courier"  
'mule-unicode-0100-24ff '("courier new" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_monotype_courier"  
'mule-unicode-2500-33ff '("courier new" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_monotype_courier"  
'mule-unicode-e000-ffff '("courier new" . "iso10646-1"))
;
     (create-fontset-from-fontset-spec "-monotype-courier  
new-medium-r-*-*-10-*-*-*-*-*-fontset-10pt_monotype_courier" t  
'noerror)
	(set-fontset-font "fontset-10pt_monotype_courier"	'latin-iso8859-1   
'("courier new" . "iso8859-1"))
	(set-fontset-font "fontset-10pt_monotype_courier"	'latin-iso8859-2   
'("courier new" . "iso8859-2"))
	(set-fontset-font "fontset-10pt_monotype_courier"	'latin-iso8859-3   
'("courier new" . "iso8859-3"))
	(set-fontset-font "fontset-10pt_monotype_courier"	'latin-iso8859-4   
'("courier new" . "iso8859-4"))
	(set-fontset-font "fontset-10pt_monotype_courier" 'cyrillic-iso8859-5   
'("courier new" . "iso8859-5"))
	(set-fontset-font "fontset-10pt_monotype_courier" 'arabic-iso8859-6   
'("courier new" . "iso8859-6"))
	(set-fontset-font "fontset-10pt_monotype_courier"	'greek-iso8859-7   
'("courier new" . "iso8859-7"))
	(set-fontset-font "fontset-10pt_monotype_courier" 'hebrew-iso8859-8   
'("courier new" . "iso8859-8"))
	(set-fontset-font "fontset-10pt_monotype_courier"	'latin-iso8859-9   
'("courier new" . "iso8859-9"))
	(set-fontset-font "fontset-10pt_monotype_courier"	'latin-iso8859-15  
'("courier new" . "iso8859-15"))
	(set-fontset-font "fontset-10pt_monotype_courier"  
'mule-unicode-0100-24ff '("courier new" . "iso10646-1"))
	(set-fontset-font "fontset-10pt_monotype_courier"  
'mule-unicode-2500-33ff '("courier new" . "iso10646-1"))
	(set-fontset-font "fontset-10pt_monotype_courier"  
'mule-unicode-e000-ffff '("courier new" . "iso10646-1"))
;
     (create-fontset-from-fontset-spec "-monotype-courier  
new-medium-r-*-*-11-*-*-*-*-*-fontset-11pt_monotype_courier" t  
'noerror)
	(set-fontset-font "fontset-11pt_monotype_courier"	'latin-iso8859-1   
'("courier new" . "iso8859-1"))
	(set-fontset-font "fontset-11pt_monotype_courier"	'latin-iso8859-2   
'("courier new" . "iso8859-2"))
	(set-fontset-font "fontset-11pt_monotype_courier"	'latin-iso8859-3   
'("courier new" . "iso8859-3"))
	(set-fontset-font "fontset-11pt_monotype_courier"	'latin-iso8859-4   
'("courier new" . "iso8859-4"))
	(set-fontset-font "fontset-11pt_monotype_courier" 'cyrillic-iso8859-5   
'("courier new" . "iso8859-5"))
	(set-fontset-font "fontset-11pt_monotype_courier" 'arabic-iso8859-6   
'("courier new" . "iso8859-6"))
	(set-fontset-font "fontset-11pt_monotype_courier"	'greek-iso8859-7   
'("courier new" . "iso8859-7"))
	(set-fontset-font "fontset-11pt_monotype_courier"  'hebrew-iso8859-8   
'("courier new" . "iso8859-8"))
	(set-fontset-font "fontset-11pt_monotype_courier"	'latin-iso8859-9   
'("courier new" . "iso8859-9"))
	(set-fontset-font "fontset-11pt_monotype_courier"	'latin-iso8859-15  
'("courier new" . "iso8859-15"))
	(set-fontset-font "fontset-11pt_monotype_courier"  
'mule-unicode-0100-24ff '("courier new" . "iso10646-1"))
	(set-fontset-font "fontset-11pt_monotype_courier"  
'mule-unicode-2500-33ff '("courier new" . "iso10646-1"))
	(set-fontset-font "fontset-11pt_monotype_courier"  
'mule-unicode-e000-ffff '("courier new" . "iso10646-1"))
; and more sizes of the same font
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;;;;;;;;; Adobe Courier - Unicode encoded TrueType font
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec  
"-adobe-courier-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_adobe_courier" t  
'noerror)
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-1   
'("adobe-courier" . "iso8859-1"))
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-2   
'("adobe-courier" . "iso8859-2"))
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-3   
'("adobe-courier" . "iso8859-3"))
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-4   
'("adobe-courier" . "iso8859-4"))
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-9   
'("adobe-courier" . "iso8859-9"))
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-14  
'("adobe-courier" . "iso8859-14"))
	(set-fontset-font "fontset-09pt_adobe_courier"       'latin-iso8859-15  
'("adobe-courier" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_adobe_courier" 'mule-unicode-0100-24ff  
'("adobe-courier" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_adobe_courier" 'mule-unicode-2500-33ff  
'("adobe-courier" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_adobe_courier" 'mule-unicode-e000-ffff  
'("adobe-courier" . "iso10646-1"))
;and more sizes of the same font
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;;;;;;;;; B&H Lucida Sans Typewriter - Unicode encoded TrueType font,  
version 18 July, 1999; 1.10 (JAVA), 1376 glyphs
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec "-b&h-lucida sans  
typewriter-medium-r-*-*-9-*-*-*-*-*-fontset 
-09pt_lucida_sans_typewriter" t 'noerror)
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"        
'latin-iso8859-1  '("lucida sans typewriter" . "iso8859-1"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"        
'latin-iso8859-2  '("lucida sans typewriter" . "iso8859-2"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"        
'latin-iso8859-3  '("lucida sans typewriter" . "iso8859-3"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"        
'latin-iso8859-4  '("lucida sans typewriter" . "iso8859-4"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"     
'cyrillic-iso8859-5  '("lucida sans typewriter" . "iso8859-5"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"       
'hebrew-iso8859-8  '("lucida sans typewriter" . "iso8859-8"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"        
'latin-iso8859-9  '("lucida sans typewriter" . "iso8859-9"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"        
'latin-iso8859-15 '("lucida sans typewriter" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter              
'thai-tis620 '("lucida sans typewriter" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"  
'mule-unicode-0100-24ff '("lucida sans typewriter" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"  
'mule-unicode-2500-33ff '("lucida sans typewriter" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_lucida_sans_typewriter"  
'mule-unicode-e000-ffff '("lucida sans typewriter" . "iso10646-1"))
;
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec  
"-b&h-lucidatypewriter-medium-r-*-*-8-*-*-*-*-*-fontset 
-08pt_lucidatypewriter" t 'noerror)
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-1  '("lucidatypewriter" . "iso8859-1"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-2  '("lucidatypewriter" . "iso8859-2"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-3  '("lucidatypewriter" . "iso8859-3"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-4  '("lucidatypewriter" . "iso8859-4"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-9  '("lucidatypewriter" . "iso8859-9"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-14 '("lucidatypewriter" . "iso8859-14"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"        
'latin-iso8859-15 '("lucidatypewriter" . "iso8859-15"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"  
'mule-unicode-0100-24ff '("lucidatypewriter" . "iso10646-1"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"  
'mule-unicode-2500-33ff '("lucidatypewriter" . "iso10646-1"))
       (set-fontset-font "fontset-08pt_lucidatypewriter"  
'mule-unicode-e000-ffff '("lucidatypewriter" . "iso10646-1"))
;
;;;;;;;;; B&H Luxi Mono Regular - Unicode encoded TrueType font,  
version 1.2 : October 12, 2001,  395 glyphs
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec "-b&h-luxi  
mono-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_luxi mono" t 'noerror)
       (set-fontset-font "fontset-09pt_luxi mono"       'latin-iso8859-1  
  '("luxi mono" . "iso8859-1"))
       (set-fontset-font "fontset-09pt_luxi mono"       'latin-iso8859-2  
  '("luxi mono" . "iso8859-2"))
       (set-fontset-font "fontset-09pt_luxi mono"       'latin-iso8859-3  
  '("luxi mono" . "iso8859-3"))
       (set-fontset-font "fontset-09pt_luxi mono"       'latin-iso8859-4  
  '("luxi mono" . "iso8859-4"))
       (set-fontset-font "fontset-09pt_luxi mono"       'latin-iso8859-9  
  '("luxi mono" . "iso8859-9"))
       (set-fontset-font "fontset-09pt_luxi mono"        
'latin-iso8859-15 '("luxi mono" . "iso8859-15"))
       (set-fontset-font "fontset-09pt_luxi mono"  
'mule-unicode-0100-24ff '("luxi mono" . "iso10646-1"))
       (set-fontset-font "fontset-09pt_luxi mono"  
'mule-unicode-2500-33ff '("luxi mono" . "iso10646-1"))
       (set-fontset-font "fontset-09pt_luxi mono"  
'mule-unicode-e000-ffff '("luxi mono" . "iso10646-1"))
;
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;;;;;;;;; Bitstream Vera Sans Mono - Unicode encoded TrueType font,  
version Release 1.10,  269 glyphs
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec "-bitsteam-bitstream vera sans  
mono-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_vera_sans_mono" t 'noerror)
	(set-fontset-font "fontset-09pt_vera_sans_mono"       'latin-iso8859-1  
  '("bitstream vera sans mono" . "iso8859-1"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"       'latin-iso8859-2  
  '("bitstream vera sans mono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"       'latin-iso8859-3  
  '("bitstream vera sans mono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"       'latin-iso8859-4  
  '("bitstream vera sans mono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"       'latin-iso8859-9  
  '("bitstream vera sans mono" . "iso8859-9"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"        
'latin-iso8859-15 '("bitstream vera sans mono" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"  
'mule-unicode-0100-24ff '("bitstream vera sans mono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"  
'mule-unicode-2500-33ff '("bitstream vera sans mono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_vera_sans_mono"  
'mule-unicode-e000-ffff '("bitstream vera sans mono" . "iso10646-1"))
;
;;;;;;;;; ProFontWindows - Unicode encoded TrueType font, version  
2.3,	278 glyphs
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec  
"-misc-profontwindows-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_profont" t  
'noerror)
	(set-fontset-font "fontset-09pt_profont"       'latin-iso8859-1   
'("profontwindows" . "iso8859-1"))	(set-fontset-font  
"fontset-09pt_profont"       'latin-iso8859-2  '("profontwindows" .  
"iso10646-1"))	(set-fontset-font "fontset-09pt_profont"        
'latin-iso8859-3  '("profontwindows" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_profont"       'latin-iso8859-4   
'("profontwindows" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_profont"       'latin-iso8859-9   
'("profontwindows" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_profont"       'latin-iso8859-15  
'("profontwindows" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_profont" 'mule-unicode-0100-24ff  
'("profontwindows" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_profont" 'mule-unicode-2500-33ff  
'("profontwindows" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_profont" 'mule-unicode-e000-ffff  
'("profontwindows" . "iso10646-1"))
;
;;;;;;;;; FreeMono - Unicode encoded TrueType font, version Version  
1.10, 2.570 glyphs
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec  
"-*-freemono-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_free_mono" t  
'noerror)
	(set-fontset-font "fontset-09pt_free_mono"	 'latin-iso8859-1   
'("freemono" . "iso8859-1"))
	(set-fontset-font "fontset-09pt_free_mono"	 'latin-iso8859-2   
'("freemono" . "iso8859-2"))
	(set-fontset-font "fontset-09pt_free_mono"	 'latin-iso8859-3   
'("freemono" . "iso8859-3"))
	(set-fontset-font "fontset-09pt_free_mono"	 'latin-iso8859-4   
'("freemono" . "iso8859-4"))
	(set-fontset-font "fontset-09pt_free_mono"   'cyrillic-iso8859-5   
'("freemono" . "iso8859-5"))
	(set-fontset-font "fontset-09pt_free_mono"	 'latin-iso8859-9   
'("freemono" . "iso8859-9"))
	(set-fontset-font "fontset-09pt_free_mono"	 'latin-iso8859-15  
'("freemono" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_free_mono" 'mule-unicode-0100-24ff  
'("freemono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_free_mono" 'mule-unicode-2500-33ff  
'("freemono" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_free_mono" 'mule-unicode-e000-ffff  
'("freemono" . "iso10646-1"))
;
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
;;;;;;;;; Everson Mono Unicode - Unicode encoded TrueType font, version  
Version 3.2b4, 4.899 glyphs -- simply the best!
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
     (create-fontset-from-fontset-spec "-*-everson mono  
unicode-medium-r-*-*-9-*-*-*-*-*-fontset-09pt_everson_mono" t 'noerror)
	(set-fontset-font "fontset-09pt_everson_mono"	    'latin-iso8859-1   
'("everson mono unicode" . "iso8859-1"))
	(set-fontset-font "fontset-09pt_everson_mono"	    'latin-iso8859-2   
'("everson mono unicode" . "iso8859-2"))
	(set-fontset-font "fontset-09pt_everson_mono"	    'latin-iso8859-3   
'("everson mono unicode" . "iso8859-3"))
	(set-fontset-font "fontset-09pt_everson_mono"	    'latin-iso8859-4   
'("everson mono unicode" . "iso8859-4"))
	(set-fontset-font "fontset-09pt_everson_mono"	 'cyrillic-iso8859-5   
'("everson mono unicode" . "iso8859-5"))
	(set-fontset-font "fontset-09pt_everson_mono"	   'hebrew-iso8859-8   
'("everson mono unicode" . "iso8859-8"))
	(set-fontset-font "fontset-09pt_everson_mono"	    'latin-iso8859-9   
'("everson mono unicode" . "iso8859-9"))
	(set-fontset-font "fontset-09pt_everson_mono"	    'latin-iso8859-15  
'("everson mono unicode" . "iso8859-15"))
	(set-fontset-font "fontset-09pt_everson_mono" 'mule-unicode-0100-24ff  
'("everson mono unicode" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_everson_mono" 'mule-unicode-2500-33ff  
'("everson mono unicode" . "iso10646-1"))
	(set-fontset-font "fontset-09pt_everson_mono" 'mule-unicode-e000-ffff  
'("everson mono unicode" . "iso10646-1"))
;
     (let ((fontset (query-fontset (frame-parameter nil 'font))))
       (when (not fontset)
	(setq fontset "fontset-default")
	(set-default-font fontset))
	(set-fontset-font fontset
		(cons (decode-char 'ucs #x0D00) (decode-char 'ucs #x0D7f)) (cons  
"misc-malayalam" "iso10646-1")); Malayam
     )
	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #x05b0)  
(decode-char 'ucs #x06ff)) "-m17n-*--13-*-iso10646-1"); Hebrew
	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #x10a0)  
(decode-char 'ucs #x10ff)) "-*-bpg glaho arial--13-*-iso10646-1");  
Georgian
;	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #x10a0)  
(decode-char 'ucs #x10ff)) "-*-titus cyberbit basic--13-*-iso10646-1");  
Georgian
;	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #x10a0)  
(decode-char 'ucs #x10ff)) "-*-code2000--13-*-iso10646-1"); Georgian
	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #x200c)  
(decode-char 'ucs #x200f)) "-m17n-*--13-*-iso10646-1"); 0 width
	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #xfb2a)  
(decode-char 'ucs #xfbff)) "-m17n-*--13-*-iso10646-1"); presentation  
forms
	(set-fontset-font "fontset-default" (cons (decode-char 'ucs #xfe70)  
(decode-char 'ucs #xfefc)) "-m17n-*--13-*-iso10646-1"); Arabic

(provide 'site-fontsets-x11)

--
Greetings

   Pete

"I wouldn't recommend sex, drugs or insanity for everyone, but they've
always worked for me."
		-- Hunter S. Thompson

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Recommendations: emacs files containing multiple languages
  2005-04-10 12:54         ` Recommendations: emacs files containing multiple languages ken
  2005-04-10 13:33           ` Peter Dyballa
@ 2005-04-10 19:06           ` Eli Zaretskii
  1 sibling, 0 replies; 15+ messages in thread
From: Eli Zaretskii @ 2005-04-10 19:06 UTC (permalink / raw)


> Date: Sun, 10 Apr 2005 08:54:48 -0400
> From: ken <gebser@speakeasy.net>
> 
> This week I've been wondering how to compose and edit text files
> containing multiple languages... like English, German, Russian, French,
> and ancient Greek... all in the same file.  Is this even possible in
> emacs?

Yes, Emacs can do this since v20.5 (actually, 20.1, but there were
bugs in early 20.x versions).

> If so, any tips?

Just do it.  You will need corresponding fonts to display the
characters, of course.  And, if your system doesn't support input in
those languages, you will have to make sure you have Leim (library of
Emacs input methods) installed and use Emacs input methods for typing
foreign characters.

> I've heard of an editor called "sam", developed by Rob Pike, that can do
> this.  But in my many years of using emacs and reading lists like this,
> I've never heard of emacs being capable of such a wild thing.

It's strange you never heard of it, since the capability exists for
quite a few years.

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-10  0:58       ` vedm
  2005-04-10 12:54         ` Recommendations: emacs files containing multiple languages ken
  2005-04-10 13:20         ` Emacs in xterm and Cyrillic? Peter Dyballa
@ 2005-04-11 17:48         ` Kevin Rodgers
       [not found]         ` <mailman.1082.1113240243.2895.help-gnu-emacs@gnu.org>
  3 siblings, 0 replies; 15+ messages in thread
From: Kevin Rodgers @ 2005-04-11 17:48 UTC (permalink / raw)


vedm wrote:
 > My cyrillic files are encoded in iso8859-5, just because that encoding
 > is within the ASCII set and is enough for the cyrilic script. Yes, I
 > agree that UTF is better for handling all sorts of languages, but I
 > still haven't tried to use it in emacs and Xterm. (One disadvantage of
 > UTF is that the UTF files (at least cyrillic files) are almost two times
 > bigger compared to ASCII encoded files).

That can't be the case, because Cyrillic characters can't even be
represented in ASCII.  It is true that your Cyrillic files will be
encoded in ISO-8859-5 with just 1 byte per character, whereas the
Cyrillic characters require 2 bytes in UTF-8 (I don't know about
UTF-16).  But the actual size of the UTF-8 files will depend on how many
Cyrillic vs. ASCII characters are present, since the ASCII characters
are still represented as a single byte.

-- 
Kevin Rodgers

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
       [not found]         ` <mailman.1082.1113240243.2895.help-gnu-emacs@gnu.org>
@ 2005-04-14  1:47           ` vedm
  2005-04-14  2:03             ` vedm
  0 siblings, 1 reply; 15+ messages in thread
From: vedm @ 2005-04-14  1:47 UTC (permalink / raw)


Kevin Rodgers <ihs_4664@yahoo.com> writes:

> vedm wrote:
>  > My cyrillic files are encoded in iso8859-5, just because that encoding
>  > is within the ASCII set and is enough for the cyrilic script. Yes, I
>  > agree that UTF is better for handling all sorts of languages, but I
>  > still haven't tried to use it in emacs and Xterm. (One disadvantage of
>  > UTF is that the UTF files (at least cyrillic files) are almost two times
>  > bigger compared to ASCII encoded files).
> 
> That can't be the case, because Cyrillic characters can't even be
> represented in ASCII.  

The first half of this statement is true, the second one is not. When
you say "That can't be the case" I assume you mean that iso-8859-5 is
not ASCII, and that is true. Until now I thought of the ISO-8859 family
of encodings as "8-bit ASCII". But now I did my research an found this
good review of character codes:
http://www.cs.tut.fi/~jkorpela/chars.html. 
As it says:

<quote>

The misnomer "8-bit ASCII"

...ASCII is strictly and unambiguously a 7-bit code in the sense that
all code positions are in the range 0 - 127.

It (the term "8-bit ASCII") is a misnomer used to refer to various
character codes which are extensions of ASCII in the following sense:
the character repertoire contains ASCII as a subset, the code numbers
are in the range 0 - 255, and the code numbers of ASCII characters equal
their ASCII codes."

</quote>

Now, the second part of your statement is that "Cyrillic characters
can't even be represented in ASCII". But the Cyrillic alphabet consists
of about 30 letters (Bulgarian - 30, Russian - 33), and the 7-bit ASCII
code has 128 positions, which is clearly more than enough to encode 30
letters (or 60, for upper and lower case)

In fact, the first Cyrillic encodings used a 7-bit char-set. A good
discussion of the Cyrillic character sets can be found here:
http://czyborra.com/charsets/cyrillic.html

> It is true that your Cyrillic files will be
> encoded in ISO-8859-5 with just 1 byte per character, whereas the
> Cyrillic characters require 2 bytes in UTF-8 (I don't know about
> UTF-16).  But the actual size of the UTF-8 files will depend on how many
> Cyrillic vs. ASCII characters are present, since the ASCII characters
> are still represented as a single byte.

My Cyrillic files consist entirely of Cyrillic characters (excluding
special characters like spaces, new lines, dots etc), so they are
invariably almost two times bigger when encoded in UTF-8. And these
files are meant to be on my web server: so if they are UTF my server
would have to pass double the data for each page...unless there is some
compression trick.


-- 
vedm

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: Emacs in xterm and Cyrillic?
  2005-04-14  1:47           ` vedm
@ 2005-04-14  2:03             ` vedm
  0 siblings, 0 replies; 15+ messages in thread
From: vedm @ 2005-04-14  2:03 UTC (permalink / raw)


vedm <ns@nospam.com> writes:

> Kevin Rodgers <ihs_4664@yahoo.com> writes:
> 
> > That can't be the case, because Cyrillic characters can't even be
> > represented in ASCII.  

...
 
> Now, the second part of your statement is that "Cyrillic characters
> can't even be represented in ASCII". But the Cyrillic alphabet consists
> of about 30 letters (Bulgarian - 30, Russian - 33), and the 7-bit ASCII
> code has 128 positions, which is clearly more than enough to encode 30
> letters (or 60, for upper and lower case)

Well, what I mean is that one could redefine some of the ASCII codes to
correspond to Cyrillic letters...but in that case it won't be ASCII code
any more, although still 7 bit. So you win :).

-- 
vedm

^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2005-04-14  2:03 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-04-03 23:34 Emacs in xterm and Cyrillic? vedm
2005-04-04 17:43 ` Peter Dyballa
2005-04-05 20:49 ` Stefan Monnier
     [not found] ` <mailman.78.1112634873.2895.help-gnu-emacs@gnu.org>
2005-04-06  1:29   ` vedm
2005-04-06  8:18     ` Peter Dyballa
2005-04-06 15:40     ` Stefan Monnier
2005-04-10  0:26       ` vedm
     [not found]     ` <mailman.354.1112774023.2895.help-gnu-emacs@gnu.org>
2005-04-10  0:58       ` vedm
2005-04-10 12:54         ` Recommendations: emacs files containing multiple languages ken
2005-04-10 13:33           ` Peter Dyballa
2005-04-10 19:06           ` Eli Zaretskii
2005-04-10 13:20         ` Emacs in xterm and Cyrillic? Peter Dyballa
2005-04-11 17:48         ` Kevin Rodgers
     [not found]         ` <mailman.1082.1113240243.2895.help-gnu-emacs@gnu.org>
2005-04-14  1:47           ` vedm
2005-04-14  2:03             ` vedm

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).