unofficial mirror of help-gnu-emacs@gnu.org
 help / color / mirror / Atom feed
* Polish characters in emacs
@ 2007-10-11 21:09 Wojtek
  2007-10-12  9:03 ` Eli Zaretskii
                   ` (2 more replies)
  0 siblings, 3 replies; 7+ messages in thread
From: Wojtek @ 2007-10-11 21:09 UTC (permalink / raw)
  To: help-gnu-emacs

Could someone point me to an explanation of settings so that Polish
characters are displayed correctly in an emacs buffer and whether this
has to do with the environment outside of emacs.
When I open up a connection to my account and run emacs from Fedora 7
the characters do not show up when viewing a mail message encoded as
utf-8.  However I can toggle the input method to polish-slash and
enter polish characters and they do show up.  When connecting to the
same account from a Windows machine using Cygwin-X, the characters in
the mail message show up without a problem. Since the emacs I am
running is starting up with the same parameters, what controlsl the
display of characters?

Wojciech Komornicki

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Polish characters in emacs
  2007-10-11 21:09 Polish characters in emacs Wojtek
@ 2007-10-12  9:03 ` Eli Zaretskii
  2007-10-12 10:20 ` Peter Dyballa
       [not found] ` <mailman.1976.1192184434.18990.help-gnu-emacs@gnu.org>
  2 siblings, 0 replies; 7+ messages in thread
From: Eli Zaretskii @ 2007-10-12  9:03 UTC (permalink / raw)
  To: help-gnu-emacs

> From: Wojtek <wnkltd@gmail.com>
> Date: Thu, 11 Oct 2007 14:09:57 -0700
> 
> When I open up a connection to my account and run emacs from Fedora 7
> the characters do not show up when viewing a mail message encoded as
> utf-8.  However I can toggle the input method to polish-slash and
> enter polish characters and they do show up.  When connecting to the
> same account from a Windows machine using Cygwin-X, the characters in
> the mail message show up without a problem. Since the emacs I am
> running is starting up with the same parameters, what controlsl the
> display of characters?

Please tell more.  For starters, when you connect from Fedora 7, do
you start Emacs in the GUI mode (i.e. under X) or in text terminal
mode?

Also, what do you mean by ``characters do not show up'', exactly?  Do
you see empty boxes or some other characters?

Finally, what version of Emacs do you run?

Oh, and try "emacs -q --no-site-file" in both cases to see if the
behavior changes.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Polish characters in emacs
  2007-10-11 21:09 Polish characters in emacs Wojtek
  2007-10-12  9:03 ` Eli Zaretskii
@ 2007-10-12 10:20 ` Peter Dyballa
       [not found] ` <mailman.1976.1192184434.18990.help-gnu-emacs@gnu.org>
  2 siblings, 0 replies; 7+ messages in thread
From: Peter Dyballa @ 2007-10-12 10:20 UTC (permalink / raw)
  To: Wojtek; +Cc: help-gnu-emacs


Am 11.10.2007 um 23:09 schrieb Wojtek:

> Could someone point me to an explanation of settings so that Polish
> characters are displayed correctly in an emacs buffer and whether this
> has to do with the environment outside of emacs.

There are three 8 bit ISO Latin encodings that support Polish: ISO  
8859-2, ISO 8859-13, and ISO 8859-16, the 8 bit MS encoding Code Page  
1250, and finally UTF-8. The all have  
ÓóĄąĆćĘꣳŃńŚśŹźżż.

> When I open up a connection to my account and run emacs from Fedora 7
> the characters do not show up when viewing a mail message encoded as
> utf-8.

*How* does this happen? Does it happen that you only see empty boxes?  
Then you're using a font that does not have the Polish characters.  
Change, for example, to Lucida Sans Typewriter from Java SDK!

> However I can toggle the input method to polish-slash and
> enter polish characters and they do show up.

This can be something completely different. (I never use an "input  
method." At least not by conscience.) And it's no proof, except that  
this GNU Emacs can display the chosen item properly. So their might  
be some mis-understanding come from the eMail client used to retrieve  
the input data for that buffer.

> When connecting to the same account from a Windows machine using  
> Cygwin-X, the characters in
> the mail message show up without a problem.

Ahh! So you are writing the whole time about eMails and their textual  
presentation? Which eMail client do you use to read the eMails? Can  
you make some of the header lines of the eMails appear in your eMail  
client, particularly those that describe the way the message was  
encoded for the transport through the Internet? The eMail client can  
have its own ideas of representing an eMail's contents ...

> Since the emacs I am running is starting up with the same  
> parameters, what controlsl the display of characters?

It's definitely the encoding used in the buffer. It's indicated at  
the beginning of the mode-line (left-most characters).

	-*: for MS CP1250 or CP1252
	-2: for ISO 8859-2   (Latin  2)
	-l: for ISO 8859-13  (Latin  7)
	-r: for ISO 8859-16  (Latin 10)
	-u: for UTF-8

BTW, with the mouse cursor you can select that character and a *Help*  
buffer with explanation opens.


A good method to check where the error can come from is to use a  
"neutral" simple and pure text file like this one:

;;; -*- mode: Text; coding: iso-8859-2; -*-
;
;	Time-stamp: <2005-05-11 23:52:49 pete>
;
;   Central and Eastern European Glyphs (Latin 2)
;
;   oct   dec   hex    UCS2    UTF-8
;=====================================
   = 240 = 160 = A0 = U+00A0 =    C2 A0 : NO-BREAK SPACE
Ą = 241 = 161 = A1 = U+0104 =    C4 84 : LATIN CAPITAL LETTER A WITH  
OGONEK
˘ = 242 = 162 = A2 = U+02D8 =    CB 98 : BREVE
Ł = 243 = 163 = A3 = U+0141 =    C5 81 : LATIN CAPITAL LETTER L WITH  
STROKE
¤ = 244 = 164 = A4 = U+00A4 =    C2 A4 : CURRENCY SIGN
Ľ = 245 = 165 = A5 = U+013D =    C4 BD : LATIN CAPITAL LETTER L WITH  
CARON
Ś = 246 = 166 = A6 = U+015A =    C5 9A : LATIN CAPITAL LETTER S WITH  
ACUTE
§ = 247 = 167 = A7 = U+00A7 =    C2 A7 : SECTION SIGN
¨ = 250 = 168 = A8 = U+00A8 =    C2 A8 : DIAERESIS
Š = 251 = 169 = A9 = U+0160 =    C5 A0 : LATIN CAPITAL LETTER S WITH  
CARON
Ş = 252 = 170 = AA = U+015E =    C5 9E : LATIN CAPITAL LETTER S WITH  
CEDILLA
Ť = 253 = 171 = AB = U+0164 =    C5 A4 : LATIN CAPITAL LETTER T WITH  
CARON
Ź = 254 = 172 = AC = U+0179 =    C5 B9 : LATIN CAPITAL LETTER Z WITH  
ACUTE
- = 255 = 173 = AD = U+00AD =    C2 AD : HYPHEN-MINUS
Ž = 256 = 174 = AE = U+017D =    C5 BD : LATIN CAPITAL LETTER Z WITH  
CARON
Ż = 257 = 175 = AF = U+017B =    C5 BB : LATIN CAPITAL LETTER Z WITH  
DOT ABOVE
° = 260 = 176 = B0 = U+00B0 =    C2 B0 : DEGREE SIGN
ą = 261 = 177 = B1 = U+0105 =    C4 85 : LATIN SMALL LETTER A WITH  
OGONEK
˛ = 262 = 178 = B2 = U+02DB =    CB 9B : OGONEK
ł = 263 = 179 = B3 = U+0142 =    C5 82 : LATIN SMALL LETTER L WITH  
STROKE
´ = 264 = 180 = B4 = U+00B4 =    C2 B4 : ACUTE ACCENT
ľ = 265 = 181 = B5 = U+013E =    C4 BE : LATIN SMALL LETTER L WITH  
CARON
ś = 266 = 182 = B6 = U+015B =    C5 9B : LATIN SMALL LETTER S WITH  
ACUTE
ˇ = 267 = 183 = B7 = U+02C7 =    CB 87 : CARON
¸ = 270 = 184 = B8 = U+00B8 =    C2 B8 : CEDILLA
š = 271 = 185 = B9 = U+0161 =    C5 A1 : LATIN SMALL LETTER S WITH  
CARON
ş = 272 = 186 = BA = U+015F =    C5 9F : LATIN SMALL LETTER S WITH  
CEDILLA
ť = 273 = 187 = BB = U+0165 =    C5 A5 : LATIN SMALL LETTER T WITH  
CARON
ź = 274 = 188 = BC = U+017A =    C5 BA : LATIN SMALL LETTER Z WITH  
ACUTE
˝ = 275 = 189 = BD = U+02DD =    CB 9D : DOUBLE ACUTE ACCENT
ž = 276 = 190 = BE = U+017E =    C5 BE : LATIN SMALL LETTER Z WITH  
CARON
ż = 277 = 191 = BF = U+017C =    C5 BC : LATIN SMALL LETTER Z WITH  
DOT ABOVE
Ŕ = 300 = 192 = C0 = U+0154 =    C5 94 : LATIN CAPITAL LETTER R WITH  
ACUTE
Á = 301 = 193 = C1 = U+00C1 =    C3 81 : LATIN CAPITAL LETTER A WITH  
ACUTE
 = 302 = 194 = C2 = U+00C2 =    C3 82 : LATIN CAPITAL LETTER A WITH  
CIRCUMFLEX
Ă = 303 = 195 = C3 = U+0102 =    C4 82 : LATIN CAPITAL LETTER A WITH  
BREVE
Ä = 304 = 196 = C4 = U+00C4 =    C3 84 : LATIN CAPITAL LETTER A WITH  
DIAERESIS
Ĺ = 305 = 197 = C5 = U+0139 =    C4 B9 : LATIN CAPITAL LETTER L WITH  
ACUTE
Ć = 306 = 198 = C6 = U+0106 =    C4 86 : LATIN CAPITAL LETTER C WITH  
ACUTE
Ç = 307 = 199 = C7 = U+00C7 =    C3 87 : LATIN CAPITAL LETTER C WITH  
CEDILLA
Č = 310 = 200 = C8 = U+010C =    C4 8C : LATIN CAPITAL LETTER C WITH  
CARON
É = 311 = 201 = C9 = U+00C9 =    C3 89 : LATIN CAPITAL LETTER E WITH  
ACUTE
Ę = 312 = 202 = CA = U+0118 =    C4 98 : LATIN CAPITAL LETTER E WITH  
OGONEK
Ë = 313 = 203 = CB = U+00CB =    C3 8B : LATIN CAPITAL LETTER E WITH  
DIAERESIS
Ě = 314 = 204 = CC = U+011A =    C4 9A : LATIN CAPITAL LETTER E WITH  
CARON
Í = 315 = 205 = CD = U+00CD =    C3 8D : LATIN CAPITAL LETTER I WITH  
ACUTE
Î = 316 = 206 = CE = U+00CE =    C3 8E : LATIN CAPITAL LETTER I WITH  
CIRCUMFLEX
Ď = 317 = 207 = CF = U+010E =    C4 8E : LATIN CAPITAL LETTER D WITH  
CARON
Đ = 320 = 208 = D0 = U+0110 =    C4 90 : LATIN CAPITAL LETTER D WITH  
STROKE
Ń = 321 = 209 = D1 = U+0143 =    C5 83 : LATIN CAPITAL LETTER N WITH  
ACUTE
Ň = 322 = 210 = D2 = U+0147 =    C5 87 : LATIN CAPITAL LETTER N WITH  
CARON
Ó = 323 = 211 = D3 = U+00D3 =    C3 93 : LATIN CAPITAL LETTER O WITH  
ACUTE
Ô = 324 = 212 = D4 = U+00D4 =    C3 94 : LATIN CAPITAL LETTER O WITH  
CIRCUMFLEX
Ő = 325 = 213 = D5 = U+0150 =    C5 90 : LATIN CAPITAL LETTER O WITH  
DOUBLE ACUTE
Ö = 326 = 214 = D6 = U+00D6 =    C3 96 : LATIN CAPITAL LETTER O WITH  
DIAERESIS
× = 327 = 215 = D7 = U+00D7 =    C3 97 : MULTIPLICATION SIGN
Ř = 330 = 216 = D8 = U+0158 =    C5 98 : LATIN CAPITAL LETTER R WITH  
CARON
Ů = 331 = 217 = D9 = U+016E =    C5 AE : LATIN CAPITAL LETTER U WITH  
RING ABOVE
Ú = 332 = 218 = DA = U+00DA =    C3 9A : LATIN CAPITAL LETTER U WITH  
ACUTE
Ű = 333 = 219 = DB = U+0170 =    C5 B0 : LATIN CAPITAL LETTER U WITH  
DOUBLE ACUTE
Ü = 334 = 220 = DC = U+00DC =    C3 9C : LATIN CAPITAL LETTER U WITH  
DIAERESIS
Ý = 335 = 221 = DD = U+00DD =    C3 9D : LATIN CAPITAL LETTER Y WITH  
ACUTE
Ţ = 336 = 222 = DE = U+0162 =    C5 A2 : LATIN CAPITAL LETTER T WITH  
CEDILLA
ß = 337 = 223 = DF = U+00DF =    C3 9F : LATIN SMALL LETTER SHARP S
ŕ = 340 = 224 = E0 = U+0155 =    C5 95 : LATIN SMALL LETTER R WITH  
ACUTE
á = 341 = 225 = E1 = U+00E1 =    C3 A1 : LATIN SMALL LETTER A WITH  
ACUTE
â = 342 = 226 = E2 = U+00E2 =    C3 A2 : LATIN SMALL LETTER A WITH  
CIRCUMFLEX
ă = 343 = 227 = E3 = U+0103 =    C4 83 : LATIN SMALL LETTER A WITH  
BREVE
ä = 344 = 228 = E4 = U+00E4 =    C3 A4 : LATIN SMALL LETTER A WITH  
DIAERESIS
ĺ = 345 = 229 = E5 = U+013A =    C4 BA : LATIN SMALL LETTER L WITH  
ACUTE
ć = 346 = 230 = E6 = U+0107 =    C4 87 : LATIN SMALL LETTER C WITH  
ACUTE
ç = 347 = 231 = E7 = U+00E7 =    C3 A7 : LATIN SMALL LETTER C WITH  
CEDILLA
č = 350 = 232 = E8 = U+010D =    C4 8D : LATIN SMALL LETTER C WITH  
CARON
é = 351 = 233 = E9 = U+00E9 =    C3 A9 : LATIN SMALL LETTER E WITH  
ACUTE
ę = 352 = 234 = EA = U+0119 =    C4 99 : LATIN SMALL LETTER E WITH  
OGONEK
ë = 353 = 235 = EB = U+00EB =    C3 AB : LATIN SMALL LETTER E WITH  
DIAERESIS
ě = 354 = 236 = EC = U+011B =    C4 9B : LATIN SMALL LETTER E WITH  
CARON
í = 355 = 237 = ED = U+00ED =    C3 AD : LATIN SMALL LETTER I WITH  
ACUTE
î = 356 = 238 = EE = U+00EE =    C3 AE : LATIN SMALL LETTER I WITH  
CIRCUMFLEX
ď = 357 = 239 = EF = U+010F =    C4 8F : LATIN SMALL LETTER D WITH  
CARON
đ = 360 = 240 = F0 = U+0111 =    C4 91 : LATIN SMALL LETTER D WITH  
STROKE
ń = 361 = 241 = F1 = U+0144 =    C5 84 : LATIN SMALL LETTER N WITH  
ACUTE
ň = 362 = 242 = F2 = U+0148 =    C5 88 : LATIN SMALL LETTER N WITH  
CARON
ó = 363 = 243 = F3 = U+00F3 =    C3 B3 : LATIN SMALL LETTER O WITH  
ACUTE
ô = 364 = 244 = F4 = U+00F4 =    C3 B4 : LATIN SMALL LETTER O WITH  
CIRCUMFLEX
ő = 365 = 245 = F5 = U+0151 =    C5 91 : LATIN SMALL LETTER O WITH  
DOUBLE ACUTE
ö = 366 = 246 = F6 = U+00F6 =    C3 B6 : LATIN SMALL LETTER O WITH  
DIAERESIS
÷ = 367 = 247 = F7 = U+00F7 =    C3 B7 : DIVISION SIGN
ř = 370 = 248 = F8 = U+0159 =    C5 99 : LATIN SMALL LETTER R WITH  
CARON
ů = 371 = 249 = F9 = U+016F =    C5 AF : LATIN SMALL LETTER U WITH  
RING ABOVE
ú = 372 = 250 = FA = U+00FA =    C3 BA : LATIN SMALL LETTER U WITH  
ACUTE
ű = 373 = 251 = FB = U+0171 =    C5 B1 : LATIN SMALL LETTER U WITH  
DOUBLE ACUTE
ü = 374 = 252 = FC = U+00FC =    C3 BC : LATIN SMALL LETTER U WITH  
DIAERESIS
ý = 375 = 253 = FD = U+00FD =    C3 BD : LATIN SMALL LETTER Y WITH  
ACUTE
ţ = 376 = 254 = FE = U+0163 =    C5 A3 : LATIN SMALL LETTER T WITH  
CEDILLA
˙ = 377 = 255 = FF = U+02D9 =    CB 99 : DOT ABOVE

and it in both Emacsen. Run them at the same time and compare mode- 
lines and other details (encodings, fonts used: C-u C-x = on a  
glyph, ...). In your user init file you can prepare sections  for  
emacs-major-version or window-system variables.


If you want to have some fun, then change this files first line from  
iso-8859-2 to, let's say, iso-8859-16 *outside* of GNU Emacs, by for  
example, cat <file> | sed -e s/iso-8859-2/iso-8859-16/ > <other  
file>. This will only change exactly *one* byte (the 2 will become  
16), but the first column will be totally different in GNU Emacs –  
and the descriptional text will become untrue for most characters.  
Just to learn that there is contents somewhere below and you only get  
some *presentation* of this contents. (As in real life you can't see  
the reality outside your head.)

--
Greetings

   Pete

Time flies like an error -- but fruit flies like a banana!
                              (almost Groucho Marx)

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Polish characters in emacs
       [not found] ` <mailman.1976.1192184434.18990.help-gnu-emacs@gnu.org>
@ 2007-10-12 19:03   ` Wojtek
  2007-10-12 20:44     ` Peter Dyballa
       [not found]     ` <mailman.1993.1192221868.18990.help-gnu-emacs@gnu.org>
  0 siblings, 2 replies; 7+ messages in thread
From: Wojtek @ 2007-10-12 19:03 UTC (permalink / raw)
  To: help-gnu-emacs

Thank you Pete for all of the explanation.
Here is a bit more info and some progress I have made.
    1) The machine to which my monitor and keyboard are attached is
running Fedora 7
    2) I connect via ssh tunneling to a machine that is running Fedora
5 and open up a gui emacs
        The command is
            ssh ... "/usr/bin/emacs -fn lucidasanstypewriter-12"
    3) In older versions of emacs (when running on various flavors of
UNIX or in Windows) I used
        the mule package which has input modes and the polish-slash
mode was such that
        /a produced ą, /e produced ę etc
     4) On the Federa 7 box, I have enabled the polish keyboard and
thus produce "ogonki" in
         any application
     5) When I open up firefox on the Fedora 5 box via ssh tuneling I
can produce "ogonki" in the
         same way
      6) I cannot produce polish characters in emacs on the Fedora 5
machine.

My IT people are going to take away the Fedora 5 machine so this may
be a mute point but I would
like to learn what is going on.

Wojtek

On Oct 12, 5:20 am, Peter Dyballa <Peter_Dyba...@Web.DE> wrote:
> Am 11.10.2007 um 23:09 schrieb Wojtek:
>
> > Could someone point me to an explanation of settings so that Polish
> > characters are displayed correctly in an emacs buffer and whether this
> > has to do with the environment outside of emacs.
>
> There are three 8 bit ISO Latin encodings that support Polish: ISO
> 8859-2, ISO 8859-13, and ISO 8859-16, the 8 bit MS encoding Code Page
> 1250, and finally UTF-8. The all have
> ÓóĄąĆćĘꣳŃńŚśŹźżż.
[..snip..]  (for the fulll text see the previous message in this
thread)

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Polish characters in emacs
  2007-10-12 19:03   ` Wojtek
@ 2007-10-12 20:44     ` Peter Dyballa
       [not found]     ` <mailman.1993.1192221868.18990.help-gnu-emacs@gnu.org>
  1 sibling, 0 replies; 7+ messages in thread
From: Peter Dyballa @ 2007-10-12 20:44 UTC (permalink / raw)
  To: Wojtek; +Cc: help-gnu-emacs


Am 12.10.2007 um 21:03 schrieb Wojtek:

> My IT people are going to take away the Fedora 5 machine so this may
> be a mute point but I would
> like to learn what is going on.

What is the version of GNU Emacs on Fedora Core 5? Probably some  
early version 21.

What is the phenomenon by which you judge that GNU Emacs can't  
display ÓóĄąĆćĘꣳŃńŚśŹźżż? I don't have more  
guesses ...

Have you checked lucidasanstypewriter-12 font with xfd? You can use a  
bigger size to see the glyphs more clearly. (I was writing of the  
TrueType fonts, which are not restricted to some ISO Latin encoding.)  
Have you checked with xfontsel or xlsfonts which fonts exist on  
*your* local Fedora Core 7 system that support Polish characters? Can  
you pass such a font's name to GNU Emacs? (Since the DISPLAY variable  
on the Fedora Core 5 system is set to your local Fedora Core 7  
system, remote GNU Emacs uses your local fonts. So the chances are  
excellent that Unicode fonts exist – although they might not be  
installed. And from these, ISO Latin or MS sub-encodings are  
automatically derived, so you can make GNU Emacs' memory use small.)  
Check X11's Font Path (xset -q) and correct it if necessary, i.e.  
there are more directories with fonts than the X server knows.

You can do the same in Cygwin ...

Which encoding is displayed in mode-line?


If you want some help you have to give the answers we ask for.  
They're not meant to waste your (leisure) time.

--
Greetings
                                  <]
    Pete      o        __o         |__    o           recumbo
     ___o    /I       -\<,         |o \  -\),-%       ergo sum!
___/\ /\___./ \___...O/ O____.....`-O-'-()--o_________________

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Polish characters in emacs
       [not found]     ` <mailman.1993.1192221868.18990.help-gnu-emacs@gnu.org>
@ 2007-10-19 16:36       ` Wojtek
  2007-10-19 19:38         ` Peter Dyballa
  0 siblings, 1 reply; 7+ messages in thread
From: Wojtek @ 2007-10-19 16:36 UTC (permalink / raw)
  To: help-gnu-emacs

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="us-ascii", Size: 2726 bytes --]

On Oct 12, 3:44 pm, Peter Dyballa <Peter_Dyba...@Web.DE> wrote:
> Am 12.10.2007 um 21:03 schrieb Wojtek:
>
> > My IT people are going to take away the Fedora 5 machine so this may
> > be a mute point but I would
> > like to learn what is going on.
>
> What is the version of GNU Emacs on Fedora Core 5? Probably some
> early version 21.
>
> What is the phenomenon by which you judge that GNU Emacs can't
> display Óó                ? I don't have more
> guesses ...

When I enter letters with diacritic marks on the Fedora 5 emacs, I
just see black
boxes instead of the letters.  When I enter   on the Fedora 5 emacs
and then use
C-x = to examine the character, it is character 331813 (0x51025).
However on the
Fedora 7 machine it is character 2353 (0x931).  Clearly(?) the same
keyboard message
is not being transmitted to the Fedora 5 emacs.

> Have you checked lucidasanstypewriter-12 font with xfd? You can use a
> bigger size to see the glyphs more clearly. (I was writing of the
> TrueType fonts, which are not restricted to some ISO Latin encoding.)
> Have you checked with xfontsel or xlsfonts which fonts exist on
> *your* local Fedora Core 7 system that support Polish characters? Can
> you pass such a font's name to GNU Emacs? (Since the DISPLAY variable
> on the Fedora Core 5 system is set to your local Fedora Core 7
> system, remote GNU Emacs uses your local fonts. So the chances are
> excellent that Unicode fonts exist - although they might not be
> installed. And from these, ISO Latin or MS sub-encodings are
> automatically derived, so you can make GNU Emacs' memory use small.)
> Check X11's Font Path (xset -q) and correct it if necessary, i.e.
> there are more directories with fonts than the X server knows.

I have tried various fonts when launching emacs with no change.

> You can do the same in Cygwin ...

When I connect to each machine from Cygwin I get the same behaviour.

> Which encoding is displayed in mode-line?

I have two dashes at the beginning of my mode line in both emacses

> If you want some help you have to give the answers we ask for.
> They're not meant to waste your (leisure) time.

I should add that I am running emacs 21.4 on Fedora 5 and emacs 22.1
on Fedora 7.  The reason that the behaviour puzzles me is that I
understand that both of the emacses are getting their fonts from the
same place and hence it is (based on my understanding) a matter of
emacs displaying the font.
When I ask emacs to describe-coding-system I get that it is utf-8 for
terminal output in both emacses.

I have used xfd to look at the font and there I can see the required
diacritical marks.

Thank you for your comments.
Wojtek

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Polish characters in emacs
  2007-10-19 16:36       ` Wojtek
@ 2007-10-19 19:38         ` Peter Dyballa
  0 siblings, 0 replies; 7+ messages in thread
From: Peter Dyballa @ 2007-10-19 19:38 UTC (permalink / raw)
  To: Wojtek; +Cc: help-gnu-emacs


Am 19.10.2007 um 18:36 schrieb Wojtek:

> I should add that I am running emacs 21.4 on Fedora 5 and emacs 22.1
> on Fedora 7.

This explains why C-u C-x = returns different "numbers." They really  
don't mean a thing for us users. They are some kind of representation  
of the way GNU Emacs stores these characters in the buffer you're  
visiting. Of importance is the last value, U+<four hex digits>. This  
corresponds to the position of the character in Unicode. So it's not  
true that different "key codes" are transmitted. See below!

Have you found the lines in which the font is named from which the  
glyph is taken to display the character?


>   The reason that the behaviour puzzles me is that I
> understand that both of the emacses are getting their fonts from the
> same place and hence it is (based on my understanding) a matter of
> emacs displaying the font.

Right. IMO much progress happened between 21.4 and 22.1, GNU Emacs  
learned some encodings, what-cursor-position gives much more and  
valuable information. You can press the mode-line in many places and  
you get information instead of errors. Many tool-tips appear ...

> When I ask emacs to describe-coding-system I get that it is utf-8 for
> terminal output in both emacses.

OK, but then your tests are incorrect. To test what is input to GNU  
Emacs just type in *scratch* buffer or such:

	C-q <some key press event on the keyboard>

(This is quite nice when you press a function or cursor key!) This  
*input* problem is one area, which probably works without fault.

You won't believe this because you don't *see* it. So your problem is  
merely an *output* problem. When you write that the mode-line starts  
with ``--:´´ then you could have already found that this stands for  
an undecided encoding (by pressing the second - in mode-line). Or:  
what you see is not what you've got. So make precise tests. For  
example with the test file I sent to the list. Its first line *will*  
set the buffer's encoding to the proper value to see the characters  
as described. Then it's up to you and your system to find the  
adequate fonts or fontsets. For this latter purpose you might write  
an ELisp file that lets you choose another font (or fontset) from  
those (monospaced) fonts (S-mouse-1) (you can, of course, leave out  
completely uninteresting font encodings):

(if (fboundp 'new-fontset)
   (progn
     (create-fontset-from-fontset-spec "-b&h-lucidatypewriter-medium- 
r-*-*-10-*-*-*-*-*-fontset-10pt_lucidatypewriter" t 'noerror)
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-1  '("lucidatypewriter" . "iso8859-1"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-2  '("lucidatypewriter" . "iso8859-2"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-3  '("lucidatypewriter" . "iso8859-3"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-4  '("lucidatypewriter" . "iso8859-4"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	   'cyrillic- 
iso8859-5  '("lucidatypewriter" . "iso8859-5"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	     'arabic- 
iso8859-6  '("lucidatypewriter" . "iso8859-6"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'greek- 
iso8859-7  '("lucidatypewriter" . "iso8859-7"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	     'hebrew- 
iso8859-8  '("lucidatypewriter" . "iso8859-8"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-9  '("lucidatypewriter" . "iso8859-9"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-10 '("lucidatypewriter" . "iso8859-10"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-13 '("lucidatypewriter" . "iso8859-13"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"	      'latin- 
iso8859-15 '("lucidatypewriter" . "iso8859-15"))
       (set-fontset-font "fontset-10pt_lucidatypewriter" 'mule- 
unicode-0100-24ff '("lucidatypewriter" . "iso10646-1"))
       (set-fontset-font "fontset-10pt_lucidatypewriter" 'mule- 
unicode-2500-33ff '("lucidatypewriter" . "iso10646-1"))
       (set-fontset-font "fontset-10pt_lucidatypewriter" 'mule- 
unicode-e000-ffff '("lucidatypewriter" . "iso10646-1"))
       (set-fontset-font  
"fontset-10pt_lucidatypewriter"                  'ascii  
'("lucidatypewriter" . "iso10646-1"))
       (set-fontset-font "fontset-10pt_lucidatypewriter"              
'iso10646-1 '("lucidatypewriter" . "iso10646-1"))
))
(provide 'site-fontsets-x11)

or (again, you can leave out variants and sizes and concentrate on  
those encodings that have ÓóĄąĆćĘꣳŃńŚśŹźŻż):

(setq x-fixed-font-alist
    '("X11 Font Menu"
       ("Test fixed"
       ("Courier @ 75 DPI"
;; For these, we specify the point height and the DPIs.
        ( "8" 	      	"-adobe-courier-medium-r-normal--*-80-75-75-m-*- 
iso10646-1"   "8")
        ("10" 	      	"-adobe-courier-medium-r-normal--*-100-75-75-m- 
*-iso10646-1" "10")
        ("12" 	      	"-adobe-courier-medium-r-normal--*-120-75-75-m- 
*-iso10646-1" "12")
        ( "8 bold"       "-adobe-courier-bold-r-normal--*-80-75-75-m- 
*-iso10646-1"   "8 bold")
        ("10 bold"       "-adobe-courier-bold-r-normal--*-100-75-75-m- 
*-iso10646-1" "10 bold")
        ("12 bold"       "-adobe-courier-bold-r-normal--*-120-75-75-m- 
*-iso10646-1" "12 bold")
        ( "8 slant"      "-adobe-courier-medium-o-normal--*-80-75-75- 
m-*-iso10646-1"   "8 slant")
        ("10 slant"      "-adobe-courier-medium-o-normal--*-100-75-75- 
m-*-iso10646-1" "10 slant")
        ("12 slant"      "-adobe-courier-medium-o-normal--*-120-75-75- 
m-*-iso10646-1" "12 slant")
        ( "8 bold slant" "-adobe-courier-bold-o-normal--*-80-75-75-m- 
*-iso10646-1"   "8 bold slant")
        ("10 bold slant" "-adobe-courier-bold-o-normal--*-100-75-75-m- 
*-iso10646-1" "10 bold slant")
        ("12 bold slant" "-adobe-courier-bold-o-normal--*-120-75-75-m- 
*-iso10646-1" "12 bold slant")
       )
       ("ISO 8859-15 medium"
        ("B&H LucidaSans Typewriter med" "-*-lucida sans typewriter- 
medium-r-*--0-0-0-0-m-0-iso8859-15")
        ("B&H Luxi Mono med"             "-*-luxi mono-medium-r- 
*--0-0-0-0-m-0-iso8859-15")
        ("Bitstream Vera Sans Mono med"  "-*-bitstream vera sans mono- 
medium-r-*--0-0-0-0-m-0-iso8859-15")
        ("ProFont med"                   "-misc-profontwindows-medium- 
r-*--0-0-0-0-m-0-iso8859-15")
        ("Monotype Andale Mono med" 	"-monotype-andale mono-medium-r- 
*--0-0-0-0-m-0-iso8859-15")
        ("Monotype Courier New med" 	"-monotype-courier new-medium-r- 
*--0-0-0-0-m-0-iso8859-15")
        ("urw++ Courier med"         	"-urw-courier-medium-r- 
*--0-0-0-0-m-0-iso8859-15")
        ("urw++ Nimbus Mono l med"   	"-urw-nimbus mono l-medium-r- 
*--0-0-0-0-m-0-iso8859-15")
        )
))
(provide 'site-fonts-x11)

In an init file you can load these extra font definitions via:

	(require 'site-fontsets-x11)

or

	(require 'site-fonts-x11)

and use them for the tests: in one window my test file is open and  
you switch through all the fonts to see whether at some time you see  
the Polish characters. Of course you can sort the 18 or 20 lines to  
become one Polish character block.

Once you've accomplished to see the the right shapes from the test  
file, you can start to enter yourself Polish characters into this  
test file (and kill the buffer without saving your changes).  
Remember: only in the right encoding you can see the right characters  
because one byte value represents dozens of different characters in  
dozens of different encodings.


BTW, what diacritic did you enter on Fedora 5? I can't see anything.

--
Greetings
   Pete

One-Shot Case Study, n.:
         The scientific equivalent of the four-leaf clover, from
         which it is concluded all clovers possess four leaves
         and are sometimes green.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2007-10-19 19:38 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-10-11 21:09 Polish characters in emacs Wojtek
2007-10-12  9:03 ` Eli Zaretskii
2007-10-12 10:20 ` Peter Dyballa
     [not found] ` <mailman.1976.1192184434.18990.help-gnu-emacs@gnu.org>
2007-10-12 19:03   ` Wojtek
2007-10-12 20:44     ` Peter Dyballa
     [not found]     ` <mailman.1993.1192221868.18990.help-gnu-emacs@gnu.org>
2007-10-19 16:36       ` Wojtek
2007-10-19 19:38         ` Peter Dyballa

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).