From: Peter Dyballa <Peter_Dyballa@Freenet.DE>
To: Stefan Monnier <monnier@iro.umontreal.ca>
Cc: 4157@emacsbugs.donarmstrong.com
Subject: bug#4157: 23.1.50; faulty character characterisation for ä
Date: Sun, 23 Aug 2009 11:57:37 +0200 [thread overview]
Message-ID: <FADA1BCB-8714-4FFD-827F-7F3FD7A22D19@Freenet.DE> (raw)
In-Reply-To: <jwvfxbjb8t1.fsf-monnier+emacsbugreports@gnu.org>
Am 23.08.2009 um 03:49 schrieb Stefan Monnier:
>> In both locales the *file names* are correct and also detected as
>> containing
>
> "correct" doesn't really tell me what you see, but I see what you
> mean.
"Correct" meant that I was seeing what I had typed before in Finder...
>
>> "composed characters," it's a problem with the file's month date.
>> In the
>
> So my guess was right: ls's output uses utf-8 for the filenames, but
> latin-1 for the date, which is why it's difficult for dired to do the
> right thing (it's not impossible, of course, but it's more work and
> dired is currently not setup for that).
>
Here is a little test from a shell (actually *shell* buffer in NS
Emacs.app with UTF-8 locales):
pete 252 /\ gls -lN zo*
-rw-r--r-- 1 pete admin 281829 20. Mär 1998 zoä€.au
pete 253 /\ ls -lw zo*
-rw-r--r-- 1 pete admin 281829 20 Mär 1998 zoä€.au
pete 254 /\ gls -lN zo* | od -j 32 -t a
0000040 0 . sp M \303 \244 r sp 1 9 9 8 sp
sp z o
0000060 a \314 88 \342 82 \254 . a u nl
0000072
pete 255 /\ env LC_CTYPE=de_DE.ISO8859-15 LANG=de_DE.ISO8859-15 gls -
lN zo* | od -j 32 -t a
0000040 0 . sp M \344 r sp 1 9 9 8 sp sp z
o a
0000060 \314 88 \342 82 \254 . a u nl
0000071
pete 256 /\ ls -lw zo* | od -j 32 -t a
0000040 2 9 sp 2 0 sp M \303 \244 r sp 1 9
9 8
0000060 sp z o a \314 88 \342 82 \254 . a u nl
0000075
pete 257 /\ env LC_CTYPE=de_DE.ISO8859-15 LANG=de_DE.ISO8859-15 ls -
lw zo* | od -j 32 -t a
0000040 2 9 sp 2 0 sp M \344 r sp sp 1 9 9
8 sp
0000060 z o a \314 88 \342 82 \254 . a u nl
0000074
So the *ls commands deliver the month date in their locale composed
while the file name is always *de*composed UTF-8:
\303 \244 = C3 A4 = LATIN SMALL LETTER A WITH DIAERESIS ä at U
+00E4
\314 88 = CC 88 = COMBINING DIAERESIS ¨ at U
+0308
\342 82 \254 = E2 88 AC = EURO SIGN € at U
+20AC
--
Greetings
Pete
Bake pizza not war!
next prev parent reply other threads:[~2009-08-23 9:57 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-08-16 2:19 bug#4157: 23.1.50; faulty character characterisation for ä Peter Dyballa
2009-08-18 1:09 ` Kenichi Handa
2009-08-18 13:40 ` Peter Dyballa
2009-08-19 0:23 ` bug#4157: " Kenichi Handa
2009-08-19 22:47 ` Peter Dyballa
2009-08-24 11:30 ` Peter Dyballa
2009-08-24 12:22 ` bug#4157: " Kenichi Handa
2009-08-24 15:21 ` Peter Dyballa
2009-08-25 0:46 ` bug#4157: " Kenichi Handa
2009-08-25 7:51 ` Peter Dyballa
2009-08-25 22:19 ` Peter Dyballa
2009-08-27 6:52 ` bug#4157: " Kenichi Handa
2009-08-27 8:50 ` Peter Dyballa
2009-08-27 11:33 ` bug#4157: " Kenichi Handa
2009-08-27 12:38 ` Peter Dyballa
2009-08-28 19:27 ` Peter Dyballa
2009-08-31 21:11 ` Peter Dyballa
2009-09-01 0:04 ` Stefan Monnier
2009-09-04 0:58 ` Kenichi Handa
2009-08-22 4:09 ` Stefan Monnier
2009-08-22 8:50 ` Peter Dyballa
2009-08-23 1:49 ` Stefan Monnier
2009-08-23 9:57 ` Peter Dyballa [this message]
2019-10-09 14:29 ` Stefan Kangas
2019-10-09 18:48 ` Eli Zaretskii
2019-10-09 19:47 ` Stefan Monnier
2019-10-09 22:42 ` Peter Dyballa
2019-11-11 1:49 ` Stefan Kangas
2019-11-11 16:36 ` Peter Dyballa
2019-10-10 0:10 ` Stefan Kangas
2019-10-10 7:20 ` Eli Zaretskii
2019-10-10 10:36 ` Stefan Kangas
2019-10-10 11:20 ` Eli Zaretskii
2019-10-10 11:52 ` Stefan Kangas
2019-10-10 12:39 ` Stefan Kangas
2019-10-10 12:41 ` Stefan Kangas
2019-10-10 18:33 ` Peter Dyballa
2019-10-10 18:57 ` Eli Zaretskii
2019-10-10 21:07 ` Stefan Monnier
2019-10-11 13:33 ` Stefan Kangas
2019-10-11 7:10 ` Andreas Schwab
2019-10-11 7:23 ` Peter Dyballa
2019-10-10 8:15 ` Andreas Schwab
2019-10-10 12:54 ` Stefan Monnier
2019-10-10 13:12 ` Stefan Kangas
2019-11-17 20:58 ` Stefan Kangas
-- strict thread matches above, loose matches on Subject: below --
2009-09-04 5:51 川幡太一
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=FADA1BCB-8714-4FFD-827F-7F3FD7A22D19@Freenet.DE \
--to=peter_dyballa@freenet.de \
--cc=4157@emacsbugs.donarmstrong.com \
--cc=monnier@iro.umontreal.ca \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.