* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
@ 2010-08-16 11:09 jidanni
2010-08-16 12:22 ` Eli Zaretskii
` (4 more replies)
0 siblings, 5 replies; 11+ messages in thread
From: jidanni @ 2010-08-16 11:09 UTC (permalink / raw)
To: 6866
I demand an explanation.
$ zgrep TW mule-cmds.el
("zh_TW" . "Chinese-Big5")
You guys just *assume* that all TW people still use Big5.
One can do LC_ALL=zh_TW.UTF-8 until he is blue in the face, but still
current-language-environment is a variable defined in `mule-cmds.el'.
Its value is "Chinese-BIG5"
emacs-version "24.0.50.1"
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
@ 2010-08-16 12:22 ` Eli Zaretskii
2010-08-16 12:23 ` Jason Rumney
` (3 subsequent siblings)
4 siblings, 0 replies; 11+ messages in thread
From: Eli Zaretskii @ 2010-08-16 12:22 UTC (permalink / raw)
To: jidanni; +Cc: 6866
> From: jidanni@jidanni.org
> Date: Mon, 16 Aug 2010 19:09:48 +0800
> Cc:
>
> I demand an explanation.
> $ zgrep TW mule-cmds.el
> ("zh_TW" . "Chinese-Big5")
> You guys just *assume* that all TW people still use Big5.
That's because they do. Case closed.
> One can do LC_ALL=zh_TW.UTF-8 until he is blue in the face
How dare you??!!!
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
2010-08-16 12:22 ` Eli Zaretskii
@ 2010-08-16 12:23 ` Jason Rumney
2010-08-16 12:58 ` jidanni
` (2 subsequent siblings)
4 siblings, 0 replies; 11+ messages in thread
From: Jason Rumney @ 2010-08-16 12:23 UTC (permalink / raw)
To: jidanni; +Cc: 6866
On 16/8/2010 7:09 PM, jidanni@jidanni.org wrote:
> I demand an explanation.
> $ zgrep TW mule-cmds.el
> ("zh_TW" . "Chinese-Big5")
> You guys just *assume* that all TW people still use Big5.
> One can do LC_ALL=zh_TW.UTF-8 until he is blue in the face, but still
> current-language-environment is a variable defined in `mule-cmds.el'.
> Its value is "Chinese-BIG5"
> emacs-version "24.0.50.1"
Please explain what bug you think this caused.
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
2010-08-16 12:22 ` Eli Zaretskii
2010-08-16 12:23 ` Jason Rumney
@ 2010-08-16 12:58 ` jidanni
2010-08-16 13:17 ` Jason Rumney
2010-08-16 13:39 ` jidanni
2010-08-16 15:27 ` jidanni
4 siblings, 1 reply; 11+ messages in thread
From: jidanni @ 2010-08-16 12:58 UTC (permalink / raw)
To: jasonr; +Cc: 6866
>>>>> "JR" == Jason Rumney <jasonr@gnu.org> writes:
JR> Please explain what bug you think this caused.
http://news.gmane.org/group/gmane.emacs.w3m/thread=8661
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 12:58 ` jidanni
@ 2010-08-16 13:17 ` Jason Rumney
0 siblings, 0 replies; 11+ messages in thread
From: Jason Rumney @ 2010-08-16 13:17 UTC (permalink / raw)
To: jidanni; +Cc: 6866
On 16/8/2010 8:58 PM, jidanni@jidanni.org wrote:
>>>>>> "JR" == Jason Rumney<jasonr@gnu.org> writes:
> JR> Please explain what bug you think this caused.
> http://news.gmane.org/group/gmane.emacs.w3m/thread=8661
It appears from that thread that there is a bug in w3m. It is not
apparent that it is related to your report here though, as the expected
behavior is that a Japanese search engine should only be chosen if the
current-language matches "Japanese", which this clearly does not.
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
` (2 preceding siblings ...)
2010-08-16 12:58 ` jidanni
@ 2010-08-16 13:39 ` jidanni
2010-08-16 14:07 ` Jason Rumney
2010-08-16 17:37 ` Eli Zaretskii
2010-08-16 15:27 ` jidanni
4 siblings, 2 replies; 11+ messages in thread
From: jidanni @ 2010-08-16 13:39 UTC (permalink / raw)
To: jasonr; +Cc: 6866
Well anyway, for our locale me and my friends all use zh_TW.UTF-8 and
stopped using zh_TW.big5 years ago. So at least it looks very dumb there
in mule-cmds.el that the zh_CN people can use UTF-8, but the HK and TW
are locked in the dark ages:
("zh_HK" . "Chinese-Big5")
("zh_TW" . "Chinese-Big5")
("zh_CN.UTF-8" . "Chinese-GBK")
("zh_CN" . "Chinese-GB")
The only big5 thing I apparently sometimes still use is
$ GET http://jidanni.org/comp/configuration/.emacs | grep -i b5
(setq default-input-method 'chinese-py-punct-b5))));no 'utf' ones
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 13:39 ` jidanni
@ 2010-08-16 14:07 ` Jason Rumney
2010-08-16 16:26 ` Werner LEMBERG
2010-08-16 17:37 ` Eli Zaretskii
1 sibling, 1 reply; 11+ messages in thread
From: Jason Rumney @ 2010-08-16 14:07 UTC (permalink / raw)
To: jidanni; +Cc: 6866
On 16/8/2010 9:39 PM, jidanni@jidanni.org wrote:
> Well anyway, for our locale me and my friends all use zh_TW.UTF-8 and
> stopped using zh_TW.big5 years ago. So at least it looks very dumb there
> in mule-cmds.el that the zh_CN people can use UTF-8, but the HK and TW
> are locked in the dark ages:
>
> ("zh_HK" . "Chinese-Big5")
> ("zh_TW" . "Chinese-Big5")
> ("zh_CN.UTF-8" . "Chinese-GBK")
> ("zh_CN" . "Chinese-GB")
GBK is a backwards compatible extension of GB with more characters. I'm
not sure that Big5 has an equivalent. In all these cases, the character
set is used to select preferences for fonts, input methods and other
language sensitive things, and has nothing to do with UTF-8 (which is
used as a preference for file encoding when specified).
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
` (3 preceding siblings ...)
2010-08-16 13:39 ` jidanni
@ 2010-08-16 15:27 ` jidanni
2010-08-16 17:41 ` Eli Zaretskii
4 siblings, 1 reply; 11+ messages in thread
From: jidanni @ 2010-08-16 15:27 UTC (permalink / raw)
To: jasonr; +Cc: 6866
>>>>> "JR" == Jason Rumney <jasonr@gnu.org> writes:
JR> In all these cases, the character set is used to select preferences
JR> for fonts, input methods and other language sensitive things, and
JR> has nothing to do with UTF-8 (which is used as a preference for file
JR> encoding when specified).
Then it is a sad choice of the name of a character set being used for
other purposes. Many users will say: didn't I make a big effort years
ago to totally convert my environment? Why do I still have traces of
big5 hanging around?
Perhaps there should be a more neutral name used. Since it seems what
you are calling Chinese-Big5 does not have much to do with
http://en.wikipedia.org/wiki/Traditional_Chinese#Computer_encoding
after all.
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 14:07 ` Jason Rumney
@ 2010-08-16 16:26 ` Werner LEMBERG
0 siblings, 0 replies; 11+ messages in thread
From: Werner LEMBERG @ 2010-08-16 16:26 UTC (permalink / raw)
To: jasonr; +Cc: 6866, jidanni
> GBK is a backwards compatible extension of GB with more characters.
> I'm not sure that Big5 has an equivalent.
Such an extension exists; it is called Big5-plus. However, AFAIK,
nobody has ever used it, and today it is obsolete since Unicode is
much better.
Werner
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 13:39 ` jidanni
2010-08-16 14:07 ` Jason Rumney
@ 2010-08-16 17:37 ` Eli Zaretskii
1 sibling, 0 replies; 11+ messages in thread
From: Eli Zaretskii @ 2010-08-16 17:37 UTC (permalink / raw)
To: jidanni; +Cc: 6866-done
> From: jidanni@jidanni.org
> Date: Mon, 16 Aug 2010 21:39:10 +0800
> Cc: 6866@debbugs.gnu.org
>
> Well anyway, for our locale me and my friends all use zh_TW.UTF-8 and
> stopped using zh_TW.big5 years ago. So at least it looks very dumb there
> in mule-cmds.el that the zh_CN people can use UTF-8, but the HK and TW
> are locked in the dark ages:
>
> ("zh_HK" . "Chinese-Big5")
> ("zh_TW" . "Chinese-Big5")
> ("zh_CN.UTF-8" . "Chinese-GBK")
> ("zh_CN" . "Chinese-GB")
Are you sure you understand what this data base is used for in Emacs?
The function within mule-cmds.el which uses this data has this
comment:
;; locale-language-names specify both lang-env and coding.
;; But, what specified in locale-preferred-coding-systems
;; has higher priority.
Thus, if you specify UTF-8 as the preferred encoding (e.g., via
LC_ALL), it overrules the Big5 default.
> The only big5 thing I apparently sometimes still use is
> $ GET http://jidanni.org/comp/configuration/.emacs | grep -i b5
> (setq default-input-method 'chinese-py-punct-b5))));no 'utf' ones
You are confused: an input method can produce Big5 characters, but
that won't prevent Emacs from encoding them in UTF-8 if that's your
preference.
I'm closing this bug.
^ permalink raw reply [flat|nested] 11+ messages in thread
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
2010-08-16 15:27 ` jidanni
@ 2010-08-16 17:41 ` Eli Zaretskii
0 siblings, 0 replies; 11+ messages in thread
From: Eli Zaretskii @ 2010-08-16 17:41 UTC (permalink / raw)
To: jidanni; +Cc: 6866
> From: jidanni@jidanni.org
> Date: Mon, 16 Aug 2010 23:27:51 +0800
> Cc: 6866@debbugs.gnu.org
>
> Many users will say: didn't I make a big effort years ago to totally
> convert my environment? Why do I still have traces of big5 hanging
> around?
Users should not look into the code unless they actually read it (as
opposed to grep them with some random string) and understand what the
code does.
> Perhaps there should be a more neutral name used. Since it seems what
> you are calling Chinese-Big5 does not have much to do with
> http://en.wikipedia.org/wiki/Traditional_Chinese#Computer_encoding
> after all.
It _is_ a name of an encoding, just the Emacs name. It just isn't
used in the way you thought.
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2010-08-16 17:41 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
2010-08-16 12:22 ` Eli Zaretskii
2010-08-16 12:23 ` Jason Rumney
2010-08-16 12:58 ` jidanni
2010-08-16 13:17 ` Jason Rumney
2010-08-16 13:39 ` jidanni
2010-08-16 14:07 ` Jason Rumney
2010-08-16 16:26 ` Werner LEMBERG
2010-08-16 17:37 ` Eli Zaretskii
2010-08-16 15:27 ` jidanni
2010-08-16 17:41 ` Eli Zaretskii
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).