unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
@ 2010-08-16 11:09 jidanni
  2010-08-16 12:22 ` Eli Zaretskii
                   ` (4 more replies)
  0 siblings, 5 replies; 11+ messages in thread
From: jidanni @ 2010-08-16 11:09 UTC (permalink / raw)
  To: 6866

I demand an explanation.
$ zgrep TW mule-cmds.el
    ("zh_TW" . "Chinese-Big5")
You guys just *assume* that all TW people still use Big5.
One can do LC_ALL=zh_TW.UTF-8 until he is blue in the face, but still
  current-language-environment is a variable defined in `mule-cmds.el'.
  Its value is "Chinese-BIG5"
emacs-version "24.0.50.1"





^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
@ 2010-08-16 12:22 ` Eli Zaretskii
  2010-08-16 12:23 ` Jason Rumney
                   ` (3 subsequent siblings)
  4 siblings, 0 replies; 11+ messages in thread
From: Eli Zaretskii @ 2010-08-16 12:22 UTC (permalink / raw)
  To: jidanni; +Cc: 6866

> From: jidanni@jidanni.org
> Date: Mon, 16 Aug 2010 19:09:48 +0800
> Cc: 
> 
> I demand an explanation.
> $ zgrep TW mule-cmds.el
>     ("zh_TW" . "Chinese-Big5")
> You guys just *assume* that all TW people still use Big5.

That's because they do.  Case closed.

> One can do LC_ALL=zh_TW.UTF-8 until he is blue in the face

How dare you??!!!





^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
  2010-08-16 12:22 ` Eli Zaretskii
@ 2010-08-16 12:23 ` Jason Rumney
  2010-08-16 12:58 ` jidanni
                   ` (2 subsequent siblings)
  4 siblings, 0 replies; 11+ messages in thread
From: Jason Rumney @ 2010-08-16 12:23 UTC (permalink / raw)
  To: jidanni; +Cc: 6866

  On 16/8/2010 7:09 PM, jidanni@jidanni.org wrote:
> I demand an explanation.
> $ zgrep TW mule-cmds.el
>      ("zh_TW" . "Chinese-Big5")
> You guys just *assume* that all TW people still use Big5.
> One can do LC_ALL=zh_TW.UTF-8 until he is blue in the face, but still
>    current-language-environment is a variable defined in `mule-cmds.el'.
>    Its value is "Chinese-BIG5"
> emacs-version "24.0.50.1"

Please explain what bug you think this caused.







^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
  2010-08-16 12:22 ` Eli Zaretskii
  2010-08-16 12:23 ` Jason Rumney
@ 2010-08-16 12:58 ` jidanni
  2010-08-16 13:17   ` Jason Rumney
  2010-08-16 13:39 ` jidanni
  2010-08-16 15:27 ` jidanni
  4 siblings, 1 reply; 11+ messages in thread
From: jidanni @ 2010-08-16 12:58 UTC (permalink / raw)
  To: jasonr; +Cc: 6866

>>>>> "JR" == Jason Rumney <jasonr@gnu.org> writes:
JR> Please explain what bug you think this caused.
http://news.gmane.org/group/gmane.emacs.w3m/thread=8661






^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 12:58 ` jidanni
@ 2010-08-16 13:17   ` Jason Rumney
  0 siblings, 0 replies; 11+ messages in thread
From: Jason Rumney @ 2010-08-16 13:17 UTC (permalink / raw)
  To: jidanni; +Cc: 6866

  On 16/8/2010 8:58 PM, jidanni@jidanni.org wrote:
>>>>>> "JR" == Jason Rumney<jasonr@gnu.org>  writes:
> JR>  Please explain what bug you think this caused.
> http://news.gmane.org/group/gmane.emacs.w3m/thread=8661

It appears from that thread that there is a bug in w3m. It is not 
apparent that it is related to your report here though, as the expected 
behavior is that a Japanese search engine should only be chosen if the 
current-language matches "Japanese", which this clearly does not.






^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
                   ` (2 preceding siblings ...)
  2010-08-16 12:58 ` jidanni
@ 2010-08-16 13:39 ` jidanni
  2010-08-16 14:07   ` Jason Rumney
  2010-08-16 17:37   ` Eli Zaretskii
  2010-08-16 15:27 ` jidanni
  4 siblings, 2 replies; 11+ messages in thread
From: jidanni @ 2010-08-16 13:39 UTC (permalink / raw)
  To: jasonr; +Cc: 6866

Well anyway, for our locale me and my friends all use zh_TW.UTF-8 and
stopped using zh_TW.big5 years ago. So at least it looks very dumb there
in mule-cmds.el that the zh_CN people can use UTF-8, but the HK and TW
are locked in the dark ages:

    ("zh_HK" . "Chinese-Big5")
    ("zh_TW" . "Chinese-Big5")
    ("zh_CN.UTF-8" . "Chinese-GBK")
    ("zh_CN" . "Chinese-GB")

The only big5 thing I apparently sometimes still use is
$ GET http://jidanni.org/comp/configuration/.emacs | grep -i b5
    (setq default-input-method 'chinese-py-punct-b5))));no 'utf' ones





^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 13:39 ` jidanni
@ 2010-08-16 14:07   ` Jason Rumney
  2010-08-16 16:26     ` Werner LEMBERG
  2010-08-16 17:37   ` Eli Zaretskii
  1 sibling, 1 reply; 11+ messages in thread
From: Jason Rumney @ 2010-08-16 14:07 UTC (permalink / raw)
  To: jidanni; +Cc: 6866

  On 16/8/2010 9:39 PM, jidanni@jidanni.org wrote:
> Well anyway, for our locale me and my friends all use zh_TW.UTF-8 and
> stopped using zh_TW.big5 years ago. So at least it looks very dumb there
> in mule-cmds.el that the zh_CN people can use UTF-8, but the HK and TW
> are locked in the dark ages:
>
>      ("zh_HK" . "Chinese-Big5")
>      ("zh_TW" . "Chinese-Big5")
>      ("zh_CN.UTF-8" . "Chinese-GBK")
>      ("zh_CN" . "Chinese-GB")

GBK is a backwards compatible extension of GB with more characters.  I'm 
not sure that Big5 has an equivalent.  In all these cases, the character 
set is used to select preferences for fonts, input methods and other 
language sensitive things, and has nothing to do with UTF-8 (which is 
used as a preference for file encoding when specified).






^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
                   ` (3 preceding siblings ...)
  2010-08-16 13:39 ` jidanni
@ 2010-08-16 15:27 ` jidanni
  2010-08-16 17:41   ` Eli Zaretskii
  4 siblings, 1 reply; 11+ messages in thread
From: jidanni @ 2010-08-16 15:27 UTC (permalink / raw)
  To: jasonr; +Cc: 6866

>>>>> "JR" == Jason Rumney <jasonr@gnu.org> writes:

JR> In all these cases, the character set is used to select preferences
JR> for fonts, input methods and other language sensitive things, and
JR> has nothing to do with UTF-8 (which is used as a preference for file
JR> encoding when specified).

Then it is a sad choice of the name of a character set being used for
other purposes. Many users will say: didn't I make a big effort years
ago to totally convert my environment? Why do I still have traces of
big5 hanging around?

Perhaps there should be a more neutral name used. Since it seems what
you are calling Chinese-Big5 does not have much to do with
http://en.wikipedia.org/wiki/Traditional_Chinese#Computer_encoding
after all.





^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 14:07   ` Jason Rumney
@ 2010-08-16 16:26     ` Werner LEMBERG
  0 siblings, 0 replies; 11+ messages in thread
From: Werner LEMBERG @ 2010-08-16 16:26 UTC (permalink / raw)
  To: jasonr; +Cc: 6866, jidanni


> GBK is a backwards compatible extension of GB with more characters.
> I'm not sure that Big5 has an equivalent.

Such an extension exists; it is called Big5-plus.  However, AFAIK,
nobody has ever used it, and today it is obsolete since Unicode is
much better.


    Werner





^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 13:39 ` jidanni
  2010-08-16 14:07   ` Jason Rumney
@ 2010-08-16 17:37   ` Eli Zaretskii
  1 sibling, 0 replies; 11+ messages in thread
From: Eli Zaretskii @ 2010-08-16 17:37 UTC (permalink / raw)
  To: jidanni; +Cc: 6866-done

> From: jidanni@jidanni.org
> Date: Mon, 16 Aug 2010 21:39:10 +0800
> Cc: 6866@debbugs.gnu.org
> 
> Well anyway, for our locale me and my friends all use zh_TW.UTF-8 and
> stopped using zh_TW.big5 years ago. So at least it looks very dumb there
> in mule-cmds.el that the zh_CN people can use UTF-8, but the HK and TW
> are locked in the dark ages:
> 
>     ("zh_HK" . "Chinese-Big5")
>     ("zh_TW" . "Chinese-Big5")
>     ("zh_CN.UTF-8" . "Chinese-GBK")
>     ("zh_CN" . "Chinese-GB")

Are you sure you understand what this data base is used for in Emacs?

The function within mule-cmds.el which uses this data has this
comment:

    ;; locale-language-names specify both lang-env and coding.
    ;; But, what specified in locale-preferred-coding-systems
    ;; has higher priority.

Thus, if you specify UTF-8 as the preferred encoding (e.g., via
LC_ALL), it overrules the Big5 default.

> The only big5 thing I apparently sometimes still use is
> $ GET http://jidanni.org/comp/configuration/.emacs | grep -i b5
>     (setq default-input-method 'chinese-py-punct-b5))));no 'utf' ones

You are confused: an input method can produce Big5 characters, but
that won't prevent Emacs from encoding them in UTF-8 if that's your
preference.

I'm closing this bug.





^ permalink raw reply	[flat|nested] 11+ messages in thread

* bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8
  2010-08-16 15:27 ` jidanni
@ 2010-08-16 17:41   ` Eli Zaretskii
  0 siblings, 0 replies; 11+ messages in thread
From: Eli Zaretskii @ 2010-08-16 17:41 UTC (permalink / raw)
  To: jidanni; +Cc: 6866

> From: jidanni@jidanni.org
> Date: Mon, 16 Aug 2010 23:27:51 +0800
> Cc: 6866@debbugs.gnu.org
> 
> Many users will say: didn't I make a big effort years ago to totally
> convert my environment? Why do I still have traces of big5 hanging
> around?

Users should not look into the code unless they actually read it (as
opposed to grep them with some random string) and understand what the
code does.

> Perhaps there should be a more neutral name used. Since it seems what
> you are calling Chinese-Big5 does not have much to do with
> http://en.wikipedia.org/wiki/Traditional_Chinese#Computer_encoding
> after all.

It _is_ a name of an encoding, just the Emacs name.  It just isn't
used in the way you thought.





^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2010-08-16 17:41 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-08-16 11:09 bug#6866: mule-cmds.el just _assumes_ all of Taiwan uses Big5 and not UTF-8 jidanni
2010-08-16 12:22 ` Eli Zaretskii
2010-08-16 12:23 ` Jason Rumney
2010-08-16 12:58 ` jidanni
2010-08-16 13:17   ` Jason Rumney
2010-08-16 13:39 ` jidanni
2010-08-16 14:07   ` Jason Rumney
2010-08-16 16:26     ` Werner LEMBERG
2010-08-16 17:37   ` Eli Zaretskii
2010-08-16 15:27 ` jidanni
2010-08-16 17:41   ` Eli Zaretskii

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).