new Emacs HELLO file??

all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed

* new Emacs HELLO file??
@ 2009-08-28 21:27 kawabata.taichi
  2009-08-28 22:19 ` Juri Linkov
  0 siblings, 1 reply; 40+ messages in thread
From: kawabata.taichi @ 2009-08-28 21:27 UTC (permalink / raw)
  To: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 1626 bytes --]


Dear sirs,

This is just a quick thought and proposal to replace Emacs etc/HELLO file.

As Emacs 23 fully supports UCS/Unicode now, etc/HELLO file may be
extended, so that it covers the languages and scripts that had not been
covered by the previous Emacs versions.

Also, HELLO file may contain symbols and rare or ancient scripts, so
that it contains at least one character from each UCS/Unicode blocks.
That would help Emacs users to quick-check which fonts their Emacsen are
missing.

I've collected the "Hello" entries from various on-line dictionaries
over the Internet (especially omniglot.com is useful).  Languages are
selected so that at most two or three languages are shown for each
script (except Latin script, which has several significant languages).
I gathered scripts that I could not find "Hello" entries to separate
entries, and classified them by writing directions.

The proposed etc/HELLO file attached below may contain some
inappropriateness or mistakes, which I appreciate if someone could point
out or fix.  Also, most of script samples merely line up the characters.
Refining them to meaningful text is also appreciated.


=====================

The quickest way to view all characters in this HELLO.txt file is to
install "Code2000" font, Mr. George Douros's Unicode Symbol font, "MPH
Damase 2D" font and Cyrillic font by BukyVede.  That would cover most of
Unicode 5.1 characters.  Beside them, specific Tibetan, Tagalog, Arabic
Supplement, Sinhalese and Sundanese font fill the gap.  Still, Font for
"New Tai Lue", "Balinease", "Lepcha" and few other script seems not
available freely for now.


[-- Attachment #2: HELLO.txt --]
[-- Type: text/plain, Size: 11422 bytes --]

This is a list of ways to say hello in various languages.
(For symbols and some scripts, only sample texts or characters are shown.)

Languages and scripts are classified by writing directions.


1. Scripts written from left to right.

LANGUAGE (NATIVE NAME)	HELLO
----------------------	-----
Ainu (アイヌ イタㇰ)	イランカラㇷ゚テ
Amharic (አማርኛ)	ሠላም
Armenian (Հայերէն)	բարև
Bengali (বাংলা)	নমস্কার
Braille	⠓⠑⠇⠇⠕
Burmese (မ္ရန္‌မာ)	မင္‍ဂလာပာ
C	printf ("Hello, world!\n");
Cantonese (粵語 / 廣東話)	早晨 / 你好 / 喂
Cherokee (ᏣᎳᎩ ᎧᏬᏂᎯᏍᏗ)	ᎣᏏᏲ
Chinese (中文 / 普通话 / 汉语 / 漢語)	你好, 您好
Comanche (Nu̶mu̶ tekwapu̶̲)	Marú̶awe!
Cree (ᓀᐦᐃᔭᐍᐏᐣ / ᓀᐦᐃᔭᐤ)	ᐊᑕᒥᐢᑳᑐᐃᐧᐣ
Czech (čeština)	Dobrý den
Danish (dansk)	Hej / Goddag / Halløj
Deseret (𐐼𐐯𐑅𐐨𐑉𐐯𐐻 𐐰𐑊𐑁𐐩𐐺𐐯𐐻)	𐐸𐑩𐑊𐐬
Dutch (Nederlands)	Hallo / Dag
Dzongkha (རྫོང་ཁ)	སྐུ་གཟུགས་བཟང་པོ
Emacs	emacs --no-splash -f view-hello-file
English /ˈɪŋɡlɪʃ/	Hello
French (français)	Bonjour / Salut
Georgian (ქართველი)	გამარჯობა
German (Deutsch)	Guten Tag / Grüß Gott
Greek (ελληνικά)	Γειά σας
Greek, Polytonic	Ἐμπρός! (on phone)
Gujarati (ગુજરાતી)	નમસ્તે
Hindi (हिंदी)	नमस्ते ।
Inuktitut (ᐃᓄᒃᑎᑐᑦ)	ᐊᐃ / ᐊᐃᓐᖓᐃ
Italian (italiano)	Ciao / Buon giorno
Japanese (日本語)	今日は。 / ｺﾝﾆﾁﾊ
Javanese (Jawa)	System.out.println("Sugeng siang!");
Kannada (ಕನ್ನಡ)	ನಮಸ್ಕಾರ
Kanza (Kaáⁿze)	ho / hawé
Khmer (ភាសាខ្មែរ)	ជំរាបសួរ / សួស្ដី
Koasati (Kowassá:tit)	Cikáʔnó!
Korean (한글 / 韓國語)	안녕하세요 / 안녕하십니까 / 안녕
Lao (ພາສາລາວ)	ສະບາຍດີ / ຂໍໃຫ້ໂຊກດີ
Malayalam (മലയാളം)	നമസ്കാരം
Marathi (मराठी)	नमस्कार ।
Mathematics	∀ p ∈ world • hello p  □
Oriya (ଓଡ଼ିଆ)	ଶୁଣିବେ
Punjabi (ਪੰਜਾਬੀ)	ਸਤ ਸ੍ਰੀ ਅਕਾਲ.
Russian (русский)	Здра́вствуйте!
Shavian (𐑖𐑭𐑝𐑾𐑯)	𐑣𐑩𐑤𐑴
Sinhala (සිංහල)	ආයුබෝවන්
Spanish (español)	¡Hola!
Swedish (på svenska)	Hej / Goddag / Hallå
Tagalog (ᜊᜌ᜔ᜊᜌᜒᜈ᜔)	ᜋᜊᜓᜑᜌ᜔
Tamil (தமிழ்)	வணக்கம்
Telugu (తెలుగు)	నమస్కారం
Thai (ภาษาไทย)	สวัสดีครับ / สวัสดีค่ะ
Tibetan (བོད་སྐད་)	བཀྲ་ཤིས་བདེ་ལེགས༎
Tigrigna (ትግርኛ)	ሰላማት
Vietnamese (tiếng Việt)	Chào bạn
Yoruba (Yorùbá)	Ẹ n lẹ


SCRIPT NAME	SAMPLES
----------------------	-----
Balinese (ᬩᬲ ᬩᬮᬶ)	ᬓᬔᬕᬖᬗᬘᬙᬚᬛᬜᬝᬞᬟ 
Buginese (ᨅᨔ ᨕᨘᨁᨗ)	ᨕᨗᨕᨊᨕᨙ ᨔᨛᨄᨒᨚ 
Buhid (ᝊᝓᝑᝒᝇ)	ᝀᝁᝂᝃᝄᝅ 
Carian	𐊠𐊥𐊣𐊹𐊮𐊸 𐊲𐊥𐊰𐊴𐊣𐊺𐊸 𐊽𐊹𐊾𐊩𐊰𐊹𐊸 
Carrier (ᑐᑊᘁᗕᑋᗸ)	ᗺᗹᗵᗷᗶ 
Cham	ꨁꨗꨩꨈꨮ 
CJK Radicals	⼀⼁⼂⼃⼄⼅⼆⼇⼈⼉ / ⺀⺁⺂⺃⺄⺅⺆⺇⺈⺉
CJK Unified Ideograph Extension-A	㐀㐁㐂㐃㐄㐅㐆㐇㐈㐉㐊㐋㐌㐍㐎㐏
CJK Unified Ideograph Extension-B	𠀀𠀁𠀂𠀃𠀄𠀅𠀆𠀇𠀈𠀉𠀊𠀋𠀌𠀍𠀎𠀏
CJK Unified Ideograph Extension-C	𪜀𪜁𪜂𪜃𪜄𪜅𪜆𪜇𪜈𪜉𪜊𪜋𪜌𪜍𪜎𪜏
Coptic	ⲘⲒⲞ.Ⲕ 
Cuneiform	𒀀𒀁𒀂𒀃𒀄𒀅𒀆𒀇𒀈𒀉𒀊𒀋𒀌𒀍𒀎𒀏 
Cyrillic Supplement	ԀԁԂԃԄԅԆԇԈԉԊԋԌԍԎԏ
Cyrillic Extended-A	ⷠⷡⷢⷣⷤⷥⷦⷧⷨⷩⷪⷫⷬⷭⷮⷯ
Cyrillic Extended-B	ꙀꙁꙂꙃꙄꙅꙆꙇꙈꙉꙊꙋꙌꙍꙎꙏ
Ethiopic Extended	ⶀⶁⶂⶃⶄⶅⶆⶇⶈⶉⶊⶋⶌⶍⶎⶏ
Ethiopic Supplement	ᎀᎁᎂᎃᎄᎅᎆᎇᎈᎉᎊᎋᎌᎍᎎᎏ
Georgian Supplement	ⴀⴁⴂⴃⴄⴅⴆⴇⴈⴉⴊⴋⴌⴍⴎⴏ
Glagolitic	ⰙⰂⰍⰌⰇⰟⰘ 
Gothic	𐌰𐍄𐍄𐌰 𐌿𐌽𐍃𐌰𐍂 𐌸𐌿 𐌹𐌽 
Hanunoo (ᜱᜨᜳᜨᜳᜢ)	ᜣᜫᜨᜳᜰᜲ 
Kayah Li	꤁꤂꤃꤄꤅ 
Latin Extended-C	ⱠⱡⱢⱣⱤⱥⱦⱧⱨⱩⱪⱫⱬⱭⱮⱯ
Latin Extended-D	꜠꜡ꜢꜣꜤꜥꜦꜧꜨꜩꜪꜫꜬꜭꜮꜯ
Lepcha	ᰣᰕᰧᰅ 
Limbu	ᤀᤁᤂᤃᤄᤅᤆᤇᤈ 
Lycian	𐊀𐊁𐊂𐊃𐊄𐊅𐊆𐊇 
Lydian	𐤠𐤡𐤢𐤣𐤤
New Tai Lue	ᦀᦁᦂᦃᦄᦅᦆᦇ
Ogham	᚛ᚁᚂᚃᚄᚅᚆᚇᚈᚉᚊᚋᚌᚍᚎᚏᚐᚑᚒᚒᚔ᚜ 
Old Persian	𐎠𐎡𐎢𐎣𐎤𐎥𐎦𐎧𐎨𐎩𐎪𐎫𐎬𐎭𐎮𐎯 
Osmanya	𐒀𐒁𐒂𐒃𐒄𐒅𐒆𐒇𐒈𐒉𐒊𐒋𐒌𐒍𐒎𐒏 
Phaistos Disc	𐇑𐇛𐇜𐇐𐇡 
Phonetic Extensions	ᴀᴁᴂᴃᴄᴅᴆᴇᴈᴉᴊᴋᴌᴍᴎᴏ
Phonetic Extension Supplement	ᶀᶁᶂᶃᶄᶅᶆᶇᶈᶉᶊᶋᶌᶍᶎᶏ
Rejang	ꤰꤱꤲꤴꤵ　
Runic	ᛒᛁᛏᚱᛅᛁᛋ᛬ᛚᛅᛁᚠᛅ᛬ᚠᚢᛋᛏᚱᛅ᛬ᚴᚢᚦᛅᚾ᛬ᚦᛅᚾ᛬ᛋᚭᚾ᛬ᛁᛚᛅᚾ᛭ 
Santali (Ol Chiki)	ᱟᱲ.ᱟ. 
Saurashtra	ꢂꢒꢂꢬꢣꢶ 
Sundanese	ᮀᮁᮂᮃᮄᮅᮆᮇᮈᮉᮊᮋᮌᮍᮎᮏ 
Syloti Nagri	ꠀꠇꠣꠌꠤꠐꠥꠔꠦ 
Tagbanwa (ᝤᝪᝨᝯ)	ᝠᝡᝢᝣᝤᝥᝦᝧᝨᝩᝪᝫᝬ 
Tifinagh (ⵜⵉⴼⵉⵏⴰⵖ)	ⴰⴱⴲⴳⴴⴵⴶⴷⴸⴹⴺⴻⴼⴽⴾⴿ
Tai Le (ᥖᥭᥰᥖᥬᥳᥑᥨᥒᥰ)	ᥐᥑᥒᥓᥔᥕᥖᥗᥘᥙᥚᥛᥜᥝᥞᥟ
Ugaritic	𐎀𐎁𐎂𐎃𐎄𐎅𐎆𐎇𐎈𐎉𐎊𐎋𐎌𐎍𐎎𐎏 
Vai	ꔀꔁꔂꔃꔄꔅꔆꔇꔈꔉꔊꔋꔌꔍꔎꔏ 
Yi (ꆇꉙ)	ꉷꆀꅇꌫꏦ


\f

2. Scripts written from Right to Left.

LANGUAGE (NATIVE NAME)	HELLO
----------------------	-----
Arabic (ةّيبرعلا)	مكيلع مالّسلا
Aramaic, Syriac (ܠܫܢܐ ܤܘܪܝܝܐ)	ܐܵܝ! / ܐܳܝ!
Dhivehi (ހިވެދި)	ކިހިނެތް؟ / ހާލު ކިހިނެތް؟ 
Hebrew (תירבע)	שלום
Persian (فارسى)	سلام / درود
Yiddish (ײִדיש / מאַמע לשון)	אַ גוטן טאָג


SCRIPT NAME	SAMPLES
----------------------	-----
Arabic Supplement	ݐݑݒݓݔݕݖݗݘݙݚݛݜݝݞݟ
Cypriot Syllabary	𐠀𐠁𐠂𐠃𐠄𐠅𐠈𐠊𐠋𐠌𐠍𐠎𐠏
Kharoshthi	𐨠𐨡𐨢𐨣𐨤𐨥𐨦𐨧𐨨𐨩𐨪𐨫𐨬𐨭𐨮𐨯
Linear B	𐀀𐀁𐀂𐀃𐀄𐀅𐀆𐀇 / 𐂀𐂁𐂂𐂃𐂄𐂅𐂆𐂇
N'Ko (ߒߞߏ)	߀߁߂߃߄߅߆߇߈߉ߊߋߌߍߎߏ
Old Italic	𐌀𐌁𐌂𐌃𐌄𐌅𐌆𐌇𐌈𐌉𐌊𐌋𐌌𐌍𐌎𐌏
Phoenician	𐤀𐤁𐤂𐤃𐤄𐤅𐤆𐤇𐤈𐤉𐤊𐤋𐤌𐤍𐤎𐤏


\f

3. Scripts written from Top to Bottom

LANGUAGE (NATIVE NAME)	HELLO
----------------------	-----
Japanese (日本語)	もし〳〵 (Vertical Repeat Mark)
Mongolian (ᠮᠣᠨᠭᠣᠯ ᠪᠢᠴᠢᠭ)	ᠰᠠᠢ᠋ᠨ ᠪᠠᠢ᠋ᠨ ᠤᠦ


SCRIPT NAME	SAMPLES
----------------------	-----
Kanbun (漢文)	使㆟籍誠不㆚以㆘蓄㆓妻子㆒憂㆗飢寒㆖乱㆙㆑心、
	有㆑銭以済㆞医薬㆝。
Manchurian (ᠮᠠᠨᠵᡠ)	ᡶᡠᡯᡳ ᡥᡝᠨᡩᡠᠮᡝ᠈ ᡨᠠᠴᡳᠮᠪᡳᠮᡝ ᡝᡵᡳᠨᡩᡝᡵᡳ
	ᡠᡵᡳᠪᡠᠴᡳ᠈ ᡳᠨᡠ ᡠᡵᡤᡠᠨ ᠸᠠᡴᠠ᠉
Phags-pa	ꡏꡟ ꡋꡞ ꡏꡟ ꡋꡞ ᠂ ꡏ ꡜꡖ ꡏꡟ ꡋꡞ ᠂ ꡓꡞ ꡏꡟ ᠁ 

\f

4. Numbers and Symbols

SYMBOL NAMES	EXAMPLES
----------------------	-----
Aegean Numbers	𐄀𐄁𐄂𐄇 𐄈𐄉𐄊𐄋𐄌𐄍𐄎𐄏
Ancient Greek Musical Notation	𝈀𝈁𝈂𝈃𝈄𝈅𝈆𝈈𝈉𝈊𝈋𝈌𝈍𝈎𝈏
Ancient Greek Numbers	𐅀𐅁𐅂𐅃𐅄𐅅𐅆𐅇𐅈𐅉𐅊𐅋𐅌𐅍𐅎𐅏
Ancient Symbols	𐆐𐆑𐆒𐆓𐆔𐆕𐆖𐆗𐆘𐆙𐆚𐆛
Arrows	←↑↠↡↰↱⇀⇁⟰⟱⟲⟳⤀⤁⤐⤑⤠⤡⤰⤱⥀⥁
Block Elements	▀▁▂▃▄▅▆▇▐░▒▓▔▕▖▗
Box Drawing	┌┐└┘├┤┬┴
Byzantine Musical Symbols	𝀰𝀱𝀲𝀳𝀴𝀵𝀶𝀷
Combining Diacritical Marks	à á â ã ā a̅ ă ȧ ä ả å a̋ ǎ a̍ a̎ ȁ
    (For Symbols)	a⃐ a⃑ a⃒ a⃓ a⃔ a⃕ a⃖ a⃗
    (Marks Supplement)	a᷀ a᷁ a᷂ a᷃ a᷄ a᷅ a᷆ a᷇ a᷈ a᷉ a᷊ a᷋ a᷌ a᷍ a᷎ a᷏
Control Pictures	␁␂␃␄␅␆␇␈␉␊␋␌␍␎␏
Counting Rod Numerals	𝍠𝍡𝍢𝍣𝍤𝍥𝍦𝍧𝍨𝍩𝍪𝍫𝍬𝍭𝍮𝍯
Currency Symbols	$¢£¤¥₠₡₢₣₤₥₦₧₨₩₪₫₭₮₯
Dingbats	✁✆✇✈✉✌✍✐✒✓✟✠
Domino Tiles	🀰🀲🁂🁒🁛🁢🁤🁴🂄🂍
Enclosed Alphanumerics	①②③④⑴⑵⑶⑷⒈⒉⒊⒋⒜⒝⒞⒟ⒶⒷⒸⒹⓐⓑⓒⓓ⓵⓶⓷⓸
Geometric Shapes	■□▢▣▤▥▦▧▰▱▲△▴▵▶▷◀◁◂◃◄◅◆◇◐◑◒◓◔◕◖◗
Ideographic Description Characters	字=⿱宀子
Khmer Symbols	᧠᧡᧢᧣᧤᧥᧦᧧᧨᧩᧪᧫᧬᧭᧮᧯
Letterlike Symbols	℀℁ℂ℃℄℅℆ℇ℈℉ℊℋℌℍℎℏ
Mahjong Tiles	🀀🀁🀂🀃🀆🀅🀄🀇🀏🀐🀘🀙🀡🀢🀦🀪🀫
Mathematical Alphanumeric Symbols	𝐀𝐁𝐂𝐃𝐄𝐅𝐆𝐇𝐈𝐉𝐊𝐋𝐌𝐍𝐎𝐏
Mathematical Operators	∀∂∃∄∈∊∌∑∓√⨀⨁⨂⨃⨐⨠⨡⨢⨰⨱⨲⩀⩁⩂
Mathematical Symbols	⟀⟁⟂⟃⟐⟑⟒⟓⟠⟡⟢⟣⦀⦁⦂⦃⦄
Misellaneous Symbols	☀☁☂☃☄★☆☐☑☒☓☔♀♁♂♃♠♡♢♣♨♰♲♳♴⚀⚁⚐⚒⚓⚠⚡⚢
Modifier Tone Letters	꜀꜁꜂꜃꜄꜅꜆꜇꜈꜉꜊꜋꜌꜍꜎꜏
Musical Symbols	𝆰𝆱𝆲𝆳𝆴𝆵𝆶𝆷𝆸𝆹𝆺𝆹𝅥𝆺𝅥𝆹𝅥𝅮𝆺𝅥𝅮𝆹𝅥𝅯
Number Forms	⅓⅔⅕⅖ⅠⅡⅢⅣⅰⅱⅲⅳↀↁↂↃↄ
Optical Character Recognition	⑀⑁⑂⑃⑄⑅⑆⑇⑈⑉⑊
Superscripts and Subscripts	⁰¹²³²⁴⁵⁶⁷⁸⁹ ₀₁₂₃₄₅₆₇₈₉
Supplemental Punctuation	⸀⸁⸂⸃⸄⸅⸆⸇⸈⸉⸊⸋⸌⸍⸎
Tai Xuan Jing Symbols	𝌰𝌱𝌲𝌳𝌴𝌵𝌶𝌷𝌸𝌹𝌺𝌻𝌼𝌽𝌾𝌿
Technical Symbols	⌀⌁⌂⌃⌐⌑⌒⌠⌡⌰⌱⌲⌳⍀⍁⍂⍐⍑⍒
Yijing Hexagram Symbols	☰☱☲☳☴☵☶☷

\f

5. Special Characters

NAME	SAMPLES
----------------------	-----
LANGUAGE TAGS	󠁈󠁅󠁌󠁌󠁏
VARIATION SELECTORS	邊 vs. 邊󠄀, 邊󠄁, 邊󠄂, 邊󠄃, 邊󠄄, 邊󠄅, 邊󠄆, 邊󠄇

\f



Copyright (C) 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009
  Free Software Foundation, Inc.

This file is part of GNU Emacs.

GNU Emacs is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

GNU Emacs is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with GNU Emacs.  If not, see <http://www.gnu.org/licenses/>.

;;; Local Variables:
;;; tab-width: 38
;;; End:

[-- Attachment #3: Type: text/plain, Size: 118 bytes --]


-- 
-----------------------------------------------------------
  (kawabata.taichi@gmail.com)       KAWABATA, Taichi

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-28 21:27 new Emacs HELLO file?? kawabata.taichi
@ 2009-08-28 22:19 ` Juri Linkov
  2009-08-29  2:16   ` 川幡 太一
  0 siblings, 1 reply; 40+ messages in thread
From: Juri Linkov @ 2009-08-28 22:19 UTC (permalink / raw)
  To: kawabata.taichi; +Cc: emacs-devel

> This is just a quick thought and proposal to replace Emacs etc/HELLO file.

Very nice!  And the last part even can be used as a Unicode character palette.

However, you removed Esperanto, Estonian, Finnish, Hungarian, Maltese,
Nederlands, Norwegian, Polish, Slovak, Slovenian, Turkish, Ukrainian.
This is too bad.

-- 
Juri Linkov
http://www.jurta.org/emacs/

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-28 22:19 ` Juri Linkov
@ 2009-08-29  2:16   ` 川幡 太一
  2009-08-29  3:31     ` Stefan Monnier
                       ` (2 more replies)
  0 siblings, 3 replies; 40+ messages in thread
From: 川幡 太一 @ 2009-08-29  2:16 UTC (permalink / raw)
  To: Juri Linkov; +Cc: emacs-devel

川幡です。

>> In <87hbvrk2df.fsf@mail.jurta.org>, 
>> Mr. Linkov wrote:

> Very nice!  And the last part even can be used as a Unicode character palette.

> However, you removed Esperanto, Estonian, Finnish, Hungarian, Maltese,
> Nederlands, Norwegian, Polish, Slovak, Slovenian, Turkish, Ukrainian.
> This is too bad.

Whoops.  Sorry, they are as follows.

Esperanto	Saluton (Eĥoŝanĝo ĉiuĵaŭde)
Estonian (eesti keel)	Tere päevast / Tere õhtust
Finnish (suomi)	Hei / Hyvää päivää
Maltese (il-Malti)	Bonġu / Saħħa / merħba
Nederlands, Vlaams	Hallo / Dag
Norwegian (norsk)	Hei / God dag
Polish  (język polski)	Dzień dobry! / Cześć!
Slovak (slovenčina)	Ahoj / Ahojte / Nazdar / Servus
Slovenian (slovenščina)	Pozdravljeni!
Turkish (Türkçe)	Merhaba
Ukrainian (українська)	Вітаю

FYI: I've also collected the followings, but I thought putting all of
them would make the file too much Latin-script centric, too much
difficult for verification.

=========================================
Abenaki (Alnôba)	Kwai Kwai!
Abkhaz (аҧсуа бызшәа)	мыш бзи!
Afar (`Afa'r af)	Mahisse
Afrikaans	Hallo, Goeie dag
Alabama (Albaamo innaaɬiilka)	Chíkmàa
Albanian (shqip / gjuha shqipe)	C'kemi, Tungjatjeta
Alsatian	Hallo, Güete Tag
Aragonese	Ola
Arapaho (Hinonoéitiit)	Héébee, Tous
Aromanian / Vlach (Armãneashti)	bunã dzua
Arrernte	Werte
Assamese (অসমীয়া)	নমস্কার
Asturian	Hola, Bonos díes
Avar (магIарул мацI)	ВорчIами! / ЙорчIами!
Balkar (малкъар/балкъар)	Кюнюгюз ашхы болсун!
Basque (euskara)	Kaixo
Belarusian (Беларуская мова)	Вітаю, Дзень добры
Bhojpuri (भोजपुरी)	प्रणाम / परणाम 
Bosnian (Bosanski)	Dobar dan Zdravo / Merhaba
Bulgarian (български)	Здравейте
Caddo (Hasí:nay)	Káhaʔahat!
Catalan (català)	Salud, Mat an traoù ganeoc'h?
Catawba	tɑnakɛ!
Chamorro	Håfa ådai / Buenas
Chechen (Нохчийн мотт)	Салам
Cheyenne (Etsėhesenestse)	Haáhe!
Chichewa (Chicheŵa)	Moni bambo!
Chickasaw (Chikasha)	Chokma!
Chinese, Hakka (客话 / 𠊎话)	爾好
Chiricahua	Hadínyaa?
Choctaw (Chahta Anumpa)	Halito
Cimbrian	Guuten takh
Congo	mambo
Cornish (Kernewek / Kernowek)	Dydh da, Hou, You, Ha, Hou sos
Corsican (corsu)	Bonghjornu
Croatian (Hrvatski)	Bok / Čao / Dobar Dan
Dalecarian	Hej / Høj / Góðdag
Dinka (Thuɔŋjäŋ)	Ci yi bak
Faroese (Føroyska)	Hallo / Hey
Fijian (Vakaviti)	Bula / Drau bula / Dou bula / Nibula
Frisian, North (Noordfreesk)	Moin / Guundach
Frisian, West (Frysk)	A goeie/ Hoi / Goeie / Goedei
Friulan (furlan / marilenghe)	Bundì / Mandi
Gaelic, Scottish (Gàidhlig)	Halò, Ciamar a tha thu / sibh?
Galician (Galego)	Ola
German, Low (Plattdüütsch)	Moin / Goden Dag
German, Swiss (Schwyzerdütsch)	hallo
Greenlandic (kalaallisut)	Inuugujoq, kutaa / Haluu
Haitian Creole (Kreyòl ayisyen)	Bonjou
Hawaiian (ʻōlelo Hawaiʻi)	Aloha
Herero (Otjiherero)	Tjike
Himba	Mono
Hungarian (magyar)	Szia! / Szép jó napot!
Icelandic (Íslenska)	Halló / Góðan dag
Ido	Saluto
Igbo	nde-ewo
Indonesian (Bahasa Indonesia)	Selamat siang
Interlingua	Salute
Irish (Gaeilge)	Dia duit
Iñupiaq (Inupiatun)	Halauġikpiñ
Jersey Norman (Jèrriais)	Salut / Bouônjour
Jutish	Godaw
Karelian	Terveh!
Kashubian (kaszëbsczi jãzëk)	Witôjze
Kazakh (Қазақ тілі)	Сәлем! / Сәлеметсіз бе?
Khanty	Вўҫя вуӆа!
Kickapoo	Aho
Kiowa	Hā́chò
Klingon	nuqneH?
Komi (коми кыв)	чолöм / Видза олан!
Kurdish (к’öрди)	Sillaw / Ew kata bash / Em kata bash
Kwanyama (kuanyama)	wa uhala po / meme
Latin (Lingua Latina)	Heus / Ave / Salve / Salvete
Latvian (latviešu valoda)	Sveiki
Lenape (Lënapei)	Hè
Limburgish (Lèmburgs)	Hallo
Lingala (lingála)	mbote
Lithuanian (lietuvių kalba)	Labas / Sveikas / Sveiki
Livonian (Līvõ kēļ)	Tēriņtš!
Lojban	coi
Luxembourgish (Lëtzebuergesch)	Moïen / Salut
Macedonian (македонски)	Здраво
Malay (Bahasa melayu)	Selamat pagi / Selamat petang
Mansi (Маньси)	Пася о̄лэн!
Manx (Gaelg / Gailck)	Dy bannee diu, Ta shiu/oo cheet!
Maori (te Reo Māori)	Kia ora / Kia ora rā kōrua
Mari, Hill	Шӓлӓ!
Mari, Meadow (марий йылме)	Салам!
Mohawk (Kanien'kéha)	kwe kwe
Moksha (мокшень кяль)	Шумбра́т! / Шумбра́тада!
Mongolian (монгол)	Сайн уу?
Nahuatl (nāhuatl)	niltze, hao
Nama (Namagowab)	!gai / oas
Navajo (Diné Bizaad)	Y´'át'ééh
Neapolitan	cia, cha
Nenets, Tundra	Торова!
Nepal Bhasa (नेपाल भाषा)	ज्वजलपा
Nepali (नेपाली)	नमस्ते
Niuean (ko e vagahau Niuē)	faka lofa lahi atu
Occitan	Bonjorn!
Old English	Wes hāl / Wesaþ hāl
Pig Latin	eyhay
Pitjantjatjara	Wai, Wai palya
Plautdietsch	Goondach
Portuguese (português)	Oi! E aí? Tudo bem? Tudo certo? Opa!
Rajasthani (राजस्थानी)	राम् राम्.
Romanian (limba română)	Salut
Sami, Inari (säämegiella)	Tiervâ!
Sami, Northern (sámi / sámegiella)	Bures!
Sami, Southern (saemien giele)	Tiõrv!
Samoan (Gagana Samoa)	Talofa
Sardinian (Limba Sarda)	Bona die
Senegal	salamaleikum
Serbian (српски)	Здраво
Shona (chiShona)	Mhoro / Mhoroi
Sicilian (sicilianu)	Ciau
Sotho, Southern (seSotho)	Lumela / Dumela / Dumelang
Stellingwarfs	Hoj
Sulka	marot
Swahili (Kiswahili)	Habari / Hujambo
Tetum (tetun)	bondia (morning), botarde (evening)
Tok Pisin (Tok Pisin)	Gude
Tongan (Faka-Tonga)	Malo e lelei / Malo e tau lava
Tsez (цез мец)	АсаламугIалейкум!
Tshiluba	moyo
Tsonga (xiTsonga)	minjhani
Udmurt (удмурт кыл)	Умой!
Uzbek (o'zbek tili)	Assalomu Alaykum!
Venda (tshiVenḓa)	I nhlikanhi
Veps	Tervhen!
Volapük	Glidis
Walloon (walon)	Bondjoû
Warlpiri	Ngurrju mayinpa
Welsh (Cymraeg / Y Gymraeg)	Helô / Hylô / Shwmae
Xhosa (isiXhosa)	Molo / Molweni
Yappese	Mogethin
Zulu (isiZulu)	Sawubona / Sanibonani
Urdu (اردو)	السلام علیکم

-- 
---------------------------------------------------------------------
  川幡 太一  (kawabata.taichi@gmail.com)       KAWABATA, Taichi

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-29  2:16   ` 川幡 太一
@ 2009-08-29  3:31     ` Stefan Monnier
  2009-08-29  5:46     ` Daniel Clemente
  2009-08-29 22:43     ` Juri Linkov
  2 siblings, 0 replies; 40+ messages in thread
From: Stefan Monnier @ 2009-08-29  3:31 UTC (permalink / raw)
  To: 川幡 太一; +Cc: Juri Linkov, emacs-devel

>> Very nice!  And the last part even can be used as a Unicode
>> character palette.

>> However, you removed Esperanto, Estonian, Finnish, Hungarian, Maltese,
>> Nederlands, Norwegian, Polish, Slovak, Slovenian, Turkish, Ukrainian.
>> This is too bad.

> Whoops.  Sorry, they are as follows.

> Esperanto	Saluton (Eĥoŝanĝo ĉiuĵaŭde)
> Estonian (eesti keel)	Tere päevast / Tere õhtust
> Finnish (suomi)	Hei / Hyvää päivää
> Maltese (il-Malti)	Bonġu / Saħħa / merħba
> Nederlands, Vlaams	Hallo / Dag
> Norwegian (norsk)	Hei / God dag
> Polish  (język polski)	Dzień dobry! / Cześć!
> Slovak (slovenčina)	Ahoj / Ahojte / Nazdar / Servus
> Slovenian (slovenščina)	Pozdravljeni!
> Turkish (Türkçe)	Merhaba
> Ukrainian (українська)	Вітаю

This looks like a good improvement.  Can someone install it, please?


        Stefan




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-29  2:16   ` 川幡 太一
  2009-08-29  3:31     ` Stefan Monnier
@ 2009-08-29  5:46     ` Daniel Clemente
  2009-08-29 22:43     ` Juri Linkov
  2 siblings, 0 replies; 40+ messages in thread
From: Daniel Clemente @ 2009-08-29  5:46 UTC (permalink / raw)
  To: emacs-devel

El sáb, ago 29 2009 a les 04:16, 川幡 太一 va escriure:
>
> FYI: I've also collected the followings, but I thought putting all of
> them would make the file too much Latin-script centric, too much
> difficult for verification.
>

  Not every language you listed uses Latin script. Urdu, for instance, could be interesting to include.

  In addition there are some errors in your listing. For instance what you listed as Catalan is Breton. In Catalan it would be „Hola, Bon dia“ (Hi, good day“).

Bon dia

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-29  2:16   ` 川幡 太一
  2009-08-29  3:31     ` Stefan Monnier
  2009-08-29  5:46     ` Daniel Clemente
@ 2009-08-29 22:43     ` Juri Linkov
  2009-08-30  1:16       ` kawabata.taichi
  2 siblings, 1 reply; 40+ messages in thread
From: Juri Linkov @ 2009-08-29 22:43 UTC (permalink / raw)
  To: 川幡 太一; +Cc: emacs-devel

> FYI: I've also collected the followings, but I thought putting all of
> them would make the file too much Latin-script centric, too much
> difficult for verification.

Omitting part of languages might cause some Emacs users to think that
we consider their languages insignificant.  OTOH, adding more languages =
more loyal Emacs users.  I mean the following reasoning of a novice
looking at the HELLO file: "Emacs supports my language - good! I'll use it."

If it is difficult to verify a group of scripts, we could group
languages by Unicode script names ordered by Unicode ranges like in
http://www.unicode.org/charts/  You can use language-script mapping in
http://www.unicode.org/cldr/data/charts/supplemental/languages_and_scripts.html

-- 
Juri Linkov
http://www.jurta.org/emacs/




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-29 22:43     ` Juri Linkov
@ 2009-08-30  1:16       ` kawabata.taichi
  2009-08-30  3:12         ` Eli Zaretskii
  2009-08-30 20:48         ` Juri Linkov
  0 siblings, 2 replies; 40+ messages in thread
From: kawabata.taichi @ 2009-08-30  1:16 UTC (permalink / raw)
  To: Juri Linkov; +Cc: emacs-devel

>> In <87fxba1e92.fsf@mail.jurta.org>, Mr. Linkov wrote:

> Omitting part of languages might cause some Emacs users to think that
> we consider their languages insignificant.  OTOH, adding more languages =
> more loyal Emacs users.  I mean the following reasoning of a novice
> looking at the HELLO file: "Emacs supports my language - good! I'll
> use it."

How about adding new language per request basis??  If I do not
misunderstand, main purposes of `HELLO' is to show and check
multilingual (multi-script) capability of Emacs and font availability in
user's installation.

`Removing' existing languages in HELLO file may not certainly be
appropriate.  As there are thousands of languages in the world,
attempting to add every languae unconditionaly may shadow the purpose of
HELLO file.

> If it is difficult to verify a group of scripts, we could group
> languages by Unicode script names ordered by Unicode ranges like in
> http://www.unicode.org/charts/

Difficulty comes from verifying if languages collected from the Internet
is really correct, or did I properly copy them to the text file.  Thanks
for pointing me of an error on Breton and Catalan...  That's my own
mistake.

-- 
---------------------------------------------------------------------
  (kawabata.taichi@gmail.com)       KAWABATA, Taichi

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-30  1:16       ` kawabata.taichi
@ 2009-08-30  3:12         ` Eli Zaretskii
  2009-08-31  3:33           ` Richard Stallman
  2009-08-30 20:48         ` Juri Linkov
  1 sibling, 1 reply; 40+ messages in thread
From: Eli Zaretskii @ 2009-08-30  3:12 UTC (permalink / raw)
  To: kawabata.taichi; +Cc: juri, emacs-devel

> From: kawabata.taichi@gmail.com
> Accept-Language: ja, en;q=0.6, zh;q=0.3, fr;q=0.1, la;q=0.01, sa;q=0.001
> Date: Sun, 30 Aug 2009 10:16:40 +0900
> Cc: emacs-devel@gnu.org
> 
> As there are thousands of languages in the world, attempting to add
> every languae unconditionaly may shadow the purpose of HELLO file.

How can extra languages in HELLO shadow its purpose?  I don't think we
should care about the size of HELLO too much, certainly not as a
reason for excluding some languages.  People are justifiably proud of
their languages, and we don't want to offend anyone, I think.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-30  3:12         ` Eli Zaretskii
@ 2009-08-31  3:33           ` Richard Stallman
  2009-08-31 16:10             ` kawabata.taichi
  2009-08-31 16:16             ` Juri Linkov
  0 siblings, 2 replies; 40+ messages in thread
From: Richard Stallman @ 2009-08-31  3:33 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: juri, kawabata.taichi, emacs-devel

    > As there are thousands of languages in the world, attempting to add
    > every languae unconditionaly may shadow the purpose of HELLO file.

    How can extra languages in HELLO shadow its purpose?

If we put the most important things at the top, there is no harm
having lots of languages after that.  I think there are some 3,000
living languages.  If each one takes 10 bytes, that's 30,000 bytes,
which is no big deal.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31  3:33           ` Richard Stallman
@ 2009-08-31 16:10             ` kawabata.taichi
  2009-08-31 17:07               ` Rupert Swarbrick
  2009-08-31 16:16             ` Juri Linkov
  1 sibling, 1 reply; 40+ messages in thread
From: kawabata.taichi @ 2009-08-31 16:10 UTC (permalink / raw)
  To: rms; +Cc: juri, Eli Zaretskii, emacs-devel

> If we put the most important things at the top, there is no harm
> having lots of languages after that.  I think there are some 3,000
> living languages.  If each one takes 10 bytes, that's 30,000 bytes,
> which is no big deal.

Anther thing I'm worring is a quality.  There may be mistakes during the
compilation process of HELLO file.  (I tried to do double-check by
putting all the phrases into the search engine and see if they hit to
multiple sources, but not all of languages such attempts are
successful.)  There may also be informal greetings that some people may
feel uncomforatble, but I couldn't know if such thing really occured.

If we could hope that people who found inappropriate entry in HELLO.txt
would kindly point it out rather than just merely get angered, and if we
are not asking an absolute quality on correctness of HELLO file, perhaps
we could put all the `Hello' phrases found the Internet.  There might be
other options, such as to put some statements in top of HELLO file
asking people to polish the quality and extend the coverage of languages...

-- 
---------------------------------------------------------
  (kawabata.taichi@gmail.com)       KAWABATA, Taichi

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 16:10             ` kawabata.taichi
@ 2009-08-31 17:07               ` Rupert Swarbrick
  0 siblings, 0 replies; 40+ messages in thread
From: Rupert Swarbrick @ 2009-08-31 17:07 UTC (permalink / raw)
  To: emacs-devel

[-- Attachment #1: Type: text/plain, Size: 805 bytes --]

kawabata.taichi@gmail.com writes:

>> If we put the most important things at the top, there is no harm
>> having lots of languages after that.  I think there are some 3,000
>> living languages.  If each one takes 10 bytes, that's 30,000 bytes,
>> which is no big deal.
>
> Anther thing I'm worring is a quality.  There may be mistakes during the
> compilation process of HELLO file.  (I tried to do double-check by
> putting all the phrases into the search engine and see if they hit to
> multiple sources, but not all of languages such attempts are
> successful.)  There may also be informal greetings that some people may
> feel uncomforatble, but I couldn't know if such thing really occured.

I presume you've seen the following page before?

http://users.elite.net/runner/jennifers/hello.htm

Rupert

[-- Attachment #2: Type: application/pgp-signature, Size: 314 bytes --]

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31  3:33           ` Richard Stallman
  2009-08-31 16:10             ` kawabata.taichi
@ 2009-08-31 16:16             ` Juri Linkov
  1 sibling, 0 replies; 40+ messages in thread
From: Juri Linkov @ 2009-08-31 16:16 UTC (permalink / raw)
  To: rms; +Cc: Eli Zaretskii, kawabata.taichi, emacs-devel

>     > As there are thousands of languages in the world, attempting to add
>     > every languae unconditionaly may shadow the purpose of HELLO file.
>
>     How can extra languages in HELLO shadow its purpose?
>
> If we put the most important things at the top, there is no harm
> having lots of languages after that.  I think there are some 3,000
> living languages.  If each one takes 10 bytes, that's 30,000 bytes,
> which is no big deal.

We could put just one character from every script at the top
(like can be seen on the logo of Wikipedia).

-- 
Juri Linkov
http://www.jurta.org/emacs/




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-30  1:16       ` kawabata.taichi
  2009-08-30  3:12         ` Eli Zaretskii
@ 2009-08-30 20:48         ` Juri Linkov
  2009-08-31 15:12           ` Chong Yidong
  1 sibling, 1 reply; 40+ messages in thread
From: Juri Linkov @ 2009-08-30 20:48 UTC (permalink / raw)
  To: kawabata.taichi; +Cc: emacs-devel

> How about adding new language per request basis??  If I do not
> misunderstand, main purposes of `HELLO' is to show and check
> multilingual (multi-script) capability of Emacs and font availability
> in user's installation.

Yes, the main purpose of HELLO is to check multi-script capability,
and it would be simpler if HELLO displayed just a few symbols
from every script.  But since HELLO contains greetings in various
languages now, we risk offending some users by not including their
languages in this list.

> Difficulty comes from verifying if languages collected from the Internet
> is really correct, or did I properly copy them to the text file.

This is not a problem since we can easily fix errors when users notice
and report them.

BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
in HELLO not supported by UTF-8?

-- 
Juri Linkov
http://www.jurta.org/emacs/

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-30 20:48         ` Juri Linkov
@ 2009-08-31 15:12           ` Chong Yidong
  2009-08-31 16:14             ` Juri Linkov
  2009-08-31 16:15             ` Andreas Schwab
  0 siblings, 2 replies; 40+ messages in thread
From: Chong Yidong @ 2009-08-31 15:12 UTC (permalink / raw)
  To: Juri Linkov; +Cc: kawabata.taichi, emacs-devel

Juri Linkov <juri@jurta.org> writes:

> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
> in HELLO not supported by UTF-8?

Isn't it currently UTF-8?




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 15:12           ` Chong Yidong
@ 2009-08-31 16:14             ` Juri Linkov
  2009-08-31 16:15             ` Andreas Schwab
  1 sibling, 0 replies; 40+ messages in thread
From: Juri Linkov @ 2009-08-31 16:14 UTC (permalink / raw)
  To: Chong Yidong; +Cc: kawabata.taichi, emacs-devel

>> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
>> in HELLO not supported by UTF-8?
>
> Isn't it currently UTF-8?

I see it's currently iso-2022-7bit-unix.  And `view-hello-file'
sets its coding system to iso-2022-7bit explicitly.

-- 
Juri Linkov
http://www.jurta.org/emacs/




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 15:12           ` Chong Yidong
  2009-08-31 16:14             ` Juri Linkov
@ 2009-08-31 16:15             ` Andreas Schwab
  2009-08-31 16:32               ` David Kastrup
  2009-09-08  0:12               ` kawabata.taichi
  1 sibling, 2 replies; 40+ messages in thread
From: Andreas Schwab @ 2009-08-31 16:15 UTC (permalink / raw)
  To: Chong Yidong; +Cc: Juri Linkov, kawabata.taichi, emacs-devel

Chong Yidong <cyd@stupidchicken.com> writes:

> Juri Linkov <juri@jurta.org> writes:
>
>> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
>> in HELLO not supported by UTF-8?
>
> Isn't it currently UTF-8?

It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
variants.

Andreas.

-- 
Andreas Schwab, schwab@linux-m68k.org
GPG Key fingerprint = 58CA 54C7 6D53 942B 1756  01D3 44D5 214B 8276 4ED5
"And now for something completely different."




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 16:15             ` Andreas Schwab
@ 2009-08-31 16:32               ` David Kastrup
  2009-08-31 17:02                 ` Andreas Schwab
  2009-08-31 17:55                 ` Eli Zaretskii
  2009-09-08  0:12               ` kawabata.taichi
  1 sibling, 2 replies; 40+ messages in thread
From: David Kastrup @ 2009-08-31 16:32 UTC (permalink / raw)
  To: emacs-devel

Andreas Schwab <schwab@linux-m68k.org> writes:

> Chong Yidong <cyd@stupidchicken.com> writes:
>
>> Juri Linkov <juri@jurta.org> writes:
>>
>>> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
>>> in HELLO not supported by UTF-8?
>>
>> Isn't it currently UTF-8?
>
> It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
> variants.

Since Emacs-23 has utf-8 as its internal encoding, I have my problems to
see how they are not lost anyway.

-- 
David Kastrup





^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 16:32               ` David Kastrup
@ 2009-08-31 17:02                 ` Andreas Schwab
  2009-09-01 11:41                   ` Kenichi Handa
  2009-08-31 17:55                 ` Eli Zaretskii
  1 sibling, 1 reply; 40+ messages in thread
From: Andreas Schwab @ 2009-08-31 17:02 UTC (permalink / raw)
  To: David Kastrup; +Cc: emacs-devel

David Kastrup <dak@gnu.org> writes:

> Andreas Schwab <schwab@linux-m68k.org> writes:
>
>> Chong Yidong <cyd@stupidchicken.com> writes:
>>
>>> Juri Linkov <juri@jurta.org> writes:
>>>
>>>> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
>>>> in HELLO not supported by UTF-8?
>>>
>>> Isn't it currently UTF-8?
>>
>> It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
>> variants.
>
> Since Emacs-23 has utf-8 as its internal encoding, I have my problems to
> see how they are not lost anyway.

See the charset properties.

Andreas.

-- 
Andreas Schwab, schwab@linux-m68k.org
GPG Key fingerprint = 58CA 54C7 6D53 942B 1756  01D3 44D5 214B 8276 4ED5
"And now for something completely different."




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 17:02                 ` Andreas Schwab
@ 2009-09-01 11:41                   ` Kenichi Handa
  2009-09-01 21:27                     ` Juri Linkov
  0 siblings, 1 reply; 40+ messages in thread
From: Kenichi Handa @ 2009-09-01 11:41 UTC (permalink / raw)
  To: Andreas Schwab; +Cc: dak, emacs-devel

In article <m38wgz7w77.fsf@hase.home>, Andreas Schwab <schwab@linux-m68k.org> writes:

> David Kastrup <dak@gnu.org> writes:
> > Andreas Schwab <schwab@linux-m68k.org> writes:
> >
>>> Chong Yidong <cyd@stupidchicken.com> writes:
>>> 
>>>> Juri Linkov <juri@jurta.org> writes:
>>>> 
>>>>> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
>>>>> in HELLO not supported by UTF-8?
>>>> 
>>>> Isn't it currently UTF-8?
>>> 
>>> It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
>>> variants.
> >
> > Since Emacs-23 has utf-8 as its internal encoding, I have my problems to
> > see how they are not lost anyway.

> See the charset properties.

As iso-2022-7bit has designation sequences for each
character set, Emacs can use that information to add proper
`charset' property, and that property infulence the font
selection.

---
Kenichi Handa
handa@m17n.org




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-01 11:41                   ` Kenichi Handa
@ 2009-09-01 21:27                     ` Juri Linkov
  2009-09-03 13:07                       ` Kenichi Handa
  0 siblings, 1 reply; 40+ messages in thread
From: Juri Linkov @ 2009-09-01 21:27 UTC (permalink / raw)
  To: Kenichi Handa; +Cc: dak, Andreas Schwab, emacs-devel

>>>>>> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
>>>>>> in HELLO not supported by UTF-8?
>>>>>
>>>>> Isn't it currently UTF-8?
>>>>
>>>> It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
>>>> variants.
>> >
>> > Since Emacs-23 has utf-8 as its internal encoding, I have my
>> > problems to see how they are not lost anyway.
>
>> See the charset properties.
>
> As iso-2022-7bit has designation sequences for each
> character set, Emacs can use that information to add proper
> `charset' property, and that property infulence the font
> selection.

The purpose of the HELLO file is to demonstrate Emacs multilingual
capabilities.  It currently does this using iso-2022-7bit.  Does this
mean that UTF-8 is not the best coding system for multilingual texts
in Emacs, because UTF-8 has some limitations (doesn't provide the
proper font selection, loses cjk variants), so iso-2022-7bit is the
preferable coding system for multilingual texts in Emacs?  Is it possible
to improve the UTF-8 support in Emacs since most files use it nowadays?

-- 
Juri Linkov
http://www.jurta.org/emacs/

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-01 21:27                     ` Juri Linkov
@ 2009-09-03 13:07                       ` Kenichi Handa
  2009-09-06 18:20                         ` 牛粥
  0 siblings, 1 reply; 40+ messages in thread
From: Kenichi Handa @ 2009-09-03 13:07 UTC (permalink / raw)
  To: Juri Linkov; +Cc: dak, schwab, emacs-devel

In article <873a762ylx.fsf@mail.jurta.org>, Juri Linkov <juri@jurta.org> writes:

> > As iso-2022-7bit has designation sequences for each
> > character set, Emacs can use that information to add proper
> > `charset' property, and that property infulence the font
> > selection.

> The purpose of the HELLO file is to demonstrate Emacs multilingual
> capabilities.  It currently does this using iso-2022-7bit.  Does this
> mean that UTF-8 is not the best coding system for multilingual texts
> in Emacs, because UTF-8 has some limitations (doesn't provide the
> proper font selection, loses cjk variants), so iso-2022-7bit is the
> preferable coding system for multilingual texts in Emacs?

At least for CJK characters included in legacy character
sets (e.g. JISX0208, GB2312, etc), iso-2022-7bit is still
better than utf-8 if we concern font selection.

> Is it possible
> to improve the UTF-8 support in Emacs since most files use it nowadays?

For that, we must use language tags, but they are strongly
discouraged by Unicode.  Unicode doesn't concern font
selection problem.  It says that it is a task of higher
level information, for instance, that is provided by XML's
language tag.

---
Kenichi Handa
handa@m17n.org




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-03 13:07                       ` Kenichi Handa
@ 2009-09-06 18:20                         ` 牛粥
  2009-09-06 20:01                           ` Eli Zaretskii
  2009-09-07  1:01                           ` Kenichi Handa
  0 siblings, 2 replies; 40+ messages in thread
From: 牛粥 @ 2009-09-06 18:20 UTC (permalink / raw)
  To: Kenichi Handa; +Cc: emacs-devel

At Thu, 03 Sep 2009 22:07:50 +0900,
Kenichi Handa wrote:
> 
> In article <873a762ylx.fsf@mail.jurta.org>, Juri Linkov <juri@jurta.org> writes:
> 
> > > As iso-2022-7bit has designation sequences for each
> > > character set, Emacs can use that information to add proper
> > > `charset' property, and that property infulence the font
> > > selection.
> 
> > The purpose of the HELLO file is to demonstrate Emacs multilingual
> > capabilities.  It currently does this using iso-2022-7bit.  Does this
> > mean that UTF-8 is not the best coding system for multilingual texts
> > in Emacs, because UTF-8 has some limitations (doesn't provide the
> > proper font selection, loses cjk variants), so iso-2022-7bit is the
> > preferable coding system for multilingual texts in Emacs?
> 
> At least for CJK characters included in legacy character
> sets (e.g. JISX0208, GB2312, etc), iso-2022-7bit is still
> better than utf-8 if we concern font selection.
> 
> > Is it possible
> > to improve the UTF-8 support in Emacs since most files use it nowadays?
> 
> For that, we must use language tags, but they are strongly
> discouraged by Unicode.  Unicode doesn't concern font
> selection problem.  It says that it is a task of higher
> level information, for instance, that is provided by XML's
> language tag.

According to Handa-san's mentions,
if we do not use iso-2022-7bit, we cannot do input/output of non-ascii
characters. Is that right? Still i'm wondering what it means ...

Sincerely,

-- 
Byung-Hee HWANG
∑ WWW: http://izb.knu.ac.kr/~bh/




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-06 18:20                         ` 牛粥
@ 2009-09-06 20:01                           ` Eli Zaretskii
  2009-09-06 21:08                             ` 牛粥
  2009-09-07  1:02                             ` Kenichi Handa
  2009-09-07  1:01                           ` Kenichi Handa
  1 sibling, 2 replies; 40+ messages in thread
From: Eli Zaretskii @ 2009-09-06 20:01 UTC (permalink / raw)
  To: 牛粥; +Cc: emacs-devel, handa

> Date: Mon, 07 Sep 2009 03:20:16 +0900
> From: bh@izb.knu.ac.kr (=?UTF-8?B?54mb57Kl?=)
> Cc: emacs-devel@gnu.org
> 
> According to Handa-san's mentions,
> if we do not use iso-2022-7bit, we cannot do input/output of non-ascii
> characters. Is that right?

No, it isn't right.  What Handa-san says is that for _some_ scripts,
selection of font can be more according to user expectations if the
information about the language is preserved.  iso-2022 gives us an
opportunity to identify the language, while utf-8 does not.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-06 20:01                           ` Eli Zaretskii
@ 2009-09-06 21:08                             ` 牛粥
  2009-09-07  1:02                             ` Kenichi Handa
  1 sibling, 0 replies; 40+ messages in thread
From: 牛粥 @ 2009-09-06 21:08 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: emacs-devel

At Sun, 06 Sep 2009 23:01:25 +0300,
Eli Zaretskii wrote:
> 
> > Date: Mon, 07 Sep 2009 03:20:16 +0900
> > From: bh@izb.knu.ac.kr (=?UTF-8?B?54mb57Kl?=)
> > Cc: emacs-devel@gnu.org
> > 
> > According to Handa-san's mentions,
> > if we do not use iso-2022-7bit, we cannot do input/output of non-ascii
> > characters. Is that right?
> 
> No, it isn't right.  What Handa-san says is that for _some_ scripts,
> selection of font can be more according to user expectations if the
> information about the language is preserved.  iso-2022 gives us an
> opportunity to identify the language, while utf-8 does not.

That makes sense to me, thanks for explanation!

Sincerely,

-- 
Byung-Hee HWANG
∑ WWW: http://izb.knu.ac.kr/~bh/




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-06 20:01                           ` Eli Zaretskii
  2009-09-06 21:08                             ` 牛粥
@ 2009-09-07  1:02                             ` Kenichi Handa
  1 sibling, 0 replies; 40+ messages in thread
From: Kenichi Handa @ 2009-09-07  1:02 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: bh, emacs-devel

In article <831vmj4zca.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:

> No, it isn't right.  What Handa-san says is that for _some_ scripts,
> selection of font can be more according to user expectations if the
> information about the language is preserved.  iso-2022 gives us an
> opportunity to identify the language, while utf-8 does not.

Yes.  Thank you for explaining it in clear words.

---
Kenichi Handa
handa@m17n.org




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-06 18:20                         ` 牛粥
  2009-09-06 20:01                           ` Eli Zaretskii
@ 2009-09-07  1:01                           ` Kenichi Handa
  1 sibling, 0 replies; 40+ messages in thread
From: Kenichi Handa @ 2009-09-07  1:01 UTC (permalink / raw)
  To: =?UTF-8?B?54mb57Kl?=; +Cc: emacs-devel

In article <86ab18orz3.wl%bh@izb.knu.ac.kr>, bh@izb.knu.ac.kr (=?UTF-8?B?54mb57Kl?=) writes:

> According to Handa-san's mentions,
> if we do not use iso-2022-7bit, we cannot do input/output of non-ascii
> characters. Is that right?

No.  I didn't say such a thing.

> Still i'm wondering what it means ...

I don't understand from which statement of mine you get such
a conclusion.

---
Kenichi Handa
handa@m17n.org




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 16:32               ` David Kastrup
  2009-08-31 17:02                 ` Andreas Schwab
@ 2009-08-31 17:55                 ` Eli Zaretskii
  2009-08-31 22:03                   ` Andreas Schwab
  1 sibling, 1 reply; 40+ messages in thread
From: Eli Zaretskii @ 2009-08-31 17:55 UTC (permalink / raw)
  To: David Kastrup; +Cc: emacs-devel

> From: David Kastrup <dak@gnu.org>
> Date: Mon, 31 Aug 2009 18:32:25 +0200
> 
> > It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
> > variants.
> 
> Since Emacs-23 has utf-8 as its internal encoding, I have my problems to
> see how they are not lost anyway.

That's because the internal encoding extends Unicode, see the ELisp
manual for the details.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 17:55                 ` Eli Zaretskii
@ 2009-08-31 22:03                   ` Andreas Schwab
  0 siblings, 0 replies; 40+ messages in thread
From: Andreas Schwab @ 2009-08-31 22:03 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: David Kastrup, emacs-devel

Eli Zaretskii <eliz@gnu.org> writes:

>> From: David Kastrup <dak@gnu.org>
>> Date: Mon, 31 Aug 2009 18:32:25 +0200
>> 
>> > It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
>> > variants.
>> 
>> Since Emacs-23 has utf-8 as its internal encoding, I have my problems to
>> see how they are not lost anyway.
>
> That's because the internal encoding extends Unicode, see the ELisp
> manual for the details.

Though these extensions are not used for CJK variants.

Andreas.

-- 
Andreas Schwab, schwab@linux-m68k.org
GPG Key fingerprint = 58CA 54C7 6D53 942B 1756  01D3 44D5 214B 8276 4ED5
"And now for something completely different."




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-08-31 16:15             ` Andreas Schwab
  2009-08-31 16:32               ` David Kastrup
@ 2009-09-08  0:12               ` kawabata.taichi
  2009-09-08  2:53                 ` Stephen J. Turnbull
                                   ` (2 more replies)
  1 sibling, 3 replies; 40+ messages in thread
From: kawabata.taichi @ 2009-09-08  0:12 UTC (permalink / raw)
  To: Andreas Schwab; +Cc: Juri Linkov, Chong Yidong, emacs-devel

[-- Attachment #1: Type: text/plain, Size: 1505 bytes --]


Sorry I was late only reply.

>> In <m3fxb86jtb.fsf@hase.home>, Andreas Schwab wrote:

> >> BTW, I think we should convert HELLO to UTF-8.  Do you see any symbol
> >> in HELLO not supported by UTF-8?
> >
> > Isn't it currently UTF-8?

> It's iso-2022-7bit, and converting it to utf-8 would lose the cjk
> variants.

`iso-2022-7bit' has been adopted for HELLO file, as it could contain CJK
variants differently, but if we are to go into depth of various scripts
and symbols that ISO-2022 does not support, we could only use UTF.

ISO/IEC 10646 Annex T provides the way to describe the language in plain
text by using Language Tags, such as follows:

󠀁󠁺󠁨开发󠀁󠁿、󠀁󠁺󠁨󠀭󠁴󠁷開發󠀁󠁿、󠀁󠁪󠁡開発󠀁󠁿

Language Tags are not recommended by the Unicode (as long as there is
other ways to describe language, such as XML lang attributes), but it is
the only way to describe language in plain text.  Or switch we switch
HELLO file from plain text to some XML format?

# Emacs still doesn't handle Language Tags, but I think it wouldn't be
# difficult in some future time...

I'll attach updated HELLO examples as follows, that incorporates all the
languages I could find "HELLO" word, plus Language Tag usage examples
(someday Emacs may support it?) and fixed the logical order of Arabic
language.

-- 
---------------------------------------------------------------------
  kawabata.taichi@gmail.com       KAWABATA, Taichi


[-- Attachment #2: HELLO.txt --]
[-- Type: text/plain, Size: 17687 bytes --]

This is a list of ways to say hello in various languages.
(For symbols and some scripts, only sample texts or characters are shown.)

Languages and scripts are classified by writing directions.


1. Scripts written from left to right.

LANGUAGE (NATIVE NAME)	HELLO
----------------------	-----
xAbenaki (Alnôba)	Kwai Kwai!
Abkhaz (аҧсуа бызшәа)	мыш бзи!
Afar (`Afa'r af)	Mahisse
Afrikaans	Hallo, Goeie dag
Ainu (アイヌ イタㇰ)	イランカラㇷ゚テ
Alabama (Albaamo innaaɬiilka)	Chíkmàa
Albanian (shqip / gjuha shqipe)	C'kemi, Tungjatjeta
Alsatian	Hallo, Güete Tag
Amharic (አማርኛ)	ሠላም
Aragonese	Ola
Arapaho (Hinonoéitiit)	Héébee, Tous
Armenian (Հայերէն)	բարև
Aromanian / Vlach (Armãneashti)	bunã dzua
Arrernte	Werte
Assamese (অসমীয়া)	নমস্কার
Asturian	Hola, Bonos díes
Avar (магIарул мацI)	ВорчIами! / ЙорчIами!
Balkar (малкъар/балкъар)	Кюнюгюз ашхы болсун!
Basque (euskara)	Kaixo
Belarusian (Беларуская мова)	Вітаю, Дзень добры
Bengali (বাংলা)	নমস্কার
Bhojpuri (भोजपुरी)	प्रणाम / परणाम 
Bosnian (Bosanski)	Dobar dan Zdravo / Merhaba
Braille	⠓⠑⠇⠇⠕
Breton	Salud, Mat an traoù ganeoc'h
Bulgarian (български)	Здравейте
Burmese (မ္ရန္‌မာ)	မင္‍ဂလာပာ
C	printf ("Hello, world!\n");
Caddo (Hasí:nay)	Káhaʔahat!
Cantonese (粵語 / 廣東話)	早晨 / 你好 / 喂
Catalan (català)	Hola, Bon dia
Catawba	tɑnakɛ!
Cayuga (Goyogo̱hó:nǫ’)	Sgę́nǫ’
Chamorro	Håfa ådai / Buenas
Chechen (Нохчийн мотт)	Салам
Cherokee (ᏣᎳᎩ ᎧᏬᏂᎯᏍᏗ)	ᎣᏏᏲ
Cheyenne (Etsėhesenestse)	Haáhe!
Chichewa (Chicheŵa)	Moni bambo!
Chickasaw (Chikasha)	Chokma!
Chinese ( 普通话 / 汉语 / 漢語)	你好, 您好
Chinese, Hakka (客话 / 𠊎话)	爾好
Chinese, Shanghainese (上海话)	儂好
Chiricahua	Hadínyaa?
Choctaw (Chahta Anumpa)	Halito
Cimbrian	Guuten takh
Congo	mambo
Comanche (Nu̶mu̶ tekwapu̶̲)	Marú̶awe!
Cornish (Kernewek / Kernowek)	Dydh da, Hou, You, Ha, Hou sos
Corsican (corsu)	Bonghjornu
Cree (ᓀᐦᐃᔭᐍᐏᐣ / ᓀᐦᐃᔭᐤ)	ᐊᑕᒥᐢᑳᑐᐃᐧᐣ
Croatian (Hrvatski)	Bok / Čao / Dobar Dan
Czech (čeština)	Dobrý den
Dalecarian	Hej / Høj / Góðdag
Danish (dansk)	Hej / Goddag / Halløj
Deseret (𐐼𐐯𐑅𐐨𐑉𐐯𐐻 𐐰𐑊𐑁𐐩𐐺𐐯𐐻)	𐐸𐑩𐑊𐐬
Dinka (Thuɔŋjäŋ)	Ci yi bak
Dutch (Nederlands)	Hallo / Dag
Dzongkha (རྫོང་ཁ)	སྐུ་གཟུགས་བཟང་པོ
Emacs	emacs --no-splash -f view-hello-file
English /ˈɪŋɡlɪʃ/	Hello
Erzya (эрзянь кель)	Шумбрат!
Esperanto	Saluton 
Estonian (eesti keel)	Tere päevast / Tere õhtust
Faroese (Føroyska)	Hallo / Hey
Fijian (Vakaviti)	Bula / Drau bula / Dou bula / Nibula
Finnish (suomi)	Hei / Hyvää päivää
French (français)	Bonjour / Salut
Frisian, North (Noordfreesk)	Moin / Guundach
Frisian, West (Frysk)	A goeie/ Hoi / Goeie / Goedei
Friulan (furlan / marilenghe)	Bundì / Mandi
Gaelic, Scottish (Gàidhlig)	Halò, Ciamar a tha thu / sibh?
Galician (Galego)	Ola
Georgian (ქართველი)	გამარჯობა
German (Deutsch)	Guten Tag / Grüß Gott
German, Low (Plattdüütsch)	Moin / Goden Dag
German, Swiss (Schwyzerdütsch)	hallo
Greek (ελληνικά)	Γειά σας
Greek, Polytonic	Ἐμπρός! 
Greenlandic (kalaallisut)	Inuugujoq, kutaa / Haluu
Gujarati (ગુજરાતી)	નમસ્તે
Haitian Creole (Kreyòl ayisyen)	Bonjou
Hawaiian (ʻōlelo Hawaiʻi)	Aloha
Herero (Otjiherero)	Tjike
Himba	Mono
Hindi (हिंदी)	नमस्ते ।
Hungarian (magyar)	Szia! / Szép jó napot!
Icelandic (Íslenska)	Halló / Góðan dag
Ido	Saluto
Igbo	nde-ewo
Indonesian (Bahasa Indonesia)	Selamat siang
Interlingua	Salute
Inuktitut (ᐃᓄᒃᑎᑐᑦ)	ᐊᐃ / ᐊᐃᓐᖓᐃ
Irish (Gaeilge)	Dia duit
Italian (italiano)	Ciao / Buon giorno
Iñupiaq (Inupiatun)	Halauġikpiñ
Japanese (日本語)	今日は。 / ｺﾝﾆﾁﾊ
Javanese (Jawa)	System.out.println("Sugeng siang!");
Jersey Norman (Jèrriais)	Salut / Bouônjour
Jutish	Godaw
Kannada (ಕನ್ನಡ)	ನಮಸ್ಕಾರ
Kanza (Kaáⁿze)	ho / hawé
Karelian	Terveh!
Kashubian (kaszëbsczi jãzëk)	Witôjze
Kazakh (Қазақ тілі)	Сәлем! / Сәлеметсіз бе?
Khanty	Вўҫя вуӆа!
Khmer (ភាសាខ្មែរ)	ជំរាបសួរ / សួស្ដី
Kickapoo	Aho
Kiowa	Hā́chò
Klingon	nuqneH?
Koasati (Kowassá:tit)	Cikáʔnó!
Komi (коми кыв)	чолöм / Видза олан!
Korean (한글 / 韓國語)	안녕하세요 / 안녕하십니까 / 안녕
Kurdish (к’öрди)	Sillaw / Ew kata bash / Em kata bash
Kwanyama (kuanyama)	wa uhala po / meme
LOLcat ;->	Ohai!
Lao (ພາສາລາວ)	ສະບາຍດີ / ຂໍໃຫ້ໂຊກດີ
Latin (Lingua Latina)	Heus / Ave / Salve / Salvete
Latvian (latviešu valoda)	Sveiki
Lenape (Lënapei)	Hè
Limburgish (Lèmburgs)	Hallo
Lingala (lingála)	mbote
Lithuanian (lietuvių kalba)	Labas / Sveikas / Sveiki
Livonian (Līvõ kēļ)	Tēriņtš!
Lojban	coi
Luxembourgish (Lëtzebuergesch)	Moïen / Salut
Macedonian (македонски)	Здраво
Malay (Bahasa melayu)	Selamat pagi / Selamat petang
Malayalam (മലയാളം)	നമസ്കാരം
Maltese (il-Malti)	Bonġu / Saħħa / merħba
Mansi (Маньси)	Пася о̄лэн!
Manx (Gaelg / Gailck)	Dy bannee diu, Ta shiu/oo cheet!
Maori (te Reo Māori)	Kia ora / Kia ora rā kōrua
Marathi (मराठी)	नमस्कार ।
Mari, Hill	Шӓлӓ!
Mari, Meadow (марий йылме)	Салам!
Mathematics	∀ p ∈ world • hello p  □
Mohawk (Kanien'kéha)	kwe kwe
Moksha (мокшень кяль)	Шумбра́т! / Шумбра́тада!
Mongolian (монгол)	Сайн уу?
Nahuatl (nāhuatl)	niltze, hao
Nama (Namagowab)	!gai / oas
Navajo (Diné Bizaad)	Y´'át'ééh
Neapolitan	cia, cha
Nederlands, Vlaams	Hallo / Dag
Nenets, Tundra	Торова!
Nepal Bhasa (नेपाल भाषा)	ज्वजलपा
Nepali (नेपाली)	नमस्ते
Niuean (ko e vagahau Niuē)	faka lofa lahi atu
Norwegian (norsk)	Hei / God dag
Occitan	Bonjorn!
Old English	Wes hāl / Wesaþ hāl
Oriya (ଓଡ଼ିଆ)	ଶୁଣିବେ
Pig Latin	eyhay
Pitjantjatjara	Wai, Wai palya
Plautdietsch	Goondach
Polish  (język polski)	Dzień dobry! / Cześć!
Portuguese (português)	Oi! E aí? Tudo bem? Tudo certo? Opa!
Punjabi (ਪੰਜਾਬੀ)	ਸਤ ਸ੍ਰੀ ਅਕਾਲ.
Rajasthani (राजस्थानी)	राम् राम्.
Romanian (limba română)	Salut
Russian (русский)	Здра́вствуйте!
Sami, Inari (säämegiella)	Tiervâ!
Sami, Northern (sámi / sámegiella)	Bures!
Sami, Southern (saemien giele)	Tiõrv!
Samoan (Gagana Samoa)	Talofa
Sardinian (Limba Sarda)	Bona die
Senegal	salamaleikum
Serbian (српски)	Здраво
Shavian (𐑖𐑭𐑝𐑾𐑯)	𐑣𐑩𐑤𐑴
Shona (chiShona)	Mhoro / Mhoroi
Sicilian (sicilianu)	Ciau
Sinhala (සිංහල)	ආයුබෝවන්
Slovak (slovenčina)	Ahoj / Ahojte / Nazdar / Servus
Slovenian (slovenščina)	Pozdravljeni!
Sotho, Southern (seSotho)	Lumela / Dumela / Dumelang
Spanish (español)	¡Hola!
Stellingwarfs	Hoj
Sulka	marot
Swahili (Kiswahili)	Habari / Hujambo
Swedish (på svenska)	Hej / Goddag / Hallå
Tagalog (ᜊᜌ᜔ᜊᜌᜒᜈ᜔)	ᜋᜊᜓᜑᜌ᜔
Tamil (தமிழ்)	வணக்கம்
Telugu (తెలుగు)	నమస్కారం
Tetum (tetun)	bondia 
Thai (ภาษาไทย)	สวัสดีครับ / สวัสดีค่ะ
Tibetan (བོད་སྐད་)	བཀྲ་ཤིས་བདེ་ལེགས༎
Tigrigna (ትግርኛ)	ሰላማት
Tok Pisin (Tok Pisin)	Gude
Tongan (Faka-Tonga)	Malo e lelei / Malo e tau lava
Tsez (цез мец)	АсаламугIалейкум!
Tshiluba	moyo
Tsonga (xiTsonga)	minjhani
Turkish (Türkçe)	Merhaba / Selam / İyi günler Alo / Efendim
Udmurt (удмурт кыл)	Умой!
Ukrainian (українська)	Вітаю
Uzbek (o'zbek tili)	Assalomu Alaykum!
Venda (tshiVenḓa)	I nhlikanhi
Veps	Tervhen!
Vietnamese (tiếng Việt)	Chào bạn
Volapük	Glidis
Walloon (walon)	Bondjoû
Warlpiri	Ngurrju mayinpa
Welsh (Cymraeg / Y Gymraeg)	Helô / Hylô / Shwmae
Xhosa (isiXhosa)	Molo / Molweni
Yappese	Mogethin
Yoruba (Yorùbá)	Ẹ n lẹ
Zulu (isiZulu)	Sawubona / Sanibonani


SCRIPT NAME	SAMPLES
----------------------	-----
Balinese (ᬩᬲ ᬩᬮᬶ)	ᬓᬔᬕᬖᬗᬘᬙᬚᬛᬜᬝᬞᬟ 
Blackfoot (ᑯᖾᖹ / ᓱᖽᐧᖿ)	ᖳᖰᖱᖲ 
Buginese (ᨅᨔ ᨕᨘᨁᨗ)	ᨕᨗᨕᨊᨕᨙ ᨔᨛᨄᨒᨚ 
Buhid (ᝊᝓᝑᝒᝇ)	ᝀᝁᝂᝃᝄᝅ 
Carian	𐊠𐊥𐊣𐊹𐊮𐊸 𐊲𐊥𐊰𐊴𐊣𐊺𐊸 𐊽𐊹𐊾𐊩𐊰𐊹𐊸 
Carrier (ᑐᑊᘁᗕᑋᗸ)	ᗺᗹᗵᗷᗶ 
Cham	ꨁꨗꨩꨈꨮ 
CJK Radicals	⼀⼁⼂⼃⼄⼅⼆⼇⼈⼉ / ⺀⺁⺂⺃⺄⺅⺆⺇⺈⺉
CJK Unified Ideograph Extension-A	㐀㐁㐂㐃㐄㐅㐆㐇㐈㐉㐊㐋㐌㐍㐎㐏
CJK Unified Ideograph Extension-B	𠀀𠀁𠀂𠀃𠀄𠀅𠀆𠀇𠀈𠀉𠀊𠀋𠀌𠀍𠀎𠀏
CJK Unified Ideograph Extension-C	𪜀𪜁𪜂𪜃𪜄𪜅𪜆𪜇𪜈𪜉𪜊𪜋𪜌𪜍𪜎𪜏
Coptic	ⲘⲒⲞ.Ⲕ 
Cuneiform	𒀀𒀁𒀂𒀃𒀄𒀅𒀆𒀇𒀈𒀉𒀊𒀋𒀌𒀍𒀎𒀏 
Cyrillic Supplement	ԀԁԂԃԄԅԆԇԈԉԊԋԌԍԎԏ
Cyrillic Extended-A	ⷠⷡⷢⷣⷤⷥⷦⷧⷨⷩⷪⷫⷬⷭⷮⷯ
Cyrillic Extended-B	ꙀꙁꙂꙃꙄꙅꙆꙇꙈꙉꙊꙋꙌꙍꙎꙏ
Ethiopic Extended	ⶀⶁⶂⶃⶄⶅⶆⶇⶈⶉⶊⶋⶌⶍⶎⶏ
Ethiopic Supplement	ᎀᎁᎂᎃᎄᎅᎆᎇᎈᎉᎊᎋᎌᎍᎎᎏ
Georgian Supplement	ⴀⴁⴂⴃⴄⴅⴆⴇⴈⴉⴊⴋⴌⴍⴎⴏ
Glagolitic	ⰙⰂⰍⰌⰇⰟⰘ 
Gothic	𐌰𐍄𐍄𐌰 𐌿𐌽𐍃𐌰𐍂 𐌸𐌿 𐌹𐌽 
Hanunoo (ᜱᜨᜳᜨᜳᜢ)	ᜣᜫᜨᜳᜰᜲ 
Kayah Li	꤁꤂꤃꤄꤅ 
Latin Extended-C	ⱠⱡⱢⱣⱤⱥⱦⱧⱨⱩⱪⱫⱬⱭⱮⱯ
Latin Extended-D	꜠꜡ꜢꜣꜤꜥꜦꜧꜨꜩꜪꜫꜬꜭꜮꜯ
Lepcha	ᰣᰕᰧᰅ 
Limbu	ᤀᤁᤂᤃᤄᤅᤆᤇᤈ 
Lycian	𐊀𐊁𐊂𐊃𐊄𐊅𐊆𐊇 
Lydian	𐤠𐤡𐤢𐤣𐤤𐤥𐤦𐤧 
New Tai Lue	ᦀᦁᦂᦃᦄᦅᦆᦇ 
Ogham	᚛ᚁᚂᚃᚄᚅᚆᚇᚈᚉᚊᚋᚌᚍᚎᚏᚐᚑᚒᚒᚔ᚜ 
Santali (Ol Chiki)	ᱟᱲ.ᱟ. 
Old Persian	𐎠𐎡𐎢𐎣𐎤𐎥𐎦𐎧𐎨𐎩𐎪𐎫𐎬𐎭𐎮𐎯 
Osmanya	𐒀𐒁𐒂𐒃𐒄𐒅𐒆𐒇𐒈𐒉𐒊𐒋𐒌𐒍𐒎𐒏 
Phaistos Disc	𐇑𐇛𐇜𐇐𐇡 
Phonetic Extensions	ᴀᴁᴂᴃᴄᴅᴆᴇᴈᴉᴊᴋᴌᴍᴎᴏ
Phonetic Extension Supplement	ᶀᶁᶂᶃᶄᶅᶆᶇᶈᶉᶊᶋᶌᶍᶎᶏ
Rejang	ꤰꤱꤲꤴꤵ　
Runic	ᛒᛁᛏᚱᛅᛁᛋ᛬ᛚᛅᛁᚠᛅ᛬ᚠᚢᛋᛏᚱᛅ᛬ᚴᚢᚦᛅᚾ᛬ᚦᛅᚾ᛬ᛋᚭᚾ᛬ᛁᛚᛅᚾ᛭ 
Saurashtra	ꢂꢒꢂꢬꢣꢶ 
Sundanese	ᮀᮁᮂᮃᮄᮅᮆᮇᮈᮉᮊᮋᮌᮍᮎᮏ 
Syloti Nagri	ꠀꠇꠣꠌꠤꠐꠥꠔꠦ 
Tagbanwa (ᝤᝪᝨᝯ)	ᝠᝡᝢᝣᝤᝥᝦᝧᝨᝩᝪᝫᝬ 
Tifinagh (ⵜⵉⴼⵉⵏⴰⵖ)	ⴰⴱⴲⴳⴴⴵⴶⴷⴸⴹⴺⴻⴼⴽⴾⴿ
Tai Le (ᥖᥭᥰᥖᥬᥳᥑᥨᥒᥰ)	ᥐᥑᥒᥓᥔᥕᥖᥗᥘᥙᥚᥛᥜᥝᥞᥟ
New Tai Lue	ᦀᦁᦂᦃᦄᦅᦆᦇᦈᦉᦊᦋᦌᦍᦎᦏ
Ugaritic	𐎀𐎁𐎂𐎃𐎄𐎅𐎆𐎇𐎈𐎉𐎊𐎋𐎌𐎍𐎎𐎏 
Vai	ꔀꔁꔂꔃꔄꔅꔆꔇꔈꔉꔊꔋꔌꔍꔎꔏ 
Yi (ꆇꉙ)	ꉷꆀꅇꌫꏦ


\f

2. Scripts written from Right to Left.

LANGUAGE (NATIVE NAME)	HELLO
----------------------	-----
Arabic (ةّيبرعلا)	السلام عليكم 
Aramaic, Syriac (ܠܫܢܐ ܤܘܪܝܝܐ)	ܐܵܝ! / ܐܳܝ!
Dhivehi (ހިވެދި)	ކިހިނެތް؟ / ހާލު ކިހިނެތް؟ 
Hebrew (תירבע)	שלום
Persian (فارسى)	سلام / درود
Urdu (اردو)	السلام علیکم
Yiddish (ײִדיש / מאַמע לשון)	אַ גוטן טאָג


SCRIPT NAME	SAMPLES
----------------------	-----
Arabic Supplement	ݐݑݒݓݔݕݖݗݘݙݚݛݜݝݞݟ
Cypriot Syllabary	𐠀𐠁𐠂𐠃𐠄𐠅𐠈𐠊𐠋𐠌𐠍𐠎𐠏
Kharoshthi	𐨠𐨡𐨢𐨣𐨤𐨥𐨦𐨧𐨨𐨩𐨪𐨫𐨬𐨭𐨮𐨯
Linear B	𐀀𐀁𐀂𐀃𐀄𐀅𐀆𐀇 / 𐂀𐂁𐂂𐂃𐂄𐂅𐂆𐂇
N'Ko (ߒߞߏ)	߀߁߂߃߄߅߆߇߈߉ߊߋߌߍߎߏ
Old Italic	𐌀𐌁𐌂𐌃𐌄𐌅𐌆𐌇𐌈𐌉𐌊𐌋𐌌𐌍𐌎𐌏
Phoenician	𐤀𐤁𐤂𐤃𐤄𐤅𐤆𐤇𐤈𐤉𐤊𐤋𐤌𐤍𐤎𐤏


\f

3. Scripts written from Top to Bottom

LANGUAGE (NATIVE NAME)	HELLO
----------------------	-----
Mongolian (ᠮᠣᠨᠭᠣᠯ ᠪᠢᠴᠢᠭ)	ᠰᠠᠢ᠋ᠨ ᠪᠠᠢ᠋ᠨ ᠤᠦ
Japanese (日本語)	もし〳〵 (Vertical Repetition Marks)


SCRIPT NAME	SAMPLES
----------------------	-----
Manchurian (ᠮᠠᠨᠵᡠ)	ᡶᡠᡯᡳ ᡥᡝᠨᡩᡠᠮᡝ᠈ ᡨᠠᠴᡳᠮᠪᡳᠮᡝ ᡝᡵᡳᠨᡩᡝᡵᡳ
	ᡠᡵᡳᠪᡠᠴᡳ᠈ ᡳᠨᡠ ᡠᡵᡤᡠᠨ ᠸᠠᡴᠠ᠉
Kanbun (漢文)	使㆟籍誠不㆚以㆘蓄㆓妻子㆒憂㆗飢寒㆖乱㆙㆑心、
	有㆑銭以済㆞医薬㆝。
Phags-pa	ꡏꡟ ꡋꡞ ꡏꡟ ꡋꡞ ᠂ ꡏ ꡜꡖ ꡏꡟ ꡋꡞ ᠂ ꡓꡞ ꡏꡟ ᠁ 

\f

4. Symbols

SYMBOL NAMES	EXAMPLES
----------------------	-----
Aegean Numbers	𐄀𐄁𐄂𐄇 𐄈𐄉𐄊𐄋𐄌𐄍𐄎𐄏
Ancient Greek Numbers	𐅀𐅁𐅂𐅃𐅄𐅅𐅆𐅇𐅈𐅉𐅊𐅋𐅌𐅍𐅎𐅏
Ancient Symbols	𐆐𐆑𐆒𐆓𐆔𐆕𐆖𐆗𐆘𐆙𐆚𐆛
Ancient Greek Musical Notation	𝈀𝈁𝈂𝈃𝈄𝈅𝈆𝈈𝈉𝈊𝈋𝈌𝈍𝈎𝈏
Arrows	←↑↠↡↰↱⇀⇁⟰⟱⟲⟳⤀⤁⤐⤑⤠⤡⤰⤱⥀⥁
Block Elements	▀▁▂▃▄▅▆▇▐░▒▓▔▕▖▗
Box Drawing	┌┐└┘├┤┬┴
Byzantine Musical Symbols	𝀰𝀱𝀲𝀳𝀴𝀵𝀶𝀷
Combining Diacritical Marks	à á â ã ā a̅ ă ȧ ä ả å a̋ ǎ a̍ a̎ ȁ
Combining Diacritical Marks For Symbols	a⃐ a⃑ a⃒ a⃓ a⃔ a⃕ a⃖ a⃗
Combining Diacritical Marks Supplement	a᷀ a᷁ a᷂ a᷃ a᷄ a᷅ a᷆ a᷇ a᷈ a᷉ a᷊ a᷋ a᷌ a᷍ a᷎ a᷏
Control Pictures	␁␂␃␄␅␆␇␈␉␊␋␌␍␎␏
Counting Rod Numerals	𝍠𝍡𝍢𝍣𝍤𝍥𝍦𝍧𝍨𝍩𝍪𝍫𝍬𝍭𝍮𝍯
Currency Symbols	$¢£¤¥₠₡₢₣₤₥₦₧₨₩₪₫₭₮₯
Dingbats	✁✆✇✈✉✌✍✐✒✓✟✠
Domino Tiles	🀰🀲🁂🁒🁛🁢🁤🁴🂄🂍
Enclosed Alphanumerics	①②③④⑴⑵⑶⑷⒈⒉⒊⒋⒜⒝⒞⒟ⒶⒷⒸⒹⓐⓑⓒⓓ⓵⓶⓷⓸
Geometric Shapes	■□▢▣▤▥▦▧▰▱▲△▴▵▶▷◀◁◂◃◄◅◆◇◐◑◒◓◔◕◖◗
Ideographic Description Characters	漢=⿰氵𦰩, 字=⿱宀子
Khmer Symbols	᧠᧡᧢᧣᧤᧥᧦᧧᧨᧩᧪᧫᧬᧭᧮᧯
Letterlike Symbols	℀℁ℂ℃℄℅℆ℇ℈℉ℊℋℌℍℎℏ
Misellaneous Symbols	☄★☆☐☑☒☓♀♁♂♃♨♰♲♳♴⚀⚁⚐⚒⚓⚠⚡⚢
Number Forms	⅓⅔⅕⅖ⅠⅡⅢⅣⅰⅱⅲⅳↀↁↂↃↄ
Mahjong Tiles	🀀🀁🀂🀃🀆🀅🀄🀇🀏🀐🀘🀙🀡🀢🀦🀪🀫
Mathematical Operators	∀∂∃∄∈∊∌∑∓√⨀⨁⨂⨃⨐⨠⨡⨢⨰⨱⨲⩀⩁⩂
Mathematical Symbols	⟀⟁⟂⟃⟐⟑⟒⟓⟠⟡⟢⟣⦀⦁⦂⦃⦄
Mathematical Alphanumeric Symbols	𝐀𝐁𝐂𝐃𝐄𝐅𝐆𝐇𝐈𝐉𝐊𝐋𝐌𝐍𝐎𝐏
Modifier Tone Letters	꜀꜁꜂꜃꜄꜅꜆꜇꜈꜉꜊꜋꜌꜍꜎꜏
Musical Symbols	𝆰𝆱𝆲𝆳𝆴𝆵𝆶𝆷𝆸𝆹𝆺𝆹𝅥𝆺𝅥𝆹𝅥𝅮𝆺𝅥𝅮𝆹𝅥𝅯
Optical Character Recognition	⑀⑁⑂⑃⑄⑅⑆⑇⑈⑉⑊
Planet Symbols	☿♀♁♂♃♄♅♆♇ / ⚳⚴⚵⚶⚷
Recycling Symbols	♲♳♴♵♶♷♸♹♺♻♼♽
Superscripts	⁰¹²³²⁴⁵⁶⁷⁸⁹⁺⁻⁼⁽⁾ⁿʰʱʲʳʴʵʶʷʸˠˡˢˣˤჼ
Subscripts	₀₁₂₃₄₅₆₇₈₉₊₋₌₍₎ₐₑᵢₒᵤⱼᵥₓᵣₔₓ
Supplemental Punctuation	⸀⸁⸂⸃⸄⸅⸆⸇⸈⸉⸊⸋⸌⸍⸎
Technical Symbols	⌀⌁⌂⌃⌐⌑⌒⌠⌡⌰⌱⌲⌳⍀⍁⍂⍐⍑⍒
Tai Xuan Jing Symbols	𝌰𝌱𝌲𝌳𝌴𝌵𝌶𝌷𝌸𝌹𝌺𝌻𝌼𝌽𝌾𝌿
Tramp Suites	♠♡♢♣♤♥♦♧
Yijing Hexagram Symbols	☰☱☲☳☴☵☶☷
Weather Symbols	☀☁☂☃☔

\f

5. Special Characters

NAME	SAMPLES
----------------------	-----
LANGUAGE TAGS	󠀁󠁺󠁨开发󠀁󠁿、󠀁󠁺󠁨󠀭󠁴󠁷開發󠀁󠁿、󠀁󠁪󠁡開発󠀁󠁿
VARIATION SELECTORS	邊 vs. 邊󠄀, 邊󠄁, 邊󠄂, 邊󠄃, 邊󠄄, 邊󠄅, 邊󠄆, 邊󠄇

\f



Copyright (C) 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009
  Free Software Foundation, Inc.

This file is part of GNU Emacs.

GNU Emacs is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

GNU Emacs is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with GNU Emacs.  If not, see <http://www.gnu.org/licenses/>.

;;; Local Variables:
;;; tab-width: 40
;;; End:

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-08  0:12               ` kawabata.taichi
@ 2009-09-08  2:53                 ` Stephen J. Turnbull
  2009-09-12 13:52                   ` Per Starbäck
  2009-09-08  3:11                 ` Eli Zaretskii
  2009-09-09  0:47                 ` Juri Linkov
  2 siblings, 1 reply; 40+ messages in thread
From: Stephen J. Turnbull @ 2009-09-08  2:53 UTC (permalink / raw)
  To: kawabata.taichi; +Cc: Juri Linkov, Chong Yidong, Andreas Schwab, emacs-devel

kawabata.taichi@gmail.com writes:

 > `iso-2022-7bit' has been adopted for HELLO file, as it could contain CJK
 > variants differently, but if we are to go into depth of various scripts
 > and symbols that ISO-2022 does not support, we could only use UTF.

ISO 2022 supports Unicode via the "incompatible encodings" control
sequence (I forget the exact name and escape sequence).  I don't know
if Emacs's iso-2022-* coding systems support it, but it's not hard to
add in the unlikely case that you don't have it already.

IMO, a better way to do this would be to change the HELLO file to be a
Lisp library, and encode it in UTF-8.  The table of greetings can
simply be wrapped into a string, and add a table of "languages we have
preferred fonts for" in Lisp.

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-08  2:53                 ` Stephen J. Turnbull
@ 2009-09-12 13:52                   ` Per Starbäck
  2009-09-12 17:51                     ` 川幡 太一
  0 siblings, 1 reply; 40+ messages in thread
From: Per Starbäck @ 2009-09-12 13:52 UTC (permalink / raw)
  To: Stephen J. Turnbull
  Cc: Juri Linkov, kawabata.taichi, Andreas Schwab, Chong Yidong,
	emacs-devel

2009/9/8 Stephen J. Turnbull <stephen@xemacs.org>:
> IMO, a better way to do this would be to change the HELLO file to be a
> Lisp library, and encode it in UTF-8.  The table of greetings can
> simply be wrapped into a string, and add a table of "languages we have
> preferred fonts for" in Lisp.

Also if there's a hello command that creates a buffer ("*Hello*"?) dynamically,
it could put a greeting for the appropriate locale at the top.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-12 13:52                   ` Per Starbäck
@ 2009-09-12 17:51                     ` 川幡 太一
  0 siblings, 0 replies; 40+ messages in thread
From: 川幡 太一 @ 2009-09-12 17:51 UTC (permalink / raw)
  To: Per Starbäck
  Cc: Juri Linkov, Stephen J. Turnbull, Andreas Schwab, Chong Yidong,
	emacs-devel

>> In <912155b0909120652s1649303o8468150c79c53f9b@mail.gmail.com>, 
>> Per Starbäck wrote:

> 2009/9/8 Stephen J. Turnbull <stephen@xemacs.org>:
> > IMO, a better way to do this would be to change the HELLO file to be a
> > Lisp library, and encode it in UTF-8.  The table of greetings can
> > simply be wrapped into a string, and add a table of "languages we have
> > preferred fonts for" in Lisp.

> Also if there's a hello command that creates a buffer ("*Hello*"?) dynamically,
> it could put a greeting for the appropriate locale at the top.

In some way, it is already realized by `sample-text' attribute of
`set-language-info-alist' function.

`HELLO' file may be useful for symbols and scripts not yet (or cannot
be) supported by Emacs as language, but are already encoded by the
UCS/Unicode.

Regards,

-- 
----------------------------------------------------
  kawabata.taichi@gmail.com       KAWABATA, Taichi




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-08  0:12               ` kawabata.taichi
  2009-09-08  2:53                 ` Stephen J. Turnbull
@ 2009-09-08  3:11                 ` Eli Zaretskii
  2009-09-12 17:45                   `  Taichi KAWABATA 
  2009-09-09  0:47                 ` Juri Linkov
  2 siblings, 1 reply; 40+ messages in thread
From: Eli Zaretskii @ 2009-09-08  3:11 UTC (permalink / raw)
  To: kawabata.taichi; +Cc: juri, cyd, schwab, emacs-devel

> From: kawabata.taichi@gmail.com
> Accept-Language: ja, en;q=0.6, zh;q=0.3, fr;q=0.1, la;q=0.01, sa;q=0.001
> Date: Tue, 08 Sep 2009 09:12:30 +0900
> Cc: Juri Linkov <juri@jurta.org>, Chong Yidong <cyd@stupidchicken.com>,
> 	emacs-devel@gnu.org
> 
> fixed the logical order of Arabic language.

Fixed how?  (Sorry, don't have time to look.)  The name of the
language (not the greeting) should be in visual order, because Emacs
does not yet support bidirectional display.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-08  3:11                 ` Eli Zaretskii
@ 2009-09-12 17:45                   `  Taichi KAWABATA 
  2009-09-12 19:39                     ` Eli Zaretskii
  0 siblings, 1 reply; 40+ messages in thread
From:  Taichi KAWABATA  @ 2009-09-12 17:45 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: juri, cyd, schwab, emacs-devel

>> In <83pra22krm.fsf@gnu.org>, Eli Zaretskii wrote:

> > fixed the logical order of Arabic language.

> Fixed how?  (Sorry, don't have time to look.)  The name of the
> language (not the greeting) should be in visual order, because Emacs
> does not yet support bidirectional display.

Fixed to follow reading order.  They should be rendered correctly when
copied to other programs that can handle BIDI correctly, which may
include future version of Emacs.

>> In <87zl946g9k.fsf@mail.jurta.org>, Juri Linkov wrote:

> Or maybe switch to a simpler markup like in emacs/etc/enriched.doc
> (and add the language tag support to this format).

> 3. You could replace the old Euro currency U+20A0 ₠ with the new one
> U+20AC €.

That's right.  They should be replaced....

> 4. Klingon uses the Private Use Area U+F8D0 - U+F8FF
> http://www.wazu.jp/gallery/Test_Klingon.html

I think non-standardized characters should be avoided.

> 5. You've removed charsets jisx0201 and jisx0208 from Japanese
> greetings.  Is it because they are not needed for font selection, or
> just because the transfer charset in your message was utf-8?

`Japanese' entry of HELLO contains characters from both JIS X 0208
charsets and JIS X 0201 charsets (halfwidth katakana chars.)

Regards,

-- 
----------------------------------------------------
  kawabata.taichi@gmail.com       KAWABATA, Taichi




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-12 17:45                   `  Taichi KAWABATA 
@ 2009-09-12 19:39                     ` Eli Zaretskii
  2009-09-15  5:20                       ` Kenichi Handa
  0 siblings, 1 reply; 40+ messages in thread
From: Eli Zaretskii @ 2009-09-12 19:39 UTC (permalink / raw)
  To:  Taichi KAWABATA ; +Cc: emacs-devel

> From: kawabata.taichi@gmail.com ( Taichi KAWABATA )
> Cc: schwab@linux-m68k.org, juri@jurta.org, cyd@stupidchicken.com,
> 	emacs-devel@gnu.org
> Date: Sun, 13 Sep 2009 02:45:24 +0900
> 
> >> In <83pra22krm.fsf@gnu.org>, Eli Zaretskii wrote:
> 
> > > fixed the logical order of Arabic language.
> 
> > Fixed how?  (Sorry, don't have time to look.)  The name of the
> > language (not the greeting) should be in visual order, because Emacs
> > does not yet support bidirectional display.
> 
> Fixed to follow reading order.  They should be rendered correctly when
> copied to other programs that can handle BIDI correctly, which may
> include future version of Emacs.

No, we deliberately have the language in visual order to make it
readable.  Being readable is more important than being
copy-paste-able.  When Emacs supports bidirectional editing, we will
reorder that again.




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-12 19:39                     ` Eli Zaretskii
@ 2009-09-15  5:20                       ` Kenichi Handa
  2009-09-15  8:08                         ` David Kastrup
  2009-09-15 17:12                         ` Eli Zaretskii
  0 siblings, 2 replies; 40+ messages in thread
From: Kenichi Handa @ 2009-09-15  5:20 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: kawabata.taichi, emacs-devel

In article <83ocpgymug.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:

> No, we deliberately have the language in visual order to make it
> readable.  Being readable is more important than being
> copy-paste-able.  When Emacs supports bidirectional editing, we will
> reorder that again.

I now think it's more important to tell whether or not Emacs
supports it.  When one sees that Arabic is correctly
displayed in HELLO, he may misunderstand that Arabic is
supported.

---
Kenichi Handa
handa@m17n.org




^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-15  5:20                       ` Kenichi Handa
@ 2009-09-15  8:08                         ` David Kastrup
  2009-09-15 17:20                           ` Eli Zaretskii
  2009-09-15 17:12                         ` Eli Zaretskii
  1 sibling, 1 reply; 40+ messages in thread
From: David Kastrup @ 2009-09-15  8:08 UTC (permalink / raw)
  To: emacs-devel

Kenichi Handa <handa@m17n.org> writes:

> In article <83ocpgymug.fsf@gnu.org>, Eli Zaretskii <eliz@gnu.org> writes:
>
>> No, we deliberately have the language in visual order to make it
>> readable.  Being readable is more important than being
>> copy-paste-able.  When Emacs supports bidirectional editing, we will
>> reorder that again.
>
> I now think it's more important to tell whether or not Emacs
> supports it.  When one sees that Arabic is correctly
> displayed in HELLO, he may misunderstand that Arabic is
> supported.

IIRC, R-L display is the principal remaining heavy-lifting task on the
roadmap.  I think that rearranging the HELLO file manually into reading
order gives the wrong message not just to the users, but also to the
developers.

-- 
David Kastrup





^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-15  8:08                         ` David Kastrup
@ 2009-09-15 17:20                           ` Eli Zaretskii
  0 siblings, 0 replies; 40+ messages in thread
From: Eli Zaretskii @ 2009-09-15 17:20 UTC (permalink / raw)
  To: David Kastrup; +Cc: emacs-devel

> From: David Kastrup <dak@gnu.org>
> Date: Tue, 15 Sep 2009 10:08:20 +0200
> 
> I think that rearranging the HELLO file manually into reading order
> gives the wrong message not just to the users, but also to the
> developers.

Well, seeing the language name and the greeting in reverse for the
last 10 years didn't prompt any developer into doing something about
that, and the only developer working on this now doesn't really need
that as motivation.  So I wouldn't exaggerate the importance of this
nit.

(By ``reading order'', I presume that you mean visual order, not
logical order.  Apologies if I misunderstood.)

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-15  5:20                       ` Kenichi Handa
  2009-09-15  8:08                         ` David Kastrup
@ 2009-09-15 17:12                         ` Eli Zaretskii
  1 sibling, 0 replies; 40+ messages in thread
From: Eli Zaretskii @ 2009-09-15 17:12 UTC (permalink / raw)
  To: Kenichi Handa; +Cc: kawabata.taichi, emacs-devel

> From: Kenichi Handa <handa@m17n.org>
> Cc: kawabata.taichi@gmail.com, emacs-devel@gnu.org
> Date: Tue, 15 Sep 2009 14:20:29 +0900
> 
> When one sees that Arabic is correctly displayed in HELLO, he may
> misunderstand that Arabic is supported.

That's why we left the greeting itself in the logical (a.k.a. reading)
order.  Only the name of the language is in visual order.  I think
this is a good compromise, and given that it was just changed in Emacs
23.1 (from the logical order for both the language name and the
greeting), I suggest that we leave it at that for the moment.

^ permalink raw reply	[flat|nested] 40+ messages in thread

* Re: new Emacs HELLO file??
  2009-09-08  0:12               ` kawabata.taichi
  2009-09-08  2:53                 ` Stephen J. Turnbull
  2009-09-08  3:11                 ` Eli Zaretskii
@ 2009-09-09  0:47                 ` Juri Linkov
  2 siblings, 0 replies; 40+ messages in thread
From: Juri Linkov @ 2009-09-09  0:47 UTC (permalink / raw)
  To: kawabata.taichi; +Cc: Chong Yidong, Andreas Schwab, emacs-devel

> Language Tags are not recommended by the Unicode (as long as there is
> other ways to describe language, such as XML lang attributes), but it is
> the only way to describe language in plain text.  Or switch we switch
> HELLO file from plain text to some XML format?

Or maybe switch to a simpler markup like in emacs/etc/enriched.doc
(and add the language tag support to this format).

> I'll attach updated HELLO examples as follows, that incorporates all the
> languages I could find "HELLO" word, plus Language Tag usage examples
> (someday Emacs may support it?) and fixed the logical order of Arabic
> language.

Thanks, your new list is surprisingly compact.  I have only a few comments.

1. In the current HELLO file the greeting in Slovak is correct
(see http://en.wiktionary.org/wiki/dobr%C3%BD_de%C5%88 )
and displays non-ASCII characters.  I think it's better to keep
the current greeting for Slovak.

2. Esperanto used to display a nice diacritical pangram "Eĥoŝanĝo ĉiuĵaŭde".
Even thought it's not a greeting, I think we should keep it.

3. You could replace the old Euro currency U+20A0 ₠ with the new one
U+20AC €.

4. Klingon uses the Private Use Area U+F8D0 - U+F8FF
http://www.wazu.jp/gallery/Test_Klingon.html

5. You've removed charsets jisx0201 and jisx0208 from Japanese greetings.
Is it because they are not needed for font selection, or just because
the transfer charset in your message was utf-8?

-- 
Juri Linkov
http://www.jurta.org/emacs/

^ permalink raw reply	[flat|nested] 40+ messages in thread

end of thread, other threads:[~2009-09-15 17:20 UTC | newest]

Thread overview: 40+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-08-28 21:27 new Emacs HELLO file?? kawabata.taichi
2009-08-28 22:19 ` Juri Linkov
2009-08-29  2:16   ` 川幡 太一
2009-08-29  3:31     ` Stefan Monnier
2009-08-29  5:46     ` Daniel Clemente
2009-08-29 22:43     ` Juri Linkov
2009-08-30  1:16       ` kawabata.taichi
2009-08-30  3:12         ` Eli Zaretskii
2009-08-31  3:33           ` Richard Stallman
2009-08-31 16:10             ` kawabata.taichi
2009-08-31 17:07               ` Rupert Swarbrick
2009-08-31 16:16             ` Juri Linkov
2009-08-30 20:48         ` Juri Linkov
2009-08-31 15:12           ` Chong Yidong
2009-08-31 16:14             ` Juri Linkov
2009-08-31 16:15             ` Andreas Schwab
2009-08-31 16:32               ` David Kastrup
2009-08-31 17:02                 ` Andreas Schwab
2009-09-01 11:41                   ` Kenichi Handa
2009-09-01 21:27                     ` Juri Linkov
2009-09-03 13:07                       ` Kenichi Handa
2009-09-06 18:20                         ` 牛粥
2009-09-06 20:01                           ` Eli Zaretskii
2009-09-06 21:08                             ` 牛粥
2009-09-07  1:02                             ` Kenichi Handa
2009-09-07  1:01                           ` Kenichi Handa
2009-08-31 17:55                 ` Eli Zaretskii
2009-08-31 22:03                   ` Andreas Schwab
2009-09-08  0:12               ` kawabata.taichi
2009-09-08  2:53                 ` Stephen J. Turnbull
2009-09-12 13:52                   ` Per Starbäck
2009-09-12 17:51                     ` 川幡 太一
2009-09-08  3:11                 ` Eli Zaretskii
2009-09-12 17:45                   `  Taichi KAWABATA 
2009-09-12 19:39                     ` Eli Zaretskii
2009-09-15  5:20                       ` Kenichi Handa
2009-09-15  8:08                         ` David Kastrup
2009-09-15 17:20                           ` Eli Zaretskii
2009-09-15 17:12                         ` Eli Zaretskii
2009-09-09  0:47                 ` Juri Linkov

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.