From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel Subject: Re: Encoding of etc/HELLO Date: Mon, 23 Apr 2018 11:23:39 -0400 Message-ID: References: <83sh7qxb5j.fsf@gnu.org> <87po2t6gdm.fsf@gmx.de> <83muxxyijl.fsf@gnu.org> <87efj96aly.fsf@gmx.de> <83in8lxceo.fsf@gnu.org> <83o9iavu5e.fsf@gnu.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1524496915 26863 195.159.176.226 (23 Apr 2018 15:21:55 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Mon, 23 Apr 2018 15:21:55 +0000 (UTC) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Mon Apr 23 17:21:51 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1fAdHr-0006r2-7c for ged-emacs-devel@m.gmane.org; Mon, 23 Apr 2018 17:21:51 +0200 Original-Received: from localhost ([::1]:54009 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fAdJw-0001Ft-7w for ged-emacs-devel@m.gmane.org; Mon, 23 Apr 2018 11:24:00 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:52798) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1fAdJq-0001Fd-2F for emacs-devel@gnu.org; Mon, 23 Apr 2018 11:23:54 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1fAdJm-0002Vd-4O for emacs-devel@gnu.org; Mon, 23 Apr 2018 11:23:54 -0400 Original-Received: from [195.159.176.226] (port=57966 helo=blaine.gmane.org) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1fAdJl-0002V0-TM for emacs-devel@gnu.org; Mon, 23 Apr 2018 11:23:50 -0400 Original-Received: from list by blaine.gmane.org with local (Exim 4.84_2) (envelope-from ) id 1fAdHb-0006ab-BE for emacs-devel@gnu.org; Mon, 23 Apr 2018 17:21:35 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 21 Original-X-Complaints-To: usenet@blaine.gmane.org Cancel-Lock: sha1:eC/ClSgkK0M9UK2MDZZaiK3mWv8= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-Received-From: 195.159.176.226 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:224811 Archived-At: >> But along the way they discovered that it's sometimes difficult to >> decide whether two "things" should be consider as one and the same >> character or not. They ended up with a set of "rules" to make those >> decisions, but it's not nearly as simple as "each character has one and >> only one encoding". > Not sure what you allude to here. For example the fact that some CJK characters should be displayed differently depending on whether they're part of a C text, or a J text, or a K text, so are they really "one and the same character"? Of course, there are other related choices: which versions of β should be one and the same and which shouldn't (e.g. I currently see in Unicode a greek and a latin version plus some variants of a math version (tho none in "roman" shape))? There are murky areas, with no "one right answer", although Unicode has had to choose somehow, i.e. doing the best it can with a messy situation. Stefan