From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.devel Subject: Re: ucs-normalize and diacritics Date: Thu, 26 Jul 2018 21:41:23 +0300 Message-ID: <83bmatu9f0.fsf@gnu.org> References: <8736w88pnn.fsf@gmail.com> <83lga0v4ff.fsf@gnu.org> <83in54v3sp.fsf@gnu.org> <87y3e07425.fsf@gmail.com> <83h8koujs9.fsf@gnu.org> <87pnzb7ogd.fsf@gmail.com> <874lgn74sw.fsf@gmail.com> <83va93tlb4.fsf@gnu.org> <877eli5r02.fsf@gmail.com> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1532630369 17138 195.159.176.226 (26 Jul 2018 18:39:29 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Thu, 26 Jul 2018 18:39:29 +0000 (UTC) Cc: emacs-devel@gnu.org To: Robert Pluim Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jul 26 20:39:25 2018 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1filAb-0004MU-3B for ged-emacs-devel@m.gmane.org; Thu, 26 Jul 2018 20:39:25 +0200 Original-Received: from localhost ([::1]:37703 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1filCi-00050e-0X for ged-emacs-devel@m.gmane.org; Thu, 26 Jul 2018 14:41:36 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:34795) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1filCb-00050Y-IC for emacs-devel@gnu.org; Thu, 26 Jul 2018 14:41:30 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1filCY-00008e-6y for emacs-devel@gnu.org; Thu, 26 Jul 2018 14:41:29 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:59537) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1filCY-00008O-28; Thu, 26 Jul 2018 14:41:26 -0400 Original-Received: from [176.228.60.248] (port=4845 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1filCW-0006lT-5m; Thu, 26 Jul 2018 14:41:25 -0400 In-reply-to: <877eli5r02.fsf@gmail.com> (message from Robert Pluim on Thu, 26 Jul 2018 10:40:45 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.org gmane.emacs.devel:227838 Archived-At: > From: Robert Pluim > Cc: emacs-devel@gnu.org > Date: Thu, 26 Jul 2018 10:40:45 +0200 > > How about something like: > > As a special case, if the character lies in the range #x3fff80 > through #x3fff9a (128 through 159 decimal, with prefix #x3fff), it > stands for a raw byte that does not correspond to any specific > displayable character. Such a character lies within the > @code{eight-bit-control} character set, and is displayed as an escaped > octal character code (0200 through 0237), or as an escaped hex > character code (x80 through x9a) if @code{display-raw-bytes-as-hex} is > non-@code{nil}. Thanks, but the original text was wrong in more than one sense, and needed a more thorough fix. I pushed a fix, please see if the new text is clear and accurate. > Iʼm not sure the 'eight-bit-control' part is true, given the reference > to 'tis620-2533' in the what-cursor-position output. Nowadays we have a single character set 'eight-bit' that covers the whole range #80 to #ff; the old partial charsets eight-bit-control and eight-bit-graphic should not be mentioned in the user documentation.