From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Paul Eggert Newsgroups: gmane.emacs.devel Subject: Re: [Emacs-diffs] master db828f6: Don't rely on defaults in decoding UTF-8 encoded Lisp files Date: Sun, 27 Sep 2015 13:21:51 -0700 Organization: UCLA Computer Science Department Message-ID: <56084FDF.704@cs.ucla.edu> References: <20150921165211.20434.28114@vcs.savannah.gnu.org> <83fv27mt7r.fsf@gnu.org> <83wpvfix7i.fsf@gnu.org> <83fv23hr0z.fsf@gnu.org> <5605CB6B.4000102@cs.ucla.edu> <83twqhhf0g.fsf@gnu.org> <5606AC48.7090801@cs.ucla.edu> <83zj09fbzp.fsf@gnu.org> <5606C140.6090309@cs.ucla.edu> <878u7trwlb.fsf@fencepost.gnu.org> <5606E995.2000102@cs.ucla.edu> <83si61ezxd.fsf@gnu.org> <560700E1.4010403@cs.ucla.edu> <83pp14fhj5.fsf@gnu.org> <87io6wqpf5.fsf@fencepost.gnu.org> <83bncof9w2.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1443385333 12086 80.91.229.3 (27 Sep 2015 20:22:13 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sun, 27 Sep 2015 20:22:13 +0000 (UTC) Cc: emacs-devel@gnu.org To: Eli Zaretskii , Rustom Mody Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sun Sep 27 22:22:05 2015 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ZgISR-0005zC-Hq for ged-emacs-devel@m.gmane.org; Sun, 27 Sep 2015 22:22:03 +0200 Original-Received: from localhost ([::1]:58649 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZgISQ-0003HI-T5 for ged-emacs-devel@m.gmane.org; Sun, 27 Sep 2015 16:22:02 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:57988) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZgISN-0003HB-Fu for emacs-devel@gnu.org; Sun, 27 Sep 2015 16:22:00 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZgISM-0005k0-KP for emacs-devel@gnu.org; Sun, 27 Sep 2015 16:21:59 -0400 Original-Received: from zimbra.cs.ucla.edu ([131.179.128.68]:36328) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZgISI-0005hr-Q6; Sun, 27 Sep 2015 16:21:54 -0400 Original-Received: from localhost (localhost [127.0.0.1]) by zimbra.cs.ucla.edu (Postfix) with ESMTP id 140E7160E18; Sun, 27 Sep 2015 13:21:53 -0700 (PDT) Original-Received: from zimbra.cs.ucla.edu ([127.0.0.1]) by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id KST8_-4ojKwy; Sun, 27 Sep 2015 13:21:52 -0700 (PDT) Original-Received: from localhost (localhost [127.0.0.1]) by zimbra.cs.ucla.edu (Postfix) with ESMTP id 51CF0160E19; Sun, 27 Sep 2015 13:21:52 -0700 (PDT) X-Virus-Scanned: amavisd-new at zimbra.cs.ucla.edu Original-Received: from zimbra.cs.ucla.edu ([127.0.0.1]) by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id XC_pr6JoFmix; Sun, 27 Sep 2015 13:21:52 -0700 (PDT) Original-Received: from [192.168.1.9] (pool-100-32-155-148.lsanca.fios.verizon.net [100.32.155.148]) by zimbra.cs.ucla.edu (Postfix) with ESMTPSA id 2E57C160E18; Sun, 27 Sep 2015 13:21:52 -0700 (PDT) User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.2.0 In-Reply-To: <83bncof9w2.fsf@gnu.org> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 131.179.128.68 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:190417 Archived-At: Eli Zaretskii wrote: > This is unrelated: it specifies which character sequences should be > composed and displayed as a single grapheme cluster. Yes. It might be reasonable to replace some of those \u instances for=20 readability, e.g.: - ("V" . "[\u0904-\u0914\u0960-\u0961\u0972]") ; independent vowel + ("V" . "[=E0=A4=84-=E0=A4=94=E0=A5=A0-=E0=A5=A1=E0=A5=B2]") ; indepe= ndent vowel But replacements would not be such a good idea for some of this code, e.g= .: - ("H" . "\u094D") ; HALANT + ("H" . "=E0=A5=8D") ; HALANT as standalone combining characters are problematic on display, and here: - ("J" . "\u200D") ; ZWJ + ("J" . "=E2=80=8D") ; ZWJ where one can't easily see a zero width joiner when editing the source fi= le. I=20 expect that whoever wrote that code felt more comfortable sticking with \= u=20 escapes uniformly, rather than using \u sometimes and not other times.