From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#24603: [RFC 02/18] Generate upcase and downcase tables from Unicode data Date: Tue, 04 Oct 2016 18:06:55 +0300 Message-ID: <83oa30m9v4.fsf@gnu.org> References: <1475543441-10493-1-git-send-email-mina86@mina86.com> <1475543441-10493-2-git-send-email-mina86@mina86.com> <83a8eko9pl.fsf@gnu.org> Reply-To: Eli Zaretskii NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1475593736 2834 195.159.176.226 (4 Oct 2016 15:08:56 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Tue, 4 Oct 2016 15:08:56 +0000 (UTC) Cc: 24603@debbugs.gnu.org To: Michal Nazarewicz Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Oct 04 17:08:46 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brRKQ-0005Gw-Ux for geb-bug-gnu-emacs@m.gmane.org; Tue, 04 Oct 2016 17:08:23 +0200 Original-Received: from localhost ([::1]:43393 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brRKP-0003iy-BM for geb-bug-gnu-emacs@m.gmane.org; Tue, 04 Oct 2016 11:08:21 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:43475) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brRKA-0003dV-RQ for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 11:08:12 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1brRK5-00057d-Rn for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 11:08:05 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:38447) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brRK5-00057T-Oo for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 11:08:01 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1brRK5-0000vS-Md for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 11:08:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 04 Oct 2016 15:08:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24603 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 24603-submit@debbugs.gnu.org id=B24603.14755936403512 (code B ref 24603); Tue, 04 Oct 2016 15:08:01 +0000 Original-Received: (at 24603) by debbugs.gnu.org; 4 Oct 2016 15:07:20 +0000 Original-Received: from localhost ([127.0.0.1]:44637 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brRJM-0000uU-BZ for submit@debbugs.gnu.org; Tue, 04 Oct 2016 11:07:20 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:51983) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brRJH-0000u8-Np for 24603@debbugs.gnu.org; Tue, 04 Oct 2016 11:07:15 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1brRJ7-0003LS-Mb for 24603@debbugs.gnu.org; Tue, 04 Oct 2016 11:07:06 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:49043) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brRJ7-0003F2-IU; Tue, 04 Oct 2016 11:07:01 -0400 Original-Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:3513 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1brRJ5-0006Nw-3V; Tue, 04 Oct 2016 11:07:00 -0400 In-reply-to: (message from Michal Nazarewicz on Tue, 04 Oct 2016 16:54:31 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:124041 Archived-At: > From: Michal Nazarewicz > Cc: 24603@debbugs.gnu.org > Date: Tue, 04 Oct 2016 16:54:31 +0200 > > + ;; Ⅰ through Ⅻ had word syntax in the past so set it here as well. > + ;; General category of those characers is Number, Letter. > + (modify-syntax-entry '(#x2160 . #x216b) "w " syn-tab) > + > + ;; ⓐ thourgh ⓩ are symbols, other according to Unicode but Emacs set > + ;; their syntax to word in the past so keep backwards compatibility. > + (modify-syntax-entry '(#x24D0 . #x24E9) "w " syn-tab)) I think we should document all the changes. If the list of changes is too long, and cannot be made short enough by summarizing (instead of showing each individual character), then it probably should go into some separate file in admin/unidata/. If it can be short enough, then a comment in characters.el is a better place, I think. > I get the following (annotated) differences: Can you add the name of each character (just one, the leftmost one) to its line and post the result? It's hard to read the report when it only shows codepoints. Thanks.