From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#24603: [RFC 02/18] Generate upcase and downcase tables from Unicode data Date: Tue, 04 Oct 2016 10:27:18 +0300 Message-ID: <83a8eko9pl.fsf@gnu.org> References: <1475543441-10493-1-git-send-email-mina86@mina86.com> <1475543441-10493-2-git-send-email-mina86@mina86.com> Reply-To: Eli Zaretskii NNTP-Posting-Host: blaine.gmane.org X-Trace: blaine.gmane.org 1475566114 16402 195.159.176.226 (4 Oct 2016 07:28:34 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Tue, 4 Oct 2016 07:28:34 +0000 (UTC) Cc: 24603@debbugs.gnu.org To: Michal Nazarewicz Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Oct 04 09:28:30 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brK96-0001w3-VB for geb-bug-gnu-emacs@m.gmane.org; Tue, 04 Oct 2016 09:28:13 +0200 Original-Received: from localhost ([::1]:40618 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brK95-0000uf-Fl for geb-bug-gnu-emacs@m.gmane.org; Tue, 04 Oct 2016 03:28:11 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:42537) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brK8z-0000uZ-TV for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 03:28:06 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1brK8w-0003qY-HA for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 03:28:05 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:37519) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brK8w-0003qC-Dl for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 03:28:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1brK8w-0001lS-7o for bug-gnu-emacs@gnu.org; Tue, 04 Oct 2016 03:28:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 04 Oct 2016 07:28:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24603 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 24603-submit@debbugs.gnu.org id=B24603.14755660506731 (code B ref 24603); Tue, 04 Oct 2016 07:28:02 +0000 Original-Received: (at 24603) by debbugs.gnu.org; 4 Oct 2016 07:27:30 +0000 Original-Received: from localhost ([127.0.0.1]:43709 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brK8Q-0001kV-Fa for submit@debbugs.gnu.org; Tue, 04 Oct 2016 03:27:30 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:51424) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brK8P-0001kH-Ah for 24603@debbugs.gnu.org; Tue, 04 Oct 2016 03:27:29 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1brK8H-0003L3-4Z for 24603@debbugs.gnu.org; Tue, 04 Oct 2016 03:27:24 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:43744) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brK8H-0003KG-1r; Tue, 04 Oct 2016 03:27:21 -0400 Original-Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:2503 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1brK8F-0002tr-6e; Tue, 04 Oct 2016 03:27:19 -0400 In-reply-to: <1475543441-10493-2-git-send-email-mina86@mina86.com> (message from Michal Nazarewicz on Tue, 4 Oct 2016 03:10:25 +0200) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:124022 Archived-At: > From: Michal Nazarewicz > Date: Tue, 4 Oct 2016 03:10:25 +0200 > > + ;; Set all Letter, uppercase; Letter, lowercase and Letter, titlecase syntax > + ;; to word. FIXME: Should this also be done for Letter, modifier and Letter, > + ;; other? What about other alphabetic characters? > + (let ((syn-tab (standard-syntax-table))) > + (map-char-table > + (lambda (ch cat) > + (when (memq cat '(Lu Ll Lt)) > + (modify-syntax-entry ch "w " syn-tab))) > + (unicode-property-table-internal 'general-category))) The answer to these questions is "as required by backward compatibility", i.e. compare with the manual setup we had until now. If that criterion doesn't provide the full answer, I would go by Unicode guidance, i.e. support all the case conversions specified in the Unicode character database (UCD). Thanks.