From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#19878: 24.4; Syntax class [:alpha:] wrongly matches the Indian digits =?UTF-8?Q?=DB=B1=DB=B2=DB=B3=DB=B4=DB=B5=DB=B6=DB=B7=DB=B8=DB=B9=DB=B0?= as letter Date: Sat, 28 Feb 2015 14:29:52 +0200 Message-ID: <83bnkete0v.fsf@gnu.org> References: <87k2zjj5gy.fsf@hochschule-trier.de> <838ufw7bzi.fsf@gnu.org> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1425126683 29728 80.91.229.3 (28 Feb 2015 12:31:23 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 28 Feb 2015 12:31:23 +0000 (UTC) Cc: 19878-done@debbugs.gnu.org To: politza@hochschule-trier.de, mohammad.mahmoudi@gmail.com Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Sat Feb 28 13:31:11 2015 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YRgY3-0005lD-B1 for geb-bug-gnu-emacs@m.gmane.org; Sat, 28 Feb 2015 13:31:11 +0100 Original-Received: from localhost ([::1]:40946 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YRgY2-0008Rn-KT for geb-bug-gnu-emacs@m.gmane.org; Sat, 28 Feb 2015 07:31:10 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:40713) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YRgXy-0008Lu-H2 for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2015 07:31:07 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YRgXv-0005CY-Ag for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2015 07:31:06 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:56830) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YRgXv-0005CC-82 for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2015 07:31:03 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.80) (envelope-from ) id 1YRgXv-0004Qk-1H for bug-gnu-emacs@gnu.org; Sat, 28 Feb 2015 07:31:03 -0500 Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-To: bug-gnu-emacs@gnu.org Resent-Date: Sat, 28 Feb 2015 12:31:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: cc-closed 19878 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Mail-Followup-To: 19878@debbugs.gnu.org, eliz@gnu.org, mohammad.mahmoudi@gmail.com Original-Received: via spool by 19878-done@debbugs.gnu.org id=D19878.142512660813129 (code D ref 19878); Sat, 28 Feb 2015 12:31:02 +0000 Original-Received: (at 19878-done) by debbugs.gnu.org; 28 Feb 2015 12:30:08 +0000 Original-Received: from localhost ([127.0.0.1]:60427 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YRgX1-0003P3-2M for submit@debbugs.gnu.org; Sat, 28 Feb 2015 07:30:07 -0500 Original-Received: from mtaout27.012.net.il ([80.179.55.183]:53335) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YRgWw-0003MV-ER for 19878-done@debbugs.gnu.org; Sat, 28 Feb 2015 07:30:04 -0500 Original-Received: from conversion-daemon.mtaout27.012.net.il by mtaout27.012.net.il (HyperSendmail v2007.08) id <0NKH00600DK78Z00@mtaout27.012.net.il> for 19878-done@debbugs.gnu.org; Sat, 28 Feb 2015 14:24:27 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout27.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NKH00OWNEGRFS80@mtaout27.012.net.il>; Sat, 28 Feb 2015 14:24:27 +0200 (IST) In-reply-to: <838ufw7bzi.fsf@gnu.org> X-012-Sender: halo1@inter.net.il X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:99897 Archived-At: > Date: Tue, 17 Feb 2015 18:13:05 +0200 > From: Eli Zaretskii > Cc: mohammad.mahmoudi@gmail.com, 19878@debbugs.gnu.org > > > From: Andreas Politz > > Date: Sun, 15 Feb 2015 21:16:13 +0100 > > Cc: 19878@debbugs.gnu.org > > > > > > I think this is supposed to be: > > > > ,----[ (info "(elisp) Char Classes") ] > > | `[:alpha:]' > > | This matches any letter. (At present, for multibyte characters, it > > | matches anything that has word syntax.) > > `---- > > Indeed, which doesn't sound very nice. > > Does someone object to the changes below (to be installed on master)? > They make [:alpha:] and [:alnum:] closer to the Unicode > recommendations in UTS #18, although we are still very far from > supporting even Level 1 of conformance. But these two seem like > low-hanging fruit to me. > > The modified definitions of these two sets are not 100% compatible > with the old ones for the multibyte characters. However, if it turns > out that some code used these to get word-constituent characters, > those places should simply be changed to use \sw instead. No further comments, so I pushed the changes as commit 1a50945 on the master branch, and I'm marking this bug closed. > Also, does someone see any potential problem to make [:digit:] be a > superset of the current ASCII-only set, to match UTS #18 as well? The > comment in regex.c says it is "only used for single-byte characters", > but it isn't clear to me whether this is a requirement, i.e. there's > some code in Emacs that relies on that, or just a statement of facts. I'd still like to hear an answer and/or opinions about this. If I hear no comments, I will look into making a similar change to [:digit:] soon.