From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Michal Nazarewicz Newsgroups: gmane.emacs.bugs Subject: bug#24603: [RFC 13/18] Add some tricky Unicode characters to regex test Date: Tue, 4 Oct 2016 03:10:36 +0200 Message-ID: <1475543441-10493-13-git-send-email-mina86@mina86.com> References: <1475543441-10493-1-git-send-email-mina86@mina86.com> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1475544043 24973 195.159.176.226 (4 Oct 2016 01:20:43 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Tue, 4 Oct 2016 01:20:43 +0000 (UTC) To: 24603@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Oct 04 03:20:40 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brEP9-0003nf-SC for geb-bug-gnu-emacs@m.gmane.org; Tue, 04 Oct 2016 03:20:24 +0200 Original-Received: from localhost ([::1]:39763 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brEP8-00036p-FL for geb-bug-gnu-emacs@m.gmane.org; Mon, 03 Oct 2016 21:20:22 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:56535) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brEHC-0006oK-Tl for bug-gnu-emacs@gnu.org; Mon, 03 Oct 2016 21:12:12 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1brEH6-0002T3-Mf for bug-gnu-emacs@gnu.org; Mon, 03 Oct 2016 21:12:09 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:37366) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brEH6-0002SY-J3 for bug-gnu-emacs@gnu.org; Mon, 03 Oct 2016 21:12:04 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1brEH6-0006jd-FJ for bug-gnu-emacs@gnu.org; Mon, 03 Oct 2016 21:12:04 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Michal Nazarewicz Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 04 Oct 2016 01:12:04 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24603 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 24603-submit@debbugs.gnu.org id=B24603.147554346825662 (code B ref 24603); Tue, 04 Oct 2016 01:12:04 +0000 Original-Received: (at 24603) by debbugs.gnu.org; 4 Oct 2016 01:11:08 +0000 Original-Received: from localhost ([127.0.0.1]:43531 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brEGC-0006fe-63 for submit@debbugs.gnu.org; Mon, 03 Oct 2016 21:11:08 -0400 Original-Received: from mail-wm0-f46.google.com ([74.125.82.46]:38608) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brEG5-0006cY-Fx for 24603@debbugs.gnu.org; Mon, 03 Oct 2016 21:11:02 -0400 Original-Received: by mail-wm0-f46.google.com with SMTP id p138so182361229wmb.1 for <24603@debbugs.gnu.org>; Mon, 03 Oct 2016 18:11:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=sender:from:to:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=s/0cLEqVK0hGvWxz9xdNR3Mix0eoFcWZTxYCdhwJ8a4=; b=CiDtMV1qS/VQMQUyU0YPtxp/9Th9XdTkmp2BX3AvTj9mz/+/yaQ79HsMaZPJTLNQME MSUdHd98XfLnK4+h/BFpOPAhYD1pdandDIfbLfWXzzQehz46x5/gBQE/SXDI0m0fi4yw o0g7Cq7FfjVQR0xUdnSMBEsdO0BGrTP96wb8aM2PsqilzVVt7T7SXWcnv6dxiJTwNEbi tennnh5asusqfHG3E69VQ6VE8S7Gb36omqhJi2AAqEtRng6tm/ADmPWu2qICfZfiHJxH iNogFnj9JZwaVSCnh8rewMR/2a1PSOeCByaHwIhCaF/LNZqW1/pcU3Cd2L+dFHgY1FYP BD4Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:sender:from:to:subject:date:message-id :in-reply-to:references:mime-version:content-transfer-encoding; bh=s/0cLEqVK0hGvWxz9xdNR3Mix0eoFcWZTxYCdhwJ8a4=; b=eM+KB6ph/WUualivo3LOKAaEhAaq4JrjcsNpNS3HxMI+0yQerqzfitUONw4Z0JX9C8 cCZJh0qc8Mez0zDJ/35JQU2Vkr/E5zYYAeoeaqJQ5RWPfJfChRs61EebdLJKbNuBWdeY vvkqa+C3HwgcfaY4bOYKDQZe3Y10N+YDoNAL0BdHY4Hzoal1IU8laXK/tyt1olvVXt/I 0Nrw+QgmrdxuY6PK83LYV9y19DqKYDms0NKhi1wGuNlAEZaAT9PZHccoLuk/BojucqQl Thpp7K7d4+E9E7BjOfDjp0ym7OqkDryy7DhO2X2Dm9tDRohCoDWlcgXpNZBpaJBNCUlO g2FQ== X-Gm-Message-State: AA6/9RnJSxYvttK/onNnDb+x9cYJQUU6RHyUKGzKjPxP6SF0ZqtXK99PeilEtIRq13veMKLs X-Received: by 10.28.137.18 with SMTP id l18mr1062711wmd.70.1475543455591; Mon, 03 Oct 2016 18:10:55 -0700 (PDT) Original-Received: from mpn.zrh.corp.google.com ([172.16.113.135]) by smtp.gmail.com with ESMTPSA id 137sm21513631wmi.16.2016.10.03.18.10.51 for <24603@debbugs.gnu.org> (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 03 Oct 2016 18:10:53 -0700 (PDT) Original-Received: by mpn.zrh.corp.google.com (Postfix, from userid 126942) id 0D2C81E029B; Tue, 4 Oct 2016 03:10:48 +0200 (CEST) X-Mailer: git-send-email 2.8.0.rc3.226.g39d4020 In-Reply-To: <1475543441-10493-1-git-send-email-mina86@mina86.com> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:124002 Archived-At: * test/src/regex-tests.el: Include capital ‘DZ’ dygraph, sharp ‘s’, capital ligature ‘IJ’, small ligature ‘fi’, title-case dygraph ‘Dz’, all three forms of Greek sigma and and IPA ɕ symbol in the regex tests. --- test/src/regex-tests.el | 25 ++++++++++++++----------- 1 file changed, 14 insertions(+), 11 deletions(-) diff --git a/test/src/regex-tests.el b/test/src/regex-tests.el index c4844c7..fa66ff1 100644 --- a/test/src/regex-tests.el +++ b/test/src/regex-tests.el @@ -65,27 +65,30 @@ regex--test-cc (skip-chars-forward (concat "[:" name ":]\u2622")) (should (or (equal (point) p) (equal (point) (1+ p)))))))) -(dolist (test '(("alnum" "abcABC012łąka" "-, \t\n") - ("alpha" "abcABCłąka" "-,012 \t\n") +(dolist (test '(("alnum" "abcABC012łąkaDZßIJfiDzΣσςɕ" "-, \t\n") + ("alpha" "abcABCłąkaDZßIJfiDzΣσςɕ" "-,012 \t\n") ("digit" "012" "abcABCłąka-, \t\n") ("xdigit" "0123aBc" "łąk-, \t\n") - ("upper" "ABCŁĄKA" "abc012-, \t\n") - ("lower" "abcłąka" "ABC012-, \t\n") + ("upper" "ABCŁĄKADZIJΣ" "abcß0fiσςɕ12-, \t\n") + ;; FIXME: ßfiɕ are all lower case (even though they don’t have + ;; (single-character) upper-case form). + ("lower" "abcłąkaσς" "ABC012DZIJΣ-, \t\n") - ("word" "abcABC012\u2620" "-, \t\n") + ("word" "abcABC012\u2620DZßIJfiDzΣσςɕ" "-, \t\n") ("punct" ".,-" "abcABC012\u2620 \t\n") ("cntrl" "\1\2\t\n" ".,-abcABC012\u2620 ") - ("graph" "abcłąka\u2620-," " \t\n\1") - ("print" "abcłąka\u2620-, " "\t\n\1") + ("graph" "abcłąka\u2620-,DZßIJfiDzΣσςɕ" " \t\n\1") + ("print" "abcłąka\u2620-,DZßIJfiDzΣσςɕ " "\t\n\1") ("space" " \t\n\u2001" "abcABCł0123") ("blank" " \t" "\n\u2001") - ("ascii" "abcABC012 \t\n\1" "łą\u2620") - ("nonascii" "łą\u2622" "abcABC012 \t\n\1") - ("unibyte" "abcABC012 \t\n\1" "łą\u2622") - ("multibyte" "łą\u2622" "abcABC012 \t\n\1"))) + ("ascii" "abcABC012 \t\n\1" "łą\u2620DZßIJfiDzΣσςɕ") + ("nonascii" "łą\u2622DZßIJfiDzΣσςɕ" "abcABC012 \t\n\1") + ;; Note: sharp s is unibyte since it’s code point is below 256. + ("unibyte" "abcABC012ß \t\n\1" "łą\u2622DZIJfiDzΣσςɕ") + ("multibyte" "łą\u2622DZIJfiDzΣσςɕ" "abcABC012ß \t\n\1"))) (let ((name (intern (concat "regex-tests-" (car test) "-character-class"))) (doc (concat "Perform sanity test of regexes using " (car test) " character class. -- 2.8.0.rc3.226.g39d4020