From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Sebastian Urban Newsgroups: gmane.emacs.bugs Subject: bug#42602: Wrong (not-)casechars value for "polish" in ispell-dictionary-base-alist Date: Wed, 29 Jul 2020 18:12:02 +0200 Message-ID: <2f58556a-8f0f-f923-2716-5366d66fa44d@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="4320"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Windows NT 6.1; Win64; x64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 To: 42602@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Wed Jul 29 18:13:11 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1k0ohb-0000y5-OA for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 29 Jul 2020 18:13:11 +0200 Original-Received: from localhost ([::1]:47704 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1k0oha-0005me-Ob for geb-bug-gnu-emacs@m.gmane-mx.org; Wed, 29 Jul 2020 12:13:10 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:58624) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1k0ohS-0005kk-5Y for bug-gnu-emacs@gnu.org; Wed, 29 Jul 2020 12:13:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:50226) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1k0ohR-0002RN-RY for bug-gnu-emacs@gnu.org; Wed, 29 Jul 2020 12:13:01 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1k0ohR-0000iL-NO for bug-gnu-emacs@gnu.org; Wed, 29 Jul 2020 12:13:01 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Sebastian Urban Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 29 Jul 2020 16:13:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 42602 X-GNU-PR-Package: emacs X-Debbugs-Original-To: Bug GNU Emacs Original-Received: via spool by submit@debbugs.gnu.org id=B.15960391302677 (code B ref -1); Wed, 29 Jul 2020 16:13:01 +0000 Original-Received: (at submit) by debbugs.gnu.org; 29 Jul 2020 16:12:10 +0000 Original-Received: from localhost ([127.0.0.1]:33538 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k0ogc-0000h6-2I for submit@debbugs.gnu.org; Wed, 29 Jul 2020 12:12:10 -0400 Original-Received: from lists.gnu.org ([209.51.188.17]:47822) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k0ogZ-0000gy-GQ for submit@debbugs.gnu.org; Wed, 29 Jul 2020 12:12:08 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:58328) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1k0ogZ-0004m6-AM for bug-gnu-emacs@gnu.org; Wed, 29 Jul 2020 12:12:07 -0400 Original-Received: from mail-ej1-x62e.google.com ([2a00:1450:4864:20::62e]:36990) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1k0ogU-0002Fi-Dg for bug-gnu-emacs@gnu.org; Wed, 29 Jul 2020 12:12:06 -0400 Original-Received: by mail-ej1-x62e.google.com with SMTP id qc22so10146223ejb.4 for ; Wed, 29 Jul 2020 09:12:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:subject:to:message-id:date:user-agent:mime-version :content-language:content-transfer-encoding; bh=v0X/1WNo+KnIWuodDVqYYu/vE8+2OafT/zqRJReAYXI=; b=cvjQiXpeXZhF4q/kLaJeQ73C0KhygabkYTfVVV7HAm/lSnw+SQ+/Wq8INUl4Vfv1mm D4/uWWqLSOFSMrf1kNrKT174biY3OqaPEOQ0BOKSqKGHQKY2FWgbh13/QQbEMYSgg2OE JYrbJHTCKmX13URBlZf21aJ9vkh7/VBsXsWPGHUJ7comUlBuqmTc1UMfbJ4/evAwFLsh zuCZG7dWgbbszUP7VLkKjjnBGKSynmORuN637dCVPGCNWtXm/yN+otQaW5aLPK+I15dU IdOLUJbcFTbMXRkrnuEE7FaxRzomz0yXXyuzBo760sSgcdblxbBQTcPCrPLA/K6kqpOj MgHA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:subject:to:message-id:date:user-agent :mime-version:content-language:content-transfer-encoding; bh=v0X/1WNo+KnIWuodDVqYYu/vE8+2OafT/zqRJReAYXI=; b=ZYGnN/kmrDpf3bDDKVI+rRnFc0dcTDxX+ojgk5JKA15GXdcQOD/wZhR8x2ke1lRJNH 9tshzGjIPCoOUAatJ2tYBiUMu6V84tFah21GjzHVL/wS7ScraEGECpthuVmuGl6u8S9y Mm4z8kOM1BVvVQqJN6Hnil/sJa+2DsfpREM5J6QU6aHpbbmxxDBc8ZqqLKI2ATT7DusK fu6fR05HcE5sIglfTTP08vftVr7atOWBLJ8eLQepDYq3osDwxFzZeZSy8g6CSEE077IH ACVeEgD0nNDDtt7x3p7Atdr6ZJb7ZcRMSiQcinveD54c9xs5YfK0CTGwiTNqtCKWHBHv 2pRQ== X-Gm-Message-State: AOAM5328UOc9LEqmNkIbon8iUhn1Petn/qFVEhX2vP0nRlgJ3U36AY/B 0fgl5EOvyFC6NAUahgDM9hvkRfzR X-Google-Smtp-Source: ABdhPJzl/kqe4gHK7b6XHVJ0uyIAsZEvCK+mWXzGRqEs0FRjDPliHp49lQ04dUW5VNtTJl2FqDGoDg== X-Received: by 2002:a17:906:37d2:: with SMTP id o18mr15966828ejc.162.1596039119874; Wed, 29 Jul 2020 09:11:59 -0700 (PDT) Original-Received: from ?IPv6:2a00:f41:184b:2a09:2d6f:1042:9358:c5b? ([2a00:f41:184b:2a09:2d6f:1042:9358:c5b]) by smtp.gmail.com with ESMTPSA id cn16sm2131924edb.86.2020.07.29.09.11.59 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 29 Jul 2020 09:11:59 -0700 (PDT) Content-Language: en-GB Received-SPF: pass client-ip=2a00:1450:4864:20::62e; envelope-from=mrsebastianurban@gmail.com; helo=mail-ej1-x62e.google.com X-detected-operating-system: by eggs.gnu.org: No matching host in p0f cache. That's all we know. X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:183666 Archived-At: Hello, for words like: męski miód klątwa ślad łuk żaba źrebak grzać bańka ispell.el sends to Aspell only part of the word, e.g. "lad" instead of "ślad", or "kl"/"twa" (depending on the cursor position) instead of "klątwa". I think this is because wrong value of (NOT-)CASECHARS, which is ASCII A-z letters and a few chars of which only ó/Ó is valid for Polish. Although, for some reason, it doesn't recognize "ó" in word "miód", sending "mi" or "d". It is on the list of CASECHARS under \363, so it should work. Moreover, if I type in regexp-builder "[\363\323]" it won't recognize ó/Ó, but it doesn't have a problem with other Polish chars, like "ł" ("[\502]") or "ż" ("[\574]"). If I put in my init.el: --8<---------------cut here---------------start------------->8--- (setq ispell-program-name "C:/cygwin64/bin/aspell") (add-hook 'ispell-initialize-spellchecker-hook (lambda () (add-to-list 'ispell-local-dictionary-alist '("pl" ;; "[[:alpha:]]" ;; "[^[:alpha:]]" ;; ęóąśłżźćńĘÓĄŚŁŻŹĆŃ "[A-Za-z\431\363\405\533\502\574\572\407\504\430\323\404\532\501\573\571\406\503]" "[^A-Za-z\431\363\405\533\502\574\572\407\504\430\323\404\532\501\573\571\406\503]" "[.]" nil nil nil iso-8859-2)))) (setq ispell-dictionary "pl") --8<---------------cut here---------------start------------->8--- everything seems to work, even ó/Ó are recognised. "[[:alpha:]]" works as well, so I leaved it as an alternative. Changing from iso-8859-2 to utf-8 doesn't break anything. Tested on: - GNU Emacs 26.3 (build 1, x86_64-w64-mingw32) of 2019-08-29, - GNU Emacs 28.0.50 (build 1, x86_64-w64-mingw32) of 2020-07-05, with Aspell from Cygwin installation. S. U.