From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: "Paul W. Rankin" Newsgroups: gmane.emacs.bugs Subject: bug#39483: 27.0.60; ispell ignores syntax/category tables word boundaries Date: Sat, 08 Feb 2020 01:44:52 +1000 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="101975"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: mu4e 1.2.0; emacs 27.0.60 To: 39483@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Fri Feb 07 16:46:16 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1j05pg-000QRK-Mo for geb-bug-gnu-emacs@m.gmane-mx.org; Fri, 07 Feb 2020 16:46:16 +0100 Original-Received: from localhost ([::1]:59706 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1j05pf-0006KS-MT for geb-bug-gnu-emacs@m.gmane-mx.org; Fri, 07 Feb 2020 10:46:15 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:45502) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1j05pU-0006JA-KC for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:46:05 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1j05pS-0007gE-VN for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:46:04 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:44183) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1j05pS-0007fE-QO for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:46:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1j05pR-0007Wb-On for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:46:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: "Paul W. Rankin" Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 07 Feb 2020 15:46:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 39483 X-GNU-PR-Package: emacs X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.158109030728823 (code B ref -1); Fri, 07 Feb 2020 15:46:01 +0000 Original-Received: (at submit) by debbugs.gnu.org; 7 Feb 2020 15:45:07 +0000 Original-Received: from localhost ([127.0.0.1]:50153 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1j05oY-0007Up-Qg for submit@debbugs.gnu.org; Fri, 07 Feb 2020 10:45:07 -0500 Original-Received: from lists.gnu.org ([209.51.188.17]:35233) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1j05oX-0007Uh-TX for submit@debbugs.gnu.org; Fri, 07 Feb 2020 10:45:06 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:45161) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1j05oW-000686-JC for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:45:05 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1j05oV-0004qV-Hb for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:45:04 -0500 Original-Received: from wout2-smtp.messagingengine.com ([64.147.123.25]:57377) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1j05oV-0004ob-9s for bug-gnu-emacs@gnu.org; Fri, 07 Feb 2020 10:45:03 -0500 Original-Received: from compute3.internal (compute3.nyi.internal [10.202.2.43]) by mailout.west.internal (Postfix) with ESMTP id 078DE50F for ; Fri, 7 Feb 2020 10:45:01 -0500 (EST) Original-Received: from mailfrontend2 ([10.202.2.163]) by compute3.internal (MEProxy); Fri, 07 Feb 2020 10:45:02 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=paulwrankin.com; h=from:to:subject:date:message-id:mime-version:content-type :content-transfer-encoding; s=fm1; bh=OonGEMF+BzvClldV8lmSPDrkuF CRcyCnQ5wGJ/JlBKA=; b=rROcWXcFyMOjChs8FVQ49+BL0DrQ3eeqRKwkcF05Px 3o6aViOQxq65SRXts6wQUJw5FYa1Etg4DsLJ/JRsw/uPnLP29FGrfop6XgXg1IBA 35lLJPGAk/mGFQWRMF2d+hgTSDzrxqh36O3w6EzN7qDeTyDYeZaKLFxG/f9eAPmR LmmHQqVGGS+Af12RfivVt5c3QurAoaGR/wE3w9cD2SSAlXY4HPJM5P0gCoTCWu64 zLDGi3hOmozewXNcmn0K2yhiBIxMD74yVZMYX9tUWCxYnz9d7/sOImLycJ0hFz65 TyJWnG9xwOqIkqxypWULe0dkiwT5qABAGh/f5NzuIR/g== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:message-id:mime-version:subject:to:x-me-proxy :x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=fm2; bh=OonGEM F+BzvClldV8lmSPDrkuFCRcyCnQ5wGJ/JlBKA=; b=WnxvMg3FmNdRG0QDLMDTPY ouzc63Bj6vyEIq0NOe4BM+ghH1s59wRplhsmlcqvO81pEPMdE6pgsVUuX0nYMsgu JFqfSpfp4aiSH4t4a3CVnIBRfs9gp3SEeHqsqf4OSh7LgtWn5ivfs/fIrDquvJCc TtSkibJd2y94/gPHp0srtXn2D+IjnFub+yFKVda9QqlKpvxeeqjgwOoi2kYIWYsW 3EeewrQdx46isrt2hapOBM3JjMQka2ieJe3L5EugrZjWljsFdjQ4sNRxS+r87xpb Ly4zbPaC8e7EqLy26ct9/RbK9HjtaqbQY3ZxqtpEzrnP8TtE/7R2cKkYdECvChcw == X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedugedrheehgdektdcutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecunecujfgurhepfgfhvffufffkgggtgfesthhqredttd erjeenucfhrhhomhepfdfrrghulhcuhgdrucftrghnkhhinhdfuceohhgvlhhlohesphgr uhhlfihrrghnkhhinhdrtghomheqnecuffhomhgrihhnpehprghulhifrhgrnhhkihhnrd gtohhmnecukfhppeduvddtrddvvddrheegrdduheefnecuvehluhhsthgvrhfuihiivgep tdenucfrrghrrghmpehmrghilhhfrhhomhephhgvlhhlohesphgruhhlfihrrghnkhhinh drtghomh X-ME-Proxy: Original-Received: from localhost (unknown [120.22.54.153]) by mail.messagingengine.com (Postfix) with ESMTPA id EEDFE30600DC for ; Fri, 7 Feb 2020 10:44:59 -0500 (EST) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 209.51.188.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:175751 Archived-At: Hello, It appears that the function `ispell-get-word' makes its own judgements on word boundaries, ignoring the buffer's syntax tables and character categories. This becomes a problem with using `electric-quote-mode' and ispell, because contractions are parsed as separate words. e.g. Calling `ispell-word' for "doesn=E2=80=99t" returns: T is correct To reproduce: 1. emacs -Q 2. (in *scratch*) M-x text-mode RET 3. enter text "doesn=E2=80=99t" (i.e. "doesn" C-x 8 ] "t") 4. M-: (modify-syntax-entry ?=E2=80=99 "w") 5. M-: (modify-category-entry ?=E2=80=99 ?^) 6. M-$ | ispell-word Expected results: Given the above syntax and category tables, M-f | forward-word and M-b | backward-word now consider "doesn=E2=80=99t" as a single word, and so should should be passed to the `ispell-program-name' and produce the same result as when checked on the command line: % echo "doesn=E2=80=99t" | aspell -a @(#) International Ispell Version 3.1.20 (but really Aspell 0.60.8) * % echo "doesn=E2=80=99t" | enchant-2 -a @(#) International Ispell Version 3.1.20 (but really Enchant 2.2.7) * Actual results: The word "doesn=E2=80=99t" is parsed as "t": T is correct Attempts at workarounds: I've tried altering slot 3 of the corresponding `ispell-dictionary-base-ali= st' entries from "[']" to "['=E2=80=99]" to no avail. Setup: GNU Emacs 27.0.60 (build 2, x86_64-apple-darwin19.3.0, NS appkit-1894.30 Version 10.15.3 (Build 19D76)) of 2020-02-05 --=20 Paul W. Rankin https://www.paulwrankin.com