From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file Date: Fri, 28 Aug 2020 15:36:01 +0300 Message-ID: <83k0xjymku.fsf@gnu.org> References: <87sjx9fula.fsf@sc3d.org> Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="24414"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 7781@debbugs.gnu.org, rrt@sc3d.org To: Stefan Kangas Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Fri Aug 28 14:37:15 2020 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kBdd5-0006Fx-Cp for geb-bug-gnu-emacs@m.gmane-mx.org; Fri, 28 Aug 2020 14:37:15 +0200 Original-Received: from localhost ([::1]:38166 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kBdd4-0004Ps-Fg for geb-bug-gnu-emacs@m.gmane-mx.org; Fri, 28 Aug 2020 08:37:14 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:37360) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kBdcs-0004Pc-Pj for bug-gnu-emacs@gnu.org; Fri, 28 Aug 2020 08:37:02 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:33900) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kBdcs-0004ub-Gm for bug-gnu-emacs@gnu.org; Fri, 28 Aug 2020 08:37:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1kBdcs-0004A1-Ef for bug-gnu-emacs@gnu.org; Fri, 28 Aug 2020 08:37:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 28 Aug 2020 12:37:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 7781 X-GNU-PR-Package: emacs Original-Received: via spool by 7781-submit@debbugs.gnu.org id=B7781.159861817915934 (code B ref 7781); Fri, 28 Aug 2020 12:37:02 +0000 Original-Received: (at 7781) by debbugs.gnu.org; 28 Aug 2020 12:36:19 +0000 Original-Received: from localhost ([127.0.0.1]:45445 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBdcA-00048w-Vz for submit@debbugs.gnu.org; Fri, 28 Aug 2020 08:36:19 -0400 Original-Received: from eggs.gnu.org ([209.51.188.92]:55136) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBdcA-00048l-0b for 7781@debbugs.gnu.org; Fri, 28 Aug 2020 08:36:18 -0400 Original-Received: from fencepost.gnu.org ([2001:470:142:3::e]:56851) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kBdc4-0004ow-IR; Fri, 28 Aug 2020 08:36:12 -0400 Original-Received: from [176.228.60.248] (port=2536 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kBdc2-0002Bu-Ml; Fri, 28 Aug 2020 08:36:12 -0400 In-Reply-To: (message from Stefan Kangas on Fri, 28 Aug 2020 05:00:11 -0700) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:186600 Archived-At: > From: Stefan Kangas > Date: Fri, 28 Aug 2020 05:00:11 -0700 > Cc: 7781@debbugs.gnu.org > > Reuben Thomas writes: > > > With the following text, and using emacs -Q, I get the errors you can > > see in the messages log below when using hunspell to spell-check a UTF-8 > > buffer with some extended characters in it. > > > > I did test this with emacs -Q, but the current session, in which I > > reproduced the problem and am now composing this bug report, was not > > started with -Q (this is so submitting the bug report works properly!). > > > > I am running a freshly bzr-pulled build of the emacs-23 branch. > > > > Text follows > > I tried this but couldn't reproduce the bug using current master and > Hunspell 1.7.0. Having read the bug report, IIUC, this was a bug in > Hunspell and not in Emacs? > > Are you still able to reproduce this using a recent Emacs and Hunspell? Some (old) versions of Hunspell had a bug, whereby the mis-spelled words were reported with offsets in bytes, not in characters. When this happens, ispell.el reports "misalignment" errors. I don't remember when (or even if) Hunspell fixed that problem (in the version I use I fixed it myself), but if 1.7.0 has that problem fixed, you will not see the problem.