From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used Date: Fri, 21 Oct 2016 17:52:09 +0300 Message-ID: <838tthsqjq.fsf@gnu.org> References: <8660om1en7.fsf@phe.ftfl.ca> <86wph2z405.fsf@phe.ftfl.ca> <83mvhyrwax.fsf@gnu.org> <86shrpzwky.fsf@phe.ftfl.ca> Reply-To: Eli Zaretskii NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1477061761 23899 195.159.176.226 (21 Oct 2016 14:56:01 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Fri, 21 Oct 2016 14:56:01 +0000 (UTC) Cc: 19653@debbugs.gnu.org To: Joseph Mingrone Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Oct 21 16:55:57 2016 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxbES-0003d9-K3 for geb-bug-gnu-emacs@m.gmane.org; Fri, 21 Oct 2016 16:55:40 +0200 Original-Received: from localhost ([::1]:32882 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bxbEU-0004kS-Vf for geb-bug-gnu-emacs@m.gmane.org; Fri, 21 Oct 2016 10:55:42 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:40915) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bxbBx-0003C6-Lv for bug-gnu-emacs@gnu.org; Fri, 21 Oct 2016 10:53:06 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bxbBu-000470-Ij for bug-gnu-emacs@gnu.org; Fri, 21 Oct 2016 10:53:05 -0400 Original-Received: from debbugs.gnu.org ([208.118.235.43]:57855) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1bxbBu-00046s-FR for bug-gnu-emacs@gnu.org; Fri, 21 Oct 2016 10:53:02 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1bxbBu-0005jH-84 for bug-gnu-emacs@gnu.org; Fri, 21 Oct 2016 10:53:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 21 Oct 2016 14:53:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 19653 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: moreinfo Original-Received: via spool by 19653-submit@debbugs.gnu.org id=B19653.147706154821967 (code B ref 19653); Fri, 21 Oct 2016 14:53:02 +0000 Original-Received: (at 19653) by debbugs.gnu.org; 21 Oct 2016 14:52:28 +0000 Original-Received: from localhost ([127.0.0.1]:45017 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxbBM-0005iE-9K for submit@debbugs.gnu.org; Fri, 21 Oct 2016 10:52:28 -0400 Original-Received: from eggs.gnu.org ([208.118.235.92]:49806) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxbBK-0005hv-Ma for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 10:52:26 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bxbBC-0003tD-Di for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 10:52:21 -0400 Original-Received: from fencepost.gnu.org ([2001:4830:134:3::e]:51115) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bxbBC-0003t9-AO; Fri, 21 Oct 2016 10:52:18 -0400 Original-Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:3263 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1bxbBB-0000Gz-Ie; Fri, 21 Oct 2016 10:52:17 -0400 In-reply-to: <86shrpzwky.fsf@phe.ftfl.ca> (message from Joseph Mingrone on Fri, 21 Oct 2016 09:59:57 -0300) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 208.118.235.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.org gmane.emacs.bugs:124776 Archived-At: > From: Joseph Mingrone > Cc: 19653@debbugs.gnu.org > Date: Fri, 21 Oct 2016 09:59:57 -0300 > > > @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) > > & alsdk 3 0: Alaska, elastic, Alston > > & sdfkjdsf 2 8: artefact's, postfix > > & sldksdfkjsfd 2 17: justification, staphylococcus > > > The second number after each misspelled word is the offset of that > > word's beginning, measured in characters, from the start of the line. > > Hunspell used to report this in bytes instead of characters; if it > > still does, you will have to patch it to fix that bug. AFAIR, the > > Hunspell issue tracker includes several patches for this bug. Or > > maybe the latest Hunspell 1.4.1 already fixes this, in which case > > please upgrade. > > It's still a problem with hunspell. > > % echo "é startingCharTwo" | hunspell -a -d en_CA -i UTF-8 > @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.3) > & é 15 0: e, s, i, a, n, r, t, o, l, c, d, u, g, m, p > & startingCharTwo 1 3: nonparticipating > > https://github.com/hunspell/hunspell/issues/418 Thanks for checking.