From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Agustin Martin Newsgroups: gmane.emacs.bugs Subject: bug#13639: [emacs] ispell.el: hunspell dicts autodetection under Emacs. Date: Thu, 28 Feb 2013 20:23:45 +0100 Message-ID: <20130228192345.GB9113@agmartin.aq.upm.es> References: <20130116122509.GA2209@omega.in.herr-schmitt.de> <20130220175045.GA20958@agmartin.aq.upm.es> <83mwuyolnq.fsf@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1362079449 9772 80.91.229.3 (28 Feb 2013 19:24:09 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 28 Feb 2013 19:24:09 +0000 (UTC) To: 13639@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Thu Feb 28 20:24:32 2013 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1UB95j-0006tj-VV for geb-bug-gnu-emacs@m.gmane.org; Thu, 28 Feb 2013 20:24:32 +0100 Original-Received: from localhost ([::1]:59186 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UB95O-00044U-UQ for geb-bug-gnu-emacs@m.gmane.org; Thu, 28 Feb 2013 14:24:10 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:53809) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UB95K-00041n-JJ for bug-gnu-emacs@gnu.org; Thu, 28 Feb 2013 14:24:07 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1UB95J-0005UC-DF for bug-gnu-emacs@gnu.org; Thu, 28 Feb 2013 14:24:06 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:51132) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UB95J-0005U3-4v for bug-gnu-emacs@gnu.org; Thu, 28 Feb 2013 14:24:05 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1UB97C-0006mN-7u for bug-gnu-emacs@gnu.org; Thu, 28 Feb 2013 14:26:02 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Agustin Martin Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 28 Feb 2013 19:26:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13639 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: pending Original-Received: via spool by 13639-submit@debbugs.gnu.org id=B13639.136207954826036 (code B ref 13639); Thu, 28 Feb 2013 19:26:02 +0000 Original-Received: (at 13639) by debbugs.gnu.org; 28 Feb 2013 19:25:48 +0000 Original-Received: from localhost ([127.0.0.1]:56596 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1UB96x-0006ls-M0 for submit@debbugs.gnu.org; Thu, 28 Feb 2013 14:25:48 -0500 Original-Received: from fibonacci.ccupm.upm.es ([138.100.198.70]:43662 helo=smtp.upm.es) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1UB96u-0006lh-GZ for 13639@debbugs.gnu.org; Thu, 28 Feb 2013 14:25:45 -0500 Original-Received: from agmartin.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by smtp.upm.es (8.14.3/8.14.3/fibonacci-001) with ESMTP id r1SJNj6I006522; Thu, 28 Feb 2013 20:23:45 +0100 Original-Received: by agmartin.aq.upm.es (Postfix, from userid 1000) id 70865935; Thu, 28 Feb 2013 20:23:45 +0100 (CET) Content-Disposition: inline In-Reply-To: <83mwuyolnq.fsf@gnu.org> User-Agent: Mutt/1.5.21 (2010-09-15) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:71962 Archived-At: On Wed, Feb 20, 2013 at 09:00:41PM +0200, Eli Zaretskii wrote: > I didn't just read the man page, I also looked into several *.aff > files that install with Hunspell dictionaries. It is clear to me that > WORDCHARS is at least unreliable, even if your interpretation is > correct (of which I'm still unconvinced): some *.aff files don't have > that entry at all (e.g., en_GB.aff, whose OTHERCHARS should include > the ' character, and also ru_RU.aff); others, like he_IL.aff, have > that entry mention all the CASECHARS, in addition to OTHERCHARS. I > wouldn't bet my money on what that entry gives us. IMHO those dictionaries are buggy (this may include some of the dicts I package for Debian, have to look). As an example, I tried Debian en_AU, not having WORDCHARS ' $ echo "ber's" | hunspell -a -d /usr/share/hunspell/en_AU @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) & ber 15 0: bee, bet, be, beer, bier, bear, berg, berm, bar, bed, bur, beg, per, her, be r * while if I add the WORDCHARS ' entry I get, as expected $ echo "ber's" | hunspell -a -d ./en_AU @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) & ber's 15 0: bee's, bet's, beer's, bier's, berg's, berm's, bar's, bed's, bur's, bergs, berms, mer's, Ser's, Berber's, Berger's with ' properly handled. > > The good news are that I started working on hunspell dicts autodetection. > > Good news, indeed! Thanks! Just commited a first cut for hunspell dicts autodetection. I have tested it only in my GNU/Debian box and seems to work well, so time is come for real life check to notice how many things went unnoticed. -- Agustin