From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Eli Zaretskii Newsgroups: gmane.emacs.bugs Subject: bug#13639: [emacs] ispell.el: hunspell dicts autodetection under Emacs. Date: Wed, 20 Feb 2013 21:00:41 +0200 Message-ID: <83mwuyolnq.fsf@gnu.org> References: <20130116122509.GA2209@omega.in.herr-schmitt.de> <20130220175045.GA20958@agmartin.aq.upm.es> Reply-To: Eli Zaretskii NNTP-Posting-Host: plane.gmane.org X-Trace: ger.gmane.org 1361386857 26293 80.91.229.3 (20 Feb 2013 19:00:57 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 20 Feb 2013 19:00:57 +0000 (UTC) Cc: 13639@debbugs.gnu.org To: Agustin Martin Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Wed Feb 20 20:01:19 2013 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1U8Euq-0003qq-8i for geb-bug-gnu-emacs@m.gmane.org; Wed, 20 Feb 2013 20:01:16 +0100 Original-Received: from localhost ([::1]:47828 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U8EuV-0007UW-GF for geb-bug-gnu-emacs@m.gmane.org; Wed, 20 Feb 2013 14:00:55 -0500 Original-Received: from eggs.gnu.org ([208.118.235.92]:60999) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U8EuR-0007UJ-LU for bug-gnu-emacs@gnu.org; Wed, 20 Feb 2013 14:00:54 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1U8EuQ-0005WZ-BS for bug-gnu-emacs@gnu.org; Wed, 20 Feb 2013 14:00:51 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:34990) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1U8EuQ-0005WT-7l for bug-gnu-emacs@gnu.org; Wed, 20 Feb 2013 14:00:50 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.72) (envelope-from ) id 1U8EvZ-0006p1-OH; Wed, 20 Feb 2013 14:02:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Eli Zaretskii Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org, Agustin Martin Resent-Date: Wed, 20 Feb 2013 19:02:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 13639 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: pending Original-Received: via spool by 13639-submit@debbugs.gnu.org id=B13639.136138690326199 (code B ref 13639); Wed, 20 Feb 2013 19:02:01 +0000 Original-Received: (at 13639) by debbugs.gnu.org; 20 Feb 2013 19:01:43 +0000 Original-Received: from localhost ([127.0.0.1]:40453 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1U8EvG-0006oW-UC for submit@debbugs.gnu.org; Wed, 20 Feb 2013 14:01:43 -0500 Original-Received: from mtaout22.012.net.il ([80.179.55.172]:51706) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1U8EvF-0006oO-1W for 13639@debbugs.gnu.org; Wed, 20 Feb 2013 14:01:42 -0500 Original-Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0MIJ006008QY3U00@a-mtaout22.012.net.il> for 13639@debbugs.gnu.org; Wed, 20 Feb 2013 21:00:27 +0200 (IST) Original-Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0MIJ005KB8SR7EH0@a-mtaout22.012.net.il>; Wed, 20 Feb 2013 21:00:27 +0200 (IST) In-reply-to: <20130220175045.GA20958@agmartin.aq.upm.es> X-012-Sender: halo1@inter.net.il X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:71564 Archived-At: > Date: Wed, 20 Feb 2013 18:50:45 +0100 > From: Agustin Martin > > > > > > Sorry, I should have written WORDCHARS. > > > > > > > > Why do we need that? > > > > > > This is what ispell.el calls otherchars. Parsing WORDCHARS ensures that > > > both > > > hunspell and ispell.el think about the same characters in that category. > > > > I think you are mistaken, that's not my reading of hunspell(4). > > Sorry for the late reply, > > (Opening a new thread specifically about hunspell dicts autodetection and > using new cloned bugreport #13639 specific about this) > > Although WORDCHARS description in hunspell(4) > > WORDCHARS characters > WORDCHARS extends tokenizer of Hunspell command line interface > with additional word character. For example, dot, dash, n-dash, numbers, > percent sign are word character in Hungarian. > > is too hungarian biassed and does not mention usual apostrophe AFAIK it > mostly refers to the same as 'otherchars', although hunspell may accept > that in locations not in the middle of a word. I didn't just read the man page, I also looked into several *.aff files that install with Hunspell dictionaries. It is clear to me that WORDCHARS is at least unreliable, even if your interpretation is correct (of which I'm still unconvinced): some *.aff files don't have that entry at all (e.g., en_GB.aff, whose OTHERCHARS should include the ' character, and also ru_RU.aff); others, like he_IL.aff, have that entry mention all the CASECHARS, in addition to OTHERCHARS. I wouldn't bet my money on what that entry gives us. > The good news are that I started working on hunspell dicts autodetection. Good news, indeed! Thanks!