From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: Bug 130397 Date: Fri, 7 Jan 2005 09:39:36 +0900 (JST) Message-ID: <200501070039.JAA18584@etlken.m17n.org> References: <28878.1105029010@ichips.intel.com> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: sea.gmane.org 1105058468 17871 80.91.229.6 (7 Jan 2005 00:41:08 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 7 Jan 2005 00:41:08 +0000 (UTC) Cc: k.stevens@ieee.org, 130397@bugs.debian.org, agustin.martin@hispalinux.es, lionel@mamane.lu, emacs-devel@gnu.org, kstevens@ichips.intel.com, ispell-bugs@itcorp.com Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Jan 07 01:40:58 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CmiBW-00080d-00 for ; Fri, 07 Jan 2005 01:40:58 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CmiMq-0001pj-Of for ged-emacs-devel@m.gmane.org; Thu, 06 Jan 2005 19:52:40 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.33) id 1CmiMR-0001ic-IT for emacs-devel@gnu.org; Thu, 06 Jan 2005 19:52:15 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.33) id 1CmiMQ-0001hf-4R for emacs-devel@gnu.org; Thu, 06 Jan 2005 19:52:14 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CmiMM-0001h8-PG for emacs-devel@gnu.org; Thu, 06 Jan 2005 19:52:12 -0500 Original-Received: from [192.47.44.130] (helo=tsukuba.m17n.org) by monty-python.gnu.org with esmtp (TLSv1:DES-CBC3-SHA:168) (Exim 4.34) id 1CmiAv-00024I-T1 for emacs-devel@gnu.org; Thu, 06 Jan 2005 19:40:22 -0500 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2]) by tsukuba.m17n.org (8.12.3/8.12.3/Debian-7.1) with ESMTP id j070dbY7030692; Fri, 7 Jan 2005 09:39:37 +0900 Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) by fs.m17n.org (8.11.6p2/8.11.6) with ESMTP id j070da914933; Fri, 7 Jan 2005 09:39:37 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id JAA18584; Fri, 7 Jan 2005 09:39:36 +0900 (JST) Original-To: Stefan Monnier In-reply-to: (message from Stefan Monnier on Thu, 06 Jan 2005 12:33:11 -0500) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.3.50 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:31985 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:31985 In article , Stefan Monnier writes: > Now encoding the whole text can't be realistically done, so we need to first > recognize words, then encode them, then use those vars. > I.e. the word-recogniztion code shouldn't use CASECHARS, NOT-CASECHARS, > OTHERCHARS, MANY-OTHERCHARS-P, EXTENDED-CHARACER-MODE, and CHARACTER-SET. It seems that it doesn't work. The documentation of ispell-dictionary-alist says as this: OTHERCHARS is a regexp of characters in the NOT-CASECHARS set but which can be used to construct words in some special way. If OTHERCHARS characters follow and precede characters from CASECHARS, they are parsed as part of a word, otherwise they become word-breaks. As an example in English, assume the regular expression "[']" for OTHERCHARS. Then "they're" and "Steven's" are parsed as single words including the "'" character, but "Stevens'" does not include the quote character as part of the word. If you want OTHERCHARS to be empty, use the empty string. Hint: regexp syntax requires the hyphen to be declared first here. MANY-OTHERCHARS-P is non-nil when multiple OTHERCHARS are allowed in a word. Otherwise only a single OTHERCHARS character is allowed to be part of any single word. --- Ken'ichi HANDA handa@m17n.org