From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Yuri D'Elia Newsgroups: gmane.emacs.help Subject: Re: correcting word groups (general spelling question) Date: Wed, 8 Jul 2015 12:19:21 +0200 Message-ID: References: <87zj3gwxhb.fsf@nl106-137-147.student.uu.se> <20150701044655.63e6e432@JRWUBU2> <87egkra3x3.fsf@nl106-137-147.student.uu.se> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1436350817 22440 80.91.229.3 (8 Jul 2015 10:20:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 8 Jul 2015 10:20:17 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Jul 08 12:20:06 2015 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ZCmSK-0003jv-Ez for geh-help-gnu-emacs@m.gmane.org; Wed, 08 Jul 2015 12:19:56 +0200 Original-Received: from localhost ([::1]:34207 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZCmSJ-0007tZ-OS for geh-help-gnu-emacs@m.gmane.org; Wed, 08 Jul 2015 06:19:55 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:39612) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZCmRz-0007ow-Az for help-gnu-emacs@gnu.org; Wed, 08 Jul 2015 06:19:40 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZCmRu-0001I4-6q for help-gnu-emacs@gnu.org; Wed, 08 Jul 2015 06:19:35 -0400 Original-Received: from plane.gmane.org ([80.91.229.3]:33430) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZCmRt-0001Hn-SY for help-gnu-emacs@gnu.org; Wed, 08 Jul 2015 06:19:30 -0400 Original-Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1ZCmRr-0003T2-PD for help-gnu-emacs@gnu.org; Wed, 08 Jul 2015 12:19:27 +0200 Original-Received: from 193.106.183.18 ([193.106.183.18]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 08 Jul 2015 12:19:27 +0200 Original-Received: from wavexx by 193.106.183.18 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 08 Jul 2015 12:19:27 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 47 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 193.106.183.18 X-Enigmail-Draft-Status: N1110 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Icedove/38.0 In-Reply-To: <87egkra3x3.fsf@nl106-137-147.student.uu.se> X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 80.91.229.3 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:105516 Archived-At: On 07/02/2015 12:50 AM, Emanuel Berg wrote: >> I fear that adding this as grammar would indeed be >> overengineering. Also because I would be too lazy to >> add a grammar rule and lose all the benefits. >> This might be perfectly dumb (that is: select words >> -> add to dictionary). > > No, no, I'm sorry I said that. What other examples did > you come across, if it isn't a secret? Perhaps testing > will be more fun and natural if you provide us with > those as well. It's hard to make examples without a lot of context unfortunately. But one thing that came up while I was drafting a paper a couple of weeks ago was about the names of the various studies and consorzia. These are usually (very bad) acronyms, which are often spelled in full with bad capitalization on purpose. One example is the "SarniNIA study", which I would like to learn as a group, since Sarninia is also often used in the paper (the name of the isle itself): SardiNIA by itself would be ambiguous, but "SardiNIA study" instead would be always correct. Now this is a borderline example, since you have three capitalized letters that are easy to spot, but it's not always so easy. Technical papers often mix technical jargon and words in very specific contextes. Often they contain localized words, which are just 1-2 edits away from a regular one. Again, these are words I never put in the dictionary, because they're just to easy to mistype. I'd rather have them marked as spelling errors to inspect them. However, they can very often be made unique using one or two words of context around them, which would avoid the continuous hassle of seeing 5-6% of the document marked with a spelling error. They would also be unique enough to be consistent between documents, which would save me even further time. Using ispell-skip-region-alist is a "nice" hack. It just needs some polish to read/write to a simple text file, and maybe a support function to add the current region to it. I can do that myself ;). But I cannot stop thinking that I cannot be the only guy with this "problem", and the solution doesn't strike me as particularly complicated for the benefit that you have. I would have expected "word" or "libreoffice" to have something similar for the sake of the user, but it doesn't.