From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Richard Stallman Newsgroups: gmane.emacs.devel Subject: Re: [yazicivo@ttnet.net.tr: Locale Dependent Downcasing in smtpmail] Date: Sat, 07 Apr 2007 13:31:31 -0400 Message-ID: References: <87wt0uhee5.fsf@ttnet.net.tr> <873b3hssyt.fsf@mocca.josefsson.org> <87y7l9608n.fsf@ttnet.net.tr> <87ircdh13m.fsf@stupidchicken.com> <46174883.4090107@gmx.at> Reply-To: rms@gnu.org NNTP-Posting-Host: lo.gmane.org Content-Type: text/plain; charset=ISO-8859-15 X-Trace: sea.gmane.org 1175968423 9101 80.91.229.12 (7 Apr 2007 17:53:43 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 7 Apr 2007 17:53:43 +0000 (UTC) Cc: simon@josefsson.org, handa@m17n.org, schwab@suse.de, cyd@stupidchicken.com, emacs-devel@gnu.org, yazicivo@ttnet.net.tr, eliz@gnu.org, kevin@atkinson.dhs.org To: martin rudalics Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Apr 07 19:53:15 2007 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1HaEr2-0007RS-Rw for ged-emacs-devel@m.gmane.org; Sat, 07 Apr 2007 19:37:37 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1HaEue-0000Id-Pm for ged-emacs-devel@m.gmane.org; Sat, 07 Apr 2007 13:41:20 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1HaEr5-0000q5-Tf for emacs-devel@gnu.org; Sat, 07 Apr 2007 13:37:39 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1HaEr5-0000or-6e for emacs-devel@gnu.org; Sat, 07 Apr 2007 13:37:39 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1HaEr5-0000oF-0H for emacs-devel@gnu.org; Sat, 07 Apr 2007 13:37:39 -0400 Original-Received: from fencepost.gnu.org ([199.232.76.164]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1HaEnS-0001lJ-4z for emacs-devel@gnu.org; Sat, 07 Apr 2007 13:33:54 -0400 Original-Received: from rms by fencepost.gnu.org with local (Exim 4.60) (envelope-from ) id 1HaEl9-0007TD-W6; Sat, 07 Apr 2007 13:31:32 -0400 In-reply-to: <46174883.4090107@gmx.at> (message from martin rudalics on Sat, 07 Apr 2007 09:30:11 +0200) X-detected-kernel: Linux 2.6, seldom 2.4 (older, 4) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:69179 Archived-At: > After the release, it would be good to design a new case conversion > system which can handle the cases where one letter converts to more > than one. It would be nice if it could even handle German. > This could be done thru the spell checker. Compare the following excerpt from the Aspell manual (appendix C.4): The German Sharp S or Eszett does not have an uppercase equivalent. Instead when `ß' is converted to `SS'. The conversion of `ß' to `SS' requires a special rule, and increases the length of a word, thus disallowing inplace case conversion. Furthermore, my general rule of converting all words to lowercase before looking them up in the dictionary won't work because the conversion of `SS' to lowercase is ambiguous; it can be `ss' or `ß'. I do plan on dealing with this eventually. That is not a problem for the method I have in mind. Emacs can generate all the possible downcasings of a word containing SS, then send each one to Aspell to see if it is the right one. Aspell can handle lower-case words, so this will work. Meanwhile, this suggests a way that Aspell could handle the upper case German words: generate the various possible downcasings of it. (If there are N occurrences of SS, there will be 2**N possible downcasings.) Then see if any of them is in the dictionary. If so, the upper case word is valid. Otherwise, construct the union of the suggestion-lists from the various possible downcasings. I cc'd the Aspell maintainer so that he will see this idea.