From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Herbert Euler" Newsgroups: gmane.emacs.devel Subject: Re: Problem of auto-fill-mode for wide character Date: Fri, 30 Dec 2005 10:43:17 +0800 Message-ID: References: NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; format=flowed X-Trace: sea.gmane.org 1135910627 18295 80.91.229.2 (30 Dec 2005 02:43:47 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 30 Dec 2005 02:43:47 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Fri Dec 30 03:43:40 2005 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EsAEt-0002GQ-JP for ged-emacs-devel@m.gmane.org; Fri, 30 Dec 2005 03:43:32 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EsAGH-0006I1-5Q for ged-emacs-devel@m.gmane.org; Thu, 29 Dec 2005 21:44:57 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1EsAG6-0006Hb-Fh for emacs-devel@gnu.org; Thu, 29 Dec 2005 21:44:46 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1EsAG5-0006HP-O8 for emacs-devel@gnu.org; Thu, 29 Dec 2005 21:44:46 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1EsAG5-0006HM-MO for emacs-devel@gnu.org; Thu, 29 Dec 2005 21:44:45 -0500 Original-Received: from [65.54.229.18] (helo=hotmail.com) by monty-python.gnu.org with esmtp (Exim 4.34) id 1EsAGX-0000O3-D5 for emacs-devel@gnu.org; Thu, 29 Dec 2005 21:45:13 -0500 Original-Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Thu, 29 Dec 2005 18:43:17 -0800 Original-Received: from 65.54.229.220 by by110fd.bay110.hotmail.msn.com with HTTP; Fri, 30 Dec 2005 02:43:17 GMT X-Originating-IP: [202.43.217.195] X-Originating-Email: [herberteuler@hotmail.com] X-Sender: herberteuler@hotmail.com In-Reply-To: Original-To: handa@m17n.org X-OriginalArrivalTime: 30 Dec 2005 02:43:17.0560 (UTC) FILETIME=[C9E2C380:01C60CEA] X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:48528 Archived-At: >From: Kenichi Handa >To: "Herbert Euler" >CC: emacs-devel@gnu.org >Subject: Re: Problem of auto-fill-mode for wide character >Date: Wed, 28 Dec 2005 16:46:27 +0900 > >Thank you for reminding of this unsolved problem. The >reason why Chinese characters invoke auto-fill is that they >are not yet registered in the char-table auto-fill-chars. >And why I have not yet done in Unicode 2 branch is that I >don't know any "authorized" information about that. > >I've just registered these apparent characters: > U+3041..U+30FF, U+3400..U+4DB5, U+4e00..U+9fbb, U+F900..U+FAFF, > U+FF00..U+FF9F, U+20000..U+2FFFF >So, now auto-fill should work for most Han characters. > >But, there are many more questionable characters, for instance: > U+3000..U+303F, U+3200..U+33FF, ... In my opinion, this solution is not an applicable one. Trying to register most characters in Chinese, Japanese and Korean as auto-fill-chars would waste lots of memory, and perhaps some characters would be forgot to be registered. For example, in Japanese, Hiragana and Katakana probably work, but not for most Kanji. Besides, the policy for filling punctuations in English and in Chinese is different: usually, if a punctuation appears to be the last character of a line but exceeds the fill-column, it will be extended to the next line with the word it follows in English, but left there (and following characters will be moved to the next line) in Chinese. I don't know whether this is supported by registering auto-fill-chars. >Do you have some idea about exactly which set of characters >to register in auto-fill-chars? I don't know the detail of how Emacs distincts auto-fill between languages in which words are separated by blanks and those not; but if the orignal design of auto-fill ignored situations in the latter ones, one possible better solution seems to be modifying the mechanism of auto-fill to make it support the different concepts of auto-fill in such languages. If the words in a language are not separated by blanks, then all characters except punctuations should be moved to the next line, assuming that it exceeds the fill-column. For some punctuations, if it appears to exceed the fill-column, it should be left there. For others, just deal with them like for characters. Regards, Guanpeng Xu _________________________________________________________________ FREE pop-up blocking with the new MSN Toolbar - get it now! http://toolbar.msn.click-url.com/go/onm00200415ave/direct/01/