From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: command fill-paragraph deletes leading Umlauts if line begins with space Date: Thu, 6 Jan 2005 16:41:53 +0900 (JST) Message-ID: <200501060741.QAA16427@etlken.m17n.org> References: <200412280014.JAA21431@etlken.m17n.org> <200412290117.KAA24724@etlken.m17n.org> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: sea.gmane.org 1104997398 2097 80.91.229.6 (6 Jan 2005 07:43:18 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 6 Jan 2005 07:43:18 +0000 (UTC) Cc: angeli@iwi.uni-sb.de, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Jan 06 08:43:05 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CmSIT-0005Mo-00 for ; Thu, 06 Jan 2005 08:43:05 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CmSTl-0002z8-Ah for ged-emacs-devel@m.gmane.org; Thu, 06 Jan 2005 02:54:45 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.33) id 1CmSTe-0002yo-0Y for emacs-devel@gnu.org; Thu, 06 Jan 2005 02:54:38 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.33) id 1CmSTd-0002yV-Bb for emacs-devel@gnu.org; Thu, 06 Jan 2005 02:54:37 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.33) id 1CmSTd-0002yL-8Z for emacs-devel@gnu.org; Thu, 06 Jan 2005 02:54:37 -0500 Original-Received: from [192.47.44.130] (helo=tsukuba.m17n.org) by monty-python.gnu.org with esmtp (TLSv1:DES-CBC3-SHA:168) (Exim 4.34) id 1CmSHQ-00060i-9B; Thu, 06 Jan 2005 02:42:01 -0500 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2]) by tsukuba.m17n.org (8.12.3/8.12.3/Debian-7.1) with ESMTP id j067fsY7014987; Thu, 6 Jan 2005 16:41:54 +0900 Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) by fs.m17n.org (8.11.6p2/8.11.6) with ESMTP id j067fr900548; Thu, 6 Jan 2005 16:41:54 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id QAA16427; Thu, 6 Jan 2005 16:41:53 +0900 (JST) Original-To: rms@gnu.org In-reply-to: (message from Richard Stallman on Wed, 29 Dec 2004 15:46:53 -0500) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.3.50 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:31938 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:31938 In article , Richard Stallman writes: > But the current Emacs keeps them in syntax table and updates > them when a language environment is changed in unibyte-mode. > I've just confirmed that 0334 (U-umlaut in Latin-1) has > syntax word-constituent in unibyte-mode in Latin-1 > lang. env. > Maybe he didn't set the language environment. > What is the situation in the CVS Emacs if you never set the > language environment? If LANG is not set or is "C", Emacs starts in English lang. env., and in that case, all 8-bit characters has whitespace syntax. In this situation, 0334 is displayed as \334 (not as U-umlaut). So, I think it shouldn't have wordconstituent syntax. > Conversion to multibyte uses Latin-1 by default. Yes. But that conversion is mainly for a user using multibyte mode. In unibyte mode (i.e. default-enable-multibyte-characters is nil), to-multibyte conversion won't happen usually. >>> making the case-conversion commands convert each character to >>> multibyte and check its syntax. > Why does case-conversion have to check syntax? > M-c detects word boundaries with syntax checking. Ah, I see. By the way, in unibyte English lang. env., case-table is also reset to the default, i.e., not set for latin-1. --- Ken'ichi HANDA handa@m17n.org