From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.bugs Subject: bug#7962: 23.2; capitalize letters ISO-8859-1 with diacritic signs in emacs 23.2.1 Date: Fri, 04 Feb 2011 16:34:44 -0500 Message-ID: References: <4D493388.2070801@ens2m.fr> <4D4AAEE7.9040509@ens2m.fr> <4D4AF8B2.6090108@ens2m.fr> <83oc6sbyum.fsf@gnu.org> <87hbckn4ph.fsf@turtle.gmx.de> <834o8jc0k7.fsf@gnu.org> <4D4C32A3.3030906@ens2m.fr> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1296855365 6157 80.91.229.12 (4 Feb 2011 21:36:05 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 4 Feb 2011 21:36:05 +0000 (UTC) Cc: schwab@linux-m68k.org, 7962@debbugs.gnu.org, svenjoac@gmx.de To: Emmanuel Bigler Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Fri Feb 04 22:36:00 2011 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PlTJt-0003ky-Tv for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Feb 2011 22:35:58 +0100 Original-Received: from localhost ([127.0.0.1]:45797 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PlTJt-00088V-Gu for geb-bug-gnu-emacs@m.gmane.org; Fri, 04 Feb 2011 16:35:57 -0500 Original-Received: from [140.186.70.92] (port=48907 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PlTJp-00088Q-12 for bug-gnu-emacs@gnu.org; Fri, 04 Feb 2011 16:35:53 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PlTJn-0002pq-RM for bug-gnu-emacs@gnu.org; Fri, 04 Feb 2011 16:35:52 -0500 Original-Received: from debbugs.gnu.org ([140.186.70.43]:34403) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PlTJn-0002pm-Ob for bug-gnu-emacs@gnu.org; Fri, 04 Feb 2011 16:35:51 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.69) (envelope-from ) id 1PlTBH-0006oN-Ed; Fri, 04 Feb 2011 16:27:03 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Stefan Monnier Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-To: owner@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 04 Feb 2011 21:27:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 7962 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 7962-submit@debbugs.gnu.org id=B7962.129685477626127 (code B ref 7962); Fri, 04 Feb 2011 21:27:03 +0000 Original-Received: (at 7962) by debbugs.gnu.org; 4 Feb 2011 21:26:16 +0000 Original-Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PlTAV-0006nM-JA for submit@debbugs.gnu.org; Fri, 04 Feb 2011 16:26:15 -0500 Original-Received: from ironport2-out.teksavvy.com ([206.248.154.183] helo=ironport2-out.pppoe.ca) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PlTAT-0006n7-OF for 7962@debbugs.gnu.org; Fri, 04 Feb 2011 16:26:14 -0500 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AvsEAJf/S01MCqAf/2dsb2JhbAClLHS5PYVaBIR6jx0 X-IronPort-AV: E=Sophos;i="4.60,428,1291611600"; d="scan'208";a="90621459" Original-Received: from 76-10-160-31.dsl.teksavvy.com (HELO ceviche.home) ([76.10.160.31]) by ironport2-out.pppoe.ca with ESMTP/TLS/ADH-AES256-SHA; 04 Feb 2011 16:34:45 -0500 Original-Received: by ceviche.home (Postfix, from userid 20848) id EFC40660DC; Fri, 4 Feb 2011 16:34:44 -0500 (EST) In-Reply-To: <4D4C32A3.3030906@ens2m.fr> (Emmanuel Bigler's message of "Fri, 04 Feb 2011 18:08:51 +0100") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Resent-Date: Fri, 04 Feb 2011 16:27:03 -0500 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-Received-From: 140.186.70.43 X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:43949 Archived-At: > I think I'm starting to understand what is going on. I think you're still confused, tho. For some reason, you haven't replied to any of my email, even tho it's blatantly obvious that your "default enable-multibyte-characters" is the main culprit (and the "default" part is important here, it means that it comes from something you've explicitly changed in your Emacs config). > I had created a long time ago an unibyte file containing the 1-byte > characters I want to test within emacs. You mean an iso-8859-1 file, then. A unibyte file only contains bytes, no chars. > I started /usr/local/bin/emacs -Q mytestchars-224-255-iso-8859.txt > under emacs 23.2.93.1 (i686-pc-linux-gnu) > The file displays perfectly correctly. (describe-char (point)) gives me > exactly what I want, i.e. an extended asci decimal code between 224 and 255. The code is not very helpful here, since depending on whether the current buffer is unibyte or multibyte, the 224 or 255 doesn't mean the same thing. So the second line "preferred charset:" is more important, since it should either say "eight-bit" (i.e. a raw byte with no associated meaning of it representing some kind of character) or "iso-8859-1". > Almost all operations (except capitalize, see below) work exactly as I wish So that leads me to think the buffer is in unibyte mode. If you started with "emacs -Q", the only explanation is that you have EMACS_UNIBYTE set in your environment variables. If that's the case, then please get rid of it. > At the beginning of this discussion, Sven explained that capitalize would > only work on 2-byte characters. unibyte-vs-multibyte is not the same as "1-byte char"-vs-"2-byte chars". It's an issue that's internal to Emacs and that's largely irrelevant to how Emacs stores chars (e.g. as an array of 32bit integers, or as a sequence of bytes, with escape sequences to represent more than 256 different values). > Which I tested of course, and of course it works, but I simply wish > I could continue to capitalize M-c unibyte words like in the good > old iso-8859 days !! Most likely you won't tell the difference: the multibyte mode works just as well for iso-8859 files. "multibyte-mode" means "we're manipulating chars", whereas unibyte mode means "we're manipulating bytes", where bytes are simply numbers between 0 and 255. Now you tell me: what does it mean to capitalize the number 224? Stefan