From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Kenichi Handa Newsgroups: gmane.emacs.devel Subject: Re: unibyte<->multibyte conversion [Re: Emacs-diffs Digest, Vol 2, Issue 28] Date: Tue, 21 Jan 2003 09:20:49 +0900 (JST) Sender: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Message-ID: <200301210020.JAA17567@etlken.m17n.org> References: <3405-Sat18Jan2003154003+0200-eliz@is.elta.co.il> <200301200229.LAA16287@etlken.m17n.org> <6480-Mon20Jan2003214849+0200-eliz@is.elta.co.il> <200301202055.h0KKtun11691@rum.cs.yale.edu> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya") Content-Type: text/plain; charset=US-ASCII X-Trace: main.gmane.org 1043108756 29715 80.91.224.249 (21 Jan 2003 00:25:56 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Tue, 21 Jan 2003 00:25:56 +0000 (UTC) Cc: emacs-devel@gnu.org Return-path: Original-Received: from quimby.gnus.org ([80.91.224.244]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18amEj-0007iu-00 for ; Tue, 21 Jan 2003 01:25:53 +0100 Original-Received: from monty-python.gnu.org ([199.232.76.173]) by quimby.gnus.org with esmtp (Exim 3.12 #1 (Debian)) id 18amPq-0003JS-00 for ; Tue, 21 Jan 2003 01:37:22 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 18amBh-0008P2-00 for emacs-devel@quimby.gnus.org; Mon, 20 Jan 2003 19:22:45 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.10.13) id 18amBF-0008JP-00 for emacs-devel@gnu.org; Mon, 20 Jan 2003 19:22:17 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.10.13) id 18amAy-00086A-00 for emacs-devel@gnu.org; Mon, 20 Jan 2003 19:22:01 -0500 Original-Received: from tsukuba.m17n.org ([192.47.44.130]) by monty-python.gnu.org with esmtp (Exim 4.10.13) id 18am9w-00072X-00; Mon, 20 Jan 2003 19:20:57 -0500 Original-Received: from fs.m17n.org (fs.m17n.org [192.47.44.2])h0L0Kok03137; Tue, 21 Jan 2003 09:20:50 +0900 (JST) (envelope-from handa@m17n.org) Original-Received: from etlken.m17n.org (etlken.m17n.org [192.47.44.125]) h0L0KoR07034; Tue, 21 Jan 2003 09:20:50 +0900 (JST) Original-Received: (from handa@localhost) by etlken.m17n.org (8.8.8+Sun/3.7W-2001040620) id JAA17567; Tue, 21 Jan 2003 09:20:49 +0900 (JST) Original-To: monnier+gnu/emacs@rum.cs.yale.edu In-reply-to: <200301202055.h0KKtun11691@rum.cs.yale.edu> (monnier+gnu/emacs@rum.cs.yale.edu) User-Agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.2.92 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI) Original-cc: eliz@is.elta.co.il Original-cc: rms@gnu.org X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1b5 Precedence: list List-Id: Emacs development discussions. List-Help: List-Post: List-Subscribe: , List-Archive: List-Unsubscribe: , Errors-To: emacs-devel-bounces+emacs-devel=quimby.gnus.org@gnu.org Xref: main.gmane.org gmane.emacs.devel:10920 X-Report-Spam: http://spam.gmane.org/gmane.emacs.devel:10920 In article <200301202055.h0KKtun11691@rum.cs.yale.edu>, "Stefan Monnier" writes: > While we're at it, how about making string-as-multibyte obsolete ? > It's not used much and it has been abused many times in the past. It is useful for instance in this scenario. I don't remember a concrete example, but some package is doing this kind of thing. Read a file of weird encoding in a unibyte buffer. Parse the contents and do decode-coding-region one bunch by one with different coding systems. Extract some part from that unibyte buffer and insert it in a mulitbyte buffer. The last step is: (let ((str (buffer-substring FROM TO))) (save-excursion (set-buffer MULTIBYTE-BUF) (insert (string-as-multibyte str)))) > Also, I believe it's more or less equivalent to > (decode-coding-string str 'emacs-mule) Yes, for the moment. But, in emacs-unicode, we must change it to: (decode-coding-string str 'utf-8-emacs) On the other hand, we don't have to change a code using string-as-multibyte. --- Ken'ichi HANDA handa@m17n.org