From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.devel,gmane.emacs.pretest.bugs Subject: Re: Emacs puts binary junk into the clipboard, marking it as text Date: Tue, 19 Sep 2006 12:15:58 -0400 Message-ID: References: <1158280855.14121.69.camel@chrislap.madeupdomain.com> <450A514E.6020205@swipnet.se> <450BE084.10905@swipnet.se> <450C3380.2050008@swipnet.se> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1158682642 21017 80.91.229.2 (19 Sep 2006 16:17:22 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 19 Sep 2006 16:17:22 +0000 (UTC) Cc: christopher.ian.moore@gmail.com, emacs-pretest-bug@gnu.org, ihs_4664@yahoo.com, richard.stallman@gnu.org, emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Tue Sep 19 18:17:19 2006 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1GPiGe-00048S-V2 for ged-emacs-devel@m.gmane.org; Tue, 19 Sep 2006 18:16:17 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GPiGe-000451-80 for ged-emacs-devel@m.gmane.org; Tue, 19 Sep 2006 12:16:16 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1GPiGQ-00041R-Sm for emacs-devel@gnu.org; Tue, 19 Sep 2006 12:16:02 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1GPiGQ-00040H-3W for emacs-devel@gnu.org; Tue, 19 Sep 2006 12:16:02 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1GPiGQ-000408-0A; Tue, 19 Sep 2006 12:16:02 -0400 Original-Received: from [64.122.15.118] (helo=alfajor) by monty-python.gnu.org with esmtp (Exim 4.52) id 1GPiJW-0004zV-2m; Tue, 19 Sep 2006 12:19:14 -0400 Original-Received: by alfajor (Postfix, from userid 20848) id 408BA1C22B; Tue, 19 Sep 2006 12:15:58 -0400 (EDT) Original-To: Kenichi Handa In-Reply-To: (Kenichi Handa's message of "Tue\, 19 Sep 2006 20\:14\:03 +0900") User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:60012 gmane.emacs.pretest.bugs:14005 Archived-At: >> Also IIRC a perfectly valid utf-8 buffer may contain eight-bit-* chars, use >> to keep track of valid unicode chars that have no corresponding character in >> emacs-mule. So the presence of eight-bit-* chars does not imply that the >> utf-8 encoded form of the text will contain an invalid utf-8 byte sequence. > Yes, but such eight-bit-* chars can be detected by checking > `untranslated-utf-8' property. Sure, but the current code doesn't do that. >> > And, if Emacs owns a unibyte string, perhaps the right thing >> > is to make it multibyte according to the current >> > lang. env. (by string-make-multibyte) at first, then encode >> > it by utf-8. >> That sounds terribly fragile/buggy. > Then, what do you think Emacs should do in such a case? I think we can't know what should be done, so we should strive for simplicity and try to avoid losing information. I.e. just return the unibyte string as-is. Stefan