From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: josh buhl Newsgroups: gmane.emacs.bugs Subject: gtk2, iso14755, pasting non-ascii characters, and the x-windows clipboard Date: Tue, 16 Dec 2003 12:54:22 +0100 Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Message-ID: <3FDEF26E.3060103@uni-bonn.de> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1071576377 1774 80.91.224.253 (16 Dec 2003 12:06:17 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 16 Dec 2003 12:06:17 +0000 (UTC) Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Dec 16 13:06:14 2003 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1AWDxu-00010g-00 for ; Tue, 16 Dec 2003 13:06:14 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.24) id 1AWEpX-0007lm-QM for geb-bug-gnu-emacs@m.gmane.org; Tue, 16 Dec 2003 08:01:39 -0500 Original-Received: from list by monty-python.gnu.org with tmda-scanned (Exim 4.24) id 1AWEll-0005du-No for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2003 07:57:45 -0500 Original-Received: from mail by monty-python.gnu.org with spam-scanned (Exim 4.24) id 1AWEkx-00057T-Cr for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2003 07:57:26 -0500 Original-Received: from [131.220.99.37] (helo=mail.stw-bonn.de) by monty-python.gnu.org with esmtp (Exim 4.24) id 1AWEkS-0004X3-MB for bug-gnu-emacs@gnu.org; Tue, 16 Dec 2003 07:56:24 -0500 Original-Received: from localhost (localhost [127.0.0.1]) by mail.stw-bonn.de (Postfix) with ESMTP id C3B32EEEAD for ; Tue, 16 Dec 2003 12:54:26 +0100 (CET) Original-Received: from uni-bonn.de (port-192-168-82-198.dhcp.stw-bonn.de [192.168.82.198]) by mail.stw-bonn.de (Postfix) with ESMTP id 35B69EEEA7 for ; Tue, 16 Dec 2003 11:54:26 +0000 (/etc/localtime) User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.4; MultiZilla v1.4.0.4A) Gecko/20031107 Debian/1.5-3 X-Accept-Language: en Original-Newsgroups: gnu.emacs.bug Original-To: bug-gnu-emacs@gnu.org X-Enigmail-Version: 0.76.3.0 X-Enigmail-Supports: pgp-inline, pgp-mime X-Virus-Scanned: by AMaViS X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.1.2 Precedence: list List-Id: Bug reports for GNU Emacs, the Swiss army knife of text editors List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.bugs:6290 X-Report-Spam: http://spam.gmane.org/gmane.emacs.bugs:6290 In GNU Emacs 21.3.1 (i386-pc-linux-gnu, X toolkit) of 2003-10-31 on raven, modified by Debian configured using `configure i386-linux --prefix=/usr --sharedstatedir=/var/lib --libexecdir=/usr/lib --localstatedir=/var/lib --infodir=/usr/share/info --mandir=/usr/share/man --with-pop=yes --without-gif --with-x=yes --with-x-toolkit=athena --without-toolkit-scroll-bars' Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: nil locale-coding-system: nil default-enable-multibyte-characters: t I'm running emacs 21.3.1 on debian/testing with the gnome 2.4 desktop. Emacs has a problem pasting in text with non-ascii characters from any of the apps which are compiled with gtk2 (via marking with mouse, and inserting per mouse-2 click). Here's an example: I mark this text from a german webpage displayed in mozilla 1.5 compiled with gtk2: "Soße wird in einer extra Soßenschüssel..." Paste it into my Emacs buffer and get this: "So\x{00DF}e wird in einer extra So\x{00DF}ensch\x{00FC}ssel..." Emacs inserts the text correctly when it has been marked in kword, kate, xedit, open office writer, or any other non-gtk2 app, and barfs if the same text has been marked in mozilla, gedit, or *any gtk+ 2* dialog like any of the gnome 2.4 dialogs. So I can mark a text in mozilla, paste it into xedit, _remark_ it and paste it into emacs, and it works, but if I don't remark, emacs barfs. If I mark the text in Emacs, then I can paste it correctly into any non-gtk2 app, but if I try to paste it into a gtk2 app, *nothing* gets pasted in. However, the gtk2 apps and the non-gtk2 apps aside from emacs, all seem to be able to paste this text in from each other properly. Only emacs has this problem. This behaviour is independent of what I've set LC_ALL to before starting emacs, but if I logout and login with default session language set to german, then all the pasting functions work properly. I'm sure this is related to this: ISO 14755 specifies using Ctrl+Shift+hex-digit to input unicode. gtk2 implemented ISO 14755 input method. The garbaged text corresponds exactly to the unicode hex encodings for the characters. for example the unicode hex encoding of ß is 00DF and emacs displays the pasted in ß as \x{00DF}. This certainly isn't a coincidence.