From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Jason Rumney Newsgroups: gmane.emacs.bugs Subject: Re: Face problems when yanking cyrillic text from other X applications Date: Mon, 05 Nov 2007 21:21:02 +0000 Message-ID: <472F893E.1080909@gnu.org> References: <1194258557.962041.108510@50g2000hsm.googlegroups.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=KOI8-R Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1194297720 448 80.91.229.12 (5 Nov 2007 21:22:00 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 5 Nov 2007 21:22:00 +0000 (UTC) Cc: bug-gnu-emacs@gnu.org To: "cmr.Pent@gmail.com" Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Mon Nov 05 22:22:03 2007 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1Ip9ON-00024P-Rj for geb-bug-gnu-emacs@m.gmane.org; Mon, 05 Nov 2007 22:21:56 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Ip9OD-0003kr-4z for geb-bug-gnu-emacs@m.gmane.org; Mon, 05 Nov 2007 16:21:45 -0500 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1Ip9Ne-0003Me-On for bug-gnu-emacs@gnu.org; Mon, 05 Nov 2007 16:21:10 -0500 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1Ip9Na-0003J0-6D for bug-gnu-emacs@gnu.org; Mon, 05 Nov 2007 16:21:10 -0500 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Ip9NZ-0003Is-US for bug-gnu-emacs@gnu.org; Mon, 05 Nov 2007 16:21:05 -0500 Original-Received: from outmail1.freedom2surf.net ([194.106.33.237]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1Ip9NZ-0000hk-J8 for bug-gnu-emacs@gnu.org; Mon, 05 Nov 2007 16:21:05 -0500 Original-Received: from [127.0.0.1] (i-83-67-23-108.freedom2surf.net [83.67.23.108]) by outmail1.freedom2surf.net (Postfix) with ESMTP id 05405503CF; Mon, 5 Nov 2007 21:21:05 +0000 (GMT) User-Agent: Thunderbird 2.0.0.6 (Windows/20070728) In-Reply-To: <1194258557.962041.108510@50g2000hsm.googlegroups.com> X-detected-kernel: by monty-python.gnu.org: Linux 2.4-2.6 X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:16912 Archived-At: cmr.Pent@gmail.com wrote: > Yes, looks like it is an encoding problem: > > character: =C1 (54225, #o151721, #xd3d1, U+0430) > charset: japanese-jisx0208 (JISX0208.1983/1990 Japanese Kanji: ISO- > IR-87.) > code point: #x27 #x51 > syntax: w which means: word > category: Y:Cyrillic characters of 2-byte character sets j:Japanese > |:While filling, we can break a line at this character. > buffer code: #x92 #xA7 #xD1 > file code: #xD0 #xB0 (encoded by coding system mule-utf-8-unix) > display: by this font (glyph code) > -JIS-Fixed-Medium-R-Normal--16-150-75-75-C-160-JISX0208.1983-0 > (#x2751) > > There are text properties here: > fontified t > =20 Have a look at the variable x-select-request-type. I think setting it to 'UTF8_STRING should give better results. I thought this was the default, but reading the doc string, it appears that the default is to use some sort of heuristic. I'm not sure why the heuristic prefers iso-2022-jp encoded Cyrillic over UTF8 encoding in a ru_RU.UTF-8 locale.