From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Rusi Newsgroups: gmane.emacs.help Subject: Re: Unicode bugs?? Date: Fri, 28 Mar 2014 06:45:36 -0700 (PDT) Message-ID: <74f2688e-0688-4e92-ae1f-c3d6ff3f3ba6@googlegroups.com> References: <819f4741-7a99-4c56-bdae-3b32ce40d55d@googlegroups.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1396014617 12393 80.91.229.3 (28 Mar 2014 13:50:17 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 28 Mar 2014 13:50:17 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Mar 28 14:50:25 2014 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WTXAu-0007d6-Nf for geh-help-gnu-emacs@m.gmane.org; Fri, 28 Mar 2014 14:50:25 +0100 Original-Received: from localhost ([::1]:33720 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTXAu-0007mD-2J for geh-help-gnu-emacs@m.gmane.org; Fri, 28 Mar 2014 09:50:24 -0400 X-Received: by 10.66.145.105 with SMTP id st9mr3501649pab.23.1396014337102; Fri, 28 Mar 2014 06:45:37 -0700 (PDT) X-Received: by 10.182.176.99 with SMTP id ch3mr6927obc.38.1396014336979; Fri, 28 Mar 2014 06:45:36 -0700 (PDT) Original-Path: usenet.stanford.edu!ur14no15124979igb.0!news-out.google.com!gi6ni490igc.0!nntp.google.com!ur14no15124978igb.0!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=59.95.25.152; posting-account=mBpa7woAAAAGLEWUUKpmbxm-Quu5D8ui Original-NNTP-Posting-Host: 59.95.25.152 User-Agent: G2/1.0 Injection-Date: Fri, 28 Mar 2014 13:45:36 +0000 Original-Xref: usenet.stanford.edu gnu.emacs.help:204537 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:96807 Archived-At: On Friday, March 28, 2014 6:49:55 PM UTC+5:30, Kevin Rodgers wrote: > On 3/28/14 6:18 AM, Rusi wrote: > > There are some characters eg =E2=9F=A8 =E2=9F=A9 > > (27E8, 27E9) > > which when emacs handles makes it use (visually) many lines to store it= . > > IOW paste these into emacs on say line 2 > > I find line 1 and line 3 some some 3-4 lines apart > > Also most strikingly the cursor becomes 4 times its size when point is = put on these chars > Move point before each character and do `C-u C-x =3D', then note the font > used to display it. Ok Currently all these are misbehaving: =E2=9F=AE =E2=9F=AF =E2=9F=AA =E2=9F=AB =F0=9D=97=AE=F0=9D=97=AF=F0=9D=97= =B0=F0=9D=97=B1=F0=9D=97=B2=F0=9D=97=B3=F0=9D=97=B4=F0=9D=97=B5=F0=9D=97=B6= =F0=9D=97=B7=F0=9D=97=B8=F0=9D=97=B9=F0=9D=97=BA=F0=9D=97=BB=F0=9D=97=BC=F0= =9D=97=BD=F0=9D=97=BE=F0=9D=97=BF=F0=9D=98=80=F0=9D=98=81=F0=9D=98=82=F0=9D= =98=83=F0=9D=98=84=F0=9D=98=85=F0=9D=98=86=F0=9D=98=87 xft:-unknown-Latin Modern Math-normal-normal-normal-*-12-*-*-*-*-0-iso1064= 6-1 (#x84D) In general what is shown is like this (name etc is of course different for = each case) position: 574 of 687 (83%), column: 8 character: =F0=9D=97=AE (displayed as =F0=9D=97=AE) (codepoint = 120302, #o352756, #x1d5ee) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0x1D5EE syntax: w which means: word category: .:Base, L:Left-to-right (strong) to input: type "C-x 8 RET HEX-CODEPOINT" or "C-x 8 RET NAME" buffer code: #xF0 #x9D #x97 #xAE file code: #xF0 #x9D #x97 #xAE (encoded by coding system utf-8-= unix) display: by this font (glyph code) xft:-unknown-Latin Modern Math-normal-normal-normal-*-12-*-*-*-*-0-iso1= 0646-1 (#xBDE) Character code properties: customize what to show name: MATHEMATICAL SANS-SERIF BOLD SMALL A general-category: Ll (Letter, Lowercase) decomposition: (font 97) (font 'a')