From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Boylan, Ross" Newsgroups: gmane.emacs.help Subject: Unable to match octal character Date: Wed, 13 Apr 2016 20:22:08 +0000 Message-ID: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1460578975 23487 80.91.229.3 (13 Apr 2016 20:22:55 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 13 Apr 2016 20:22:55 +0000 (UTC) To: "help-gnu-emacs@gnu.org" Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Apr 13 22:22:41 2016 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1aqRJA-0005Ij-Ng for geh-help-gnu-emacs@m.gmane.org; Wed, 13 Apr 2016 22:22:40 +0200 Original-Received: from localhost ([::1]:51470 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aqRJA-0005g2-1s for geh-help-gnu-emacs@m.gmane.org; Wed, 13 Apr 2016 16:22:40 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:58424) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aqRIt-0005dx-Ae for help-gnu-emacs@gnu.org; Wed, 13 Apr 2016 16:22:24 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1aqRIp-0005D9-VO for help-gnu-emacs@gnu.org; Wed, 13 Apr 2016 16:22:23 -0400 Original-Received: from esa2.ucsf.iphmx.com ([68.232.143.34]:50565) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1aqRIo-00058d-Ug for help-gnu-emacs@gnu.org; Wed, 13 Apr 2016 16:22:19 -0400 Original-Received: from unknown (HELO bcuda2.ucsf.edu) ([64.54.157.33]) by esa2.ucsf.iphmx.com with ESMTP/TLS/AES256-SHA; 13 Apr 2016 13:22:10 -0700 X-ASG-Debug-ID: 1460578929-0a8b7b32b71011d0001-2yy5ZX Original-Received: from exht05.net.ucsf.edu (mx.ucsf.edu [64.54.247.193]) by bcuda2.ucsf.edu with ESMTP id HXgHXpwzUPN2Z7VM (version=TLSv1 cipher=ECDHE-RSA-AES256-SHA bits=256 verify=NO) for ; Wed, 13 Apr 2016 13:22:09 -0700 (PDT) X-Barracuda-Envelope-From: Ross.Boylan@ucsf.edu X-Barracuda-Effective-Source-IP: mx.ucsf.edu[64.54.247.193] X-Barracuda-Apparent-Source-IP: 64.54.247.193 Original-Received: from EX08.net.ucsf.edu ([64.54.247.161]) by exht05.net.ucsf.edu ([64.54.247.222]) with mapi id 14.03.0224.002; Wed, 13 Apr 2016 13:22:09 -0700 Thread-Topic: Unable to match octal character X-ASG-Orig-Subj: Unable to match octal character Thread-Index: AdGVv6ZDiSVBFBWPRpy7XefYrg+SHA== Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [66.181.128.6] X-Barracuda-Connect: mx.ucsf.edu[64.54.247.193] X-Barracuda-Start-Time: 1460578929 X-Barracuda-Encrypted: ECDHE-RSA-AES256-SHA X-Barracuda-URL: https://bcuda2.ucsf.edu:443/cgi-mod/mark.cgi X-Barracuda-Scan-Msg-Size: 1869 X-Virus-Scanned: by bsmtpd at ucsf.edu X-Barracuda-BRTS-Status: 1 X-Barracuda-Spam-Score: 0.00 X-Barracuda-Spam-Status: No, SCORE=0.00 using per-user scores of TAG_LEVEL=1000.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=9.0 tests= X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.3.28707 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- X-CFilter-Loop: Reflected X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 68.232.143.34 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.org gmane.emacs.help:109754 Archived-At: I have a file with some characters that display in the emacs buffer as \203= . Yet when I search for that, using=0A= C-s C-q 203 =0A= I can't match it. Likewise if I use search and replace.=0A= =0A= I have verified that \203 is the display of a single character both by usin= g the arrow key (it's a single step to move over it) and by searching on th= e literal string \203 or 203.=0A= =0A= What's going on?=0A= =0A= When I put the cursor on one of these characters and do describe-char I get= =0A= position: 474 of 48736 (1%), column: 25=0A= character: \203 (displayed as \203) (codepoint 4194179, #o17777= 603, #x3fff83)=0A= preferred charset: tis620-2533 (TIS620.2533)=0A= code point in charset: 0x83=0A= syntax: w which means: word=0A= category: L:Left-to-right (strong)=0A= to input: type "C-x 8 RET HEX-CODEPOINT" or "C-x 8 RET NAME"= =0A= buffer code: #x83=0A= file code: #x83 (encoded by coding system raw-text-unix)=0A= display: not encodable for terminal=0A= =0A= Character code properties: customize what to show=0A= general-category: Cn (Other, Not Assigned)=0A= decomposition: (4194179) ('?')=0A= =0A= Is the fact that the codepoint is not 203 significant?=0A= =0A= History of the file:=0A= SAS running on MS Windows produced rtf output. SAS has its own font it lik= es to use,=0A= and the horizontal bar used in tables does not travel well (even on Windows= ). That's the character that's causing trouble.=0A= Opened in Wordpad on Windows and exported text.=0A= Move the file to Debian GNU/Linux (UTF-8 environment) and opened the text f= ile in emacs 24.4.1.=0A= =0A= Using a slightly different procedure I was able to replace octal characters= :=0A= Started with same rtf output.=0A= On linux ran unrtf -- text to convert to text.=0A= Open text file in emacs. In this case the character was \220.=0A= I tried wordpad because unrtf did not preserve the column alignment=0A= =0A= Thanks.=0A= Ross Boylan.=