From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: Dan Jacobson Newsgroups: gmane.emacs.bugs Subject: Re: user sees \xxx but is thwarted from searching for them Date: 18 Apr 2002 10:15:29 +0800 Sender: bug-gnu-emacs-admin@gnu.org Message-ID: References: <6048-Tue16Apr2002152118+0300-eliz@is.elta.co.il> Reply-To: jidanni@yam.com.tw NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: main.gmane.org 1019118063 24587 127.0.0.1 (18 Apr 2002 08:21:03 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 18 Apr 2002 08:21:03 +0000 (UTC) Return-path: Original-Received: from fencepost.gnu.org ([199.232.76.164]) by main.gmane.org with esmtp (Exim 3.33 #1 (Debian)) id 16y7A6-0006OS-00 for ; Thu, 18 Apr 2002 10:21:03 +0200 Original-Received: from localhost ([127.0.0.1] helo=fencepost.gnu.org) by fencepost.gnu.org with esmtp (Exim 3.34 #1 (Debian)) id 16y7AG-0007j9-00; Thu, 18 Apr 2002 04:21:12 -0400 Original-Received: from mailbox3.ucsd.edu ([132.239.1.55]) by fencepost.gnu.org with esmtp (Exim 3.34 #1 (Debian)) id 16y77Q-0007Xd-00 for ; Thu, 18 Apr 2002 04:18:16 -0400 Original-Received: from mail.fu-berlin.de (mail.fu-berlin.de [160.45.11.165]) by mailbox3.ucsd.edu (8.12.1/8.12.1) with ESMTP id g3I8I3xS015491 for ; Thu, 18 Apr 2002 01:18:14 -0700 (PDT) Original-Received: by mail.fu-berlin.de (Smail3.2.0.98) from Curry.ZEDAT.FU-Berlin.DE (160.45.10.36) with esmtp id ; Thu, 18 Apr 2002 10:18:03 +0200 (MEST) Original-Received: by Curry.ZEDAT.FU-Berlin.DE (Smail3.2.0.98) from news.fu-berlin.de with bsmtp id ; Thu, 18 Apr 2002 10:18:02 +0200 (MEST) Original-To: gnu-emacs-bug@moderators.isc.org Original-Path: 61-227-44-125.hinet-ip.hinet.NET!not-for-mail Original-Newsgroups: gnu.emacs.bug Original-Lines: 48 X-Orig-NNTP-Posting-Host: 61-227-44-125.hinet-ip.hinet.net (61.227.44.125) X-Orig-X-Trace: fu-berlin.de 1019117881 4593429 61.227.44.125 (16 [99749]) X-Orig-Path: localhost.localdomain!nobody User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.1 X-MIME-Autoconverted: from 8bit to quoted-printable by mailbox3.ucsd.edu id g3I8I3xS015491 Errors-To: bug-gnu-emacs-admin@gnu.org X-BeenThere: bug-gnu-emacs@gnu.org X-Mailman-Version: 2.0.9 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: Bug reports for GNU Emacs, the Swiss army knife of text editors List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.emacs.bugs:763 X-Report-Spam: http://spam.gmane.org/gmane.emacs.bugs:763 >>>>> "K" =3D=3D Kai Gro=DFjohann wr= ites: K> Then it is clear that Dan wants to search for buffer parts that K> aren't in (representable) in the foo coding. Right? K> Dan? [sniff] they referred to me by name. it's almost like I exist [sniff]. Sorry, I've been chasing the wild pig hunters of my land. OK, my file would be a well behaved big5 chinese file except for a few scattered characters that the author was using to represent some IPA symbols. My mission: no hunt them down and deal with them so that the file can the be used with emacs. what I probably should do is find a perl script that will replace any characters outside the intended coding system of the file [which I could tell it explicitly], "with ***\343\433 was here***" [ASCII] which I could then deal with later in emacs. Hmmm, this seems hard in perl, given big5's definition of /[\x80-\xFE][\x40-\x7E\xA1-\xFE]/ also one should ignore any [0x00-0x7F]. Indeed, how do the \xxx's get on my screen in the first place? well C-x C-f is just going to make the whole file \xxx, so I do M-! cat file, at least there I can see most of the chinese, and the \xxx's stick out like a sore thumb. but, what a drag it is that one can see the \xxx's but cant search for them. it almost makes one want to wrap this emacs inside another emacs to be able to search for them [but a screen at a time]. Anyway, I would just be searching in *Shell Command Output*, and still have to navigate the now 100% \xxx source file. So, my perl script idea seems better. By the way, apparently gnus asked me if I wanted to save Kai's name to BBDB and I hit "y" or something. Well, as Kai has that big B in his name [=3Dss I think], and as I already had some big5 in my BBDB, well, when it came time to save I was given the Spanish inquisition about coding sets or something ... who knows, one false step here and your file will become Coptic Egyptian or something. So I want back and switched the B for ss before saving. By they way, does emacs require on to see octal codes on their screen or can it live in a hex world yet? --=20 http://jidanni.org/ Taiwan(04)25854780