From mboxrd@z Thu Jan 1 00:00:00 1970 Path: main.gmane.org!not-for-mail From: sebyte Newsgroups: gmane.emacs.help Subject: Re: When is a text file not a text file? Date: Fri, 09 Jan 2004 18:17:23 +0000 Organization: Customer of PlusNet Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Message-ID: References: NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1073674868 27049 80.91.224.253 (9 Jan 2004 19:01:08 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 9 Jan 2004 19:01:08 +0000 (UTC) Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Jan 09 20:00:59 2004 Return-path: Original-Received: from monty-python.gnu.org ([199.232.76.173]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Af1sQ-0000Jm-00 for ; Fri, 09 Jan 2004 20:00:59 +0100 Original-Received: from localhost ([127.0.0.1] helo=monty-python.gnu.org) by monty-python.gnu.org with esmtp (Exim 4.24) id 1Af2LI-0003oU-QH for geh-help-gnu-emacs@m.gmane.org; Fri, 09 Jan 2004 14:30:48 -0500 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news.tele.dk!news.tele.dk!small.news.tele.dk!peer1.news.newnet.co.uk!peer1.news.newnet.co.uk!212.159.2.27.MISMATCH!landlord!wards.force9.net.POSTED!not-for-mail User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O; en-US; rv:1.4) Gecko/20030624 Netscape/7.1 X-Accept-Language: en-us, en Original-Newsgroups: gnu.emacs.help In-Reply-To: Original-Lines: 57 Original-NNTP-Posting-Host: 81.174.136.8 Original-X-Complaints-To: abuse@plus.net.uk Original-X-Trace: wards.force9.net 1073672369 81.174.136.8 (Fri, 09 Jan 2004 18:19:29 GMT) Original-NNTP-Posting-Date: Fri, 09 Jan 2004 18:19:29 GMT Original-Xref: shelby.stanford.edu gnu.emacs.help:119935 Original-To: help-gnu-emacs@gnu.org X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.2 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: main.gmane.org gmane.emacs.help:15864 X-Report-Spam: http://spam.gmane.org/gmane.emacs.help:15864 > What "tags" are these? I don't know the actual program you are > using. But I seem to recall that I once had a program "html2text" or > "htmltotxt" or whatever that procuced a text file as output > *containing ANSI escape sequences for colours*. > > Could that be the case? It would explain why dumping such a file on > the tty---unlike visiting it with a text editor---would display it > correctly. > > If this *is* the case, then you probably could do something with > ansi-color.el, though I don't know offhand how exactly. > > Oliver Hi Oliver, Thanks for your time. Here's an example of html2text's output, displayed in an Emacs buffer: C^HCo^Hop^Hpy^Hyr^Hri^Hig^Hgh^Hht^Ht n^Hno^Hot^Hti^Hic^Hce^He:^H: All reader-contributed material on freshmeat.net is the property and responsibility of its author; for reprint rights, please contact the author directly. ----------------------------------------------------------------------------- Let me repeat that: OS X is not Unix. Consider the following: all of Apple.com's _^Hm_^Ha_^Hr_^Hk_^He_^Ht_^Hi_^Hn_^Hg_^H _^Hp_^Ha_^Hg_^He_^Hs on the subject of their darling new operating system are extremely careful to note that OS X is "_^HU_^HN_^HI_^HX_^H-_^Hb_^Ha_^Hs_^He_^Hd". Here is how it looks on a tty or in an Emacs *shell* buffer: Copyright notice: All reader-contributed material on freshmeat.net is the property and responsibility of its author; for reprint rights, please contact the author directly. ----------------------------------------------------------------------------- Let me repeat that: OS X is not Unix. Consider the following: all of Apple.com's marketing pages on the subject of their darling new operating system are extremely careful to note that OS X is "UNIX-based". I had thought that they might be remnants of HTML tags, (I must admit I didn't look very closely), but I have found out since they are actually ANSI 'backspace control sequences', used to preserve things like underlining and boldface. The html2text option '-nobs' gets rid of them. (After days spent looking for information, I discovered that html2text comes with a manpage and all was revealed. DOH!) sebyte