From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Buchs, Kevin J." Newsgroups: gmane.emacs.help Subject: reading .msg files from MS Outlook Date: Fri, 21 Mar 2014 14:06:49 -0500 Message-ID: <3dfcdc$fvt0tp@ironport9.mayo.edu> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1395428831 8074 80.91.229.3 (21 Mar 2014 19:07:11 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 21 Mar 2014 19:07:11 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Fri Mar 21 20:07:20 2014 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WR4mh-0006j3-P1 for geh-help-gnu-emacs@m.gmane.org; Fri, 21 Mar 2014 20:07:15 +0100 Original-Received: from localhost ([::1]:54303 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WR4mg-0003zK-NL for geh-help-gnu-emacs@m.gmane.org; Fri, 21 Mar 2014 15:07:14 -0400 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:59160) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WR4mP-0003yi-Fd for help-gnu-emacs@gnu.org; Fri, 21 Mar 2014 15:07:03 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WR4mJ-0003ZQ-Hp for help-gnu-emacs@gnu.org; Fri, 21 Mar 2014 15:06:57 -0400 Original-Received: from mail10.mayo.edu ([129.176.114.198]:41011) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WR4mJ-0003Z9-Cu for help-gnu-emacs@gnu.org; Fri, 21 Mar 2014 15:06:51 -0400 Original-Received: from unknown (HELO mail9.mayo.edu) ([10.146.65.139]) by ironport10-dlp.mayo.edu with ESMTP; 21 Mar 2014 14:06:53 -0500 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Ap4EAC6NLFMKgNEM/2dsb2JhbABZxnCBOHSCZAQBZIEjCAEBh3rJMYYzjx6CZ4E7BIlSlWuPCA Original-Received: from unknown (HELO msgoms03.mayo.edu) ([10.128.209.12]) by ironport9.mayo.edu with ESMTP; 21 Mar 2014 14:05:04 -0500 User-Agent: Mozilla/5.0 (X11; Linux i686 on x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.4.0 X-CFilter-Loop: Reflected X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 129.176.114.198 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:96682 Archived-At: From time to time I want to open a .msg file which has been created from Outlook in emacs on GNU/Linux. The file is a mixed binary/text compound document. I'm interested in the text and in the sections that contain that, apparently there is a 16-bit encoding where the upper byte is null (for English text). But, double nulls mean something different. I just gave my 'ol college try at employing a variety of character encodings when reading the file to no avail. In the past I did some complex replacements to get rid of nulls and still keep word separation. I have to wonder, is there a better way? -- Kevin Buchs