From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!.POSTED!not-for-mail From: Tomas Nordin Newsgroups: gmane.emacs.help Subject: Re: viewing docx files Date: Mon, 30 Jan 2017 23:03:56 +0100 Message-ID: <87k29c19yr.fsf@flaptop.tomnor.org> References: <159e5b8b2e0.2800.9343beaceee5adfd5722805b7ce72987@gmail.com> <87wpddijo4.fsf@flaptop.tomnor.org> <87k29d4h0r.fsf@fastmail.fm> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: blaine.gmane.org 1485813898 5194 195.159.176.226 (30 Jan 2017 22:04:58 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Mon, 30 Jan 2017 22:04:58 +0000 (UTC) Cc: Joost Kremers , Jude DaShiell , "help-gnu-emacs@gnu.org" , Devin Prater To: Yuri Khan Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Mon Jan 30 23:04:54 2017 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cYK4D-00018z-NW for geh-help-gnu-emacs@m.gmane.org; Mon, 30 Jan 2017 23:04:53 +0100 Original-Received: from localhost ([::1]:35228 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cYK4F-0007JR-LP for geh-help-gnu-emacs@m.gmane.org; Mon, 30 Jan 2017 17:04:55 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:43706) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cYK3p-0007JB-3h for help-gnu-emacs@gnu.org; Mon, 30 Jan 2017 17:04:29 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cYK3k-0005lG-9c for help-gnu-emacs@gnu.org; Mon, 30 Jan 2017 17:04:29 -0500 Original-Received: from mout02.posteo.de ([185.67.36.66]:54683) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cYK3k-0005l8-36 for help-gnu-emacs@gnu.org; Mon, 30 Jan 2017 17:04:24 -0500 Original-Received: from submission (posteo.de [89.146.220.130]) by mout02.posteo.de (Postfix) with ESMTPS id BEC7820BDD for ; Mon, 30 Jan 2017 23:04:21 +0100 (CET) Original-Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 3vC3MR4rpWzytZ; Mon, 30 Jan 2017 23:04:19 +0100 (CET) Original-Received: from tomas by flaptop with local (Exim 4.84_2) (envelope-from ) id 1cYK3I-00028Q-9f; Mon, 30 Jan 2017 23:03:56 +0100 In-Reply-To: X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-Received-From: 185.67.36.66 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.org gmane.emacs.help:112209 Archived-At: Yuri Khan writes: > Instead of iconv, use xmllint --encode utf-8. It will extract the > source encoding from the XML declaration at the top of the file, and > reencode from that to UTF-8. Trigger it unconditionally, for each > *.xml file in the archive. > > Consider also trying to persuade Pandoc developers to support > non-UTF-8-encoded XML data. Seem like good ideas to keep in the pocket. I have been surfing around a bit and it seems the docx xml files should be utf-8 encoded by default.