From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Drew Adams Newsgroups: gmane.emacs.devel Subject: RE: Emacs as word processor / Text Properties Date: Wed, 27 Nov 2013 15:01:21 -0800 (PST) Message-ID: <35e892b1-73b8-4ca2-9317-7eb83e7223e5@default> References: <87vbzqfgd6.fsf@uwakimon.sk.tsukuba.ac.jp> <83vbzkcx20.fsf@gnu.org> <83d2lrczi7.fsf@gnu.org> <8338mmcsd9.fsf@gnu.org> <83txf1blf2.fsf@gnu.org> <87txf133yd.fsf@zigzag.favinet> <83r4a5bj5x.fsf@gnu.org> <87mwktdy6r.fsf@uwakimon.sk.tsukuba.ac.jp> <83iovhb0ez.fsf@gnu.org> <87k3fxdpmg.fsf@uwakimon.sk.tsukuba.ac.jp> <837gbwbcsx.fsf@gnu.org> <87d2lnevq7.fsf@uwakimon.sk.tsukuba.ac.jp> <87ob57rlkb.fsf_-_@informatimago.com> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1385593308 11622 80.91.229.3 (27 Nov 2013 23:01:48 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 27 Nov 2013 23:01:48 +0000 (UTC) To: "T.V. Raman" , "Pascal J. Bourguignon" , emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Nov 28 00:01:52 2013 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Vlo7B-0002XE-QS for ged-emacs-devel@m.gmane.org; Thu, 28 Nov 2013 00:01:49 +0100 Original-Received: from localhost ([::1]:38205 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vlo7B-0008Vs-1H for ged-emacs-devel@m.gmane.org; Wed, 27 Nov 2013 18:01:49 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:52445) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vlo6z-0008V5-8A for emacs-devel@gnu.org; Wed, 27 Nov 2013 18:01:45 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vlo6q-0005bT-NI for emacs-devel@gnu.org; Wed, 27 Nov 2013 18:01:37 -0500 Original-Received: from userp1040.oracle.com ([156.151.31.81]:48261) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vlo6q-0005a2-HQ for emacs-devel@gnu.org; Wed, 27 Nov 2013 18:01:28 -0500 Original-Received: from acsinet21.oracle.com (acsinet21.oracle.com [141.146.126.237]) by userp1040.oracle.com (Sentrion-MTA-4.3.1/Sentrion-MTA-4.3.1) with ESMTP id rARN1OOg026984 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Wed, 27 Nov 2013 23:01:25 GMT Original-Received: from aserz7022.oracle.com (aserz7022.oracle.com [141.146.126.231]) by acsinet21.oracle.com (8.14.4+Sun/8.14.4) with ESMTP id rARN1M9x000890 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 27 Nov 2013 23:01:23 GMT Original-Received: from abhmp0015.oracle.com (abhmp0015.oracle.com [141.146.116.21]) by aserz7022.oracle.com (8.14.4+Sun/8.14.4) with ESMTP id rARN1MqQ017042; Wed, 27 Nov 2013 23:01:22 GMT In-Reply-To: X-Priority: 3 X-Mailer: Oracle Beehive Extensions for Outlook 2.0.1.8 (707110) [OL 12.0.6680.5000 (x86)] X-Source-IP: acsinet21.oracle.com [141.146.126.237] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.4.x-2.6.x [generic] X-Received-From: 156.151.31.81 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:165817 Archived-At: > All that said, *every* known WYSWYG word-processor also degrades > to using a dump of its internal data structures as its file > format. Not sure just what you mean by that. But if the internal data structures faithfully represent XML data, and the output file format is a serialization of that XML data, then this is hardly messy or lossy. And that is the case for more and more "WYSIWYG" editors, at least the "high-end" ones. The internal data structures are XML, and the file format is XML. From the internal XML, or from the output XML file, is generated XHTML, PDF, or whatever. Internally, an XML representation might use a DOM or binary XML or any number of other implementation means. But the only important question is whether the output (serialized) form and the internal form mirror each other properly. (For that, there can be considerations of whether document fidelity is needed or just DOM fidelity (i.e., whether or not insignificant whitespace needs to be preserved). Sometimes an editor ("word processor") has additional output file formats, even if it is capable of saving as XML. But that's another story. More and more, the "real" output file format is XML. (Yes, for MS Word, most people still see *.doc files, not XML files.)