From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: "Stephen J. Turnbull" Newsgroups: gmane.emacs.devel,gmane.comp.tex.texinfo.bugs Subject: Re: texi2html output validity Date: Fri, 26 Dec 2014 00:58:24 +0900 Message-ID: <87sig34t7j.fsf@uwakimon.sk.tsukuba.ac.jp> References: <87k31kga2y.fsf@fencepost.gnu.org> <87r3vsdps7.fsf@fencepost.gnu.org> <87a92ehctk.fsf_-_@violet.siamics.net> <20141223164911.GD5623@free.fr> <871tnp682q.fsf@uwakimon.sk.tsukuba.ac.jp> <20141225140528.GB16721@gnu.org> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 X-Trace: ger.gmane.org 1419523132 4729 80.91.229.3 (25 Dec 2014 15:58:52 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 25 Dec 2014 15:58:52 +0000 (UTC) Cc: Yuri Khan , bug-texinfo@gnu.org, Emacs developers To: Ineiev Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Dec 25 16:58:46 2014 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Y4AoH-0003SD-CX for ged-emacs-devel@m.gmane.org; Thu, 25 Dec 2014 16:58:45 +0100 Original-Received: from localhost ([::1]:51208 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Y4AoG-00084r-Sh for ged-emacs-devel@m.gmane.org; Thu, 25 Dec 2014 10:58:44 -0500 Original-Received: from eggs.gnu.org ([2001:4830:134:3::10]:60601) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Y4Ao4-00084d-T0 for emacs-devel@gnu.org; Thu, 25 Dec 2014 10:58:33 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Y4Ao4-0005nY-23 for emacs-devel@gnu.org; Thu, 25 Dec 2014 10:58:32 -0500 Original-Received: from shako.sk.tsukuba.ac.jp ([130.158.97.161]:54986) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Y4Any-0005kp-0F; Thu, 25 Dec 2014 10:58:26 -0500 Original-Received: from uwakimon.sk.tsukuba.ac.jp (uwakimon.sk.tsukuba.ac.jp [130.158.99.156]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by shako.sk.tsukuba.ac.jp (Postfix) with ESMTPS id 714411C38AC; Fri, 26 Dec 2014 00:58:24 +0900 (JST) Original-Received: by uwakimon.sk.tsukuba.ac.jp (Postfix, from userid 1000) id 4E2E51A2CFC; Fri, 26 Dec 2014 00:58:24 +0900 (JST) In-Reply-To: <20141225140528.GB16721@gnu.org> X-Mailer: VM undefined under 21.5 (beta34) "kale" acf1c26e3019 XEmacs Lucid (x86_64-unknown-linux) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 130.158.97.161 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:180660 gmane.comp.tex.texinfo.bugs:7007 Archived-At: Ineiev writes: > On Wed, Dec 24, 2014 at 12:27:25PM +0900, Stephen J. Turnbull wrote: > > AFAIK the encoding declaration is optional, defaulting to UTF-8. In > > that case, we can (and IMHO *should*, but I am no longer an expert on > > current encoding practice) require that our software generate UTF-8 > > and omit the declaration. Non-UTF-8 should be invalid in Info-HTML. > > The fact is that some users have ASCII-incompatible default > encodings (like UTF-16). if we add the declaration, it costs little, > but the pages just work for them. AFAIK, default encodings are not a problem. If Info-HTML is specified to be served as XML (which has its own issues, but that's one way to do it) then conformant browsers RFC2119-MUST assume Unicode as the coded character set, and will automatically determine the transformation format (UTF-8, UTF-16, UTF-16-little-endian) by checking the first two octets. I believe HTML5 also specifies UTF-8 as the default. Alternatively, for such systems it's trivial to generate UTF-16 from UTF-8.