From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juri Linkov Newsgroups: gmane.emacs.devel Subject: Re: Handling invalid HTML Date: Wed, 19 Oct 2005 18:59:14 +0300 Organization: JURTA Message-ID: <877jc9h5hi.fsf@jurta.org> References: <200510181105.56063.jyavner@member.fsf.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1129742157 28145 80.91.229.2 (19 Oct 2005 17:15:57 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 19 Oct 2005 17:15:57 +0000 (UTC) Cc: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Wed Oct 19 19:15:55 2005 Return-path: Original-Received: from lists.gnu.org ([199.232.76.165]) by ciao.gmane.org with esmtp (Exim 4.43) id 1ESHWU-0005b4-Jw for ged-emacs-devel@m.gmane.org; Wed, 19 Oct 2005 19:14:42 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1ESHWT-0003qF-Sj for ged-emacs-devel@m.gmane.org; Wed, 19 Oct 2005 13:14:41 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1ESGox-0005RJ-Md for emacs-devel@gnu.org; Wed, 19 Oct 2005 12:29:43 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1ESGow-0005Qm-HL for emacs-devel@gnu.org; Wed, 19 Oct 2005 12:29:42 -0400 Original-Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1ESGow-0005Qj-EJ for emacs-devel@gnu.org; Wed, 19 Oct 2005 12:29:42 -0400 Original-Received: from [194.126.101.114] (helo=mail.neti.ee) by monty-python.gnu.org with esmtp (Exim 4.34) id 1ESGow-0006yr-9p for emacs-devel@gnu.org; Wed, 19 Oct 2005 12:29:42 -0400 Original-Received: from mail.neti.ee (80-235-35-28-dsl.mus.estpak.ee [80.235.35.28]) by Relayhost1.neti.ee (Postfix) with ESMTP id 8D7E81FC1; Wed, 19 Oct 2005 19:29:54 +0300 (EEST) Original-To: Jonathan Yavner In-Reply-To: <200510181105.56063.jyavner@member.fsf.org> (Jonathan Yavner's message of "Tue, 18 Oct 2005 11:05:55 -0400") User-Agent: Gnus/5.110004 (No Gnus v0.4) Emacs/22.0.50 (gnu/linux) X-Virus-Scanned: by amavisd-new-2.2.1 (20041222) (Debian) at neti.ee X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:44330 Archived-At: > http://www.w3.org/TR/html4/intro/sgmltut.html#h-3.2.2 > "By default, SGML requires that all attribute values be delimited > using either double quotation marks (ASCII decimal 34) or single > quotation marks (ASCII decimal 39). ... In certain cases, authors > may specify the value of an attribute without any quotation marks." > > In XHTML the no-marks case was eliminated, but the use of 'apostrophes' > is still valid. There are many complaints one can make about > Microsoft, but this isn't one of them. I still see no reason for them to generate HTML files with such an uncommon syntax, if not for making the life of users harder. Anyway, the following patch will allow Emacs to recognize encoding with either quotation marks (and for the attribute `content-type' quotation marks are optional): Index: lisp/international/mule.el =================================================================== RCS file: /cvsroot/emacs/emacs/lisp/international/mule.el,v retrieving revision 1.226 diff -c -r1.226 mule.el *** lisp/international/mule.el 24 Sep 2005 13:43:59 -0000 1.226 --- lisp/international/mule.el 19 Oct 2005 15:57:28 -0000 *************** *** 2229,2242 **** (save-excursion (forward-line 10) (point)))) ! (when (and (search-forward "