From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Lennart Borgman Newsgroups: gmane.emacs.devel Subject: Re: Converting a string to valid XHTML id? Date: Thu, 2 Dec 2010 00:12:31 +0100 Message-ID: References: <40291.130.55.118.19.1291217640.squirrel@webmail.lanl.gov> <20101201155858.GB12842@seid-online.de> <54241.130.55.118.19.1291242750.squirrel@webmail.lanl.gov> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1291245184 27645 80.91.229.12 (1 Dec 2010 23:13:04 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 1 Dec 2010 23:13:04 +0000 (UTC) Cc: rm@tuxteam.de, Ralf Mattes , emacs-devel@gnu.org To: herring@lanl.gov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Thu Dec 02 00:12:59 2010 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PNvr9-0002YJ-Ka for ged-emacs-devel@m.gmane.org; Thu, 02 Dec 2010 00:12:59 +0100 Original-Received: from localhost ([127.0.0.1]:41719 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PNvr9-0002NF-0g for ged-emacs-devel@m.gmane.org; Wed, 01 Dec 2010 18:12:59 -0500 Original-Received: from [140.186.70.92] (port=42591 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PNvr3-0002JR-N9 for emacs-devel@gnu.org; Wed, 01 Dec 2010 18:12:54 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PNvr2-0002k2-H1 for emacs-devel@gnu.org; Wed, 01 Dec 2010 18:12:53 -0500 Original-Received: from mail-ey0-f169.google.com ([209.85.215.169]:55197) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PNvr2-0002js-Cd for emacs-devel@gnu.org; Wed, 01 Dec 2010 18:12:52 -0500 Original-Received: by eydd26 with SMTP id d26so17624032eyd.0 for ; Wed, 01 Dec 2010 15:12:51 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:cc:content-type :content-transfer-encoding; bh=iVxt+KMKuO83P9FmTtmGvUza29Sx875AGmMXqhIJBYA=; b=ibFyP9Zk0GXNZlXvvUqGyy2hClLlJuiQa8a8yd727tDSNuY1cStCGU4iYNRpp2AAAz lW9grC4pw3ohsjin3eUBuJnsm9kAbD2nLLLv06Hu6ofdtSS6OHddIFJLVh51oGI2A1Uj Np41+cosiXJ90YcoHCAQQyaZKPI75aUo43iZ8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type:content-transfer-encoding; b=hdIV+T1WCEGuB5Zr1vVsPQ1uEqFXsFMEsxKjRvlfQqilPcttE7+/hvO3Nr5G4baYOT f86VZ160MfqzpxjypE/R5+VbYdG5xBnCwQ3HcyIENxnajTw5MWDXx6Q/6eNCntuX442V B3/u/O6D3ZVqRdlYKMNnFFRVVHKlpMdCmHBo8= Original-Received: by 10.213.5.5 with SMTP id 5mr50012ebt.84.1291245171302; Wed, 01 Dec 2010 15:12:51 -0800 (PST) Original-Received: by 10.213.29.8 with HTTP; Wed, 1 Dec 2010 15:12:31 -0800 (PST) In-Reply-To: <54241.130.55.118.19.1291242750.squirrel@webmail.lanl.gov> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:133286 Archived-At: On Wed, Dec 1, 2010 at 11:32 PM, Davis Herring wrote: >> That sounds tempting but is wrong :-/ Percent-encoding doesn't produce >> valid =C2=A0ID values. From the html 4 specs: >> >> =C2=A06.2 SGML basic types >> >> =C2=A0 .... >> >> =C2=A0ID and NAME tokens must begin with a letter ([A-Za-z]) and may be >> =C2=A0followed by any number of letters, digits ([0-9]), hyphens ("-"), >> =C2=A0underscores ("_"), colons (":"), and periods ("."). > > If you're referring to the leading letter, you're right -- I forgot about > it. =C2=A0Easy enough to fix: also use Lennart's "ANON-" prefix when the = string > begins with a non-letter or with the string "ANON-". > > Or is there something more fundamental that I'm missing? Yes, % is not allowed. And the names should be unique.