From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Juanma Barranquero Newsgroups: gmane.emacs.devel Subject: Re: Interactive ucs-insert fails in HEAD Date: Sat, 10 Oct 2009 13:44:02 +0200 Message-ID: References: <83hbu8ls52.fsf@gnu.org> <83d44wleqw.fsf@gnu.org> <83y6njk7ob.fsf@gnu.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1255175104 2373 80.91.229.12 (10 Oct 2009 11:45:04 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 10 Oct 2009 11:45:04 +0000 (UTC) Cc: emacs-devel@gnu.org To: Eli Zaretskii Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Oct 10 13:44:55 2009 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1MwaNG-0007mP-VL for ged-emacs-devel@m.gmane.org; Sat, 10 Oct 2009 13:44:35 +0200 Original-Received: from localhost ([127.0.0.1]:57873 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MwaNG-0002M2-F3 for ged-emacs-devel@m.gmane.org; Sat, 10 Oct 2009 07:44:34 -0400 Original-Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1MwaNA-0002KI-QK for emacs-devel@gnu.org; Sat, 10 Oct 2009 07:44:28 -0400 Original-Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1MwaN6-0002JG-Mz for emacs-devel@gnu.org; Sat, 10 Oct 2009 07:44:28 -0400 Original-Received: from [199.232.76.173] (port=56238 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1MwaN6-0002JD-IP for emacs-devel@gnu.org; Sat, 10 Oct 2009 07:44:24 -0400 Original-Received: from mail-fx0-f205.google.com ([209.85.220.205]:37219) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1MwaN5-0000NF-7l; Sat, 10 Oct 2009 07:44:23 -0400 Original-Received: by fxm1 with SMTP id 1so6494204fxm.31 for ; Sat, 10 Oct 2009 04:44:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :from:date:message-id:subject:to:cc:content-type :content-transfer-encoding; bh=x5JEyKJURFqocoXIbGcoOESQHfXMIApUPTULNRZ4Yp8=; b=avoaXQiSNLFp/hjuWC3V6qBGLAu7qTuSt5PCChbFiVnb7xY2zF5HF+/DKggRBx6BIb ES6UyfymEA/tL/knU7RYPmNqw7EObXNPpItJ7Ns+iIChEO0okSBfDWpQzcCpPiRg2Ufc wBha1WWKNKJMeQG5ft/nuQTtv3dfzqPSxEaMM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type:content-transfer-encoding; b=uQ0ZuQ5LVfJvfaw3ixcmWiC8NwanyRZvIhJreGk1kEG9SVxsgfWAjqij5Hvz9rfKyI qPHTi6afUqsW5HFAyZjlsh5ccByTu2FSh9os4VNAo36V3lg3S9qYBbeHdcQOHGa4hI5v lCyatcqrqyGPcdWNZpRIOAvaOGlj86HP8h6zY= Original-Received: by 10.239.170.18 with SMTP id q18mr201918hbe.50.1255175062190; Sat, 10 Oct 2009 04:44:22 -0700 (PDT) In-Reply-To: <83y6njk7ob.fsf@gnu.org> X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 2) X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:116060 Archived-At: On Sat, Oct 10, 2009 at 09:52, Eli Zaretskii wrote: > If you mean entries like this one: > > =C2=A0 =C2=A020000;;Lo;0;L;;;;;N;;;;; > =C2=A0 =C2=A02A6D6;;Lo;0;L;;;;;N;;;;; > > then I agree. =C2=A0Still, I don't understand how including them could > screw completion in ucs-insert. =C2=A0Can you explain? Is not that they screw completion, is that they provide no name for completion. You still can use ucs-insert to insert them by codepoint number. > What I meant is to have something that detects in UnicodeData.txt the > pattern of entries which need to be excluded. For example, if entries > ike this are the only ones: 4DB5;;Lo;0;L;;;;;N;;;;; > then detecting bracketed text where a character name should be would > suffice. We could then run such a script as part of what > admin/Makefile does for the unidata.txt file it creates, and display a > proper message. unidata-setup-list already detects these kind of ranges and deals with them= . > Anyway, we now skip entries like this as well: > > =C2=A0 =C2=A01F210;SQUARED CJK UNIFIED IDEOGRAPH-624B > > which I think we don't need to skip. As I said before, I agree: we *shouldn't* need to skip them. But somehow, the process in unidata-gen.el that creates the chartables with codepoint information is generating bad data exclusively for that range: U+1F200..U+1F2FF (which are not the only characters added in 5.2.0, BTW). That's what triggered the ucs-insert error: getting the name of one of those characters returns a number instead of a symbol. I'm trying to determine why the bad data is generated in the first place. Juanma