From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Stefan Kangas Newsgroups: gmane.emacs.bugs Subject: bug#52263: Stale comment in xsd-regexp.el about Emacs not supporting Unicode Date: Sat, 4 Dec 2021 14:07:46 +0100 Message-ID: References: <83r1at7als.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="19182"; mail-complaints-to="usenet@ciao.gmane.io" Cc: 52263@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Sat Dec 04 14:08:12 2021 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mtUlw-0004kw-75 for geb-bug-gnu-emacs@m.gmane-mx.org; Sat, 04 Dec 2021 14:08:12 +0100 Original-Received: from localhost ([::1]:49426 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mtUlu-00014p-A3 for geb-bug-gnu-emacs@m.gmane-mx.org; Sat, 04 Dec 2021 08:08:10 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:48714) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mtUlm-00013Y-ID for bug-gnu-emacs@gnu.org; Sat, 04 Dec 2021 08:08:02 -0500 Original-Received: from debbugs.gnu.org ([209.51.188.43]:41548) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mtUlm-0004wf-Ax for bug-gnu-emacs@gnu.org; Sat, 04 Dec 2021 08:08:02 -0500 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1mtUll-0005Ts-VO for bug-gnu-emacs@gnu.org; Sat, 04 Dec 2021 08:08:01 -0500 X-Loop: help-debbugs@gnu.org Resent-From: Stefan Kangas Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sat, 04 Dec 2021 13:08:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 52263 X-GNU-PR-Package: emacs Original-Received: via spool by 52263-submit@debbugs.gnu.org id=B52263.163862327521055 (code B ref 52263); Sat, 04 Dec 2021 13:08:01 +0000 Original-Received: (at 52263) by debbugs.gnu.org; 4 Dec 2021 13:07:55 +0000 Original-Received: from localhost ([127.0.0.1]:53094 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mtUlf-0005TW-2E for submit@debbugs.gnu.org; Sat, 04 Dec 2021 08:07:55 -0500 Original-Received: from mail-pg1-f178.google.com ([209.85.215.178]:35649) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mtUlc-0005TJ-DC for 52263@debbugs.gnu.org; Sat, 04 Dec 2021 08:07:53 -0500 Original-Received: by mail-pg1-f178.google.com with SMTP id j11so5861618pgs.2 for <52263@debbugs.gnu.org>; Sat, 04 Dec 2021 05:07:52 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:in-reply-to:references:mime-version:date :message-id:subject:to:cc; bh=a6qOi49fVoApturDoaX18BF1SuidqKVpe0R0cuhmCmM=; b=suqy7hO5UlC64u6Fc/cm6YFe3f6ngr9RlThVwGjU0LkgwCvnFGid+p/y0puVezq6FW GwzDEppYi1y8vy59Dt0sHDcm2q9AND2Ttu5EUBVnw9SpUfdi95Xca20DqCzcPPe/1BGE ZvLfF/Jmqi1wcvD6PwtjJ9Fzqh7BHGJczSOoKVduK7O+vkDe4tziJhhy8iWHnxY+Ks98 CL3mgSV9ePQasvZmrLDLweyE6/f9zn8ra9Cw3HeDQb3Ak8++SQnMMENGPLGIxworRS92 KJ91cvrQ0OIwLC0l+SQBxJTIz9ZrSCcKE8Hmn87o7zfRvmgxyC9sbf4Km3wRM+ummOBY DsBg== X-Gm-Message-State: AOAM532W64WGO5cpA5e/2A1DDtZqrXY8dQ17B8cHlwiqnXQLb66VxaJP qLUtS50qfZ26dU1UImiirvO2JCbEraMAMZUMxScLOKUK X-Google-Smtp-Source: ABdhPJzyJdAYWqEPHSGrAAfUDm9/pzpD4lWU33nxHEbd1D/NSPJLFBK9HoTi+8jxG5wV+rlxcQHXcOHnSkHEs/U1N8Y= X-Received: by 2002:a05:6a00:2444:b0:4ab:15b9:20e5 with SMTP id d4-20020a056a00244400b004ab15b920e5mr10026913pfj.0.1638623266729; Sat, 04 Dec 2021 05:07:46 -0800 (PST) Original-Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Sat, 4 Dec 2021 14:07:46 +0100 In-Reply-To: <83r1at7als.fsf@gnu.org> X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:221466 Archived-At: Eli Zaretskii writes: >> I believe this comment in lisp/nxml/xsd-regexp.el can be removed as >> Emacs supports Unicode now: >> >> ;; The semantics of XSD regexps are defined in terms of Unicode. >> ;; Non-Unicode characters are not allowed in regular expressions and >> ;; will not match against the generated regular expressions. A >> ;; Unicode character means a character in one of the Mule charsets >> ;; ascii, latin-iso8859-1, mule-unicode-0100-24ff, >> ;; mule-unicode-2500-33ff, mule-unicode-e000-ffff, eight-bit-control >> ;; or a character translatable to such a character (i.e a character >> ;; for which `encode-char' will return non-nil). >> ;; >> ;; Unfortunately, this means that this package is currently useless >> ;; for CJK characters, since there's no mule-unicode charset for the >> ;; CJK ranges of Unicode. We should devise a workaround for this >> ;; until the fabled Unicode version of Emacs makes an appearance. >> >> Is that correct? > > Probably. The mule-Unicode-* stuff is definitely obsolete. The only > thing that bothers me is what happens with eight-bit characters in the > XSD regexps -- are they allowed? Emacs in general does allow them. > If xsd-regexp.el doesn't, that should be stated there. Hmm, so probably more work is needed here than just removing the above comment. There is a lot of non-trivial mule and conversion stuff going on in that library that might need a proper look by someone that knows this stuff well. Perhaps this bug should also be retitled accordingly.