From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: MON KEY Newsgroups: gmane.emacs.bugs Subject: bug#6283: doc/lispref/searching.texi reference to octal code `0377' correct? Date: Mon, 31 May 2010 20:24:00 -0400 Message-ID: References: <83vda9md09.fsf@gnu.org> <83sk5cmr8k.fsf@gnu.org> <83sk5btdcu.fsf@gnu.org> <836323ucry.fsf@gnu.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1275352987 22375 80.91.229.12 (1 Jun 2010 00:43:07 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Tue, 1 Jun 2010 00:43:07 +0000 (UTC) Cc: 6283@debbugs.gnu.org To: Eli Zaretskii Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Tue Jun 01 02:43:05 2010 connect(): No such file or directory Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1OJFZR-0005Un-2q for geb-bug-gnu-emacs@m.gmane.org; Tue, 01 Jun 2010 02:43:05 +0200 Original-Received: from localhost ([127.0.0.1]:39891 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OJFZP-0004xF-T2 for geb-bug-gnu-emacs@m.gmane.org; Mon, 31 May 2010 20:43:04 -0400 Original-Received: from [140.186.70.92] (port=54012 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1OJFZI-0004it-6w for bug-gnu-emacs@gnu.org; Mon, 31 May 2010 20:42:57 -0400 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1OJFKW-0001sG-Re for bug-gnu-emacs@gnu.org; Mon, 31 May 2010 20:27:41 -0400 Original-Received: from debbugs.gnu.org ([140.186.70.43]:43252) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OJFKW-0001sA-PS for bug-gnu-emacs@gnu.org; Mon, 31 May 2010 20:27:40 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.69) (envelope-from ) id 1OJFHy-0001B2-1F; Mon, 31 May 2010 20:25:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: MON KEY Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-To: owner@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 01 Jun 2010 00:25:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 6283 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: Original-Received: via spool by 6283-submit@debbugs.gnu.org id=B6283.12753518444515 (code B ref 6283); Tue, 01 Jun 2010 00:25:02 +0000 Original-Received: (at 6283) by debbugs.gnu.org; 1 Jun 2010 00:24:04 +0000 Original-Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OJFH2-0001Al-18 for submit@debbugs.gnu.org; Mon, 31 May 2010 20:24:04 -0400 Original-Received: from mail-gy0-f172.google.com ([209.85.160.172]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1OJFH0-0001AP-A6 for 6283@debbugs.gnu.org; Mon, 31 May 2010 20:24:02 -0400 Original-Received: by gyh4 with SMTP id 4so3007425gyh.3 for <6283@debbugs.gnu.org>; Mon, 31 May 2010 17:24:01 -0700 (PDT) Original-Received: by 10.150.183.11 with SMTP id g11mr5659360ybf.66.1275351840340; Mon, 31 May 2010 17:24:00 -0700 (PDT) Original-Received: by 10.151.143.21 with HTTP; Mon, 31 May 2010 17:24:00 -0700 (PDT) In-Reply-To: <836323ucry.fsf@gnu.org> X-Google-Sender-Auth: -xSZxazc3uNEEe_O63oU6qNbnDE X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Resent-Date: Mon, 31 May 2010 20:25:02 -0400 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.bugs:37466 Archived-At: On Mon, May 31, 2010 at 2:49 PM, Eli Zaretskii wrote: >> > In Unicode, it's a codepoint of LATIN SMALL LETTER Y WITH DIAERESIS. >> >> I don't understand this. > > I don't know how to express this more clearly. Perhaps you could ask > specific questions. > If you step through the Emacs Lisp example I sent along previously you may notice that the search doesn't match either of the `=C3=BF's. It does however match the character with numeric notations: 4194303, #o17777777, #x3fffff 4194221, #o17777655, #x3fffad E.g. These rawbytes as presented by Emacs as characters: (insert-byte (multibyte-char-to-unibyte 4194221) 1) (insert-byte (multibyte-char-to-unibyte 4194303) 1) This is what I don't understand. If I evauate the following: (progn (save-excursion (insert-byte (multibyte-char-to-unibyte 4194221) 1) (insert-byte (multibyte-char-to-unibyte 4194303) 1)) (search-forward-regexp "=C3=BF" nil t)) I don't match. Whereas if I evaluate: (progn (save-excursion (insert 10 #o377)) (search-forward-regexp "=C3=BF" nil t)) I get a match. Likewise, if I evaluate (progn (save-excursion (insert 10 4194303)) (search-forward-regexp "\377" nil t)) I get a match. Which is to say, given the example regexp from the manual, i.e: ,---- | You cannot always match all non-ASCII characters with the regular | expression `"[\200-\377]"' `---- I am unable to locate the character: =C3=BF (255, #o377, #xff) e.g. LATIN SMALL LETTER Y WITH DIAERESIS To be clear, my issue isn't that I am not able to match `=C3=BF' but rather that I am able to match the raw-byte character representation with a visual appearance which coincides with the octal value for the `=C3=BF' character code i.e. #o377 this being otherwise widely understood as `octal 0377'. I hope this is more clear than the previous mail. I apologize if it is not. -- /s_P]