From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Hongyi Zhao Newsgroups: gmane.emacs.help Subject: Re: [External] : Re: Strange whitespaces. Date: Fri, 1 Oct 2021 15:26:51 +0800 Message-ID: References: <8735pmml4r.fsf@zoho.eu> <83v92if3ew.fsf@gnu.org> <835yuhfe2a.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="2398"; mail-complaints-to="usenet@ciao.gmane.io" Cc: help-gnu-emacs To: Eli Zaretskii Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Fri Oct 01 09:35:59 2021 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mWD5L-0000PQ-6Y for geh-help-gnu-emacs@m.gmane-mx.org; Fri, 01 Oct 2021 09:35:59 +0200 Original-Received: from localhost ([::1]:50376 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mWD5J-0003pX-DM for geh-help-gnu-emacs@m.gmane-mx.org; Fri, 01 Oct 2021 03:35:57 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:39072) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mWCwk-00008A-Rs for help-gnu-emacs@gnu.org; Fri, 01 Oct 2021 03:27:06 -0400 Original-Received: from mail-ua1-x935.google.com ([2607:f8b0:4864:20::935]:35830) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mWCwi-0000Ag-Tz; Fri, 01 Oct 2021 03:27:06 -0400 Original-Received: by mail-ua1-x935.google.com with SMTP id k32so6036012uae.2; Fri, 01 Oct 2021 00:27:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=HdSfZUMZn0gvdJa6HqEWJV6FT1DiQFBMLk6L4IHZh5k=; b=b+mU0zpm28azXDRt+cSlmmgH4t4mCFeEZISXqD67TQ458JHJwk08RihXV9YDQdDWwD vAsAcIjN4HwKykeH+qbUyZaNpZHZbkOIM8PjxFD+jxRu7RLFSDwvilT/uUhDorebdj3r avLTS9wmQCnBZE5D1+nnzeQ5+j2oYrJMId0xc37yhNmCuGKGDhUu2gvHzU5FW85Kvb85 PU5414it4i8IbpdEeSBAxBM2cRGSo4AdJ4kG46j7nmViSU0Hdgmy6MlAkrksoSXGMpPM PLpT2w9ECTQDBRdDJa9MsNJ4jUyOSKg74yEk5vQVQzzkt5+cswzm8eWG4P0Et50aKuy8 DXdg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=HdSfZUMZn0gvdJa6HqEWJV6FT1DiQFBMLk6L4IHZh5k=; b=AMqiY5RHEnODoX7ZLXL2vvhAPRxkUuNvc6yTrKfpL6AQGao8lJCtOQNbtDRq5WbRfx juoEyvozlnjEkEPpbbDnjYODeL264BQNLn3u7bAEXaYQVPyuCr+Fmvg7zYnJ2amFpinR zrbuGGWBkofTYWmFyz9sotJk70dZz6wywHI0D7CUUSwTqRQ7K8QV0O/S0Ghwx9TNfJVW I4sBHB1oACVwAqbx2k8CYE1TyhBsCxxujHMp4JQQ4lJa+NPh6ouhg5M9TkAzzYOJMlXx 41qD3Uu9r3WuAgYtLRy06B2jY7TTjwsXJkOxt6qXr34mtVaABPWUfl4h2VHoKvzsvDyW RqSQ== X-Gm-Message-State: AOAM530XqmvfV6qTsaOX9d7VzLb2lGZBla4zjQGdaarXfJdBXeTxNEVy 5xkPiY6SJAwO8nKI17utID4xxwVST9QgnPMKpC3+82cTJc5FNvDk X-Google-Smtp-Source: ABdhPJx4QLI5XOH1GR6kp/PVCfFOKPS1dIDnPoypOGjdOb1o/FSLCwhznFHCCkVr9/+PDwKoz/JUUWHs5I6SGDOZc/M= X-Received: by 2002:ab0:1001:: with SMTP id f1mr8658984uab.115.1633073223292; Fri, 01 Oct 2021 00:27:03 -0700 (PDT) In-Reply-To: <835yuhfe2a.fsf@gnu.org> Received-SPF: pass client-ip=2607:f8b0:4864:20::935; envelope-from=hongyi.zhao@gmail.com; helo=mail-ua1-x935.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:133473 Archived-At: On Fri, Oct 1, 2021 at 2:36 PM Eli Zaretskii wrote: > > > From: Hongyi Zhao > > Date: Fri, 1 Oct 2021 09:51:22 +0800 > > Cc: help-gnu-emacs > > > > > We now highlight any non-ASCII character whose Unicode > > > general category is "Space Separator" (or Zs for short). > > > > I fail to see the connection between the abbreviation and the original > > representation it stands for. > > You mean, Zs vs "Space Separator"? Yes. > Please complain to the Unicode Consortium about any of that, Emacs just uses the names and > nomenclature they invented. See > > https://www.unicode.org/reports/tr44/#General_Category_Values I presumably basically figured out the logic behinds the nomenclature: Based on the Description given on the above URL: a space character (of various non-zero widths) So, the Z <---> non-zero, and s <---> space. This is like the naming rules used in regular expression metacharacters, say, in python [1]: \s For Unicode (str) patterns: Matches Unicode whitespace characters (which includes [ \t\n\r\f\v], and also many other characters, for example the non-breaking spaces mandated by typography rules in many languages). If the ASCII flag is used, only [ \t\n\r\f\v] is matched. For 8-bit (bytes) patterns: Matches characters considered whitespace in the ASCII character set; this is equivalent to [ \t\n\r\f\v]. \S Matches any character which is not a whitespace character. This is the opposite of \s. If the ASCII flag is used this becomes the equivalent of [^ \t\n\r\f\v]. [1] https://docs.python.org/3/library/re.html HZ