From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: pjb@informatimago.com (Pascal J. Bourguignon) Newsgroups: gmane.emacs.help Subject: Re: search for any two consecutive uppercase characters Date: Thu, 13 Aug 2009 15:09:31 +0200 Organization: Anevia SAS Message-ID: <7c63cramic.fsf@pbourguignon.anevia.com> References: <87fxbw3bfh.fsf@galatea.local> <4a84058c$0$303$14726298@news.sunsite.dk> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1250171295 32541 80.91.229.12 (13 Aug 2009 13:48:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 13 Aug 2009 13:48:15 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Thu Aug 13 15:48:08 2009 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1Mbaez-0003YI-R4 for geh-help-gnu-emacs@m.gmane.org; Thu, 13 Aug 2009 15:48:06 +0200 Original-Received: from localhost ([127.0.0.1]:57019 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Mbaez-0004wM-9E for geh-help-gnu-emacs@m.gmane.org; Thu, 13 Aug 2009 09:48:05 -0400 Original-Path: news.stanford.edu!headwall.stanford.edu!news.glorb.com!news2.glorb.com!proxad.net!feeder1-2.proxad.net!cleanfeed3-b.proxad.net!nnrp11-2.free.fr!not-for-mail Original-Newsgroups: gnu.emacs.help Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwAQMAAABtzGvEAAAABlBMVEUAAAD///+l2Z/dAAAA oElEQVR4nK3OsRHCMAwF0O8YQufUNIQRGIAja9CxSA55AxZgFO4coMgYrEDDQZWPIlNAjwq9 033pbOBPtbXuB6PKNBn5gZkhGa86Z4x2wE67O+06WxGD/HCOGR0deY3f9Ijwwt7rNGNf6Oac l/GuZTF1wFGKiYYHKSFAkjIo1b6sCYS1sVmFhhhahKQssRjRT90ITWUk6vvK3RsPGs+M1RuR mV+hO/VvFAAAAABJRU5ErkJggg== X-Accept-Language: fr, es, en X-Disabled: X-No-Archive: no User-Agent: Gnus/5.101 (Gnus v5.10.10) Emacs/22.2 (gnu/linux) Cancel-Lock: sha1:c8DWInIrRP/7UjgC4J4Uwc9li5g= Original-Lines: 39 Original-NNTP-Posting-Date: 13 Aug 2009 15:09:31 MEST Original-NNTP-Posting-Host: 88.170.236.224 Original-X-Trace: 1250168971 news-2.free.fr 9364 88.170.236.224:41644 Original-X-Complaints-To: abuse@proxad.net Original-Xref: news.stanford.edu gnu.emacs.help:171945 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:67121 Archived-At: "Colin S. Miller" writes: > Pascal J. Bourguignon wrote: >> Horacio Suarez writes: >> >>> Hello all: >>> >>> Is there a way to search for any two consecutive uppercase characters? In example "PÉREZ" or >>> "GONZÁLEZ" >> >> Yes, this is difficult, because of the accented letters. There is no >> [:upper:] in emacs regular expressions. It might be possible to build >> a syntax table or something to identify uppercase letters including >> accented ones, but AFAIK, there's nothing built in. The simpliest >> would be to prepare a regular expression explicitely listing all the >> characters you'd want, something like: >> >> "\\<[A-ZÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝÞ]+\\>" >> > Isn't > C-u C-s (aka isearch-forward-regexp) > [A-ZÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝÞ]\{2,\} > better? FSVO "better". > here \{2,\} > means two (or more) of the preceding expression. They don't mean the same. My expression means: words containing only uppercase letters. Your expression means: any occurence of two or more consecutive uppercase letters. Is 0x42AB a word? (I'd say no, it's a number in C syntax for hexadecimal). Is NeXTstep a word? (Yes, but it's not all uppercase). -- __Pascal Bourguignon__