From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Rustom Mody Newsgroups: gmane.emacs.help Subject: Re: Regexp: match any character including newline Date: Wed, 16 Oct 2013 08:58:25 -0700 (PDT) Message-ID: <3c91f48a-a042-4513-a058-8639870fc550@googlegroups.com> References: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1381939254 21287 80.91.229.3 (16 Oct 2013 16:00:54 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 16 Oct 2013 16:00:54 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Oct 16 18:00:59 2013 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1VWTWt-0005Jz-Fv for geh-help-gnu-emacs@m.gmane.org; Wed, 16 Oct 2013 18:00:59 +0200 Original-Received: from localhost ([::1]:48082 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VWTWt-000784-5F for geh-help-gnu-emacs@m.gmane.org; Wed, 16 Oct 2013 12:00:59 -0400 X-Received: by 10.58.12.67 with SMTP id w3mr1263627veb.0.1381939106063; Wed, 16 Oct 2013 08:58:26 -0700 (PDT) X-Received: by 10.50.152.105 with SMTP id ux9mr642030igb.13.1381939105982; Wed, 16 Oct 2013 08:58:25 -0700 (PDT) Original-Path: usenet.stanford.edu!i2no15835457qav.0!news-out.google.com!9ni46435qaf.0!nntp.google.com!o2no8288813qas.0!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail Original-Newsgroups: gnu.emacs.help In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=59.95.38.174; posting-account=mBpa7woAAAAGLEWUUKpmbxm-Quu5D8ui Original-NNTP-Posting-Host: 59.95.38.174 User-Agent: G2/1.0 Injection-Date: Wed, 16 Oct 2013 15:58:26 +0000 Original-Xref: usenet.stanford.edu gnu.emacs.help:201772 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:94041 Archived-At: On Wednesday, October 16, 2013 8:12:54 PM UTC+5:30, Yuri Khan wrote: > Hello All, >=20 >=20 > I=92m doing regexp replacements on a hard-wrapped XHTML-alike. Here=92s a= n > original fragment: Regexp handling of xml is commonly a source of grief. It is usually better to use a dedicated tool like this http://www.crummy.com/software/BeautifulSoup/ or (more xmlish than htmlish) http://lxml.de/ These are python solutions. Im sure there are equivalent ones in other scri= pting languages of your choice