From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Uwe Ziegenhagen Newsgroups: gmane.emacs.help Subject: Re: Desperately need help for html to LaTeX conversion Date: Mon, 15 Oct 2007 20:52:31 +0200 Message-ID: <5nhr7fFih2b3U1@mid.uni-berlin.de> References: <1192471300.380394.300040@v23g2000prn.googlegroups.com> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1192542913 17277 80.91.229.12 (16 Oct 2007 13:55:13 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 16 Oct 2007 13:55:13 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Tue Oct 16 15:55:13 2007 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1Ihmsy-0004kd-JA for geh-help-gnu-emacs@m.gmane.org; Tue, 16 Oct 2007 15:55:04 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Ihmsr-0000XZ-Ms for geh-help-gnu-emacs@m.gmane.org; Tue, 16 Oct 2007 09:54:57 -0400 Original-Path: shelby.stanford.edu!headwall.stanford.edu!fu-berlin.de!uni-berlin.de!not-for-mail Original-Newsgroups: comp.text.tex, comp.lang.postscript, gnu.emacs.help, comp.lang.scheme, comp.lang.perl.misc Original-Lines: 54 Original-X-Trace: news.uni-berlin.de 7WrCK+kKputtr3Z47M8WXQym0owLyCrcatiu5RhayuHU6/2aX6wr8w User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.8.1.6) Gecko/20070728 Thunderbird/2.0.0.6 Mnenhy/0.7.5.0 In-Reply-To: <1192471300.380394.300040@v23g2000prn.googlegroups.com> Original-Xref: shelby.stanford.edu comp.text.tex:360064 comp.lang.postscript:85930 gnu.emacs.help:152964 comp.lang.scheme:74480 comp.lang.perl.misc:617722 X-Mailman-Approved-At: Tue, 16 Oct 2007 09:52:11 -0400 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:48485 Archived-At: vasan999@hotmail.com schrieb: > Because my previous thread had no reply that could help > me, I start it again. > > I am very depressed now with the following problem. > > I have a set of html files, must be human readable (as some of the > output tools produce human readable LaTeX) > > that I want to convert to human readable latex. > > The program by > http://in.geocities.com/ad_rab/ > > produces human unreadable tex. and i could not see the compiled > resulting dvi. > > The gnu program produces human readable latex, but does not handle all > the tags. > > the C program by Faase ??? produces human readable latex but misses a > few tags. > > The C program is unreadable by me so I cant modify it. > > On sourceforge there are two programs one in Perl and the other in > Java. > > The perl one does not run due to path problems. > > The java one produces complicated latex, possibly human readable, but > that does not compile so I could not see its quality. > > Does someone have a program? I prefer a well documented and fully > commented program that one can modify or one that can do the job > without the need for understanding its operation. > > Please help. The situation is very desperate. > > Vasantha > > Is there a html-merging tool that can merge a set of html files into a > single html file and resolve the links and cross-references with > minimal changes to the rest of the portions of the html, ie not > scramble the rest of the file. > > I know some awk/perl/sed/python/scheme/lisp will do it. > XSLT is your friend, if the other tools don't work. Using XSLT you can easily write your own converter, given the HTML is not too messy and complicated. Uwe