From mboxrd@z Thu Jan 1 00:00:00 1970 From: Marvin Doyley Subject: Extracting pdf metadata Date: Thu, 24 Mar 2011 21:21:43 -0400 Message-ID: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=20cf3054ac83b6e5b4049f446ad4 Return-path: Received: from [140.186.70.92] (port=52747 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q2vil-0005C8-Db for emacs-orgmode@gnu.org; Thu, 24 Mar 2011 21:21:48 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q2vij-0007br-UQ for emacs-orgmode@gnu.org; Thu, 24 Mar 2011 21:21:47 -0400 Received: from mail-fx0-f52.google.com ([209.85.161.52]:56798) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q2vij-0007bX-Q5 for emacs-orgmode@gnu.org; Thu, 24 Mar 2011 21:21:45 -0400 Received: by fxm6 with SMTP id 6so864771fxm.39 for ; Thu, 24 Mar 2011 18:21:44 -0700 (PDT) List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org --20cf3054ac83b6e5b4049f446ad4 Content-Type: text/plain; charset=ISO-8859-1 Hi there, Does anybody have a lisp code that can extract metadata from pdf. There is an interesting program called sciplpore ( http://www.sciplore.org/software/sciplore_mindmapping/ that does this for freemind), it might be useful if were able to do the same with org (i.e., important pdf meta data, bookmark and stickies directly into org). Cheers M PS I think one of my goals this summer will be to learn lisp :) --20cf3054ac83b6e5b4049f446ad4 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi there,

Does anybody have a lisp code that can extract metadata fr= om pdf. There is an interesting program called sciplpore (http://www.sciplore.org/s= oftware/sciplore_mindmapping/ that does this for freemind), it might be= useful if were able to do the same with org (i.e., important pdf meta data= , bookmark and stickies directly into org).


Cheers

M

PS I think one of my goals this summer will = be to learn lisp :)
--20cf3054ac83b6e5b4049f446ad4-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: John Hendy Subject: Re: Extracting pdf metadata Date: Thu, 24 Mar 2011 21:08:27 -0500 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Return-path: Received: from [140.186.70.92] (port=57147 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q2wRy-0007gW-QA for emacs-orgmode@gnu.org; Thu, 24 Mar 2011 22:08:31 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q2wRw-0006bh-Gy for emacs-orgmode@gnu.org; Thu, 24 Mar 2011 22:08:30 -0400 Received: from mail-bw0-f41.google.com ([209.85.214.41]:38387) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q2wRw-0006bQ-88 for emacs-orgmode@gnu.org; Thu, 24 Mar 2011 22:08:28 -0400 Received: by bwz17 with SMTP id 17so718163bwz.0 for ; Thu, 24 Mar 2011 19:08:27 -0700 (PDT) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Marvin Doyley Cc: emacs-orgmode@gnu.org On Thu, Mar 24, 2011 at 8:21 PM, Marvin Doyley wrote: > Hi there, > > Does anybody have a lisp code that can extract metadata from pdf. There is > an interesting program called sciplpore > (http://www.sciplore.org/software/sciplore_mindmapping/ that does this for > freemind), it might be useful if were able to do the same with org (i.e., > important pdf meta data, bookmark and stickies directly into org). > Not that this what you asked for, but there's a small python application called stapler that can extract metadata. At the least, maybe somehow it could be useful to look at the code? Then again, it's built on a python library... so maybe there's nothing that will really translate to elisp. I just ran into it as an alternative to pdftk and thus it was fresh in my mind. At github: https://github.com/fwenzel/stapler Pertinent output from help: ,--- | $ stapler --help | ... | info: ... (no output needed) | Display PDF metadata | ... `--- Best regards, John > > Cheers > > M > > PS I think one of my goals this summer will be to learn lisp :) > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Rainer M Krug Subject: Re: Extracting pdf metadata Date: Fri, 25 Mar 2011 09:34:10 +0100 Message-ID: <4D8C5382.3070809@gmail.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Return-path: Received: from [140.186.70.92] (port=39410 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q32TL-0000ix-8w for emacs-orgmode@gnu.org; Fri, 25 Mar 2011 04:34:20 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q32TG-0003Dl-6C for emacs-orgmode@gnu.org; Fri, 25 Mar 2011 04:34:15 -0400 Received: from mail-ww0-f49.google.com ([74.125.82.49]:51649) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q32TG-0003Dc-1k for emacs-orgmode@gnu.org; Fri, 25 Mar 2011 04:34:14 -0400 Received: by wwb39 with SMTP id 39so357842wwb.30 for ; Fri, 25 Mar 2011 01:34:13 -0700 (PDT) In-Reply-To: List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Marvin Doyley Cc: emacs-orgmode@gnu.org -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 25/03/11 02:21, Marvin Doyley wrote: > Hi there, > > Does anybody have a lisp code that can extract metadata from pdf. There > is an interesting program called sciplpore > (http://www.sciplore.org/software/sciplore_mindmapping/ that does this > for freemind), it might be useful if were able to do the same with org > (i.e., important pdf meta data, bookmark and stickies directly into org). As far as I remember, sciplore is not only extracting metadata embedded in th pdf, but also from the text - they submit it to a server which uses the academic article and compares it to layouts from different publishers and uses those to extract bibliographic information from the text. If that is what you want, then it might be considerably more difficult then just extracting embedded metadata. Cheers, Rainer > > > Cheers > > M > > PS I think one of my goals this summer will be to learn lisp :) - -- Rainer M. Krug, PhD (Conservation Ecology, SUN), MSc (Conservation Biology, UCT), Dipl. Phys. (Germany) Centre of Excellence for Invasion Biology Natural Sciences Building Office Suite 2039 Stellenbosch University Main Campus, Merriman Avenue Stellenbosch South Africa Tel: +33 - (0)9 53 10 27 44 Cell: +27 - (0)8 39 47 90 42 Fax (SA): +27 - (0)8 65 16 27 82 Fax (D) : +49 - (0)3 21 21 25 22 44 Fax (FR): +33 - (0)9 58 10 27 44 email: Rainer@krugs.de Skype: RMkrug -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk2MU4IACgkQoYgNqgF2egqKNgCdH5J+8IOb8Sz5jjultIDXI/yU noUAnA++JSXpB7zMaY/bdNOWG8PppXGF =Fl62 -----END PGP SIGNATURE-----