all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Martin Rubey <axiomize@yahoo.de>
To: help-gnu-emacs@gnu.org
Subject: Re: emacs metadata editor for (mostly) scientific pdf's
Date: Wed, 16 Jan 2013 09:22:42 +0100	[thread overview]
Message-ID: <d94nihy1t9.fsf@ada0.ifam.uni-hannover.de> (raw)
In-Reply-To: 874niiywrj.fsf@casa.home

Daimrod <daimrod@gmail.com> writes:

> Martin Rubey <axiomize@yahoo.de> writes:
>
>> Hi there!
>>
>> I wonder whether anybody has tried or would like to try to adapt dired
>> to manage pdf's metadata.
>>
>> Namely, I have a collection of files, organized in a simple directory
>> structure by topic (as "algebra", "combinatorics", ...), which mostly
>> works for me.
>>
>> I have a few tools (pdfmeat http://code.google.com/p/pdfmeat/, pdftk)
>> that I may want to use on the files.
>>
>> The main problem is: instead of mode, owner, size, date I would rather
>> like to see (and possibly edit) some fields from the file's metadata
>> (eg. author, title) in addition to the filename.
>>
>> There is no way I could write this, but I'd be happy to fiddle around a
>> little...
>
> Do you know any tools usable from the command line to extract this
> information?

as I wrote above: pdfmeat from http://code.google.com/p/pdfmeat/

    pdfmeat.py --alone --inject myfile.pdf

extracts some text from myfile.pdf, searches google scholar to find a
match, injects it into the metadata section of myfile.pdf.  Of course it
makes mistakes sometimes.

    pdftk myfile.pdf dump_data

lists just its info fields, while

    pdfinfo -meta myfile.pdf

lists info fields (Title, Subject, ... PDF version) and XMP stream
(Metadata).  Example below, where pdfmeat was used to inject the info
fields and the XMP stream.

> Though I don't know how easy it is to customize the attributes show by
> dired, I think it wouldn't be difficult to add a shortcut to display
> some information about specific files in another buffer or via
> `message'.

Well, the main point is being able to edit at least Author, Title, Year
easily.  (Because pdfmeat makes mistakes) I guess the reason that
pdfmeat really writes into the XMP stream is that the infofields are
somewhat restricted.  Therefore, it might be best to be able to connect
with the bibtex-mode...

Martin

pdfinfo -meta Hu\,Yang\;\ 2004\;\ Some\ irreducible\ representations\ of\ Brauer\'s\ centralizer\ algebras.pdf 
Title:          Some irreducible representations of Brauer's centralizer
algebras
Subject:        Glasgow Mathematical Journal, 2004
Keywords:       article: hu2004some
Author:         Hu, J.; Yang, Y.
Creator:        PDFMeat's bibtex2pdfmeta
Producer:       PDFMeat's bibtex2pdfmeta
CreationDate:   Wed Sep 15 16:28:29 2004
ModDate:        Wed Nov 10 06:48:47 2010
Tagged:         no
Pages:          15
Encrypted:      no
Page size:      493 x 700 pts
File size:      175296 bytes
Optimized:      no
PDF version:    1.3
Metadata:
<?xml version='1.0' encoding='UTF-8'?>
<rdf:Description xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bibtex="http://jabref.sourceforge.net/bibteXMP/"
    bibtex:bibtexkey="hu2004some"
    bibtex:owner=""
    bibtex:timestamp="2013-01-11 10:44:35">
       <bibtex:title>Some irreducible representations of Brauer's
       centralizer algebras</bibtex:title>
<bibtex:author>Hu, J. and Yang, Y.</bibtex:author>
<bibtex:journal>Glasgow Mathematical Journal</bibtex:journal>
<bibtex:volume>46</bibtex:volume>
<bibtex:number>03</bibtex:number>
<bibtex:pages>499--513</bibtex:pages>
<bibtex:year>2004</bibtex:year>
<bibtex:publisher>Cambridge Univ Press</bibtex:publisher>
<bibtex:file>file:///home/rubey/Books+Papers/algebra/Hu Yang Some
Irreducible Representations of Brauer's Centralizer
Algebras.pdf:pdf</bibtex:file>
<bibtex:md5sum>f296ecff7b3e2b6b78ca6eb57f1458eb</bibtex:md5sum>
<bibtex:url>http://journals.cambridge.org/abstract_S001708950400196X</bibtex:url>
<bibtex:citations>4</bibtex:citations>
<bibtex:citedbyid>13306009431956969271</bibtex:citedbyid>
<bibtex:abstract>Let m, n ∈ ,ގV be a 2m-dimensional complex vector
space. The irreducible representations of the Brauer's centralizer
algebra Bn (-2m) appearing in V (x)n are in 1-1 correspondence to the
set of pairs ( f, λ), where f ∈ ޚwith 0 &lt;= f &lt;= [n/2], and λ n -
2f satisfying λ1 &lt;= m. In this paper, we first show that each of
these representations has a basis consists of eigenvectors for the
subalgebra of Bn (-2m) generated by all the Jucys-Murphy operators, and
we determine the corresponding eigenvalues. Then we identify these
representations with the irreducible representations constructed from a
cellular basis of Bn (-2m). Finally, an explicit description of the
action of each generator of Bn (-2m) on such a basis is also given,
which generalizes earlier work of [15] for Brauer's centralizer algebra
Bn (m). 2000 Mathematics Subject
Classification. 16G99.</bibtex:abstract>
<bibtex:mailhosts>mathematik.uni-stuttgart.de;
yahoo.com.cn</bibtex:mailhosts>
<bibtex:pdfmeat>timestamp: 2013-01-11 10:44:35; queries: 1; inode:
2505172</bibtex:pdfmeat>

        <bibtex:author>
            <rdf:Seq><rdf:li>J. Hu</rdf:li><rdf:li>Y. Yang</rdf:li></rdf:Seq>
        </bibtex:author>
    <bibtex:entrytype>article</bibtex:entrytype>
</rdf:Description>


  reply	other threads:[~2013-01-16  8:22 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-01-15  7:55 emacs metadata editor for (mostly) scientific pdf's Martin Rubey
2013-01-15 21:14 ` Daimrod
2013-01-16  8:22   ` Martin Rubey [this message]
2013-01-16 10:38     ` Thien-Thi Nguyen
     [not found]     ` <mailman.17600.1358332613.855.help-gnu-emacs@gnu.org>
2013-01-16 10:46       ` Martin Rubey
2013-01-16 11:10         ` Rainer M Krug

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=d94nihy1t9.fsf@ada0.ifam.uni-hannover.de \
    --to=axiomize@yahoo.de \
    --cc=help-gnu-emacs@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.