From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Martin Rubey Newsgroups: gmane.emacs.help Subject: Re: emacs metadata editor for (mostly) scientific pdf's Date: Wed, 16 Jan 2013 09:22:42 +0100 Organization: Linux Private Site Message-ID: References: <874niiywrj.fsf@casa.home> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: ger.gmane.org 1358324722 24381 80.91.229.3 (16 Jan 2013 08:25:22 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 16 Jan 2013 08:25:22 +0000 (UTC) To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Wed Jan 16 09:25:40 2013 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane.org Original-Received: from lists.gnu.org ([208.118.235.17]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1TvOJX-0005et-PY for geh-help-gnu-emacs@m.gmane.org; Wed, 16 Jan 2013 09:25:40 +0100 Original-Received: from localhost ([::1]:46581 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TvOJH-0003K5-8J for geh-help-gnu-emacs@m.gmane.org; Wed, 16 Jan 2013 03:25:23 -0500 Original-Path: usenet.stanford.edu!news.tele.dk!news.tele.dk!small.news.tele.dk!news-2.dfn.de!news.dfn.de!newsserver.rrzn.uni-hannover.de!.POSTED!not-for-mail Original-Newsgroups: gnu.emacs.help Original-Lines: 121 Original-NNTP-Posting-Host: ada0.ifam.uni-hannover.de Original-X-Trace: newsserver.rrzn.uni-hannover.de 1358324563 20722 130.75.17.184 (16 Jan 2013 08:22:43 GMT) Original-X-Complaints-To: usenet@newsserver.rrzn.uni-hannover.de Original-NNTP-Posting-Date: Wed, 16 Jan 2013 08:22:43 +0000 (UTC) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.2 (gnu/linux) Cancel-Lock: sha1:63dJJIkv0kizotsD1avZOYQue0c= Original-Xref: usenet.stanford.edu gnu.emacs.help:196347 X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.help:88643 Archived-At: Daimrod writes: > Martin Rubey writes: > >> Hi there! >> >> I wonder whether anybody has tried or would like to try to adapt dired >> to manage pdf's metadata. >> >> Namely, I have a collection of files, organized in a simple directory >> structure by topic (as "algebra", "combinatorics", ...), which mostly >> works for me. >> >> I have a few tools (pdfmeat http://code.google.com/p/pdfmeat/, pdftk) >> that I may want to use on the files. >> >> The main problem is: instead of mode, owner, size, date I would rather >> like to see (and possibly edit) some fields from the file's metadata >> (eg. author, title) in addition to the filename. >> >> There is no way I could write this, but I'd be happy to fiddle around a >> little... > > Do you know any tools usable from the command line to extract this > information? as I wrote above: pdfmeat from http://code.google.com/p/pdfmeat/ pdfmeat.py --alone --inject myfile.pdf extracts some text from myfile.pdf, searches google scholar to find a match, injects it into the metadata section of myfile.pdf. Of course it makes mistakes sometimes. pdftk myfile.pdf dump_data lists just its info fields, while pdfinfo -meta myfile.pdf lists info fields (Title, Subject, ... PDF version) and XMP stream (Metadata). Example below, where pdfmeat was used to inject the info fields and the XMP stream. > Though I don't know how easy it is to customize the attributes show by > dired, I think it wouldn't be difficult to add a shortcut to display > some information about specific files in another buffer or via > `message'. Well, the main point is being able to edit at least Author, Title, Year easily. (Because pdfmeat makes mistakes) I guess the reason that pdfmeat really writes into the XMP stream is that the infofields are somewhat restricted. Therefore, it might be best to be able to connect with the bibtex-mode... Martin pdfinfo -meta Hu\,Yang\;\ 2004\;\ Some\ irreducible\ representations\ of\ Brauer\'s\ centralizer\ algebras.pdf Title: Some irreducible representations of Brauer's centralizer algebras Subject: Glasgow Mathematical Journal, 2004 Keywords: article: hu2004some Author: Hu, J.; Yang, Y. Creator: PDFMeat's bibtex2pdfmeta Producer: PDFMeat's bibtex2pdfmeta CreationDate: Wed Sep 15 16:28:29 2004 ModDate: Wed Nov 10 06:48:47 2010 Tagged: no Pages: 15 Encrypted: no Page size: 493 x 700 pts File size: 175296 bytes Optimized: no PDF version: 1.3 Metadata: Some irreducible representations of Brauer's centralizer algebras Hu, J. and Yang, Y. Glasgow Mathematical Journal 46 03 499--513 2004 Cambridge Univ Press file:///home/rubey/Books+Papers/algebra/Hu Yang Some Irreducible Representations of Brauer's Centralizer Algebras.pdf:pdf f296ecff7b3e2b6b78ca6eb57f1458eb http://journals.cambridge.org/abstract_S001708950400196X 4 13306009431956969271 Let m, n ∈ ,ގV be a 2m-dimensional complex vector space. The irreducible representations of the Brauer's centralizer algebra Bn (-2m) appearing in V (x)n are in 1-1 correspondence to the set of pairs ( f, λ), where f ∈ ޚwith 0 <= f <= [n/2], and λ n - 2f satisfying λ1 <= m. In this paper, we first show that each of these representations has a basis consists of eigenvectors for the subalgebra of Bn (-2m) generated by all the Jucys-Murphy operators, and we determine the corresponding eigenvalues. Then we identify these representations with the irreducible representations constructed from a cellular basis of Bn (-2m). Finally, an explicit description of the action of each generator of Bn (-2m) on such a basis is also given, which generalizes earlier work of [15] for Brauer's centralizer algebra Bn (m). 2000 Mathematics Subject Classification. 16G99. mathematik.uni-stuttgart.de; yahoo.com.cn timestamp: 2013-01-11 10:44:35; queries: 1; inode: 2505172 J. HuY. Yang article