From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: dalanicolai Newsgroups: gmane.emacs.devel Subject: Re: Multi image PDF continuous mode Date: Sat, 8 Jan 2022 08:18:27 +0100 Message-ID: References: <86lf0tq5zy.fsf@mail.linkov.net> <87ilvxinbn.fsf@logand.com> <86pmq5lf2o.fsf@mail.linkov.net> Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="000000000000946da305d50ce90f" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="621"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Tomas Hlavaty , Emacs Devel To: Juri Linkov Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sat Jan 08 08:59:02 2022 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1n66cv-000ATJ-Im for ged-emacs-devel@m.gmane-mx.org; Sat, 08 Jan 2022 08:59:01 +0100 Original-Received: from localhost ([::1]:60766 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1n66cu-000440-8I for ged-emacs-devel@m.gmane-mx.org; Sat, 08 Jan 2022 02:59:00 -0500 Original-Received: from eggs.gnu.org ([209.51.188.92]:34070) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1n666M-0006aC-At for emacs-devel@gnu.org; Sat, 08 Jan 2022 02:25:22 -0500 Original-Received: from [2607:f8b0:4864:20::1032] (port=38635 helo=mail-pj1-x1032.google.com) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1n666K-0001R6-4f for emacs-devel@gnu.org; Sat, 08 Jan 2022 02:25:22 -0500 Original-Received: by mail-pj1-x1032.google.com with SMTP id l10-20020a17090a384a00b001b22190e075so14592382pjf.3 for ; Fri, 07 Jan 2022 23:25:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=AA5l3NB81sD3k3nFDxYhqssAHfAbyxBTOWFI4I1sFMk=; b=iliSHFEDxBKtGzxiqrEnFSzOGv34hadJ6fQ+MeuDZlEBXNcFokeBxTy8ORr/1Z1t8X d/fv/O58/r9yY6ZddUXqzbHlJjt2L8S4boy9ZmaJTK+N6OihCq91e/ty79rcBFInaNKF WKxU6qc0DX4pYb9WFRqq8QXqBmdDKHLHtS6fqNYhpRsLkGX6pkzYqOAL57XfZCWDWSfU 2WqBXdVCk0hSnOCpb41PeYIgDaheR5MmZDgsW0bOh2SJgpIogTHWY8opzNtopkdVSs6s gSY5b3tLBubfqwyxx7D08/TmLcDdYV8w0Z1OJhzzMsqMd+ooHBwmZRMkzJ/w/oF0ytGR kgsg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=AA5l3NB81sD3k3nFDxYhqssAHfAbyxBTOWFI4I1sFMk=; b=PQ26lwssUu96cvL/LaXwuOPpS9bZMFBdIjQgAbDfk7KVRx4ehPnI9JgZeXt1R7Ev2x 9xkbWJYgeDYxRuHJE2JX4FqCk9diWG9wbufyyTr5duAu2B1j33frZlylwHk4RJxrluRv LSjP6+S/Tm9Ki/DLwSlyQxOyBrGercyj8T36aRjEOfDArauryN00WGQKw1PDb0cwDJQ5 u7h9r2V4o0o+Myo5ynpuwSeP7Me8eH24tpYd+f9AZAuSe+OAr/rL1hXwf/AeO4HjIBAc 6BmjxXFwMmudfMFC0rrh1BGxoT1ryUpXbYxIT0MmqE+adxuAZTHOTHtlfqr3zWpVZIU9 QUQA== X-Gm-Message-State: AOAM532yrgQCHrW+LCbTm/Roj2LgVQVJBgejwQ3u5suV5AfOHN1rzfn5 CqsDVfhBird3diw2p6igq2ui1+yY+AQ1MkGZ4l8KZ9YP X-Google-Smtp-Source: ABdhPJwmcilJx+8oPSK0CfFmwHufhDyZdAkomMpKTMxP8/tRU9PDdVoKQHP542ELLBe0MiuVuKe+0u8EPCZOgpvOqLs= X-Received: by 2002:a05:6122:c89:: with SMTP id ba9mr1481979vkb.17.1641626318694; Fri, 07 Jan 2022 23:18:38 -0800 (PST) In-Reply-To: X-Host-Lookup-Failed: Reverse DNS lookup failed for 2607:f8b0:4864:20::1032 (failed) Received-SPF: pass client-ip=2607:f8b0:4864:20::1032; envelope-from=dalanicolai@gmail.com; helo=mail-pj1-x1032.google.com X-Spam_score_int: -12 X-Spam_score: -1.3 X-Spam_bar: - X-Spam_report: (-1.3 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RDNS_NONE=0.793, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:284448 Archived-At: --000000000000946da305d50ce90f Content-Type: text/plain; charset="UTF-8" So the idea is that the bookroll can be used for doc-view (and possibly djvu.el) also. The code for the new server is also 'crude' code, but it works fine here (I am no professional programmer, so I appreciate any feedback about anything). On Sat, 8 Jan 2022 at 08:11, dalanicolai wrote: > After reading this thread, and especially after reading this message > by > Eli (if > the links don't work, I have attached them also at the end), I have again > looked > into this. I would like to share my perspective on this, but before I do > that, > let me first ask you if you have any idea about why pdf-tools is not part > of > Emacs. It is a very high quality package, that already lazily loads the > images, > and provides many sophisticated features. > > > You might already be familiar with my 2-buffer continuous scroll hack > for it > (which can be easily ported to doc-view mode, but 'developing' a 'real' > continuous scroll mode solution makes much more sense, and, if possible, > to me > it would make even more sense to try to get pdf-tools into Emacs). > > Some related interesting announcements are that: > > - I have written an alternative server for pdf-tools, which uses the > pymupdf > library. You can read more about it in the PR > . A nice thing about its > design > is that it uses the python interpreter/REPL directly as a server (it > reads > messages and prints to standard output). It already extends/complements > the > features of the original epdfinfo server by supporting line/arrow > annotations > and supporting the EPUB format very nicely. Additionally, it enables > other non > C programmers to extend its features (e.g. support for freetext > annotations > should be only little work). > > - I have created a, very rude but already nicely working, 'real' continuous > scroll proof of concept, for which, if you are interested, you can find > the > code in this commit > . > It currently uses a trick where it only uses two images > at the time, but as I will describe now, I think it will be better to > create a > 'bookroll' package for it. > > > The 'continuous' rendering is only part of implementing continuous scroll > into > pdf-tools because it would also be nice if it could be made compatible > with all > pdf-tools its features. After investigating how to achieve that, I have > come to > the conclusion that we need a 'bookroll' alternative to 'image-mode', > because > the main obstacle is that pdf-tools now uses image-mode functions, that are > written only for a single overlay per buffer. > > I think the bookroll should not be so difficult to implement (I first > started to > think about a general 'image-roll', but I think continuous scrolling is > generally not what you want for viewing/scrolling images, so it can be > just a > dedicated bookroll). So my current idea for how to implement it, is by > immediately creating overlays for all pages in a single buffer and fill > them > with 'empty' svg-images of the correct size (after testing this with a > thousand > 'placeholders', it seems that the 'empty' images use almost no memory). > Then, > the scrolling can be implemented, by changing the display properties (from > empty > svg to real image, and back) and jumping to the correct positions using > `set-window-vscroll`. I have started on writing bookroll.el, of course your > joined efforts or feedback would be very much appreciated. Otherwise, this > short > message serves just to inform you about these activities. > > Finally, it would be great if you could share your 'knowledge' about why > pdf-tools is not in Emacs. Its main developer has stepped down from the > project, > and 'vedang' has taken the roll of new maintainer. Anyway, I guess we > could best > ask the former and current maintainers as soon as possible about thier > 'opinions' and for signing the necessary papers. > > https://lists.gnu.org/archive/html/emacs-devel/2018-04/msg00087.html > https://github.com/dalanicolai/pdf-continuous-scroll-mode.el > https://pymupdf.readthedocs.io/en/latest/ > https://github.com/vedang/pdf-tools/pull/61 > > https://github.com/dalanicolai/pdf-tools/commit/b76a6337c39f114aa668e9f1985bfdfd87bd857d > > > > On Thu, 9 Dec 2021 at 21:06, Juri Linkov wrote: > >> >> After doc-view generates a gallery of PDF images, image-dired could be >> >> invoked on the output directory of PNG images, and indeed in this case >> >> the window layout of image-dired looks like what most PDF viewers do: >> >> on the left side there is a narrow window with thumbnails of PDF >> >> pages, and on the right side a larger window with PDF pages. >> > >> > Scaleable document viewer should generate the images lazily. >> >> Indeed, for long documents we will need more optimization: >> not to load all images at once, but only after going to the next page. >> At the beginning each page could have a placeholder, either just >> newlines or an empty image. Then navigating to a page, >> it will attach the image file with the display property. >> Also only visited pages should update their images when >> the user zooms or changes other parameters, etc. >> >> --000000000000946da305d50ce90f Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
So the idea is that the bookroll can be used for doc-= view (and possibly djvu.el) also.
The code for the new server is = also 'crude' code, but it works fine here (I am no
profes= sional programmer, so I appreciate any feedback about anything).
<= /div>
O= n Sat, 8 Jan 2022 at 08:11, dalanicolai <dalanicolai@gmail.com> wrote:
After reading this thread= , and especially after reading this message by = Eli (if
the links don't work, I have attached them also at the end),= I have again looked
into this. I would like to share my perspective on = this, but before I do that,
let me first ask you if you have any idea ab= out why pdf-tools is not part of
Emacs. It is a very high quality packag= e, that already lazily loads the images,
and provides many sophisticated= features.


You might already be familiar with my 2-buffer continuous scroll hack for it
(which can be easily ported = to doc-view mode, but 'developing' a 'real'
continu= ous scroll mode solution makes much more sense, and, if possible, to me
it would make even more sense to try to get pdf-tools into Emac= s).

Some related interesting announcements are that:

- = I have written an alternative server for pdf-tools, which uses the pymupdf
=C2=A0 library. You can read more about it in the
PR. A nice thing = about its design
=C2=A0 is that it uses the python interpreter/REPL dire= ctly as a server (it reads
=C2=A0 messages and prints to standard output= ). It already extends/complements the
=C2=A0 features of the original ep= dfinfo server by supporting line/arrow annotations
=C2=A0 and supporting= the EPUB format very nicely. Additionally, it enables other non
=C2=A0 = C programmers to extend its features (e.g. support for freetext annotations=
=C2=A0 should be only little work).

- I have created a, very rud= e but already nicely working, 'real' continuous
=C2=A0 scroll pr= oof of concept, for which, if you are interested, you can find the
=C2= =A0 code in this commit. It= currently uses a trick where it only uses two images
=C2=A0 at the time= , but as I will describe now, I think it will be better to create a
=C2= =A0 'bookroll' package for it.


The 'continuous' = rendering is only part of implementing continuous scroll into
pdf-tools = because it would also be nice if it could be made compatible with all
pd= f-tools its features. After investigating how to achieve that, I have come = to
the conclusion that we need a 'bookroll' alternative to '= image-mode', because
the main obstacle is that pdf-tools now uses im= age-mode functions, that are
written only for a single overlay per buffe= r.

I think the bookroll should not be so difficult to implement (I f= irst started to
think about a general 'image-roll', but I think = continuous scrolling is
generally not what you want for viewing/scrollin= g images, so it can be just a
dedicated bookroll). So my current idea fo= r how to implement it, is by
immediately creating overlays for all pages= in a single buffer and fill them
with 'empty' svg-images of the= correct size (after testing this with a thousand
'placeholders'= , it seems that the 'empty' images use almost no memory). Then,
= the scrolling can be implemented, by changing the display properties (from = empty
svg to real image, and back) and jumping to the correct positions = using
`set-window-vscroll`. I have started on writing bookroll.el, of co= urse your
joined efforts or feedback would be very much appreciated. Oth= erwise, this short
message serves just to inform you about these activit= ies.

Finally, it would be great if you could share your 'knowled= ge' about why
pdf-tools is not in Emacs. Its main developer has step= ped down from the project,
and 'vedang' has taken the roll of ne= w maintainer. Anyway, I guess we could best
ask the former and current m= aintainers as soon as possible about thier
'opinions' and f= or signing the necessary papers.




On Thu, 9 Dec 2021 at 21:06, Juri= Linkov <juri@linko= v.net> wrote:
>> After doc-view generates a gallery of PDF images, image-dired= could be
>> invoked on the output directory of PNG images, and indeed in this = case
>> the window layout of image-dired looks like what most PDF viewers = do:
>> on the left side there is a narrow window with thumbnails of PDF >> pages, and on the right side a larger window with PDF pages.
>
> Scaleable document viewer should generate the images lazily.

Indeed, for long documents we will need more optimization:
not to load all images at once, but only after going to the next page.
At the beginning each page could have a placeholder, either just
newlines or an empty image.=C2=A0 Then navigating to a page,
it will attach the image file with the display property.
Also only visited pages should update their images when
the user zooms or changes other parameters, etc.

--000000000000946da305d50ce90f--