From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Bob Newell Newsgroups: gmane.emacs.help Subject: Re: editing a PDF [Re: emacs 30.5.0 editing epub] Date: Wed, 22 Mar 2023 08:48:01 -1000 Organization: Avi Gobbler Publishing Message-ID: <87sfdwr5b2.nntm@hhbnwe.viijngy.net> References: <877cvhqo9p.fsf@web.de> <4a7a0baf-677b-118c-fa6c-e50d054800e7@posteo.de> <87o7osp4ck.fsf@web.de> <704a63ef-c56d-f892-1e3f-9ee0f884b038@mousecar.com> <87y1nsmgwz.fsf@web.de> <0e6c9b8a-b70f-d244-c031-68c0c58dca86@mousecar.com> <87v8iur4lp.fsf@dataswamp.org> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="34913"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Safari/5.5 To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Wed Mar 22 19:48:52 2023 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1pf3W0-0008of-LE for geh-help-gnu-emacs@m.gmane-mx.org; Wed, 22 Mar 2023 19:48:52 +0100 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pf3VJ-0008JK-OF; Wed, 22 Mar 2023 14:48:09 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pf3VH-0008Iq-AL for help-gnu-emacs@gnu.org; Wed, 22 Mar 2023 14:48:07 -0400 Original-Received: from mail-pl1-x62b.google.com ([2607:f8b0:4864:20::62b]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1pf3VF-0004Bj-61 for help-gnu-emacs@gnu.org; Wed, 22 Mar 2023 14:48:07 -0400 Original-Received: by mail-pl1-x62b.google.com with SMTP id kq3so7734939plb.13 for ; Wed, 22 Mar 2023 11:48:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bobnewell-net.20210112.gappssmtp.com; s=20210112; t=1679510883; h=mime-version:user-agent:message-id:in-reply-to:date:references :organization:subject:to:from:from:to:cc:subject:date:message-id :reply-to; bh=PzwP6Rxu8vh6w87MvSeBi2OtaWVKcbvR7raO6CT+6T4=; b=p4bfPRRHUgVYFjIijD3wx3v2VDPOxK/gJPpXPaYTQazVENy9+okx70YFZYtMuzc4jZ INlMGLTWZSdq3qessNv5eP3yN/I8+HEwqj7OUPc+kIdCnqYgoC9UvwsaMqnIcdju+PwO yNi3pQFfhyw7plvgvkR6P6RTkWVs6tHpjhIXTnS9UE36hDKr1a3jMbXyGqJvpsrEiQ/4 aGA74ssV5TPpled2+dIx5QuvHZG/gRjRh9kJym/4SX/FHEcKtIG5LEIY0rxfjIRuz8gs j09Z2OjEyx1LNwD3fM13Srnact5O8CpaScqF4iTMNtAfjXjjiuEyYywEqJUiqhmWc7gs lqLw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1679510883; h=mime-version:user-agent:message-id:in-reply-to:date:references :organization:subject:to:from:x-gm-message-state:from:to:cc:subject :date:message-id:reply-to; bh=PzwP6Rxu8vh6w87MvSeBi2OtaWVKcbvR7raO6CT+6T4=; b=0L8/fwkyWuqLOVkCVa+ipqNT8Ph743S/j6MAsAq4YsjHOwKtsto7CCo0Y24wleXVuK NuJV6V8JBjf4z7IxhHAXi97L61RgUQzxAhr95dYYKH7c4Ju/lTIyOFU36/qVodnio2aO Cz+q2T7YI2XIWBzQk+TR0BUZZ4elbS1X4/DTlsWx+pytuBnnCHlWpxSthmPXLBgEE7Yy nCo8KUJBWAmU1vZge7H+s+EtPUlsS2mB3bx6hRR8PdGqAVHQ4+8mjBtKLHVdssG2gZ7x tIurLRNn2v8ZB8Bw4DeQVkY8ASjyqnIOwIHdwsDdTW3/7ace6zacSi7JImK3K3vf0/n9 iOLQ== X-Gm-Message-State: AO0yUKW9z2ZBGuElZCcJgCLHTCg+nkLNVKdnI46rlHjGBr7jmqrKPqW3 1+dAdPHarqgbG6lEZDqQJ5lfuq1xj/EacHGEnKw= X-Google-Smtp-Source: AK7set9IOq48dh9E4fFYGUpeU6rS2VPFVATVLx57yi0Lx2BRF3OXu+AqE17olSf9ak8qHM4kenn3KQ== X-Received: by 2002:a17:903:1208:b0:1a0:6a47:184e with SMTP id l8-20020a170903120800b001a06a47184emr4047518plh.42.1679510882919; Wed, 22 Mar 2023 11:48:02 -0700 (PDT) Original-Received: from localhost (dhcp-141-239-253-106.hawaiiantel.net. [141.239.253.106]) by smtp.gmail.com with ESMTPSA id a5-20020a1709027d8500b001a1c2ee06e0sm7623833plm.15.2023.03.22.11.48.02 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 22 Mar 2023 11:48:02 -0700 (PDT) In-Reply-To: (Yuri Khan's message of "Wed, 22 Mar 2023 23:32:26 +0700") Received-SPF: none client-ip=2607:f8b0:4864:20::62b; envelope-from=bobnewell@bobnewell.net; helo=mail-pl1-x62b.google.com X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_NONE=0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.help:143087 Archived-At: I get sent PDFs all the time and it's often frustrating. However most of the time I need the text, not necessarily the formatting. For that, there are various PDF to text converters which work to some degree (not so great with multi-column text for the most part). I also found a PDF to spreadsheet converter which isn't especially great but better than nothing. Well, at least a little better. I realize this doesn't at all address the original question about editing PDFs in the true sense of editing. But in a lot of cases text extraction may be good enough, depending on your purpose. Copy/paste works in many cases as well. For instance I just got sent a PDF of a budget spreadsheet, and with not so much effort was able to make it into an org-mode table. All bets are off if the PDF is an /image/ of some text. Then you've got to get out your friendly OCR software. In such cases I write back to the sender and politely ask for something easier to use. -- Bob Newell Honolulu, Hawai`i - Via GNU/Linux/Emacs/Gnus/BBDB