From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Nikolay Kudryavtsev Newsgroups: gmane.emacs.help Subject: =?UTF-8?Q?Re=3A_the_=C2=ABinverse=C2=BB_function_of_join-lines?= Date: Tue, 24 Oct 2023 15:50:19 +0300 Message-ID: References: <87zg08pe1q.fsf@mat.ucm.es> <87sf602uwq.fsf@vagabond.tim-landscheidt.de> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="15343"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla Thunderbird To: help-gnu-emacs@gnu.org Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Tue Oct 24 16:22:30 2023 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1qvIIg-0003u3-3p for geh-help-gnu-emacs@m.gmane-mx.org; Tue, 24 Oct 2023 16:22:30 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1qvGrv-0006EY-Vu; Tue, 24 Oct 2023 08:50:48 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1qvGrb-00062K-GP for help-gnu-emacs@gnu.org; Tue, 24 Oct 2023 08:50:27 -0400 Original-Received: from mail-lj1-x22f.google.com ([2a00:1450:4864:20::22f]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1qvGrX-0003aX-Uq for help-gnu-emacs@gnu.org; Tue, 24 Oct 2023 08:50:27 -0400 Original-Received: by mail-lj1-x22f.google.com with SMTP id 38308e7fff4ca-2c509f2c46cso64440621fa.1 for ; Tue, 24 Oct 2023 05:50:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1698151821; x=1698756621; darn=gnu.org; h=content-transfer-encoding:in-reply-to:references:to :content-language:subject:user-agent:mime-version:date:message-id :from:from:to:cc:subject:date:message-id:reply-to; bh=rT/T+lEvP7g6bCOnHYa+UgenlPNGh00SpngD+xt0lyo=; b=bsalAiIuQMfiwe7ToyujXOk2ma7NU7SBj0SIF48aaUd13fn0ifkXnxMiwr1O6d/qKY zagiZYedlDuCe+3dicY1F1ZRzsXHRa3TMZb31UGDfyAGfvbdQJMrdzyxXXYrtHdqSWSW RbW1cslcrSOhI4gOC1Dg0EhTWdqlwkVL+/msfOUxfNyTBAkLXd9bqZabl7slpJ/N+9nM 1EBxRx3uDxEdhEwVFOl586KJPSSKJq45PgpR3o+CCbJ7qYfGoVmxl8BE3OfGLWRRP3Xj tyiZyzgJGWg2phXgnJ5OvU8j7Z1KJZcghv4GtT4waLVdW+QjjfncFNVmf0TTWsG8IX2b /TCA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1698151821; x=1698756621; h=content-transfer-encoding:in-reply-to:references:to :content-language:subject:user-agent:mime-version:date:message-id :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=rT/T+lEvP7g6bCOnHYa+UgenlPNGh00SpngD+xt0lyo=; b=rbaYc+onPFaEBNncrrgDu+07feSW0iM1Ri9/XSIMYN6yicm3jcaP/xW+lrCBPAm7ag UelBLcVkUWbn5MTWHDNJOiOmgbqM84cidgIIV5+HF+c9jOI1lwxteK2wfVsNUKUm6mUg 0m4K6aIJExDEw70cx8sVquVv0M+/3dQsxPl+lNwDgg8MEcF1lZWStWVYM63DuYiyIS+e +ou4uXtNSW7MHeEqPVDT3gudU4if1WQy43B0RZEP25QuqwdXuN7/bViVk+grXlehQqqv l8n2Ypkb/NaosNmVwbRbOSZFtbMvsE7v5QDjOYhZJ3O6uCCi5Jo2eSexeSUlwgMnHxFQ DkKw== X-Gm-Message-State: AOJu0Ywz4JVSvpihXWBdOlHXQ5cFojCrC/qcr7bOB127YtPhbHxLLNwn va6UM5XXmm0cu5WubNC7FKuVaCBvz2E= X-Google-Smtp-Source: AGHT+IFk52gJRqBClS1KgGmuleDVZVehkhMSYSj1xtOO2mPhReJy67w1Fk1TKeI/Wh16zK/VgpekTw== X-Received: by 2002:a2e:be83:0:b0:2c5:74b0:5c31 with SMTP id a3-20020a2ebe83000000b002c574b05c31mr5218465ljr.26.1698151820829; Tue, 24 Oct 2023 05:50:20 -0700 (PDT) Original-Received: from ?IPV6:2a02:2168:b3fc:c600:7d9b:17bc:827a:8e57? ([2a02:2168:b3fc:c600:7d9b:17bc:827a:8e57]) by smtp.gmail.com with ESMTPSA id v7-20020a2e87c7000000b002c12b823669sm2005870ljj.32.2023.10.24.05.50.20 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 24 Oct 2023 05:50:20 -0700 (PDT) X-Google-Original-From: Nikolay Kudryavtsev Content-Language: ru, en-US In-Reply-To: <87sf602uwq.fsf@vagabond.tim-landscheidt.de> Received-SPF: pass client-ip=2a00:1450:4864:20::22f; envelope-from=nikolay.kudryavtsev@gmail.com; helo=mail-lj1-x22f.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.help:145372 Archived-At: Incidentally I have a somewhat related question I've been wondering about recently. So I have a massive txt ebook that I've been slowly reediting into an org file, for the purposes of easier navigation and textual analysis. It is also fill-paragraphed. I've been considering the idea of removing all superfluous line breaks. Now, mechanically it's quite doable - you check for sentence end punctuation signs at line endings and if there isn't one you remove the line break. There would of course be some false positives... This task seems like a somewhat common editing task, so I've been wondering if someone has a working code for it already, maybe there are some gotchas there, that I haven't though about.