From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp11.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms5.migadu.com with LMTPS id OEiPBEAy/WMQXwEAbAwnHQ (envelope-from ) for ; Mon, 27 Feb 2023 23:44:16 +0100 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp11.migadu.com with LMTPS id wMCTBEAy/WPHGgAA9RJhRA (envelope-from ) for ; Mon, 27 Feb 2023 23:44:16 +0100 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id C39AF9B02 for ; Mon, 27 Feb 2023 23:44:15 +0100 (CET) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pWmE0-0004Gb-79; Mon, 27 Feb 2023 17:44:04 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pWmDy-0004GT-B1 for guix-patches@gnu.org; Mon, 27 Feb 2023 17:44:02 -0500 Received: from debbugs.gnu.org ([209.51.188.43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1pWmDy-0004rZ-2y for guix-patches@gnu.org; Mon, 27 Feb 2023 17:44:02 -0500 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1pWmDx-0000JG-Uw for guix-patches@gnu.org; Mon, 27 Feb 2023 17:44:01 -0500 X-Loop: help-debbugs@gnu.org Subject: [bug#61851] [PATCH] gnu: tesseract-ocr-tessdata-fast: Install tesseract config files. Resent-From: Simon South Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Mon, 27 Feb 2023 22:44:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 61851 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: jlicht@fsfe.org Cc: 61851@debbugs.gnu.org Received: via spool by 61851-submit@debbugs.gnu.org id=B61851.16775378341167 (code B ref 61851); Mon, 27 Feb 2023 22:44:01 +0000 Received: (at 61851) by debbugs.gnu.org; 27 Feb 2023 22:43:54 +0000 Received: from localhost ([127.0.0.1]:49135 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1pWmDp-0000Ik-SC for submit@debbugs.gnu.org; Mon, 27 Feb 2023 17:43:54 -0500 Received: from mailout.easymail.ca ([64.68.200.34]:53206) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1pWmDm-0000IS-9Q for 61851@debbugs.gnu.org; Mon, 27 Feb 2023 17:43:52 -0500 Received: from localhost (localhost [127.0.0.1]) by mailout.easymail.ca (Postfix) with ESMTP id 8726CE8B93; Mon, 27 Feb 2023 22:43:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at emo08-pco.easydns.vpn Received: from mailout.easymail.ca ([127.0.0.1]) by localhost (emo08-pco.easydns.vpn [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id sQiJe3HqSP1h; Mon, 27 Feb 2023 22:43:43 +0000 (UTC) Received: from earth (23-233-96-72.cpe.pppoe.ca [23.233.96.72]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mailout.easymail.ca (Postfix) with ESMTPSA id B23A3E8B3D; Mon, 27 Feb 2023 22:43:43 +0000 (UTC) From: Simon South References: Date: Mon, 27 Feb 2023 17:43:43 -0500 In-Reply-To: (jlicht@fsfe.org's message of "Mon, 27 Feb 2023 21:55:16 +0100") Message-ID: <878rgik9uo.fsf@simonsouth.net> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+larch=yhetil.org@gnu.org Sender: guix-patches-bounces+larch=yhetil.org@gnu.org X-Migadu-Country: US X-Migadu-Flow: FLOW_IN ARC-Authentication-Results: i=1; aspmx1.migadu.com; dkim=none; spf=pass (aspmx1.migadu.com: domain of "guix-patches-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-patches-bounces+larch=yhetil.org@gnu.org"; dmarc=none ARC-Seal: i=1; s=key1; d=yhetil.org; t=1677537855; a=rsa-sha256; cv=none; b=RHC1jrBCajuLrotoy9gRkFAFKPUDJ4DNY/UVhdlEJdMUP5otWc1itpCWT7jfRQenbjn+K6 lQrYaqjILCJAMToxBn1Sl8HeFL3WWizhOmg1720ntrPCH9PY6p6tv8bVmn0iHcgNf5DGKT 4PyHUw79LC/s5zD8p3LCkWTp8rMiy2OQkop8QCpejeC7p2rPxA8sroo8tsb/gybcudx2s/ kML/C2OcwLptPueb4czayCWssT3OtSqHsqfh8akfeGVHaKdf4WhTqp0dpiPfqCoiABWCzY ELMY4vTNnirfhpb5UmGaOFYuS7MerFJzDZhZMzsBEWUa9iRjCDGYMYqmUObAJw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=yhetil.org; s=key1; t=1677537855; h=from:from:sender:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:resent-cc:resent-from:resent-sender: resent-message-id:in-reply-to:in-reply-to:references:references: list-id:list-help:list-unsubscribe:list-subscribe:list-post; bh=RYMwbRw8lo5ImIWlA6Sz2gXS/WlhognAScGK7CbL8iM=; b=UBwa4i/TwcmYT84hDT4+G1Se2jNTCvjLUzaTAx/jltQ/UZ10QfmPFUIdA8j4hMy7iMpDGh ydvAPum3PNYOMauhknrrGU+oOc9VIqf9myJ663itwggHSimbB3FvBU+LVz66ER3kKYbwHa rVLEnLEeUp4kF0j8dvdod6ubX6Cpn8PeF75eP0CFcnCR7NAlh5Wki7wpyh7qQ5dnzMCn6x 4jzTwabA271wmTT5n/XgyWOns3PLjeKVRdKCYZrzEM4HOwaBNaKMeO/YNC8uvBR40PQbBq q/yAkrXJzeSX/VN5s/0HLrQwYSQQUni2qM8B2MOB3G5xwv6sA5uWxXt3tX8+pA== Authentication-Results: aspmx1.migadu.com; dkim=none; spf=pass (aspmx1.migadu.com: domain of "guix-patches-bounces+larch=yhetil.org@gnu.org" designates 209.51.188.17 as permitted sender) smtp.mailfrom="guix-patches-bounces+larch=yhetil.org@gnu.org"; dmarc=none X-Migadu-Spam-Score: -1.89 X-Spam-Score: -1.89 X-Migadu-Scanner: scn1.migadu.com X-Migadu-Queue-Id: C39AF9B02 X-TUID: Wo7ymbCHDe7P Jelle, Respectfully, and speaking only as an interested observer, I think this may not be the right fix. Guix's Tesseract is indeed missing its config files, causing (among other things) the examples in the online documentation[0] to not work, e.g.: ssouth@hamlet ~/tesseract-ocr-test [env]$ tesseract images/eurotext.png - -l eng hocr read_params_file: Can't open hocr The (quick) [brown] {fox} jumps! Over the $43,456.78 #90 dog (...) But the root issue appears to be a misconfiguration of the TESSDATA_PREFIX search path in the tessdata-ocr package, which causes Tesseract's own config files to be installed in a folder other than the one it's configured to search. Fixing this places Tesseract's config files and the trained-data files together beneath /usr/share/tessdata, allowing Tesseract to work as expected: ssouth@hamlet ~/tesseract-ocr-test [env]$ tesseract images/eurotext.png - -l eng hocr