From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Lynn Winebarger Newsgroups: gmane.emacs.devel Subject: Re: Grammar checking Date: Sun, 23 Apr 2023 10:14:36 -0400 Message-ID: References: <87sfdnyuxc.fsf@posteo.de> <83sfdl2z26.fsf@gnu.org> <58158ae49808189da7b2@heytings.org> <83mt3t2xz1.fsf@gnu.org> <86jzyxxqir.fsf@gmail.com> <58158ae4986fa602fe47@heytings.org> <87jzylqvhl.fsf@posteo.net> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="10764"; mail-complaints-to="usenet@ciao.gmane.io" Cc: philipk@posteo.net, emacs-devel@gnu.org To: rms@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Sun Apr 23 16:15:41 2023 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1pqaVA-0002Vn-Ct for ged-emacs-devel@m.gmane-mx.org; Sun, 23 Apr 2023 16:15:40 +0200 Original-Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pqaUQ-0000P9-D8; Sun, 23 Apr 2023 10:14:54 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pqaUO-0000Om-Vo for emacs-devel@gnu.org; Sun, 23 Apr 2023 10:14:53 -0400 Original-Received: from mail-pg1-x534.google.com ([2607:f8b0:4864:20::534]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1pqaUN-00014S-7Y; Sun, 23 Apr 2023 10:14:52 -0400 Original-Received: by mail-pg1-x534.google.com with SMTP id 41be03b00d2f7-5144a9c11c7so3771854a12.2; Sun, 23 Apr 2023 07:14:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1682259288; x=1684851288; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=DUJl+56EB7imAAdaIcd9FeGXQ5ztwKJcDzdZXqJBQgY=; b=nNZ1UKx22ty3GzYtBE7Y6mkTAkxv+h1DgcrlYLI3DIemZaCO33Q5vilNtIJBHkLorv yQQTza4AK995wOWMRzMye7YIYAiF2JZ93KVJ2pZDe2VDyZtfg3pqktKj/Li8ML9hYBq3 pUgED6gHjpomuY8OElgqhLUttremlkLlq8AgjCWwHToLn59dGZxWSFU5Dt9IDPL9mzMQ g/SNv1/SCfFRB0YWx3G3m5SdBEGkMYpmfwif3+sWa0xuNSO6CPZ5YRLXWYtg9FEuoNK7 5f7HLjRKazIqqcjzA86xUJRPIt3cUWDs8Elz7hK0XTe9fufmxP2oYGuh0KwQ8BoT8TTY EqXA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1682259288; x=1684851288; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=DUJl+56EB7imAAdaIcd9FeGXQ5ztwKJcDzdZXqJBQgY=; b=Mosh7HvcoQu6nB1GqovY7tKLj56zNPZ+Rq0OHDCZEOuvSjSN/Nnd9qv6gjfVOl6JRK 2a/q/AkClLj5KAa8lBTUefGt92h8HYhvoVhi4z7ZUyoj2/2o1RJox6jyd2C3NAws+4YM gGG3sP9uAG+3MbFkQUnp2vtQ2FYQS64FI/EPRA2WYY4QDZlR1NflTFn4jxj6l0qT9BJN mZG4VhggEAkygsgCir/W8zmu8s2jcUrG2Tqlo4J1bD3ZTo1bWSqsmHDTFdKrc1kG2uZL 1m4xqiMsAgHUf6fAtljQPwy0rvvMBhTbYJNpJBH3k81bL+yW09KStyfHhKZIvbx/F7xs hPDg== X-Gm-Message-State: AAQBX9cCn+FDuii1LTURCgKkVr9Eu3jtAYjtQsLDCBPe32Zesf/wo3Kg 1oo6OM2RFZJr0pGFKpOpm9fr+cuhf+2Dfz+3uvwxlze/9B8= X-Google-Smtp-Source: AKy350ad+hKz/WJsz9Wqf+qbW8VtZxWoeHV3gQPOQ+D6syqz4ER9dkvswAuVqnK+XdIdEY4IZUFtPGV4PGMV0KpU4es= X-Received: by 2002:a17:90a:408f:b0:246:b973:de35 with SMTP id l15-20020a17090a408f00b00246b973de35mr11371688pjg.4.1682259288359; Sun, 23 Apr 2023 07:14:48 -0700 (PDT) In-Reply-To: Received-SPF: pass client-ip=2607:f8b0:4864:20::534; envelope-from=owinebar@gmail.com; helo=mail-pg1-x534.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Xref: news.gmane.io gmane.emacs.devel:305606 Archived-At: On Fri, Apr 21, 2023 at 10:22=E2=80=AFPM Richard Stallman wro= te: > > There's this directory in languagetool's source code that appears to > > provide infrastructure for using some RPC protocol for "MLServer"s: > > https://github.com/languagetool-org/languagetool/tree/master/language= tool-core/src/main/java/org/languagetool/rules/ml. > > If we wamt to use the free version of Languagetool, this raiss > important questions. First of all, does the program as currently > released try by default to use tha remote protocol? > [From subsequent amending email] > > Secondarily, how much of the usefulness of that program as > > currently released is due to using that dis-service? And how > > would we modify the release to turn that off? > > If the program as currently released does NOT try ny default to use > tha protocol, is there something that they invite users to do so as > to start using it? We might want to delete that feature from the version > we distribute, so we don't end up distributing SaaSS unknowingly. > The only concrete description I recall seeing (from an emacs user) is from Ihor Radchenko in https://lists.gnu.org/archive/html/emacs-devel/2023-03/msg01044.html The design of the software reflects its intended use in a multi-user SaaSS server. If emacs maintainer time is going to be consumed by ensuring an acceptable version and configuration is installed on user machines, I hope some consideration (by that maintainer) would be whether the bulk of the functionality could be implemented in a simpler, single-user, form in some incremental way - say by an emacs lisp program running in a batch emacs process. https://dev.languagetool.org/languages classifies the rules enforced by LanguageTool into XML, Java-based, spell-checking, and confusion pairs. There's a separate 8GB of n-gram data, covering the languages supported by the languagetool organization. So, how much work would be required to meaningfully use the confusion pairs, and what would it provide. Or interpreting and applying the XML rules. Or be more efficient in the use of the 8GB of n-gram data (https://dev.languagetool.org/finding-errors-using-n-gram-data). Maybe the effort wouldn't be worth it, but maybe it would. I don't think it's obvious either way. Lynn