From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Jean-Christophe Helary Newsgroups: gmane.emacs.help Subject: Re: Emacs as a translator's tool Date: Wed, 10 Jun 2020 22:39:37 +0900 Message-ID: <9F4D5274-B074-4A3F-91F1-ED4835F687BE@traduction-libre.org> References: <871rn35lqc.fsf@mbork.pl> <87h7vz2m5g.fsf@ebih.ebihd> <87d06k4rmg.fsf@mbork.pl> <87eeqzmanl.fsf@ebih.ebihd> <877dwmoboq.fsf@mbork.pl> <87bllypckg.fsf@ebih.ebihd> <87tuzpmnuo.fsf@mbork.pl> <87bllu4lx0.fsf@ebih.ebihd> <87blluxfcq.fsf@mbork.pl> <1rmqrrvn.fsf@ebih.ebihd> <87o8ptydil.fsf@ebih.ebihd> <87ftb5ycrz.fsf@ebih.ebihd> <878sgwkkn8.fsf@ebih.ebihd> <0631E9A3-8E0C-4DF6-B8F3-84A1109C1D59@traduction-libre.org> <87imfzn1g2.fsf@ebih.ebihd> <15BD6D26-87A5-4E5F-9271-273495A647EC@traduction-libre.org> <87mu5bmviz.fsf@ebih.ebihd> <4F5ED686-7444-441C-B547-49DB18175F17@traduction-libre.org> <87imfzmucg.fsf@ebih.ebihd> <2F52BF05-8427-4228-B691-6C3926B4A40D@traduction-libre.org> <87pna73zuf.fsf@ebih.ebihd> Mime-Version: 1.0 (Mac OS X Mail 13.4 \(3608.80.23.2.2\)) Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="67515"; mail-complaints-to="usenet@ciao.gmane.io" Cc: help-gnu-emacs@gnu.org To: Emanuel Berg Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Wed Jun 10 15:50:21 2020 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jj17S-000HPf-Gm for geh-help-gnu-emacs@m.gmane-mx.org; Wed, 10 Jun 2020 15:50:18 +0200 Original-Received: from localhost ([::1]:36430 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jj17R-0003SC-CS for geh-help-gnu-emacs@m.gmane-mx.org; Wed, 10 Jun 2020 09:50:17 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:34188) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jj0xJ-0007KF-Mc for help-gnu-emacs@gnu.org; Wed, 10 Jun 2020 09:39:49 -0400 Original-Received: from relay12.mail.gandi.net ([217.70.178.232]:52669) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jj0xH-0002cO-HR for help-gnu-emacs@gnu.org; Wed, 10 Jun 2020 09:39:49 -0400 Original-Received: from [10.0.1.13] (pl22363.ag1313.nttpc.ne.jp [111.89.103.91]) (Authenticated sender: jean.christophe.helary@traduction-libre.org) by relay12.mail.gandi.net (Postfix) with ESMTPSA id 5995C200003; Wed, 10 Jun 2020 13:39:42 +0000 (UTC) In-Reply-To: <87pna73zuf.fsf@ebih.ebihd> X-Mailer: Apple Mail (2.3608.80.23.2.2) Received-SPF: pass client-ip=217.70.178.232; envelope-from=jean.christophe.helary@traduction-libre.org; helo=relay12.mail.gandi.net X-detected-operating-system: by eggs.gnu.org: First seen = 2020/06/10 08:51:53 X-ACL-Warn: Detected OS = Linux 3.11 and newer X-Spam_score_int: -25 X-Spam_score: -2.6 X-Spam_bar: -- X-Spam_report: (-2.6 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:123313 Archived-At: > On Jun 10, 2020, at 20:49, Emanuel Berg via Users list for the GNU = Emacs text editor wrote: >=20 > Jean-Christophe Helary wrote: >=20 >> Emacs default sentence definition can be expressed >> as a SRX rules set. SRX is useful for eXchanging >> information. But the format you use internally >> is irrelevant. >=20 > The Emacs sentence format and associated functions > certainly makes sense to use but with tweaked > settings to get in particular shorter segments > I think would benefit both searching the DB and > getting better results. Sure, but it's not trivial to find "natural" subsegments with the tools = at hand. > For example this is one sentence by the default > rules: >=20 > Both of these are library functions that do a lot > under the hood, but if they don't meet your needs > (or you just want to experiment and learn) you can > also use system calls directly. >=20 > But, would you get a good translation suggestion out > of all of that? It depends on what's been translated already, I guess. > Or is it better to simplify in terms of the computer, > and increase the interactivity/human checking by > feeding >=20 > 1. Both of these are library functions > 2. that do a lot under the hood > 3. but if they don't meet your needs > 4. or you just want to experiment and learn > 5. you can also use system calls directly >=20 > ? I honestly have no idea how complex matching algorithms work to produce = subsegment matches. > Anyway, if there isn't anything let's drop this, what > remains is the DB of translation suggestions, and the > algorithm to search and quantify, so e.g., for (1), > one would get, if one translates into Swedish >=20 > segment: Both of these are library functions hit > ---------------------------------------------------------- > suggestion 1 [a]: B=C3=A5da tv=C3=A5 =C3=A4r biblioteksfunktioner = 90% > suggestion 2 [s]: B=C3=A5da tv=C3=A5 =C3=A4r bibliotekets funktioner = 7% > ... ... ... > suggestion n [~]: Den h=C3=A4r =C3=B6vers=C3=A4ttningen suger = 1% >=20 > Then one would hit [a] to insert suggestion 1! >=20 > So yes, where do you get the database? In most of the cases, it's something the translator build from her own = translations. And then, there is the matching algorithm that produces the most = relevant match. --=20 Jean-Christophe Helary @brandelune http://mac4translators.blogspot.com