From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Emanuel Berg via Users list for the GNU Emacs text editor Newsgroups: gmane.emacs.help Subject: Re: Emacs as a translator's tool Date: Mon, 01 Jun 2020 10:51:26 +0200 Message-ID: <87eeqzmanl.fsf@ebih.ebihd> References: <871rn35lqc.fsf@mbork.pl> <87zh9r45ad.fsf@mbork.pl> <87h7vz2m5g.fsf@ebih.ebihd> <87d06k4rmg.fsf@mbork.pl> Reply-To: Emanuel Berg Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="76060"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) To: help-gnu-emacs@gnu.org Cancel-Lock: sha1:2TSNzbjyvagOzeMmxXHI1vGlYDY= Original-X-From: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Mon Jun 01 10:51:55 2020 Return-path: Envelope-to: geh-help-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1jfgAl-000Jg0-EM for geh-help-gnu-emacs@m.gmane-mx.org; Mon, 01 Jun 2020 10:51:55 +0200 Original-Received: from localhost ([::1]:44894 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jfgAk-0005gp-Hr for geh-help-gnu-emacs@m.gmane-mx.org; Mon, 01 Jun 2020 04:51:54 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:54912) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jfgAS-0005gU-Go for help-gnu-emacs@gnu.org; Mon, 01 Jun 2020 04:51:36 -0400 Original-Received: from ciao.gmane.io ([159.69.161.202]:43620) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jfgAR-0006Gy-Ai for help-gnu-emacs@gnu.org; Mon, 01 Jun 2020 04:51:36 -0400 Original-Received: from list by ciao.gmane.io with local (Exim 4.92) (envelope-from ) id 1jfgAP-000JIK-16 for help-gnu-emacs@gnu.org; Mon, 01 Jun 2020 10:51:33 +0200 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: help-gnu-emacs@gnu.org Mail-Copies-To: never Received-SPF: pass client-ip=159.69.161.202; envelope-from=geh-help-gnu-emacs@m.gmane-mx.org; helo=ciao.gmane.io X-detected-operating-system: by eggs.gnu.org: First seen = 2020/06/01 03:29:08 X-ACL-Warn: Detected OS = Linux 2.2.x-3.x [generic] [fuzzy] X-Spam_score_int: -16 X-Spam_score: -1.7 X-Spam_bar: - X-Spam_report: (-1.7 / 5.0 requ) BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.249, SPF_PASS=-0.001, URIBL_BLOCKED=0.001 autolearn=_AUTOLEARN X-Spam_action: no action X-BeenThere: help-gnu-emacs@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Users list for the GNU Emacs text editor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: help-gnu-emacs-bounces+geh-help-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "help-gnu-emacs" Xref: news.gmane.io gmane.emacs.help:123222 Archived-At: Marcin Borkowski wrote: >> Can't you do ecat-highlight-next-sentence and >> ecat-highlight-previous-sentence by just moving >> point to the next sentence and then do >> ecat-highlight-this-sentence? Feels more >> natural... > > That would completely defeat the purpose. All CAT > stuff (as many people already told) is about > efficiency. One of the main points of my (very > simple) code is that I do not have to move > point anywhere. Because its inefficient? ... you are a fast translator :) but ... OK. Only it still looks wierd with the same code two extra times. > Well, you already got your answers, but let me > stress that one of the important points is > extracting text from some strange formats and > putting the translation back into it. Think Word > documents with complex formatting, or HTML with > many tags/attributes. If you are to translate > things like > >

Click the class="dancing-elephants">big red button to > launch the nuke

> > and all the markup has to be there in the > translation, you really don't want to type it by > hand - it's time-consuming and error-prone. Well, that's a task for a parser rather to convert between one format to another... very mechanical and easy. >> Some idiomatic phrases are pitfalls tho. >> For example the English "more or less" looks like >> the Swedish "mer eller mindre" (which means >> "correct but with room for fine details") but the >> way native speakers use it seems to be more (?) >> "både och" which means discussion can go both >> (disparate) ways and BOTH are correct! So perhaps >> one could have a list of these "trap phrases" so >> when they turn up in the text, they are >> highlighted to indicate "watch out! we are not >> just piling words here!" Who'd compile that list >> is another matter... > > I guess this is a very minor problem... It depends how many there are... should be systemized just as everything unusual is in any trade. Easy thing to do with huge gain and possible to build on and extend... -- underground experts united http://user.it.uu.se/~embe8573 https://dataswamp.org/~incal