From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Torsten Hilbrich Newsgroups: gmane.emacs.devel Subject: Re: Integration of dictionary package Date: Fri, 20 Nov 2020 07:44:07 +0100 Message-ID: <63f152c2-5c56-91a5-35a0-b98f327d5c03@hilbrich.tk> References: <87o8k7zxoj.fsf@gmail.com> <494a350b-e2ac-5497-3336-61358517a102@hilbrich.tk> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="544"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 Cc: emacs-devel@gnu.org To: Jean Louis Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Fri Nov 20 07:46:26 2020 Return-path: Envelope-to: ged-emacs-devel@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kg0Be-000Aai-Nr for ged-emacs-devel@m.gmane-mx.org; Fri, 20 Nov 2020 07:46:26 +0100 Original-Received: from localhost ([::1]:44414 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kg0Bd-0000SM-DN for ged-emacs-devel@m.gmane-mx.org; Fri, 20 Nov 2020 01:46:25 -0500 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52346) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kg09Z-0008H2-Eb for emacs-devel@gnu.org; Fri, 20 Nov 2020 01:44:21 -0500 Original-Received: from einhorn-mail-out.in-berlin.de ([217.197.80.21]:41039) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kg09U-0006dL-Ct for emacs-devel@gnu.org; Fri, 20 Nov 2020 01:44:17 -0500 X-Envelope-From: torsten@myrkr.in-berlin.de Original-Received: from authenticated.user (localhost [127.0.0.1]) by einhorn.in-berlin.de with ESMTPSA id 0AK6i7Ms016293 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 20 Nov 2020 07:44:08 +0100 Original-Received: from [10.42.142.2] by ricky.myrkr.in-berlin.de with esmtpsa (Exim 4.90_1 #4 (Debian)) id 1kg09P-0003ze-BZ; Fri, 20 Nov 2020 07:44:07 +0100 In-Reply-To: Content-Language: en-GB Received-SPF: none client-ip=217.197.80.21; envelope-from=torsten@myrkr.in-berlin.de; helo=einhorn-mail-out.in-berlin.de X-Spam_score_int: -22 X-Spam_score: -2.3 X-Spam_bar: -- X-Spam_report: (-2.3 / 5.0 requ) BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.25, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001, SPF_NONE=0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane-mx.org@gnu.org Original-Sender: "Emacs-devel" Xref: news.gmane.io gmane.emacs.devel:259469 Archived-At: On 20.11.20 06:36, Jean Louis wrote: > * Torsten Hilbrich [2020-11-20 08:29]: >> On 20.11.20 03:32, Jean Louis wrote: >>> ?boobytrapped? >>> >>> Some words appear with question marks. Do you know why is it so? >>> >> >> I found examples of this in the jargon file. It seems to be some kind of >> emphasis of the text. I compared with pure html versions and it seems >> that there single quotes are used, like in >> http://www.catb.org/jargon/html/F/foo.html >> >> dict: ?Fucked Up Beyond All Repair? >> html: ‘Fucked Up Beyond All Repair’ >> >> This seems to be caused by the convertion process of the jargon file >> into the dictionary format (maybe by limitations of ASCII). >> >> We could do some post-processing on the text returned by the server >> (which is already done to mark links). But the pattern would be quite >> complex and most likely specific to the dictionary used. > > If that is not bug in your package then maybe it is in how dictd > outputs those chars. If it outputs question marks, than is better > leaving it as future may bring enhancement from server side. Here is an excerpt from asking the server directly: torsten@t460:~$ telnet dict.org 2628 Trying 199.48.130.6... Connected to dict.org. Escape character is '^]'. 220 dict.dict.org dictd 1.12.1/rf on Linux 4.19.0-10-amd64 <269222.19371.1605854343@dict.dict.org> define jargon foo 150 1 definitions retrieved 151 "foo" jargon "The Jargon File (version 4.4.7, 29 Dec 2003)" foo /foo/ 1. interj. Term of disgust. 2. [very common] Used very generally as a sample name for absolutely anything, esp. programs and files (esp. scratch files). 3. First on the standard list of {metasyntactic variable}s used in syntax examples. See also {bar}, {baz}, {qux}, {quux}, {garply}, {waldo}, {fred}, {plugh}, {xyzzy}, {thud}. When ?foo? is used in connection with ?bar? it has generally traced to the WWII-era Army slang acronym {FUBAR} (?Fucked Up Beyond All Repair? or ?Fucked Up Beyond All Recognition?), later modified to {foobar}. Early versions of the Jargon File interpreted this change as a post-war bowdlerization, but it it now seems more likely that FUBAR was itself a derivative of ?foo? perhaps influenced by German furchtbar (terrible) ? ?foobar? may actually have been the original form. ... So they originate in the server. And the "OPTION MIME" does not change the behaviour. Torsten