From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.org!not-for-mail From: Tetsuo Tsukamoto Newsgroups: gmane.emacs.devel Subject: Compounds in SKK-JISYO.L may be included? Date: Sat, 26 Nov 2011 21:54:21 +0100 Message-ID: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1322340998 2466 80.91.229.12 (26 Nov 2011 20:56:38 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sat, 26 Nov 2011 20:56:38 +0000 (UTC) To: emacs-devel@gnu.org Original-X-From: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Sat Nov 26 21:56:35 2011 Return-path: Envelope-to: ged-emacs-devel@m.gmane.org Original-Received: from lists.gnu.org ([140.186.70.17]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1RUPIY-0003b2-16 for ged-emacs-devel@m.gmane.org; Sat, 26 Nov 2011 21:56:34 +0100 Original-Received: from localhost ([::1]:55176 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RUPIX-00041X-NT for ged-emacs-devel@m.gmane.org; Sat, 26 Nov 2011 15:56:33 -0500 Original-Received: from eggs.gnu.org ([140.186.70.92]:50441) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RUPGX-0003i4-Ch for emacs-devel@gnu.org; Sat, 26 Nov 2011 15:54:30 -0500 Original-Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RUPGW-0006cp-DF for emacs-devel@gnu.org; Sat, 26 Nov 2011 15:54:29 -0500 Original-Received: from mail-bw0-f41.google.com ([209.85.214.41]:35132) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RUPGW-0006cl-54 for emacs-devel@gnu.org; Sat, 26 Nov 2011 15:54:28 -0500 Original-Received: by bke17 with SMTP id 17so6905960bke.0 for ; Sat, 26 Nov 2011 12:54:26 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=from:to:subject:date:message-id:user-agent:mime-version :content-type:content-transfer-encoding; bh=+MaSkOxhkK7GKxsnf/gSRdJ5qiqlfXvrGF3IXPy5xDA=; b=dHwDvssGoR1V8DNVo1WNjCUSREc50Cq1/Z9EztoErX11G1+deuEHeA5u5eMJ9dMQgu hMGRh5cY2ahQ4JFrnlx8ajYy0IG3Y4o3a2g8N4Tvu+H8yhibqXCMEr0UBk37sfrcNWDF ynJ/CXvUy/gfeSkgsmGYosRlyZCKnzcFugIiw= Original-Received: by 10.204.8.16 with SMTP id f16mr13254119bkf.134.1322340866572; Sat, 26 Nov 2011 12:54:26 -0800 (PST) Original-Received: from jitaku.gmail.com ([193.52.24.23]) by mx.google.com with ESMTPS id e18sm22687797bkr.15.2011.11.26.12.54.23 (version=TLSv1/SSLv3 cipher=OTHER); Sat, 26 Nov 2011 12:54:25 -0800 (PST) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.91 (darwin) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 209.85.214.41 X-Mailman-Approved-At: Sat, 26 Nov 2011 15:56:31 -0500 X-BeenThere: emacs-devel@gnu.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: "Emacs development discussions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Original-Sender: emacs-devel-bounces+ged-emacs-devel=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.devel:146271 Archived-At: Hello, Emacs includes part of SKK-JISYO.L, which is a Kana-Kanji dictinoary for a Japanese input method called SKK. Its size is more than 4 MB, but the size of leim/ja-dic.el is much reduced by excluding those words which may be generated as compounds. For example, leim/ja-dic.el doesn't contain the word =E5=A4=A7=E4=B8=88=E5= =A4=AB (dai-jo-bu) that means fine, OK, all right, etc.=20=20 =E5=A4=A7=E4=B8=88=E5=A4=AB can be generated by =E5=A4=A7 (dai) and =E4=B8= =88=E5=A4=AB (jo-bu), but this process also generates many meaningless words, which may annoy users. I would propose to include compounds for a better efficacy if the size permits. --=20 Tetsuo Tsukamoto