all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Lars Ingebrigtsen <larsi@gnus.org>
To: Stefan Kangas <stefan@marxist.se>
Cc: Emacs developers <emacs-devel@gnu.org>
Subject: Re: OKURI-NASI
Date: Mon, 30 May 2022 11:53:28 +0200	[thread overview]
Message-ID: <87ee0bxs6f.fsf@gnus.org> (raw)
In-Reply-To: <CADwFkmmryY9Zy7OOy4X1oscC0Xdv+KCLgzeTqOUa6rkqCOfAVw@mail.gmail.com> (Stefan Kangas's message of "Mon, 30 May 2022 09:31:06 +0200")

Stefan Kangas <stefan@marxist.se> writes:

> I currently synch SKK-JISYO.L against upstream once a month [but there
> has been no changes to this file since December last year].  It should
> be easy to include a step recreating the generated file and committing
> that too, if that's what we want.

It is extra work, though, and any work that can be avoided is nice.  😀

Doing some very light profiling here, a lot of the time is taken up by
skkdic-get-entry, which is just lookup-nested-alist.  My guess is that
if somebody took a look ja-dic-cnv.el, this algorithm could be made
substantially more efficient by using other data structures than an
extremely long nested alist.

But I have really no idea what it's really doing, so it's a bit daunting
to start poking at the code.  And my guess is that's why nobody else
has, either, since not many people currently hacking at Emacs has the
required domain knowledge.

;;; Commentary:

;; SKK is a Japanese input method running on Mule created by Masahiko
;; Sato <masahiko@sato.riec.tohoku.ac.jp>.  Here we provide utilities
;; to handle a dictionary distributed with SKK so that a different
;; input method (e.g. quail-japanese) can utilize the dictionary.

;; The format of SKK dictionary is quite simple.  Each line has the
;; form "KANASTRING /CONV1/CONV2/.../" which means KANASTRING (仮名文
;; 字列) can be converted to one of CONVi.  CONVi is a Kanji (漢字)
;; and Kana (仮名) mixed string.
;;
;; KANASTRING may have a trailing ASCII letter for Okurigana (送り仮名)
;; information.  For instance, the trailing letter `k' means that one
;; of the following Okurigana is allowed: かきくけこ.  So, in that
;; case, the string "KANASTRINGく" can be converted to one of "CONV1く",
;; CONV2く, ...

Well, that doesn't sound all that complicated, eh?

(I'm hoping to entice somebody to see this as a fun challenge.  😀)

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no



  reply	other threads:[~2022-05-30  9:53 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-05-29 20:18 OKURI-NASI Lars Ingebrigtsen
2022-05-29 20:22 ` OKURI-NASI Lars Ingebrigtsen
2022-05-30  2:28   ` OKURI-NASI Eli Zaretskii
2022-05-30  9:47     ` OKURI-NASI Lars Ingebrigtsen
2022-05-30 11:39       ` OKURI-NASI Eli Zaretskii
2022-05-30 11:44         ` OKURI-NASI Lars Ingebrigtsen
2022-05-30 11:49           ` OKURI-NASI Lars Ingebrigtsen
2022-05-30 12:08           ` OKURI-NASI Eli Zaretskii
2022-05-30 15:22             ` OKURI-NASI Lars Ingebrigtsen
2022-06-01  2:54               ` OKURI-NASI Lars Ingebrigtsen
2022-05-30  7:31 ` OKURI-NASI Stefan Kangas
2022-05-30  9:53   ` Lars Ingebrigtsen [this message]
2022-05-30 13:06     ` OKURI-NASI Stefan Monnier
2022-05-30 13:19       ` OKURI-NASI Lars Ingebrigtsen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87ee0bxs6f.fsf@gnus.org \
    --to=larsi@gnus.org \
    --cc=emacs-devel@gnu.org \
    --cc=stefan@marxist.se \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.