* bug#5990: 23.1; Cannot type the word 買います @ 2010-04-20 18:48 Erik Mekhsian 2016-08-07 4:05 ` Andrew Hyatt 0 siblings, 1 reply; 9+ messages in thread From: Erik Mekhsian @ 2010-04-20 18:48 UTC (permalink / raw) To: 5990 Setting the input method to Japanese and typing the word "kaimasu", then hitting <space> to cycle through the list of replacement kanji shows only 改マス and 開マス. In particular, I expected to see 買います, although 飼います, 交います, and 支います should be there as well. This is the only word I have had this problem with so far. In GNU Emacs 23.1.1 (i486-pc-linux-gnu, GTK+ Version 2.18.3) of 2010-03-25 on palmer, modified by Debian Windowing system distributor `The X.Org Foundation', version 11.0.10604000 configured using `configure '--build=i486-linux-gnu' '--host=i486-linux-gnu' '--prefix=/usr' '--sharedstatedir=/var/lib' '--libexecdir=/usr/lib' '--localstatedir=/var/lib' '--infodir=/usr/share/info' '--mandir=/usr/share/man' '--with-pop=yes' '--enable-locallisppath=/etc/emacs23:/etc/emacs:/usr/local/share/emacs/23.1/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/23.1/site-lisp:/usr/share/emacs/site-lisp:/usr/share/emacs/23.1/leim' '--with-x=yes' '--with-x-toolkit=gtk' '--with-toolkit-scroll-bars' 'build_alias=i486-linux-gnu' 'host_alias=i486-linux-gnu' 'CFLAGS=-DDEBIAN -g -O2' 'LDFLAGS=-g' 'CPPFLAGS='' Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: en_US.UTF-8 value of $XMODIFIERS: @im=ibus locale-coding-system: utf-8-unix default-enable-multibyte-characters: t Major mode: Text Minor modes in effect: tooltip-mode: t tool-bar-mode: t mouse-wheel-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t global-auto-composition-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t transient-mark-mode: t Recent input: C-x C-f j p t s t . t x t <return> C-\ j a p a n e s e <return> k a i m a s u SPC SPC SPC SPC SPC SPC SPC <return> M-x r e p o r t - e m <tab> <return> Recent messages: For information about GNU Emacs and the GNU system, type C-h C-a. (New file) Loading quail/japanese...done Loading /home/akira/.kkcrc...done /usr/bin/mail is not an executable. Setting mail-interactive to t. Loading ja-dic/ja-dic...done ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2010-04-20 18:48 bug#5990: 23.1; Cannot type the word 買います Erik Mekhsian @ 2016-08-07 4:05 ` Andrew Hyatt 2016-08-07 13:59 ` Eli Zaretskii 0 siblings, 1 reply; 9+ messages in thread From: Andrew Hyatt @ 2016-08-07 4:05 UTC (permalink / raw) To: Erik Mekhsian; +Cc: 5990 Trying this out Emacs 25, I get completely different text. My choices are: かいます 垣間す 加居間す 過居間す 可居間す I don't read Japanese, so I can't comment on how appropriate any of the matches are. From what I can tell, the Japanese word list is build off of http://openlab.ring.gr.jp/skk/skk/dic/SKK-JISYO.L. I don't know about the process of changing that. Erik Mekhsian <akirashinigami@gmail.com> writes: > Setting the input method to Japanese and typing the word "kaimasu", then > hitting <space> to cycle through the list of replacement kanji shows > only 改マス and 開マス. In particular, I expected to see 買います, > although 飼います, 交います, and 支います should be there as well. This > is the only word I have had this problem with so far. > > In GNU Emacs 23.1.1 (i486-pc-linux-gnu, GTK+ Version 2.18.3) > of 2010-03-25 on palmer, modified by Debian > Windowing system distributor `The X.Org Foundation', version 11.0.10604000 > configured using `configure '--build=i486-linux-gnu' > '--host=i486-linux-gnu' '--prefix=/usr' '--sharedstatedir=/var/lib' > '--libexecdir=/usr/lib' '--localstatedir=/var/lib' > '--infodir=/usr/share/info' '--mandir=/usr/share/man' '--with-pop=yes' > '--enable-locallisppath=/etc/emacs23:/etc/emacs:/usr/local/share/emacs/23.1/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/23.1/site-lisp:/usr/share/emacs/site-lisp:/usr/share/emacs/23.1/leim' > '--with-x=yes' '--with-x-toolkit=gtk' '--with-toolkit-scroll-bars' > 'build_alias=i486-linux-gnu' 'host_alias=i486-linux-gnu' > 'CFLAGS=-DDEBIAN -g -O2' 'LDFLAGS=-g' 'CPPFLAGS='' > > Important settings: > value of $LC_ALL: nil > value of $LC_COLLATE: nil > value of $LC_CTYPE: nil > value of $LC_MESSAGES: nil > value of $LC_MONETARY: nil > value of $LC_NUMERIC: nil > value of $LC_TIME: nil > value of $LANG: en_US.UTF-8 > value of $XMODIFIERS: @im=ibus > locale-coding-system: utf-8-unix > default-enable-multibyte-characters: t > > Major mode: Text > > Minor modes in effect: > tooltip-mode: t > tool-bar-mode: t > mouse-wheel-mode: t > menu-bar-mode: t > file-name-shadow-mode: t > global-font-lock-mode: t > font-lock-mode: t > blink-cursor-mode: t > global-auto-composition-mode: t > auto-composition-mode: t > auto-encryption-mode: t > auto-compression-mode: t > line-number-mode: t > transient-mark-mode: t > > Recent input: > C-x C-f j p t s t . t x t <return> C-\ j a p a n e > s e <return> k a i m a s u SPC SPC SPC SPC SPC SPC > SPC <return> M-x r e p o r t - e m <tab> <return> > > Recent messages: > For information about GNU Emacs and the GNU system, type C-h C-a. > (New file) > Loading quail/japanese...done > Loading /home/akira/.kkcrc...done > /usr/bin/mail is not an executable. Setting mail-interactive to t. > Loading ja-dic/ja-dic...done ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2016-08-07 4:05 ` Andrew Hyatt @ 2016-08-07 13:59 ` Eli Zaretskii 2016-08-08 17:01 ` Alex 0 siblings, 1 reply; 9+ messages in thread From: Eli Zaretskii @ 2016-08-07 13:59 UTC (permalink / raw) To: Andrew Hyatt; +Cc: akirashinigami, 5990 > From: Andrew Hyatt <ahyatt@gmail.com> > Date: Sun, 07 Aug 2016 00:05:45 -0400 > Cc: 5990@debbugs.gnu.org > > Trying this out Emacs 25, I get completely different text. My choices > are: > > かいます > 垣間す > 加居間す > 過居間す > 可居間す > > I don't read Japanese, so I can't comment on how appropriate any of the > matches are. I tried the same in Emacs 23, and I couldn't reproduce the issue there, either. Maybe the OP had some customized dictionary, or maybe even didn't use the Emacs IM? E.g., this might be relevant: > value of $XMODIFIERS: @im=ibus > >From what I can tell, the Japanese word list is build off of > http://openlab.ring.gr.jp/skk/skk/dic/SKK-JISYO.L. I don't know about > the process of changing that. If there are significant updates, we can import a new version for the next release, I think. ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2016-08-07 13:59 ` Eli Zaretskii @ 2016-08-08 17:01 ` Alex 2021-08-26 19:21 ` Lars Ingebrigtsen 0 siblings, 1 reply; 9+ messages in thread From: Alex @ 2016-08-08 17:01 UTC (permalink / raw) To: Eli Zaretskii; +Cc: Andrew Hyatt, akirashinigami, 5990 Eli Zaretskii <eliz@gnu.org> writes: > E.g., this might be relevant: > >> value of $XMODIFIERS: @im=ibus FWIW, I tried it on a system with that value and I got the same results as Andrew. Maybe he did use a different IM. > If there are significant updates, we can import a new version for the > next release, I think. I can't say anything on that front, but I tried updating the Japanese dictionary to the latest version using skkdic-convert in lisp/international/ja-dic-cnv.el and it still gave the "wrong" results for 買います. I can also reproduce this with inflected versions of some common words like はなします and よみます. Neither produce the expected results of 話 します and 読みます. The above examples work as expected with the mozc IM (https://github.com/google/mozc). ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2016-08-08 17:01 ` Alex @ 2021-08-26 19:21 ` Lars Ingebrigtsen 2021-08-31 6:13 ` Naohiro Aota 0 siblings, 1 reply; 9+ messages in thread From: Lars Ingebrigtsen @ 2021-08-26 19:21 UTC (permalink / raw) To: Alex; +Cc: Andrew Hyatt, akirashinigami, 5990 Alex <agrambot@gmail.com> writes: >> If there are significant updates, we can import a new version for the >> next release, I think. > > I can't say anything on that front, but I tried updating the Japanese > dictionary to the latest version using skkdic-convert in > lisp/international/ja-dic-cnv.el and it still gave the "wrong" results for > 買います. In Emacs 28, I'm getting: 魔居間す 加居間す 過居間す 可居間す かいます So I'm not getting 買います here, either. But I guess we're just using whatever is in: http://openlab.ring.gr.jp/skk/skk/dic/SKK-JISYO.L (Our version was updated earlier this year.) So is this something that we can fix on our side, or is it just what this dictionary says? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2021-08-26 19:21 ` Lars Ingebrigtsen @ 2021-08-31 6:13 ` Naohiro Aota 2021-08-31 8:34 ` Lars Ingebrigtsen 0 siblings, 1 reply; 9+ messages in thread From: Naohiro Aota @ 2021-08-31 6:13 UTC (permalink / raw) To: Lars Ingebrigtsen; +Cc: Andrew Hyatt, akirashinigami, Alex, 5990 On Thu, Aug 26, 2021 at 09:21:12PM +0200, Lars Ingebrigtsen wrote: > Alex <agrambot@gmail.com> writes: > > >> If there are significant updates, we can import a new version for the > >> next release, I think. > > > > I can't say anything on that front, but I tried updating the Japanese > > dictionary to the latest version using skkdic-convert in > > lisp/international/ja-dic-cnv.el and it still gave the "wrong" results for > > 買います. Background: I'm Japanese. > In Emacs 28, I'm getting: > These are still all valid (possible) kanji conversion for "kaimasu". > 魔居間す (This looks strange. I guess there are some mistakes for this one.) > 加居間す > 過居間す > 可居間す All these can be read as "ka" (加, 過, 可) + "ima" (居間) + "su" (す). > かいます The implementation of leim's kanji conversion is so simple as follow. In short, it just find the longest match from the dictionary even if it is strange (for Japanese) or not. (setq kkc-current-key (string-to-vector kkc-original-kana)) (setq kkc-length-head (length kkc-current-key)) (unwind-protect ... (while (not (kkc-lookup-key kkc-length-head nil first)) (setq kkc-length-head (1- kkc-length-head) first nil)) So, we get the longest conversion of "kaimasu" as "kaima" as above. To get "買います", we need explicitly set the conversion length to 2 with C-o/C-i. It can be reproduced in a code like this: (let ((kkc-current-key "かいます")) (kkc-lookup-key 2) kkc-current-conversions) (1 "買い" "書い" "描い" "飼い" "画い" "欠い" "掻い" "嗅い" "交い" "畫い" "缺い" ...) ;; `-- Here, we have the "買います" result (let ((kkc-current-key "かいます")) (kkc-lookup-key 3) kkc-current-conversions) (1 "垣間" "加居間" "過居間" "可居間") > So I'm not getting 買います here, either. > > But I guess we're just using whatever is in: > > http://openlab.ring.gr.jp/skk/skk/dic/SKK-JISYO.L > > (Our version was updated earlier this year.) So is this something that > we can fix on our side, or is it just what this dictionary says? Well, other IM like mozc is much more intelligent to prefer "買います" than the above three conversions because "買います" is plausible. IMHO, implementing such complex algorithm is out of scope for leim. It still can be a "rescue" tool even with the simple algorithm if you can set a proper conversion length. And, we anyway use other IM like mozc, SKK, tc.el. Thanks, > -- > (domestic pets only, the antidote for overdose, milk.) > bloggy blog: http://lars.ingebrigtsen.no > > > ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2021-08-31 6:13 ` Naohiro Aota @ 2021-08-31 8:34 ` Lars Ingebrigtsen 2021-08-31 13:03 ` Naohiro Aota 0 siblings, 1 reply; 9+ messages in thread From: Lars Ingebrigtsen @ 2021-08-31 8:34 UTC (permalink / raw) To: Naohiro Aota; +Cc: Andrew Hyatt, akirashinigami, Alex, 5990 Naohiro Aota <naota@elisp.net> writes: > To get "買います", we need explicitly set the conversion length to 2 > with C-o/C-i. It can be reproduced in a code like this: > > (let ((kkc-current-key "かいます")) > (kkc-lookup-key 2) > kkc-current-conversions) > (1 "買い" "書い" "描い" "飼い" "画い" "欠い" "掻い" "嗅い" "交い" "畫い" "缺い" ...) > ;; `-- Here, we have the "買います" result Ah, I see. > IMHO, implementing such complex algorithm is out of scope for leim. It > still can be a "rescue" tool even with the simple algorithm if you can > set a proper conversion length. And, we anyway use other IM like mozc, > SKK, tc.el. So it this basically working as designed, and the bug report should be closed? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2021-08-31 8:34 ` Lars Ingebrigtsen @ 2021-08-31 13:03 ` Naohiro Aota 2021-09-01 8:07 ` Lars Ingebrigtsen 0 siblings, 1 reply; 9+ messages in thread From: Naohiro Aota @ 2021-08-31 13:03 UTC (permalink / raw) To: Lars Ingebrigtsen; +Cc: Andrew Hyatt, akirashinigami, Alex, 5990 On Tue, Aug 31, 2021 at 10:34:11AM +0200, Lars Ingebrigtsen wrote: > Naohiro Aota <naota@elisp.net> writes: > > > To get "買います", we need explicitly set the conversion length to 2 > > with C-o/C-i. It can be reproduced in a code like this: > > > > (let ((kkc-current-key "かいます")) > > (kkc-lookup-key 2) > > kkc-current-conversions) > > (1 "買い" "書い" "描い" "飼い" "画い" "欠い" "掻い" "嗅い" "交い" "畫い" "缺い" ...) > > ;; `-- Here, we have the "買います" result > > Ah, I see. > > > IMHO, implementing such complex algorithm is out of scope for leim. It > > still can be a "rescue" tool even with the simple algorithm if you can > > set a proper conversion length. And, we anyway use other IM like mozc, > > SKK, tc.el. > > So it this basically working as designed, and the bug report should be > closed? Yes, I think so. > -- > (domestic pets only, the antidote for overdose, milk.) > bloggy blog: http://lars.ingebrigtsen.no ^ permalink raw reply [flat|nested] 9+ messages in thread
* bug#5990: 23.1; Cannot type the word 買います 2021-08-31 13:03 ` Naohiro Aota @ 2021-09-01 8:07 ` Lars Ingebrigtsen 0 siblings, 0 replies; 9+ messages in thread From: Lars Ingebrigtsen @ 2021-09-01 8:07 UTC (permalink / raw) To: Naohiro Aota; +Cc: Andrew Hyatt, akirashinigami, Alex, 5990 Naohiro Aota <naota@elisp.net> writes: >> So it this basically working as designed, and the bug report should be >> closed? > > Yes, I think so. Thanks for the explanation; I'm closing this bug report, then. -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2021-09-01 8:07 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2010-04-20 18:48 bug#5990: 23.1; Cannot type the word 買います Erik Mekhsian 2016-08-07 4:05 ` Andrew Hyatt 2016-08-07 13:59 ` Eli Zaretskii 2016-08-08 17:01 ` Alex 2021-08-26 19:21 ` Lars Ingebrigtsen 2021-08-31 6:13 ` Naohiro Aota 2021-08-31 8:34 ` Lars Ingebrigtsen 2021-08-31 13:03 ` Naohiro Aota 2021-09-01 8:07 ` Lars Ingebrigtsen
Code repositories for project(s) associated with this public inbox https://git.savannah.gnu.org/cgit/emacs.git This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).