unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#41488: 26.3; Cyrillic chars in output of find-dired and find-name-dired looks like '\234\188\123'
@ 2020-05-23 14:40 Viktor Larkin
  2020-05-23 16:03 ` Eli Zaretskii
  0 siblings, 1 reply; 3+ messages in thread
From: Viktor Larkin @ 2020-05-23 14:40 UTC (permalink / raw)
  To: 41488


Hello.

I've no idea what causes this bug, but... When I do recursive search
with find-dired or find-name-dired with cyrillic chars as search pattern
(e.g. документ) I always get something like '\234\221\123' instead of
correct cyrillic chars in file names and file paths. To reproduce:

1. emacs -Q

2. C-x d

3. M-x find-name-dired

4. RET

5. документ*.txt

6. RET

Output of locale in terminal and in eshell is the same. ru_RU.UTF-8 is a
chosen encoding for LANG and LC_CTYPE. Emacs version is 26.3. GNU findutils
version is 4.7.0.

I've found this topic
https://www.reddit.com/r/emacs/comments/bvuv9q/cyrillic_letters_dont_work_with_find_in_emacs/eptkhsr/
. Seems like bug is still in the wild.



In GNU Emacs 26.3 (build 1, x86_64-slackware-linux-gnu, GTK+ Version 3.24.12)
 of 2019-11-25 built on hive64.slackware.lan
Windowing system distributor 'The X.Org Foundation', version 11.0.12008000
Recent messages:
Undo!
Change in Dired buffer undone.
Actual changes in files cannot be undone by Emacs.
find-dired *Find* finished.
Quit
Making completion list...
You can run the command ‘describe-coding-system’ with C-h C
mwheel-scroll: Beginning of buffer [2 times]
Mark set
Making completion list...

Configured using:
 'configure --prefix=/usr --sysconfdir=/etc --localstatedir=/var
 --program-prefix= --program-suffix= --mandir=/usr/man
 --infodir=/usr/info --without-gconf --without-gsettings --with-modules
 --with-x --with-x-toolkit=gtk3 --build=x86_64-slackware-linux
 'CFLAGS=-O2 -fPIC''

Configured features:
XPM JPEG TIFF GIF PNG RSVG IMAGEMAGICK SOUND GPM DBUS GLIB NOTIFY ACL
GNUTLS LIBXML2 FREETYPE XFT ZLIB TOOLKIT_SCROLL_BARS GTK3 X11 XDBE XIM
MODULES THREADS LCMS2

Important settings:
  value of $LC_COLLATE: C
  value of $LC_CTYPE: ru_RU.UTF-8
  value of $LC_MESSAGES: ru_RU.UTF-8
  value of $LC_MONETARY: ru_RU.UTF-8
  value of $LC_NUMERIC: ru_RU.UTF-8
  value of $LC_TIME: en_GB.UTF-8
  value of $LANG: ru_RU.UTF-8
  locale-coding-system: utf-8-unix

Major mode: Eshell

Minor modes in effect:
  shell-dirtrack-mode: t
  global-undo-tree-mode: t
  undo-tree-mode: t
  icomplete-mode: t
  electric-pair-mode: t
  display-time-mode: t
  global-hl-line-mode: t
  override-global-mode: t
  tooltip-mode: t
  global-eldoc-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t

Load-path shadows:
None found.

Features:
(shadow sort bbdb-message mail-extr emacsbug mule-diag misearch
multi-isearch org-rmail org-mhe org-irc org-info org-gnus nnir
org-docview doc-view jka-compr image-mode org-bibtex bibtex org-bbdb
org-w3m bookmark pp eieio-opt speedbar sb-image ezimage dframe help-fns
radix-tree thingatpt etags xref project pcmpl-unix pcmpl-gnu novice
conf-mode em-unix em-term term disp-table ehelp em-script em-prompt
em-ls em-hist em-pred em-glob em-dirs em-cmpl em-basic em-banner
em-alias esh-var esh-io esh-cmd esh-opt esh-ext esh-proc esh-arg
esh-groups eshell esh-module esh-mode esh-util gnus-delay gnus-draft
gnus-agent gnus-srvr gnus-score score-mode nnvirtual gnus-cache gnus-msg
gnus-art mm-uu mml2015 mm-view mml-smime smime dig mailcap gnus-sum
nndraft nnmh shell find-dired quail dired-aux gnus-dired ox-odt rng-loc
rng-uri rng-parse rng-match rng-dt rng-util rng-pttrn nxml-parse nxml-ns
nxml-enc xmltok nxml-util ox-latex ox-icalendar ox-html table ox-ascii
ox-publish ox undo-tree diff bbdb-mua bbdb-com crm bbdb bbdb-site
timezone org-bullets yasnippet icomplete smtpmail sendmail gnus-topic
gnus-demon nntp gnus-group gnus-undo gnus-start gnus-cloud nnimap nnmail
mail-source tls gnutls utf7 netrc parse-time gnus-spec gnus-win nnoo
gnus-int gnus-range message rmc puny dired dired-loaddefs rfc822 mml
mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev
gmm-utils mailheader gnus nnheader gnus-util rmail rmail-loaddefs
rfc2047 rfc2045 ietf-drums mail-utils mm-util mail-prsvr wid-edit
re-builder ibuffer ibuffer-loaddefs epa-file epa derived epg edmacro
kmacro elec-pair cl-extra help-mode time hl-line org-element avl-tree
generator org advice org-macro org-footnote org-pcomplete pcomplete
org-list org-faces org-entities noutline outline org-version
ob-emacs-lisp ob ob-tangle org-src ob-ref ob-lob ob-table ob-keys ob-exp
ob-comint comint ansi-color ring ob-core ob-eval org-compat org-macs
org-loaddefs format-spec find-func cal-menu calendar cal-loaddefs
use-package use-package-ensure use-package-delight use-package-diminish
use-package-bind-key bind-key easy-mmode use-package-core finder-inf
tex-site info package easymenu epg-config url-handlers url-parse
auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs
password-cache url-vars seq byte-opt gv bytecomp byte-compile cconv
cl-loaddefs cl-lib time-date mule-util tooltip eldoc electric uniquify
ediff-hook vc-hooks lisp-float-type mwheel term/x-win x-win
term/common-win x-dnd tool-bar dnd fontset image regexp-opt fringe
tabulated-list replace newcomment text-mode elisp-mode lisp-mode
prog-mode register page menu-bar rfn-eshadow isearch timer select
scroll-bar mouse jit-lock font-lock syntax facemenu font-core
term/tty-colors frame cl-generic cham georgian utf-8-lang misc-lang
vietnamese tibetan thai tai-viet lao korean japanese eucjp-ms cp51932
hebrew greek romanian slovak czech european ethiopic indian cyrillic
chinese composite charscript charprop case-table epa-hook jka-cmpr-hook
help simple abbrev obarray minibuffer cl-preloaded nadvice loaddefs
button faces cus-face macroexp files text-properties overlay sha1 md5
base64 format env code-pages mule custom widget hashtable-print-readable
backquote threads dbusbind inotify lcms2 dynamic-setting
font-render-setting move-toolbar gtk x-toolkit x multi-tty
make-network-process emacs)

Memory information:
((conses 16 480598 45587)
 (symbols 48 47686 1)
 (miscs 40 752 933)
 (strings 32 150163 6542)
 (string-bytes 1 4624749)
 (vectors 16 58053)
 (vector-slots 8 1708284 169674)
 (floats 8 314 520)
 (intervals 56 6746 1261)
 (buffers 992 47))






^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#41488: 26.3; Cyrillic chars in output of find-dired and find-name-dired looks like '\234\188\123'
  2020-05-23 14:40 bug#41488: 26.3; Cyrillic chars in output of find-dired and find-name-dired looks like '\234\188\123' Viktor Larkin
@ 2020-05-23 16:03 ` Eli Zaretskii
  2020-05-24  3:34   ` Dmitry Alexandrov
  0 siblings, 1 reply; 3+ messages in thread
From: Eli Zaretskii @ 2020-05-23 16:03 UTC (permalink / raw)
  To: Viktor Larkin; +Cc: 41488

merge 41488 36085
thanks

> From: Viktor Larkin <com.larkin@gmail.com>
> Date: Sat, 23 May 2020 17:40:51 +0300
> 
> I've no idea what causes this bug, but... When I do recursive search
> with find-dired or find-name-dired with cyrillic chars as search pattern
> (e.g. документ) I always get something like '\234\221\123' instead of
> correct cyrillic chars in file names and file paths. To reproduce:
> 
> 1. emacs -Q
> 
> 2. C-x d
> 
> 3. M-x find-name-dired
> 
> 4. RET
> 
> 5. документ*.txt
> 
> 6. RET
> 
> Output of locale in terminal and in eshell is the same. ru_RU.UTF-8 is a
> chosen encoding for LANG and LC_CTYPE. Emacs version is 26.3. GNU findutils
> version is 4.7.0.
> 
> I've found this topic
> https://www.reddit.com/r/emacs/comments/bvuv9q/cyrillic_letters_dont_work_with_find_in_emacs/eptkhsr/
> . Seems like bug is still in the wild.

This is a duplicate of bug#36085, which was found not to be an Emacs
bug.  See the discussion there:

  https://debbugs.gnu.org/cgi/bugreport.cgi?bug=36085





^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#41488: 26.3; Cyrillic chars in output of find-dired and find-name-dired looks like '\234\188\123'
  2020-05-23 16:03 ` Eli Zaretskii
@ 2020-05-24  3:34   ` Dmitry Alexandrov
  0 siblings, 0 replies; 3+ messages in thread
From: Dmitry Alexandrov @ 2020-05-24  3:34 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: Viktor Larkin, 41488


[-- Attachment #1.1: Type: text/plain, Size: 330 bytes --]

Eli Zaretskii <eliz@gnu.org> wrote:
>> From: Viktor Larkin <com.larkin@gmail.com>
>>
>> I've found this topic https://www.reddit.com/r/emacs/comments/bvuv9q/cyrillic_letters_dont_work_with_find_in_emacs/eptkhsr/
>
> This is a duplicate of bug#36085

Neither Reddit thread nor that bug mention the most obvious workaround though:


[-- Attachment #1.2: Type: application/emacs-lisp, Size: 188 bytes --]

;; ‘-ls’ switch is broken with regards to non-ASCII
(setq find-ls-option '("-printf '%M %n %u %g %s %TY-%Tm-%Td %TH:%TM %p\n'" .
                       "-l --time-style=long-iso"))

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 247 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2020-05-24  3:34 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-05-23 14:40 bug#41488: 26.3; Cyrillic chars in output of find-dired and find-name-dired looks like '\234\188\123' Viktor Larkin
2020-05-23 16:03 ` Eli Zaretskii
2020-05-24  3:34   ` Dmitry Alexandrov

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).