unofficial mirror of bug-gnu-emacs@gnu.org 
 help / color / mirror / code / Atom feed
* bug#50718: 28.0.50; `split-string` fails on certain unicode strings
@ 2021-09-21  9:27 dalanicolai
  2021-09-21  9:44 ` Eli Zaretskii
  2021-09-21  9:51 ` Andreas Schwab
  0 siblings, 2 replies; 5+ messages in thread
From: dalanicolai @ 2021-09-21  9:27 UTC (permalink / raw)
  To: 50718

[-- Attachment #1: Type: text/plain, Size: 3443 bytes --]

Evaluate: (split-string "१०.३" ".")
It wrongly returns a list with only empty string.
Of course it should return alist with the individual devanagari numbers.


In GNU Emacs 28.0.50 (build 3, x86_64-pc-linux-gnu, GTK+ Version 3.24.30,
cairo version 1.17.4)
 of 2021-09-06 built on daniel-fedora
Repository revision: c4724add006e62b81f847937db56335a81bdcc74
Repository branch: master
Windowing system distributor 'The X.Org Foundation', version 11.0.12011000
System Description: Fedora 34 (Workstation Edition)

Configured using:
 'configure --with-mailutils --with-cairo --with-modules --with-pgtk
 --with-native-compilation'

Configured features:
ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG
JSON LCMS2 LIBOTF LIBSELINUX LIBSYSTEMD LIBXML2 M17N_FLT MODULES
NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS TIFF
TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK3 ZLIB

Important settings:
  value of $LANG: en_US.UTF-8
  value of $XMODIFIERS: @im=none
  locale-coding-system: utf-8-unix

Major mode: Lisp Interaction

Minor modes in effect:
  tooltip-mode: t
  global-eldoc-mode: t
  eldoc-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  indent-tabs-mode: t
  transient-mark-mode: t

Load-path shadows:
None found.

Features:
(shadow sort mail-extr emacsbug comp comp-cstr warnings rx message rmc
puny dired dired-loaddefs rfc822 mml mml-sec epa derived epg rfc6068
epg-config gnus-util rmail rmail-loaddefs auth-source cl-seq eieio
eieio-core cl-macs eieio-loaddefs password-cache json map mm-decode
mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader
sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils
time-date subr-x cl-extra shortdoc text-property-search seq byte-opt gv
bytecomp byte-compile cconv help-fns radix-tree help-mode cl-loaddefs
cl-lib iso-transl tooltip eldoc electric uniquify ediff-hook vc-hooks
lisp-float-type mwheel term/x-win x-win term/common-win x-dnd tool-bar
dnd fontset image regexp-opt fringe tabulated-list replace newcomment
text-mode elisp-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech
european ethiopic indian cyrillic chinese composite charscript charprop
case-table epa-hook jka-cmpr-hook help simple abbrev obarray
cl-preloaded nadvice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 format env code-pages
mule custom widget hashtable-print-readable backquote threads dbusbind
inotify lcms2 dynamic-setting system-font-setting font-render-setting
cairo move-toolbar gtk x-toolkit x multi-tty make-network-process
native-compile emacs)

Memory information:
((conses 16 94870 10759)
 (symbols 48 7970 1)
 (strings 32 23722 1760)
 (string-bytes 1 872683)
 (vectors 16 16528)
 (vector-slots 8 305866 17210)
 (floats 8 71 35)
 (intervals 56 444 0)
 (buffers 992 14))

[-- Attachment #2: Type: text/html, Size: 26983 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* bug#50718: 28.0.50; `split-string` fails on certain unicode strings
  2021-09-21  9:27 bug#50718: 28.0.50; `split-string` fails on certain unicode strings dalanicolai
@ 2021-09-21  9:44 ` Eli Zaretskii
  2021-09-21 15:29   ` Stefan Kangas
  2021-09-21  9:51 ` Andreas Schwab
  1 sibling, 1 reply; 5+ messages in thread
From: Eli Zaretskii @ 2021-09-21  9:44 UTC (permalink / raw)
  To: dalanicolai; +Cc: 50718

tags 50718 notabug
thanks

> From: dalanicolai <dalanicolai@gmail.com>
> Date: Tue, 21 Sep 2021 11:27:48 +0200
> 
> Evaluate: (split-string "१०.३" ".")
> It wrongly returns a list with only empty string.
> Of course it should return alist with the individual devanagari numbers.

That's a cockpit error: the SEPARATORS argument should be a regular
expression, so you should use "\\." instead.





^ permalink raw reply	[flat|nested] 5+ messages in thread

* bug#50718: 28.0.50; `split-string` fails on certain unicode strings
  2021-09-21  9:27 bug#50718: 28.0.50; `split-string` fails on certain unicode strings dalanicolai
  2021-09-21  9:44 ` Eli Zaretskii
@ 2021-09-21  9:51 ` Andreas Schwab
  2021-09-22  7:59   ` dalanicolai
  1 sibling, 1 reply; 5+ messages in thread
From: Andreas Schwab @ 2021-09-21  9:51 UTC (permalink / raw)
  To: dalanicolai; +Cc: 50718

On Sep 21 2021, dalanicolai wrote:

> Evaluate: (split-string "१०.३" ".")
> It wrongly returns a list with only empty string.

You have specified all characters as separators, since "." matches any
character.  If you want to match only the period you need to use "\\."
has the regexp.

Andreas.

-- 
Andreas Schwab, schwab@linux-m68k.org
GPG Key fingerprint = 7578 EB47 D4E5 4D69 2510  2552 DF73 E780 A9DA AEC1
"And now for something completely different."





^ permalink raw reply	[flat|nested] 5+ messages in thread

* bug#50718: 28.0.50; `split-string` fails on certain unicode strings
  2021-09-21  9:44 ` Eli Zaretskii
@ 2021-09-21 15:29   ` Stefan Kangas
  0 siblings, 0 replies; 5+ messages in thread
From: Stefan Kangas @ 2021-09-21 15:29 UTC (permalink / raw)
  To: Eli Zaretskii; +Cc: 50718-done, dalanicolai

Eli Zaretskii <eliz@gnu.org> writes:

> tags 50718 notabug
> thanks
>
>> From: dalanicolai <dalanicolai@gmail.com>
>> Date: Tue, 21 Sep 2021 11:27:48 +0200
>>
>> Evaluate: (split-string "१०.३" ".")
>> It wrongly returns a list with only empty string.
>> Of course it should return alist with the individual devanagari numbers.
>
> That's a cockpit error: the SEPARATORS argument should be a regular
> expression, so you should use "\\." instead.

I'm therefore closing this bug report.





^ permalink raw reply	[flat|nested] 5+ messages in thread

* bug#50718: 28.0.50; `split-string` fails on certain unicode strings
  2021-09-21  9:51 ` Andreas Schwab
@ 2021-09-22  7:59   ` dalanicolai
  0 siblings, 0 replies; 5+ messages in thread
From: dalanicolai @ 2021-09-22  7:59 UTC (permalink / raw)
  To: Andreas Schwab; +Cc: 50718

[-- Attachment #1: Type: text/plain, Size: 733 bytes --]

Haha, okay that is a some unexperienced (or not fully awake) mistake.
Anyway, will not forget about that again, I guess. Thanks for the reply!

On Tue, 21 Sept 2021 at 11:51, Andreas Schwab <schwab@linux-m68k.org> wrote:

> On Sep 21 2021, dalanicolai wrote:
>
> > Evaluate: (split-string "१०.३" ".")
> > It wrongly returns a list with only empty string.
>
> You have specified all characters as separators, since "." matches any
> character.  If you want to match only the period you need to use "\\."
> has the regexp.
>
> Andreas.
>
> --
> Andreas Schwab, schwab@linux-m68k.org
> GPG Key fingerprint = 7578 EB47 D4E5 4D69 2510  2552 DF73 E780 A9DA AEC1
> "And now for something completely different."
>

[-- Attachment #2: Type: text/html, Size: 1162 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2021-09-22  7:59 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-09-21  9:27 bug#50718: 28.0.50; `split-string` fails on certain unicode strings dalanicolai
2021-09-21  9:44 ` Eli Zaretskii
2021-09-21 15:29   ` Stefan Kangas
2021-09-21  9:51 ` Andreas Schwab
2021-09-22  7:59   ` dalanicolai

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).