From mboxrd@z Thu Jan 1 00:00:00 1970 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: dalanicolai Newsgroups: gmane.emacs.bugs Subject: bug#50718: 28.0.50; `split-string` fails on certain unicode strings Date: Tue, 21 Sep 2021 11:27:48 +0200 Message-ID: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="00000000000070ba7005cc7e03d1" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="23710"; mail-complaints-to="usenet@ciao.gmane.io" To: 50718@debbugs.gnu.org Original-X-From: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Tue Sep 21 11:29:19 2021 Return-path: Envelope-to: geb-bug-gnu-emacs@m.gmane-mx.org Original-Received: from lists.gnu.org ([209.51.188.17]) by ciao.gmane.io with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mSc5W-0005zX-BU for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 21 Sep 2021 11:29:18 +0200 Original-Received: from localhost ([::1]:59238 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mSc5V-000177-8o for geb-bug-gnu-emacs@m.gmane-mx.org; Tue, 21 Sep 2021 05:29:17 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52112) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mSc5I-00015i-1X for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:29:04 -0400 Original-Received: from debbugs.gnu.org ([209.51.188.43]:33386) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mSc5H-0000JR-Oh for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:29:03 -0400 Original-Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1mSc5G-0002Is-HP for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:29:02 -0400 X-Loop: help-debbugs@gnu.org Resent-From: dalanicolai Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Tue, 21 Sep 2021 09:29:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 50718 X-GNU-PR-Package: emacs X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Original-Received: via spool by submit@debbugs.gnu.org id=B.16322164898779 (code B ref -1); Tue, 21 Sep 2021 09:29:02 +0000 Original-Received: (at submit) by debbugs.gnu.org; 21 Sep 2021 09:28:09 +0000 Original-Received: from localhost ([127.0.0.1]:44932 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mSc4O-0002HX-Vk for submit@debbugs.gnu.org; Tue, 21 Sep 2021 05:28:09 -0400 Original-Received: from lists.gnu.org ([209.51.188.17]:34398) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mSc4J-0002HI-13 for submit@debbugs.gnu.org; Tue, 21 Sep 2021 05:28:07 -0400 Original-Received: from eggs.gnu.org ([2001:470:142:3::10]:52024) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mSc4I-000112-RB for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:28:02 -0400 Original-Received: from mail-vk1-xa29.google.com ([2607:f8b0:4864:20::a29]:39729) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mSc4G-0007sU-Pu for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:28:02 -0400 Original-Received: by mail-vk1-xa29.google.com with SMTP id f73so5904046vkf.6 for ; Tue, 21 Sep 2021 02:28:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:from:date:message-id:subject:to; bh=CeKw8G9cdmMUMch3Lh1tNNu/ItbE0FqXxN3Yn0TmsDk=; b=Ujvvw5FIk7Ab46mbA59ul6lRUZK0msO1P7TauPB1KhjCRa0l/VyTgxQ5oas5qCLed2 kN0qdGE2xB949aZTRmcGS86EUv9MXPETB9zkDp7EKEJBuOBOZAfwdAUECjVFkzmsc89Q cvvWPdw83kupYFBN+Y1YOZHNr3Rwx5DxZuHXP65Bbay1DKrtQ3OGZO4h8Lao25KM6gUb vLzL0bG0THimGF/l5Sw0tLBlnt740cFJgBou2BW9kb809CeUBnRmQ89g4OiuCJxf7IVz Syf3QvnZykGfk4ys2b1H1twQ4ZMrLKROASrwDltlPFHM4cc2Z3kg37ZjjEVSEWo2e1t/ KfJw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=CeKw8G9cdmMUMch3Lh1tNNu/ItbE0FqXxN3Yn0TmsDk=; b=dLhVJpF1cvw0j7FonYra/+oIa0EQvRJX+Ucnu1uYaGgZ2hTuvCpHjsobsF9BPF5AMg OqEboqLM9c3xAldiLj/WydIClN3zcQoWeRMZCISKQyPNx+pAFSeG+0QvUEDjJ7qEeBEU HDZrnK3kVYNLym0AunJ9wjIVOmEwCgeDdKOZrY9FM72JUp5T8os059RDBIoKUbw5L1Wy 035Svh7mbwVJsGw9IZ4f6qnasIqHhm+AVHGnD+KWOPAMi0UzvyZ0marAXdWPTzBI98ru Rsu7hCAvLL0VoDlhwcBxuqORqf+hWhV0aF4Jee3gpSU3PjTUd447FupbRbXlqxeh7ZOH GwoA== X-Gm-Message-State: AOAM531Mzpu+OAadszg5AyhF3zLovWWVZyBzdeWhOr7StSO+8eu5kN/g wXFZUBmxWD7Nel2PRXq+fZs6AFWCRVenNLd5T8218fJEeWE= X-Google-Smtp-Source: ABdhPJxuTIW16cXK9D8AFcV+zv7L2t+u7s5JYXgJNPk6HOk8XenY3AMNj9nE9UHfYNKUlrU7N1PgeoS7VX19zSN21uM= X-Received: by 2002:a1f:9f10:: with SMTP id i16mr10434342vke.0.1632216479217; Tue, 21 Sep 2021 02:27:59 -0700 (PDT) Received-SPF: pass client-ip=2607:f8b0:4864:20::a29; envelope-from=dalanicolai@gmail.com; helo=mail-vk1-xa29.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: bug-gnu-emacs@gnu.org List-Id: "Bug reports for GNU Emacs, the Swiss army knife of text editors" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnu-emacs-bounces+geb-bug-gnu-emacs=m.gmane-mx.org@gnu.org Original-Sender: "bug-gnu-emacs" Xref: news.gmane.io gmane.emacs.bugs:214938 Archived-At: --00000000000070ba7005cc7e03d1 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Evaluate: (split-string "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" ".") It wrongly returns a list with only empty string. Of course it should return alist with the individual devanagari numbers. In GNU Emacs 28.0.50 (build 3, x86_64-pc-linux-gnu, GTK+ Version 3.24.30, cairo version 1.17.4) of 2021-09-06 built on daniel-fedora Repository revision: c4724add006e62b81f847937db56335a81bdcc74 Repository branch: master Windowing system distributor 'The X.Org Foundation', version 11.0.12011000 System Description: Fedora 34 (Workstation Edition) Configured using: 'configure --with-mailutils --with-cairo --with-modules --with-pgtk --with-native-compilation' Configured features: ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG JSON LCMS2 LIBOTF LIBSELINUX LIBSYSTEMD LIBXML2 M17N_FLT MODULES NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS TIFF TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK3 ZLIB Important settings: value of $LANG: en_US.UTF-8 value of $XMODIFIERS: @im=3Dnone locale-coding-system: utf-8-unix Major mode: Lisp Interaction Minor modes in effect: tooltip-mode: t global-eldoc-mode: t eldoc-mode: t electric-indent-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t indent-tabs-mode: t transient-mark-mode: t Load-path shadows: None found. Features: (shadow sort mail-extr emacsbug comp comp-cstr warnings rx message rmc puny dired dired-loaddefs rfc822 mml mml-sec epa derived epg rfc6068 epg-config gnus-util rmail rmail-loaddefs auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs password-cache json map mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils time-date subr-x cl-extra shortdoc text-property-search seq byte-opt gv bytecomp byte-compile cconv help-fns radix-tree help-mode cl-loaddefs cl-lib iso-transl tooltip eldoc electric uniquify ediff-hook vc-hooks lisp-float-type mwheel term/x-win x-win term/common-win x-dnd tool-bar dnd fontset image regexp-opt fringe tabulated-list replace newcomment text-mode elisp-mode lisp-mode prog-mode register page tab-bar menu-bar rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock font-lock syntax font-core term/tty-colors frame minibuffer cl-generic cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese composite charscript charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray cl-preloaded nadvice button loaddefs faces cus-face macroexp files window text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote threads dbusbind inotify lcms2 dynamic-setting system-font-setting font-render-setting cairo move-toolbar gtk x-toolkit x multi-tty make-network-process native-compile emacs) Memory information: ((conses 16 94870 10759) (symbols 48 7970 1) (strings 32 23722 1760) (string-bytes 1 872683) (vectors 16 16528) (vector-slots 8 305866 17210) (floats 8 71 35) (intervals 56 444 0) (buffers 992 14)) --00000000000070ba7005cc7e03d1 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Evaluate: (split-str= ing "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" ".")
It wrongly returns a list with only empty s= tring.
Of course it should r= eturn alist with the individual devanagari numbers.

In GNU Emacs 28.0.50 (build 3,= x86_64-pc-linux-gnu, GTK+ Version 3.24.30, cairo version 1.17.4)
=
=C2=A0of 2021-09-06 built on daniel-f= edora
Repository revision: c= 4724add006e62b81f847937db56335a81bdcc74
Repository branch: master
Windowing system distributor 'The X.Org Foundation', vers= ion 11.0.12011000
System Des= cription: Fedora 34 (Workstation Edition)

Configured us= ing:
=C2=A0'configure --= with-mailutils --with-cairo --with-modules --with-pgtk
=C2=A0--with-native-compilation'
<= div style=3D"color:rgb(46,52,54);font-family:monospace;font-size:13.3333px;= font-style:normal;font-variant-caps:normal;font-weight:normal;letter-spacin= g:normal;text-align:start;text-indent:0px;text-transform:none;word-spacing:= 0px;text-decoration:none;width:71ch">
Configured features:
ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG<= br>
JSON LCMS2 LIBOTF LIBSELINUX= LIBSYSTEMD LIBXML2 M17N_FLT MODULES
NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS = TIFF
TOOLKIT_SCROLL_BARS X11= XDBE XIM XPM GTK3 ZLIB

=
Important settings:
=C2=A0 value of $LANG: en_US.UTF-8
<= /div>
=C2=A0 value of $XMODIFIERS: @im= =3Dnone
=C2=A0 locale-coding= -system: utf-8-unix

Major mode: Lisp Interaction

Minor modes in effect:
=C2=A0 tooltip-mode: t
=C2=A0 global-eldoc-mode: t
=C2=A0 eldoc-mode: t
=C2= =A0 electric-indent-mode: t
= =C2=A0 mouse-wheel-mode: t
= =C2=A0 tool-bar-mode: t
=C2= =A0 menu-bar-mode: t
=C2=A0 = file-name-shadow-mode: t
=C2= =A0 global-font-lock-mode: t
=C2=A0 font-lock-mode: t
= =C2=A0 blink-cursor-mode: t
= =C2=A0 auto-composition-mode: t
=C2=A0 auto-encryption-mode: t
=C2=A0 auto-compression-mode: t
=C2=A0 line-number-mode: t
=C2=A0 indent-tabs-mode: t
=C2=A0 transient-mark-mode: t

Load-path shadow= s:
None found.

Features:
(shadow sor= t mail-extr emacsbug comp comp-cstr warnings rx message rmc
puny dired dired-loaddefs rfc822 mml mml-se= c epa derived epg rfc6068
ep= g-config gnus-util rmail rmail-loaddefs auth-source cl-seq eieio
<= div style=3D"color:rgb(46,52,54);font-family:monospace;font-size:13.3333px;= font-style:normal;font-variant-caps:normal;font-weight:normal;letter-spacin= g:normal;text-align:start;text-indent:0px;text-transform:none;word-spacing:= 0px;text-decoration:none;width:71ch">eieio-core cl-macs eieio-loaddefs pass= word-cache json map mm-decode
sendmail rfc2047 rfc2045 ietf-= drums mm-util mail-prsvr mail-utils
time-date subr-x cl-extra shortdoc text-property-search seq byte-op= t gv
bytecomp byte-compile c= conv help-fns radix-tree help-mode cl-loaddefs
cl-lib iso-transl tooltip eldoc electric uniquify ediff-= hook vc-hooks
lisp-float-typ= e mwheel term/x-win x-win term/common-win x-dnd tool-bar
dnd fontset image regexp-opt fringe tabulated-= list replace newcomment
text= -mode elisp-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer = select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic=
cham georgian utf-8-lang mi= sc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slo= vak czech
european ethiopic = indian cyrillic chinese composite charscript charprop
case-table epa-hook jka-cmpr-hook help simple a= bbrev obarray
cl-preloaded n= advice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 for= mat env code-pages
mule cust= om widget hashtable-print-readable backquote threads dbusbind
inotify lcms2 dynamic-setting system-font= -setting font-render-setting
cairo move-toolbar gtk x-toolkit x multi-tty make-network-process
native-compile emacs)

Memory information:
((= conses 16 94870 10759)
=C2= =A0(symbols 48 7970 1)
=C2= =A0(strings 32 23722 1760)
= =C2=A0(string-bytes 1 872683)
= =C2=A0(vector-slots 8 305866 17210)
=C2=A0(floats 8 71 35)
=C2=A0(intervals 56 444 0)

--00000000000070ba7005cc7e03d1--