unofficial mirror of emacs-devel@gnu.org 
 help / color / mirror / code / Atom feed
From: "Herman, Géza" <geza.herman@gmail.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: "Géza Herman" <geza.herman@gmail.com>, emacs-devel@gnu.org
Subject: Re: [PATCH] Implement fast verisons of json-parse functions
Date: Sat, 30 Mar 2024 11:50:19 +0100	[thread overview]
Message-ID: <871q7snffr.fsf@gmail.com> (raw)
In-Reply-To: <865xx4dv0g.fsf@gnu.org>


Eli Zaretskii <eliz@gnu.org> writes:

> Thanks.  I installed this on the master branch, after adding the
> required commit log messages and some cleanup of unused 
> functions.
Thanks!

> However:
>
>   . 3 tests in test/src/json-tests.el are now failing, where 
>   they
>     succeeded before; see the log at the end
>   . the times of the relevant tests don't seem to be faster than
>     the libjansson version, perhaps because this is an 
>     unoptimized
>     build

3 test failures:
1. Handling of utf-8 decode errors: the new parser emits 
json-utf8-decode-error instead of json-parse-error (this is what 
the test expects).  I can fix this by modifying the test
2. Handling of a single \0 byte: the new parser emits 
json-end-of-file.  I think this is not the best error kind for 
this case, so I'll modify the parser to emit json-parse-error 
instead.  This is still different what the test expects 
(wrong-type-argument), but I think there is no reason to treat 
zero bytes specially.  Considering the JSON spec, it's the same 
error as any other unexpected byte value.
3. Handling objects with duplicate keys.  That's an interesting 
one.  With alist/plist objects, the old parser removed duplicate 
members, but the new parser doesn't remove such members, it keeps 
them all. The JSON spec doesn't really say anything about this 
case, so I think we're free to do anything we like. Mattias 
Engdegård had an interesting idea: what if we put alist/plist 
members in reversed order? This way, if one uses assq/plist-get to 
get values by keys, the behavior will be consistent with the hash 
table representation (which keeps that last value of duplicate 
keys).  I like the idea of consistency, but I don't like that the 
elements will become reversed after parsing.  I had the idea that 
if the hash table kept the first value of duplicate keys, then 
we'd also have consistency.  What do you think?

Regarding performance: the new parser only becames significantly 
faster on larger JSONs only.  And yes, unoptimized build also has 
an impact on performance.



  reply	other threads:[~2024-03-30 10:50 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-03-19 18:23 [PATCH] Implement fast verisons of json-parse functions Herman, Géza
2024-03-19 18:47 ` Eli Zaretskii
2024-03-19 18:50   ` Herman, Géza
2024-03-19 19:28     ` Eli Zaretskii
2024-03-19 19:50       ` Herman, Géza
2024-03-20  3:37         ` Eli Zaretskii
2024-03-23 10:40           ` Herman, Géza
2024-03-23 11:11             ` Eli Zaretskii
2024-03-30  7:43             ` Eli Zaretskii
2024-03-30 10:50               ` Herman, Géza [this message]
2024-03-30 13:00                 ` Eli Zaretskii
2024-03-30 18:36                   ` Herman, Géza
2024-03-30 20:18                     ` [External] : " Drew Adams
2024-03-30 20:51                       ` Herman, Géza
2024-03-30 23:28                         ` Drew Adams
2024-03-31  5:08                     ` Eli Zaretskii
2024-03-31 14:51                       ` Mattias Engdegård
2024-03-31 15:44                         ` Eli Zaretskii
2024-04-01  8:48                           ` Mattias Engdegård
2024-04-01 16:54                             ` Herman, Géza
2024-03-31 17:48                       ` Adam Porter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/emacs/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=871q7snffr.fsf@gmail.com \
    --to=geza.herman@gmail.com \
    --cc=eliz@gnu.org \
    --cc=emacs-devel@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).