From: "Herman, Géza" <geza.herman@gmail.com>
To: Eli Zaretskii <eliz@gnu.org>
Cc: "Géza Herman" <geza.herman@gmail.com>, emacs-devel@gnu.org
Subject: Re: [PATCH] Implement fast verisons of json-parse functions
Date: Sat, 30 Mar 2024 11:50:19 +0100 [thread overview]
Message-ID: <871q7snffr.fsf@gmail.com> (raw)
In-Reply-To: <865xx4dv0g.fsf@gnu.org>
Eli Zaretskii <eliz@gnu.org> writes:
> Thanks. I installed this on the master branch, after adding the
> required commit log messages and some cleanup of unused
> functions.
Thanks!
> However:
>
> . 3 tests in test/src/json-tests.el are now failing, where
> they
> succeeded before; see the log at the end
> . the times of the relevant tests don't seem to be faster than
> the libjansson version, perhaps because this is an
> unoptimized
> build
3 test failures:
1. Handling of utf-8 decode errors: the new parser emits
json-utf8-decode-error instead of json-parse-error (this is what
the test expects). I can fix this by modifying the test
2. Handling of a single \0 byte: the new parser emits
json-end-of-file. I think this is not the best error kind for
this case, so I'll modify the parser to emit json-parse-error
instead. This is still different what the test expects
(wrong-type-argument), but I think there is no reason to treat
zero bytes specially. Considering the JSON spec, it's the same
error as any other unexpected byte value.
3. Handling objects with duplicate keys. That's an interesting
one. With alist/plist objects, the old parser removed duplicate
members, but the new parser doesn't remove such members, it keeps
them all. The JSON spec doesn't really say anything about this
case, so I think we're free to do anything we like. Mattias
Engdegård had an interesting idea: what if we put alist/plist
members in reversed order? This way, if one uses assq/plist-get to
get values by keys, the behavior will be consistent with the hash
table representation (which keeps that last value of duplicate
keys). I like the idea of consistency, but I don't like that the
elements will become reversed after parsing. I had the idea that
if the hash table kept the first value of duplicate keys, then
we'd also have consistency. What do you think?
Regarding performance: the new parser only becames significantly
faster on larger JSONs only. And yes, unoptimized build also has
an impact on performance.
next prev parent reply other threads:[~2024-03-30 10:50 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-03-19 18:23 [PATCH] Implement fast verisons of json-parse functions Herman, Géza
2024-03-19 18:47 ` Eli Zaretskii
2024-03-19 18:50 ` Herman, Géza
2024-03-19 19:28 ` Eli Zaretskii
2024-03-19 19:50 ` Herman, Géza
2024-03-20 3:37 ` Eli Zaretskii
2024-03-23 10:40 ` Herman, Géza
2024-03-23 11:11 ` Eli Zaretskii
2024-03-30 7:43 ` Eli Zaretskii
2024-03-30 10:50 ` Herman, Géza [this message]
2024-03-30 13:00 ` Eli Zaretskii
2024-03-30 18:36 ` Herman, Géza
2024-03-30 20:18 ` [External] : " Drew Adams
2024-03-30 20:51 ` Herman, Géza
2024-03-30 23:28 ` Drew Adams
2024-03-31 5:08 ` Eli Zaretskii
2024-03-31 14:51 ` Mattias Engdegård
2024-03-31 15:44 ` Eli Zaretskii
2024-04-01 8:48 ` Mattias Engdegård
2024-04-01 16:54 ` Herman, Géza
2024-03-31 17:48 ` Adam Porter
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=871q7snffr.fsf@gmail.com \
--to=geza.herman@gmail.com \
--cc=eliz@gnu.org \
--cc=emacs-devel@gnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.savannah.gnu.org/cgit/emacs.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).