From: Emanuel Berg via Users list for the GNU Emacs text editor <help-gnu-emacs@gnu.org>
To: help-gnu-emacs@gnu.org
Subject: Re: Any faster way to find frequency of words?
Date: Sun, 09 May 2021 20:00:30 +0200 [thread overview]
Message-ID: <87v97rzt1d.fsf@zoho.eu> (raw)
In-Reply-To: YJgZuQLiYg6pxKt3@protected.localdomain
Jean Louis wrote:
> I think that your (4) is not necessary, as counting is
> not necessary.
Some counting is if you are to learn the frequency.
How about `forward-word' the whole buffer and for every word
feed it to a data structure, which keeps a record and a digit
and increase that by 1?
Then the challenge would be to pick a data structure where
searching is fast and in particular where search time doesn't
_grow_ fast with respect to it's overall size growing (size =
the number of unique words)
BTW the theoretical worst-case would be a buffer where all
words are unique. Buffer cost is almost 1, ultimately n.
With the theoretical worst-case, data structure would be, if
linear, like this
if we denote buffer cost : data structure cost
1: 0 <-- first word
1: 1
1: 2
1: 3
..
1: n + 1 <-- last word
linear!
But probably data structure cost is less than linear, say
logarithmic, then we would have
linear(n) + n * logarithmic(n)
linear(n) will grow the faster, so linear!
Whatever you do with the data structure, it'll be fast enough!
--
underground experts united
https://dataswamp.org/~incal
next prev parent reply other threads:[~2021-05-09 18:00 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-05-09 14:38 Any faster way to find frequency of words? Jean Louis
2021-05-09 14:56 ` Eric Abrahamsen
2021-05-09 15:05 ` Emanuel Berg via Users list for the GNU Emacs text editor
2021-05-09 17:16 ` Jean Louis
2021-05-10 3:37 ` Eric Abrahamsen
2021-05-10 7:14 ` Jean Louis
2021-05-10 14:02 ` [External] : " Drew Adams
2021-05-10 16:26 ` Jean Louis
2021-05-10 16:34 ` Drew Adams
2021-05-10 17:05 ` Jean Louis
2021-05-09 15:02 ` Emanuel Berg via Users list for the GNU Emacs text editor
2021-05-09 17:19 ` Jean Louis
2021-05-09 18:00 ` Emanuel Berg via Users list for the GNU Emacs text editor [this message]
2021-05-09 19:03 ` Jean Louis
2021-05-09 23:33 ` Emanuel Berg via Users list for the GNU Emacs text editor
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/emacs/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87v97rzt1d.fsf@zoho.eu \
--to=help-gnu-emacs@gnu.org \
--cc=moasenwood@zoho.eu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).